Detecting Speech and Music in Audio Content
The Netflix TechBlog
NOVEMBER 13, 2023
To evaluate and benchmark our dataset, we manually labeled 20 audio tracks from various TV shows which do not overlap with our training data. Results We evaluated our models on four open datasets comprising audio data from TV programs, YouTube clips and various content such as concert, radio broadcasts, and low-fidelity folk music.
Let's personalize your content