Detecting Speech and Music in Audio Content
The Netflix TechBlog
NOVEMBER 13, 2023
To evaluate and benchmark our dataset, we manually labeled 20 audio tracks from various TV shows which do not overlap with our training data. One of the fundamental issues encountered during the annotation of our manually-labeled TVSM-test set, was the definition of music and speech. What constitutes music or speech?
Let's personalize your content