Speaker Clustering via Bayesian Information Criterion using a Global Similarity Constraint
- Main information
-
In this paper we proposed a global similarity constraint that improves speaker clustering as standardly performed using the Bayesian Information Criterion (BIC). The novelty of our approach lies in the fact that it exploits the hypothesis that audio segments belonging to the same speaker cluster should demonstrate similar global behavior, i.e. exhibit approximately the same pattern of similarity and dissimilarity with the all other segments. Every segment is represented by a global similarity vector whose components encode the BIC-based local similarity between that segment and each of the other segments to be clustered. Speaker clustering is performed bottom up using the BIC to compare each pair of segments and determine if their similarity is high enough to merge them. We use the global similarity vectors to constrain merging to segment pairs that have approximately the same patterns of global similarity. The evaluation, performed on audio data from 4 different German-language radio programs, shows that the proposed approach represents an improvement on the standard BIC clustering. WP5: Detection, Extraction and Annotation of Knowledge. IAIS Konstantin Biatov, Martha Larson 2007-03-15 17:38 Request for more detail
- Access and Use Rights
-
Condition of use defined in response to "need to access request". Copyright Fraunhofer Institut Intelligente Analyse- und Informationssysteme. Closed, attachment is not public
This item is not available for public download. For further information click on Request for more details above