Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System | Zendy

Xavier Anguera | Zendy; Chuck Wooters | Zendy; Barbara Peskin | Zendy; Mateu Aguiló | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System

Author(s) -

Xavier Anguera,

Chuck Wooters,

Barbara Peskin,

Mateu Aguiló

Publication year - 2006

Publication title -

lecture notes in computer science

Language(s) - English

Resource type - Book series

SCImago Journal Rank - 0.249

H-Index - 400

eISSN - 1611-3349

pISSN - 0302-9743

ISBN - 3-540-32549-2

DOI - 10.1007/11677482_34

Subject(s) - speaker diarisation , computer science , speech recognition , segmentation , artificial intelligence , speaker recognition

In this paper we describe the ICSI-SRI entry in the Rich Transcription 2005 Spring Meeting Recognition Evaluation. The current system is based on the ICSI-SRI clustering system for Broadcast News (BN), with extra modules to process the different meetings tasks in which we participated. Our base system uses agglomerative clustering with a modified Bayesian Information Criterion (BIC) measure to determine when to stop merging clusters and to decide which pairs of clusters to merge. This approach does not require any pre-trained models, thus increasing robustness and simplifying the port from BN to the meetings domain. For the meetings domain, we have added several features to our baseline clustering system, including a “purification” module that tries to keep the clusters acoustically homogeneous throughout the clustering process, and a delay&sum beamforming algorithm which enhances signal quality for the multiple distant microphones (MDM) sub-task. In post-evaluation work we further improved the delay&sum algorithm, experimented with a new speech/non-speech detector and proposed a new system for the lecture room environment.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research