Two‐stage spoken term detection system for under‐resourced languages | Zendy

G Deekshitha | Zendy; Mary Leena | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Two‐stage spoken term detection system for under‐resourced languages

Author(s) -

G Deekshitha,

Mary Leena

Publication year - 2020

Publication title -

iet signal processing

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.384

H-Index - 42

ISSN - 1751-9683

DOI - 10.1049/iet-spr.2019.0131

Subject(s) - computer science , matching (statistics) , template matching , feature (linguistics) , false positive paradox , term (time) , spoken language , artificial intelligence , speech recognition , query by example , natural language processing , process (computing) , dynamic time warping , pattern matching , pattern recognition (psychology) , search engine , image (mathematics) , information retrieval , mathematics , physics , quantum mechanics , web search query , operating system , linguistics , statistics , philosophy

Spoken Term Detection (STD) is the process of locating the occurrences of spoken queries in a given speech database. Generally, two methods are adopted for STD: an ASR based sequence matching and ASR‐free, feature‐based template matching. If a well‐performing ASR is available, the former STD method is accurate. However, to build an ASR with consistent performance, several hours of labelled corpora is required. Template matching methods work well for small or chopped utterances. However, in practice, the volume of the search database can be huge, containing sentences of varying lengths. Hence time complexity of template matching techniques will be high, which makes them impractical for realistic search applications. In this work, a two‐stage STD system is proposed, which combines the ASR‐based phoneme sequence matching in the first stage and feature sequence template matching of selected locations in the second stage. The time complexity of the second stage is reduced by performing DTW‐based template matching only at probable query locations identified by the first stage. ‘Split and match’ approach helps to reduce the false‐positives in case of longer query words. Effectiveness of the proposed method is demonstrated using English and Malayalam datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore