Sequential Linefeed Insertion into Lecture Transcriptions for Real‐Time Captioning | Zendy

Ohno Tomohiro | Zendy; Murata Masaki | Zendy; Matsubara Shigeki | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Sequential Linefeed Insertion into Lecture Transcriptions for Real‐Time Captioning

Author(s) -

Ohno Tomohiro,

Murata Masaki,

Matsubara Shigeki

Publication year - 2015

Publication title -

electronics and communications in japan

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.131

H-Index - 13

eISSN - 1942-9541

pISSN - 1942-9533

DOI - 10.1002/ecj.11616

Subject(s) - closed captioning , computer science , sentence , phrase , speech recognition , natural language processing , artificial intelligence , image (mathematics)

SUMMARY To generate readable captions for Japanese spoken monologues such as lectures in real time, it is necessary to sequentially display captions that have proper linefeeds inserted. This paper proposes a technique for sequentially inserting proper linefeeds into a lecture transcript whenever a bunsetsu, which is a linguistic unit shorter than a sentence in Japanese and that roughly corresponds to a basic phrase in English, is identified. Under the assumption that linefeeds are inserted at bunsetsu boundaries, this technique can reduce the delay time of captioning to the utmost possible. This technique statistically judges whether or not a linefeed should be inserted into each bunsetsu boundary by using the information that is available at the time. We conducted experiments on linefeed insertion using a Japanese lecture corpus. The experimental results confirmed that our method, which is a bunsetsu‐based linefeed insertion method, was almost as accurate as the sentence‐based linefeed insertion method. In addition, we conducted comparative evaluations using four baseline methods. The results confirmed that our method could insert linefeeds more accurately than the simple methods that are thought to have the same delay time as our method.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research