z-logo
Premium
Sequential Linefeed Insertion into Lecture Transcriptions for Real‐Time Captioning
Author(s) -
Ohno Tomohiro,
Murata Masaki,
Matsubara Shigeki
Publication year - 2015
Publication title -
electronics and communications in japan
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.131
H-Index - 13
eISSN - 1942-9541
pISSN - 1942-9533
DOI - 10.1002/ecj.11616
Subject(s) - closed captioning , computer science , sentence , phrase , speech recognition , natural language processing , artificial intelligence , image (mathematics)
SUMMARY To generate readable captions for Japanese spoken monologues such as lectures in real time, it is necessary to sequentially display captions that have proper linefeeds inserted. This paper proposes a technique for sequentially inserting proper linefeeds into a lecture transcript whenever a bunsetsu, which is a linguistic unit shorter than a sentence in Japanese and that roughly corresponds to a basic phrase in English, is identified. Under the assumption that linefeeds are inserted at bunsetsu boundaries, this technique can reduce the delay time of captioning to the utmost possible. This technique statistically judges whether or not a linefeed should be inserted into each bunsetsu boundary by using the information that is available at the time. We conducted experiments on linefeed insertion using a Japanese lecture corpus. The experimental results confirmed that our method, which is a bunsetsu‐based linefeed insertion method, was almost as accurate as the sentence‐based linefeed insertion method. In addition, we conducted comparative evaluations using four baseline methods. The results confirmed that our method could insert linefeeds more accurately than the simple methods that are thought to have the same delay time as our method.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here