Indonesian Sentence Boundary Detection using Deep Learning Approaches | Zendy

Joan Santoso | Zendy; Esther Irawati Setiawan | Zendy; Christian Nathaniel Purwanto | Zendy; Fachrul Kurniawan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Indonesian Sentence Boundary Detection using Deep Learning Approaches

Author(s) -

Joan Santoso,

Esther Irawati Setiawan,

Christian Nathaniel Purwanto,

Fachrul Kurniawan

Publication year - 2021

Publication title -

knowledge engineering and data science

Language(s) - English

Resource type - Journals

eISSN - 2597-4637

pISSN - 2597-4602

DOI - 10.17977/um018v4i12021p38-48

Subject(s) - sentence , computer science , natural language processing , artificial intelligence , security token , indonesian , boundary (topology) , speech recognition , sequence (biology) , linguistics , mathematics , mathematical analysis , philosophy , computer security , biology , genetics

Detecting the sentence boundary is one of the crucial pre-processing steps in natural language processing. It can define the boundary of a sentence since the border between a sentence, and another sentence might be ambiguous. Because there are multiple separators and dynamic sentence patterns, using a full stop at the end of a sentence is sometimes inappropriate. This research uses a deep learning approach to split each sentence from an Indonesian news document. Hence, there is no need to define any handcrafted features or rules. In Part of Speech Tagging and Named Entity Recognition, we use sequence labeling to determine sentence boundaries. Two labels will be used, namely O as a non-boundary token and E as the last token marker in the sentence. To do this, we used the Bi-LSTM approach, which has been widely used in sequence labeling. We have proved that our approach works for Indonesian text using pre-trained embedding in Indonesian, as in previous studies. This study achieved an F1-Score value of 98.49 percent. When compared to previous studies, the achieved performance represents a significant increase in outcomes..

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore