z-logo
open-access-imgOpen Access
Text segmentation by language
Author(s) -
Robin Cabeza Ruiz
Publication year - 2016
Publication title -
sistemas and telematica
Language(s) - English
Resource type - Journals
ISSN - 1692-5238
DOI - 10.18046/syt.v14i38.2289
Subject(s) - sentence , computer science , natural language processing , segmentation , artificial intelligence , task (project management) , language model , linguistics , hidden markov model , text segmentation , philosophy , management , economics
There are two approaches for text segmentation by language: first, assuming that language changes happen in the “border” between sentences (never within a sentence); second, assuming that language changes can happen anyplace in the text. This work presents methods for both types of text’s segmentation by languages. On the first proposal, the text is initially segmented by sentence, then the language of each sentence is obtained; the second proposal is an adaptation of hidden Markov model to this task. Both cases, according to results obtained in experimental proofs, exceed the state of art.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here