Ancient Chinese Sentence Segmentation Based on Bidirectional LSTM+CRF Model | Zendy

Hongbin Wang | Zendy; Haibing Wei | Zendy; Jianyi Guo | Zendy; Liang Cheng | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Ancient Chinese Sentence Segmentation Based on Bidirectional LSTM+CRF Model

Author(s) -

Hongbin Wang,

Haibing Wei,

Jianyi Guo,

Liang Cheng

Publication year - 2019

Publication title -

journal of advanced computational intelligence and intelligent informatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.172

H-Index - 20

eISSN - 1343-0130

pISSN - 1883-8014

DOI - 10.20965/jaciii.2019.p0719

Subject(s) - computer science , conditional random field , sentence , viterbi algorithm , artificial intelligence , sequence (biology) , value (mathematics) , segmentation , recurrent neural network , natural language processing , speech recognition , artificial neural network , hidden markov model , machine learning , genetics , biology

This study proposes a novel method for the segmentation of Archaic Chinese sentences based on a bidirectional long short-term memory (LSTM) + conditional random field (CRF) model. The method added a layer of linear statistical model to the traditional bidirectional LSTM neural network; it can be used for sequence annotation from the sentence level. In addition, this model introduced the stochastic gradient descent (SGD) to prevent excessive fitting, and the viterbi algorithm was used to calculate the optimal sequence of the sentences. In the experiment, this study tests the performance of the proposed method using the History of the Han Dynasty, the History of the later Han Dynasty, Three Kingdoms, and the Book of Jin, amongst others. The results show that the precision value, recall value, and F1 value are 0.77, 0.75, and 0.76, respectively, in the open test, and 0.90, 0.88, and 0.76, respectively, in the closed test.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research