A self-attention based neural architecture for Chinese medical named entity recognition | Zendy

Qian Wan | Zendy; Jie Liu | Zendy; Luona Wei | Zendy; Bin Ji | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A self-attention based neural architecture for Chinese medical named entity recognition

Author(s) -

Qian Wan,

Jie Liu,

Luona Wei,

Bin Ji

Publication year - 2020

Publication title -

mathematical biosciences and engineering

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.451

H-Index - 45

eISSN - 1551-0018

pISSN - 1547-1063

DOI - 10.3934/mbe.2020197

Subject(s) - computer science , word2vec , raw data , named entity recognition , encoder , artificial intelligence , dependency (uml) , autoencoder , transformer , context (archaeology) , architecture , artificial neural network , machine learning , art , management , embedding , quantum mechanics , voltage , economics , visual arts , biology , programming language , task (project management) , operating system , paleontology , physics

The combination of medical field and big data has led to an explosive growth in the volume of electronic medical records (EMRs), in which the information contained has guiding significance for diagnosis. And how to extract these information from EMRs has become a hot research topic. In this paper, we propose an ELMo-ET-CRF model based approach to extract medical named entity from Chinese electronic medical records (CEMRs). Firstly, a domain-specific ELMo model is fine-tuned on a common ELMo model with 4679 raw CEMRs. Then we use the encoder from Transformer (ET) as our model's encoder to alleviate the long context dependency problem, and the CRF is utilized as the decoder. At last, we compare the BiLSTM-CRF and ET-CRF model with word2vec and ELMo embeddings to CEMRs respectively to validate the effectiveness of ELMo-ET-CRF model. With the same training data and test data, the ELMo-ET-CRF outperforms all the other mentioned model architectures in this paper with 85.59% F1-score, which indicates the effectiveness of the proposed model architecture, and the performance is also competitive on the CCKS2019 leaderboard.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research