Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text | Zendy

Sumei Ruan | Zendy; Xusheng Sun | Zendy; Ruanxingchen Yao | Zendy; Wei Li | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text

Author(s) -

Sumei Ruan,

Xusheng Sun,

Ruanxingchen Yao,

Wei Li

Publication year - 2021

Publication title -

computational intelligence and neuroscience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.605

H-Index - 52

eISSN - 1687-5273

pISSN - 1687-5265

DOI - 10.1155/2021/1165296

Subject(s) - computer science , benchmark (surveying) , word embedding , natural language processing , artificial intelligence , sentence , word (group theory) , machine learning , embedding , linguistics , philosophy , geodesy , geography

To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers. However, most studies divide the long text into words and count words to treat the text as word count vectors, bringing massive invalid information but ignoring meaningful contexts. Aiming to efficiently represent the text of large size, an end-to-end neural networks model based on hierarchical self-attention is proposed in this study after the state-of-the-art pretrained model is introduced for text embedding including contexts. The proposed model has two notable characteristics. First, the hierarchical self-attention only affords the essential content with high weights in word-level and sentence-level and automatically neglects lots of information that has no business with risk prediction, which is suitable for extracting effective parts of the large-scale text. Second, after fine-tuning, the word embedding adapts the specific contexts of samples and conveys the original text expression more accurately without excessive manual operations. Experiments confirm that the addition of text improves the accuracy of financial distress forecasting and the proposed model outperforms benchmark models better at AUC and F2-score. For visualization, the elements in the weight matrix of hierarchical self-attention act as scalers to estimate the importance of each word and sentence. In this way, the “red-flag” statement that implies financial risk is figured out and highlighted in the original text, providing effective references for decision-makers.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research