| Literature DB >> 35719688 |
Paola Cerchiello1, Giancarlo Nicola1, Samuel Rönnqvist2, Peter Sarlin3.
Abstract
In this paper, we focus our attention on leveraging the information contained in financial news to enhance the performance of a bank distress classifier. The news information should be analyzed and inserted into the predictive model in the most efficient way and this task deals with the issues related to Natural Language interpretation and to the analysis of news media. Among the different models proposed for such purpose, we investigate a deep learning approach. The methodology is based on a distributed representation of textual data obtained from a model (Doc2Vec) that maps the documents and the words contained within a text onto a reduced latent semantic space. Afterwards, a second supervised feed forward fully connected neural network is trained combining news data distributed representations with standard financial figures in input. The goal of the model is to classify the corresponding banks in distressed or tranquil state. The final aim is to comprehend both the improvement of the predictive performance of the classifier and to assess the importance of news data in the classification process. This to understand if news data really bring useful information not contained in standard financial variables.Entities:
Keywords: Doc2Vec; classification model; credit risk; deep learning; text analysis
Year: 2022 PMID: 35719688 PMCID: PMC9200951 DOI: 10.3389/frai.2022.871863
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Structure of the model.
Figure 2Textual data—sensitivity analysis on the number of nodes of the hidden layer.
Figure 4Numerical and textual data—sensitivity analysis on the number of nodes of the hidden layer.
List of available numerical variables.
|
|
|
|
|---|---|---|
| Capital to asset | Mortgages to loans | House price gap (Deviation from trend of the real residential property price index) |
| Interest to liabilities | Securities to liabilities d4 | Macroeconomic Imbalance Procedure (MIP), international investment position |
| Reserves to asset | Financial assets to gdp | Private debt |
| - | - | Government bond yield |
| - | - | Credit to gdp |
| - | - | Credit to gdp delta over 12 months |
Summary statistics of available numerical variables.
|
|
|
|
|
|
|---|---|---|---|---|
| Capital to asset | 2.5 | 10.2 | 3.2 | 21.5 |
| Reserves to asset | 4.2 | 8.5 | 2.9 | 4.3 |
| Interest to liab | 3.4 | 8.5 | 2.9 | 104.6 |
| Financial assets to gdp | 385.0 | 134365.2 | 366.6 | 33.2 |
| Mortgages to loans d4 | 0.2 | 1.7 | 1.3 | 0.1 |
| Securities to liab d4 | −12.0 | 1342234.9 | 1158.5 | 105.7 |
| Credit to gdp | 140.2 | 2623.4 | 51.2 | 0.0 |
| Credit to gdp d12 | 13.7 | 479.1 | 21.9 | 0.5 |
| House price index rt16 gap | −2.5 | 33.7 | 5.8 | 6.8 |
| International investment position | −21.0 | 2967.7 | 54.5 | 0.0 |
| Private debt | 188.2 | 4938.7 | 70.3 | 0.1 |
| Gov bold yield d4 | 0.0 | 11.4 | 3.4 | 23.5 |
List of considered financial institutions.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Aareal Bank | DE | Carnegie Investment Bank | SE | Kommunalkredit | AT |
| ABN Amro | NL | Commerzbank | DE | LBBW | DE |
| Agricultural Bank of Greece | GR | Credit Mutuel | FR | Lloyds TSB | UK |
| Allied Irish Banks | IE | Credito Valtellinese | IT | Max Bank | DK |
| Alpha Bank | GR | Cyprus Popular | CY | Monte dei Paschi di Siena | IT |
| Amagerbanken | DK | Danske Bank | DK | National Bank of Greece | GR |
| ATE Bank | GR | Dexia | FR | Nordea | SE |
| Attica Bank | GR | EBH | DK | NordLB | DE |
| Banca Popolare di Milano | IT | EFG Eurobank | GR | Nova ljubljanska banka Group (NLB) | SI |
| Banco Popolare | IT | Erste Bank | HU | OTP Bank Nyrt | HU |
| Bank of Cyprus Public Co Ltd | CY | Fionia (Nova Bank) | DK | Piraeus Bank | GR, CY |
| Bank of Ireland | IE | Fortis Bank | LU, NL, BE | Pronton Bank | GR |
| Banque Populaire | FR | HBOS | UK | RBS | UK |
| Bawag | AT | Hellenic | GR | Roskilde Bank | DK |
| BayernLB | DE | HSH Nordbank | DE | Societe Generale | FR |
| BBK | ES | Hypo Real Estate | DE | Swedbank | SE |
| BNP Paribas | FR | Hypo Tirol Bank | AT | T-Bank | GR |
| BPCE | FR | IKB | DE | UNNIM | ES |
| Caixa General de Depositos | PT | ING | NL | Vestjysk | DK |
| Caja Castilla-La Mancha | ES | Irish Nationwide Building Society | IE | ||
| CAM | ES | KBC | BE |
Figure 5Comparison of the relative usefulness obtained with the textual financial data (left), numerical financial dataset (center), and with their combination (right).