| Literature DB >> 35136451 |
Takanori Matsui1, Kanoko Suzuki1, Kyota Ando1, Yuya Kitai2, Chihiro Haga1, Naoki Masuhara3, Shun Kawakubo2.
Abstract
Sharing successful practices with other stakeholders is important for achieving SDGs. In this study, with a deep-learning natural language processing model, bidirectional encoder representations from transformers (BERT), the authors aimed to build (1) a classifier that enables semantic mapping of practices and issues in the SDGs context, (2) a visualizing method of SDGs nexus based on co-occurrence of goals (3) a matchmaking process between local issues and initiatives that may embody solutions. A data frame was built using documents published by official organizations and multi-labels corresponding to SDGs. A pretrained Japanese BERT model was fine-tuned on a multi-label text classification task, while nested cross-validation was conducted to optimize the hyperparameters and estimate cross-validation accuracy. A system was then developed to visualize the co-occurrence of SDGs and to couple the stakeholders by evaluating embedded vectors of local challenges and solutions. The paper concludes with a discussion of four future perspectives to improve the natural language processing system. This intelligent information system is expected to help stakeholders take action to achieve the sustainable development goals. Supplementary Information: The online version contains supplementary material available at 10.1007/s11625-022-01093-3.Entities:
Keywords: Artificial intelligence technology; BERT model; Matchmaking stakeholders; Nexus and interlinkages; Sustainable development goals; Text classification
Year: 2022 PMID: 35136451 PMCID: PMC8815292 DOI: 10.1007/s11625-022-01093-3
Source DB: PubMed Journal: Sustain Sci ISSN: 1862-4057 Impact factor: 7.196
Fig. 1An overall structure in building corpus database, defining the model structure, training best model, and applying the best model to solutions. The numbers in each process correspond to section numbers
Samples of corpus database
| ID | SDGs multi label | Original sentences in Japanese | Translated text in English |
|---|---|---|---|
| 1 | [0, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 1, 0] | 学校の授業は「持続可能な開発目標」を実現させる大きな力になる!、岐阜県のある中学校では、中学校2年生の英語の学習で "Landmines and AkiRa"を学んだ際、日本ユニセフ協会に出前授業を依頼し、地雷についての授業を行いました。生徒たちは地雷や不発弾のレプリカを実際に見て手に取り、内戦以後も地雷や不発弾によって死傷するカンボジアの多くの子どもたちのこと、自分たちができることに ついて深く考えさせられました。その学習がきっかけとなり、のちに生徒たちは募金活動を行い、集まった募金をユニセフに贈呈するなど、世界の子どもが直面している問題について学ぶだけでなく、自分たちに何ができるのかを考え、行動に繋げることができました。日本ユニセフ協会では、地雷の木製レプリカセット地雷教育用のポスタ指導用のパワーポイントが入った学習キットを貸し出しています。(length = 378) | School lessons will be a great force to achieve the “Sustainable development goals”! At a junior high school in Gifu prefecture, when I learned “Landmines and Aki Ra” in the second year of junior high school, I asked the Japan Committee for UNICEF to give a class about land mines. The students actually saw and picked up replicas of landmines and unexploded ordnance, and were made to think deeply about many Cambodian children who were killed or injured by landmines and unexploded ordnance even after the civil war, and what they could do. The learning triggered the students to raise funds and donate the collected funds to UNICEF, not only to learn about the problems facing children around the world, but also to learn what they can do. I was able to think and act. The Japan Committee for UNICEF rents out a wooden replica set of landmines, a power for teaching posters for landmine education, and a learning kit containing her points. (length = 942) |
| 2 | [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0] | 気候変動債、グリーンボンド、その他の債券や証券などを通じ、気候変動リスクの軽減、気候変動へのレジリエンスおよび気候変動への適応のために投資や資金提供を行う。国および地域の自然災害保険スキームの補償範囲を拡大する。保険引受業務、投資分析および意思決定に気候変動リスクを組み入れる。。設定した目標に照らして気候変動によるリスクを測定、軽減、報告し、気候変動に立ち向かうための対策を進展させると同時に、産業セクター全体の報告の透明性と一貫性のレベルを継続的に向上させていく。(length = 234) | Invest and fund climate change risk mitigation, climate change resilience and climate change adaptation through climate change bonds, green bonds, and other bonds and securities. Expand coverage of national and regional natural disaster insurance schemes. Incorporate climate change risk into underwriting, investment analysis, and decision making. Measure, mitigate and report on the risks of climate change against the goals set and develop measures to combat climate change while continually improving the level of transparency and consistency of reporting across the industrial sector. (length = 590) |
| 3 | [0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1] | 「新たなスポーツ施策の振興及びスポーツを活用した地域活性化」。地域活性化、健康福祉、観光客の誘致・地域PR (インバウンドを含む) 、情報化 (ICT・IoT・AIの利活用等) 。社団法人・財団法人、NPO・NGO、大学・教育機関・研究機関・国機関等、宿泊・飲食サービス業、運輸・通信業、サービス業。連携のイメージは、平塚海洋エネルギー研究会のウェブをご覧ください。情報収集・共有及び意見交換等を行いたい。自動車関連や化学系の大手・中小の工場、研究所が多数立地。商業、農業、漁業が一定規模ある。。東京大学生産技術研究所林研究室との共同研究(3年間)を計画している。テレワークの増加やCO2排出削減の世界的な動きに対応するため、国産技術での再生可能エネルギーの普及を目指しています。(length = 336) | “Promotion of new sports measures and regional revitalization utilizing sports.” Regional revitalization, health and welfare, attracting tourists/regional PR (including inbound), computerization (utilization of ICT/IoT/AI, etc.). Incorporated associations/foundations, NPOs/NGOs, universities/educational institutions/research institutes/national institutions, etc., accommodation/food service industry, transportation/communication industry, service industry. For an image of the collaboration, please see the website of the Hiratsuka Marine Energy Study Group. I would like to collect and share information and exchange opinions. There are many large, small, and medium-sized factories and research institutes in the automobile and chemical fields. There is a certain scale of commerce, agriculture, and fishing. We are planning a joint research (3 years) with Hayashi Laboratory, Institute of Industrial Science, University of Tokyo. We are aiming to popularize renewable energy using domestic technology in order to respond to the increase in telework and the global movement to reduce CO2 emissions. (length = 1104) |
| 4 | [0, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 1] | 「湯の丸高原天然水のブランド化に向けた取り組み」。地域活性、観光客の誘致・地域PR (インバウンドを含む) 、その他 (地域資源の有効活用、水道事業の経営安定化) 。宿泊・飲食サービス業、卸売・小売業、飲食店、電気・ガス・水道・熱供給業水販売戦略を有し、販売実績のある事業者を希望する。・湯の丸水源の天然水の販売のためのコンセプトや販売先へ"強み”や他地域との差別化の検討・具体的な販路開拓。情報収集・共有及び意見交換等を行いたい。自動車関連や化学系の大手・中小の工場、研究所が多数立地。商業、農業、漁業が一定規模ある。令和3年度の予算計上を検討している。テレワークの増加やCO2排出削減の世界的な動きに対応するため、国産技術での再生可能エネルギーの普及を目指しています。(length = 332) | “Efforts to brand Yunomaru Kogen natural water.” Regional revitalization, attracting tourists/regional PR (including inbound), etc. (effective utilization of regional resources, stabilization of water supply business management). Accommodation/restaurant service industry, wholesale/retail industry, restaurant, electricity/gas/water/heat supply industry we would like to have a business with a water sales strategy and a sales record.・ Consideration of “strengths” and differentiation from other regions to the concept and sales destinations for selling natural water from Yunomaru water source.・ Development of specific sales channels. I would like to collect and share information and exchange opinions. There are many large, small, and medium-sized factories and research institutes in the automobile and chemical fields. There is a certain scale of commerce, agriculture, and fishing. We are considering budgeting for the third year of Reiwa. We are aiming to popularize renewable energy using domestic technology in order to respond to the increase in telework and the global movement to reduce CO2 emissions. (length = 1116) |
| 5 | [1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0] | 廃棄物処理 (再生燃料製造) 、環境修復事業。SDGs貢献に向けた取り組みの概要。【海外事業のきっかけは、Team-E KANSAI】。海外事業は、Team-E KANSAI (海外展開支援機関、経済団体、自治体、企業等で構成) の歩みとともにある。。Team-E KANSAIのミッションなどで、アジア各国を訪問し、社会課題とともに、ビジネスチャンスがあることがわかった。。【海外と日本をつなぎ、さまざまな社会課題を解決】。マレーシア、インドネシアは、世界の80%以上のパームオイルを生産している。。それに伴う未利用物、排水が社会問題になっている。。そこで、未利用物であるパーム房 (EFB) を自社技術で、木質ペレットに加工し、発電所燃料として利用し、その燃焼灰はセメント工場でセメント原料にしている。このように、アジアの環境問題の解決は、他の社会課題の解決にもつながっている。(length = 385) | Waste treatment (recycled fuel production), environmental restoration business. Outline of efforts to contribute to the SDGs. [Team-E KANSAI was the catalyst for overseas business]. The overseas business is in line with the progress of Team-E KANSAI (composed of overseas expansion support organizations, economic organizations, local governments, companies, etc.). I visited Asian countries on the mission of Team-E KANSAI and found out that there are business opportunities as well as social issues. [Connecting overseas and Japan to solve various social issues]. Malaysia and Indonesia produce more than 80% of the world’s palm oil. Along with this, unused materials and wastewater have become a social problem. Therefore, the unused palm bunch (EFB) is processed into wood pellets using our own technology and used as fuel for power plants, and the combustion ash is used as a raw material for cement at a cement factory. In this way, the solution of environmental problems in Asia has led to the solution of other social issues. (length = 1034) |
Note: The column 1 is sentence id, column 2 is the 17-dimensional multi hot vectors which mean the correspondence to the SDGs (correspondent 1, not correspondent 0), column 3 and column 4 are the original Japanese sentences and the English sentences translated by Google translate. The original Japanese sentences were used for the model training and validation
Performance of nested cross-validation
Fig. 2SDGs mapping by text classification and attention visualization: a sample of SDG 5 Gender equality. Note: The red markers in the panel a and b mean the BERT model pays attention to the tokens. The color intensity is consistent with the level of attention. And the red markers in panel c mean the high probability of the prediction. The ## in the panel a indicates the sub word division by WordPiese algorithm
Fig. 3Visualization of SDGs nexus by analyzing the co-occurrences of predicted SDGs multi-labels with the Inventory of Business Indicators from SDG compass
Cases of a match making between municipalities and private sector
Fig. 4Matchmaking map by dimension reduction algorithm. Note: the small plots mean players which have potential solutions, and the large plots are players who have needs to be solved. The positions of the points were the two-dimensional coordinates obtained from original 768-dimensional vectors by dimension reduction using t-SNE. The color of points means the goal of SDGs that the players have the highest probability to be related to
Corpus statistics and classification performance of Inventory of Business Indicators from SDG compass (Global Reporting Initiative, UN Global Compact, and WBCSD 2015)
| SDGs | Number of indicator | Character/sentence | Token/sentence | Mean predicted probability | Recall | Precision | f1-score | ROC-AUC | PR-AUC |
|---|---|---|---|---|---|---|---|---|---|
| Overall | 1479 | 49.4 | 30.3 | 0.10 | 0.26 | 0.19 | 0.21 | 0.70 | 0.17 |
| Goal 01: No poverty | 77 | 56.6 | 35.1 | 0.07 | 0.17 | 0.16 | 0.17 | 0.68 | 0.11 |
| Goal 02: Zero hunger | 43 | 49.1 | 29.7 | 0.05 | 0.16 | 0.15 | 0.16 | 0.57 | 0.05 |
| Goal 03: Good health and well-being | 96 | 55.7 | 34.2 | 0.06 | 0.15 | 0.23 | 0.18 | 0.64 | 0.15 |
| Goal 04: Quality education | 19 | 44.3 | 27.2 | 0.04 | 0.21 | 0.08 | 0.11 | 0.77 | 0.06 |
| Goal 05: Gender equality | 85 | 56.7 | 35.4 | 0.05 | 0.19 | 0.23 | 0.21 | 0.68 | 0.17 |
| Goal 06: Clean water and sanitation | 131 | 53.3 | 33.8 | 0.10 | 0.47 | 0.47 | 0.47 | 0.83 | 0.48 |
| Goal 07: Affordable and clean energy | 85 | 42.0 | 24.9 | 0.21 | 0.65 | 0.18 | 0.28 | 0.79 | 0.27 |
| Goal 08: Decent work and economic growth | 243 | 49.4 | 30.8 | 0.16 | 0.33 | 0.34 | 0.33 | 0.65 | 0.28 |
| Goal 09: Industry, innovation and infrastructure | 81 | 24.3 | 14.2 | 0.05 | 0.06 | 0.09 | 0.07 | 0.61 | 0.10 |
| Goal 10: Reduced inequalities | 53 | 37.9 | 22.8 | 0.06 | 0.13 | 0.09 | 0.10 | 0.62 | 0.03 |
| Goal 11: Sustainable cities and communities | 25 | 31.1 | 18.8 | 0.17 | 0.20 | 0.02 | 0.04 | 0.64 | 0.02 |
| Goal 12: Responsible consumption and production | 116 | 39.5 | 23.9 | 0.09 | 0.21 | 0.19 | 0.20 | 0.66 | 0.15 |
| Goal 13: Climate action | 96 | 55.1 | 33.4 | 0.25 | 0.55 | 0.15 | 0.24 | 0.76 | 0.18 |
| Goal 14: Life below water | 76 | 42.9 | 27.0 | 0.06 | 0.12 | 0.12 | 0.12 | 0.68 | 0.09 |
| Goal 15: Life on land | 131 | 65.4 | 40.5 | 0.10 | 0.36 | 0.37 | 0.36 | 0.78 | 0.32 |
| Goal 16: Peace, justice and strong institutions | 109 | 55.5 | 32.4 | 0.11 | 0.54 | 0.39 | 0.45 | 0.79 | 0.38 |
| Goal 17: Partnerships for the goals | 13 | 41.8 | 23.4 | 0.07 | 0.00 | 0.00 | 0.00 | 0.71 | 0.02 |
Note: recall, precision and f1-score were calculated by setting the threshold of the banalization of predicted probability in 0.5. ROC-AUC and PR-AUC denote the Area under Curve of ROC curve and Precision-Recall curve, which is a metric ranged 0 to 1 in machine learning field. 0 and means poor and excellent, respectively. ROC/PR-AUC in Overall means macro AUC, which is the mean of AUC by goal. All metrics were calculated by scikit-learn (== 0.22.1) (Pedregosa et al. 2011)