| Literature DB >> 32939161 |
Luchun Yan1, Yupeng Diao1, Zhaoyang Lang1, Kewei Gao1,2.
Abstract
The empirical modeling methods are widely used in corrosion behavior analysis. But due to the limited regression ability of conventional algorithms, modeling objects are often limited to individual factors and specific env<span class="Chemical">ironments. This study proposed a modeling method based on machine learning to simulate the marine atmospheric corrosion behavior of low-alloy <span class="Chemical">steels. The correlations between material, environmental factors and corrosion rate were evaluated, and their influences on the corrosion behavior of steels were analyzed intuitively. By using the selected dominating factors as input variables, an optimized random forest model was established with a high prediction accuracy of corrosion rate (R2 values, 0.94 and 0.73 to the training set and testing set) to different low-alloy steel samples in several typical marine atmospheric environments. The results demonstrated that machine learning was efficient in corrosion behavior analysis, which usually involves a regression analysis of multiple factors.Entities:
Keywords: 106 Metallic materials; 212 Surface and interfaces; 404 Materials informatics / Genomics; Atmospheric corrosion; corrosion; data mining; materials informatics; random forest; regression analysis
Year: 2020 PMID: 32939161 PMCID: PMC7476538 DOI: 10.1080/14686996.2020.1746196
Source DB: PubMed Journal: Sci Technol Adv Mater ISSN: 1468-6996 Impact factor: 8.090
Chemical compositions (wt.%) of selected low-alloy steels from CoDS of MatNavi.
| Steels | %C | %Si | %Mn | %P | %S | %Cu | %Cr | %Ni |
|---|---|---|---|---|---|---|---|---|
| Fe-1Ni | 0.001 | <0.003 | 0.01 | 0.0003 | 0.0001 | <0.01 | <0.01 | 0.98 |
| Fe-3Ni | 0.001 | <0.003 | 0.01 | 0.0005 | 0.0002 | <0.01 | <0.01 | 3.02 |
| Fe-5Ni | 0.001 | <0.003 | 0.11 | 0.0006 | 0.0003 | <0.01 | <0.01 | 5.01 |
| Fe-9Ni | 0.001 | <0.003 | 0.12 | 0.0005 | 0.0003 | <0.01 | <0.01 | 9.06 |
| Fe-1Cr | 0.005 | <0.003 | 0.07 | 0.0010 | 0.0002 | <0.01 | 1.01 | <0.01 |
| Fe-3 Cr | 0.006 | <0.003 | 0.05 | 0.0007 | 0.0001 | <0.01 | 3.05 | <0.01 |
| Fe-5Cr | 0.003 | <0.003 | 0.11 | 0.0003 | 0.0010 | <0.01 | 5.03 | <0.01 |
| Fe-9Cr | 0.003 | <0.003 | 0.12 | 0.0002 | 0.0003 | <0.01 | 9.03 | <0.01 |
| Fe-0.5P | 0.0012 | <0.01 | 0.011 | 0.50 | 0.0004 | 0.009 | <0.005 | <0.003 |
| Fe-1.0P | 0.0022 | <0.01 | 0.031 | 0.99 | 0.0005 | 0.015 | <0.005 | <0.003 |
| Fe-1.5P | 0.0024 | <0.01 | 0.054 | 1.48 | 0.0004 | 0.021 | <0.005 | <0.003 |
| Fe-0.4Cu | 0.001 | <0.01 | <0.01 | 0.0006 | 0.0007 | 0.43 | <0.005 | <0.01 |
| Fe-1Cu | 0.0011 | <0.01 | <0.03 | <0.001 | <0.0003 | 1.00 | <0.005 | <0.003 |
| Fe-2Cu | 0.0011 | <0.01 | <0.03 | <0.001 | <0.0003 | 1.98 | <0.005 | <0.003 |
| Fe-3Cu | 0.0013 | <0.01 | <0.03 | <0.001 | <0.0003 | 2.97 | <0.005 | <0.003 |
| SPA-H | 0.089 | 0.22 | 0.39 | 0.10 | 0.0044 | 0.31 | 0.39 | 0.11 |
| 0.09 | 0.43 | 0.38 | 0.102 | 0.005 | 0.30 | 0.67 | 0.18 | |
| SMA490 | 0.12 | 0.36 | 1.08 | 0.013 | 0.0076 | 0.34 | 0.51 | 0.08 |
| 0.13 | 0.26 | 1.01 | 0.011 | 0.005 | 0.32 | 0.48 | 0.10 | |
| SM490A | 0.15 | 0.285 | 1.45 | 0.020 | 0.0039 | <0.01 | 0.05 | <0.01 |
| 0.14 | 0.25 | 1.35 | 0.012 | 0.003 | <0.01 | 0.04 | <0.01 |
For each of low-alloy steels SPA-H, SMA490 and SM490A, corresponding specimens may have two different chemical compositions as recorded in the CoDS database [28].
List of considered material and environmental features.
| Features | Data range | Descriptions | |
|---|---|---|---|
| Material | ELEMENTS | 0.5–9.2 wt.% | Total content of alloying elements |
| Environmental | T_MAX | 31.0–37.0 °C | Maximum air temperature |
| T_MIN | −8.0–9.5 °C | Minimum air temperature | |
| T_AVE | 14.2–24.0 °C | Mean air temperature | |
| RH_MIN | 15.0–55.0% | Minimum relative humidity | |
| RH_AVE | 72.5–79.5% | Mean relative humidity | |
| SUNSHINE | 1450 – 1990 h | Duration of sunshine | |
| TOW | 3700 – 5300 h | Time of wetness | |
| PRECIPIT | 1100 – 2300 mm | Precipitation | |
| WIND_MAX | 5.5–39.5 m/s | Maximum velocity of wind | |
| WIND_AVE | 1.1–4.7 m/s | Mean velocity of wind | |
| SOLAR | 4100 – 6600 MJ/m2 | Solar radiation | |
| UV | 180 – 370 MJ/m2 | Ultraviolet radiation | |
| CHLORIDE | 2 – 55 mg NaCl/m2·d | Chloride deposition rate | |
| SO2 | 1.8–6.1 mg SO2/m2·d | SO2 deposition rate | |
| TIME | 1, 2, 3, 5, 7, 10 years | Exposure period | |
| Target property | Corrosion rate | 0.0003–0.1995 mm/a | Annual corrosion depth, millimeter per year |
Figure 1.Pearson correlation map for environmental features. Pearson correlation coefficient is shown in each box, and the values indicating extremely strong correlation (>0.8) are marked with blue squares.
Figure 2.Correlation coefficient between each feature and the corrosion rate from (a) Pearson correlation coefficient method and (b) maximal information coefficient (MIC) method.
Figure 3.The predictive accuracy (predicted corrosion rates vs. measured corrosion rates) of the random forest model and the feature importance of corresponding input variables. The random forest models were separately built for samples with (a-b) 1, (c-d) 2, (e-f) 3, (g-h) 5, (i-j) 7 and (k-l) 10 years of exposure.
Figure 4.The SHAP variable importance plot of marine atmospheric corrosion data in different exposure periods. Each dot refers to a sample, and the dots were colored by the corresponding feature value from low (blue) to high (red). The positive SHAP value represents the ability to increase the corrosion rate, and the negative SHAP value represents the ability to decrease the corrosion rate. The features were listed on the vertical axis from top to bottom in a sequence of their feature importance.
Figure 5.The performance of the machine learning models using (a) Multiple Linear Regression (MLR), (b) Ridge Regression (RR), (c) Support Vector Regression (SVR), (d) Random Forest (RF), (e) Gradient Boosting Decision Tree (GBDT) and (f) eXtreme Gradient Boosting (XGBoost) algorithms.
The predictive accuracy of machine learning models. The coefficient of determination (R) and mean absolute error (MAE) were individually calculated for samples in the training set and testing set.
| Models | Training set | Testing set | Training set | Testing set |
|---|---|---|---|---|
| Multiple Linear Regression | 0.54 | 0.43 | 0.015 | 0.018 |
| Ridge Regression | 0.56 | 0.33 | 0.014 | 0.019 |
| Support Vector Regression | 0.90 | 0.48 | 0.006 | 0.014 |
| Random Forest | 0.94 | 0.73 | 0.004 | 0.010 |
| Gradient Boosting Decision Tree | 0.96 | 0.69 | 0.005 | 0.009 |
| eXtreme Gradient Boosting | 0.93 | 0.77 | 0.006 | 0.009 |
Figure 6.The application of machine learning models for corrosion resistance evaluation on the basis of very limited material and environmental information. (a) illustration of three different atmospheric exposure sites [28]; the measured corrosion rate (star icons, three parallel samples) vs. predicted corrosion rate (bar plot) of typical low-alloy steel SPA-H, SMA490 and SM490A after (b) one and (c) ten years of exposure.