Literature DB >> 35312706

Developing random forest hybridization models for estimating the axial bearing capacity of pile.

Tuan Anh Pham1, Van Quan Tran1.   

Abstract

Accurate determination of the axial load capacity of the pile is of utmost importance when designing the pile foundation. However, the methods of determining the axial load capacity of the pile in the field are often costly and time-consuming. Therefore, the purpose of this study is to develop a hybrid machine-learning to predict the axial load capacity of the pile. In particular, two powerful optimization algorithms named Herd Optimization (PSO) and Genetic Algorithm (GA) were used to evolve the Random Forest (RF) model architecture. For the research, the data set including 472 results of pile load tests in Ha Nam province-Vietnam was used to build and test the machine-learning models. The data set was divided into training and testing parts with ratio of 80% and 20%, respectively. Various performance indicators, namely absolute mean error (MAE), mean square root error (RMSE), and coefficient of determination (R2) are used to evaluate the performance of RF models. The results showed that, between the two optimization algorithms, GA gave superior performance compared to PSO in finding the best RF model architecture. In addition, the RF-GA model is also compared with the default RF model, the results show that the RF-GA model gives the best performance, with the balance on training and testing set, meaning avoiding the phenomenon of overfitting. The results of the study suggest a potential direction in the development of machine learning models in engineering in general and geotechnical engineering in particular.

Entities:  

Mesh:

Year:  2022        PMID: 35312706      PMCID: PMC8936477          DOI: 10.1371/journal.pone.0265747

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


1. Introduction

In engineering, piles have been used to support the building, in which, the axial load capacity of the pile is considered the most important parameter in the pile foundation design. Typically, the axial load capacity of a pile can be determined using static and dynamic load tests on the construction sites. However, these methods are not only time-consuming and expensive but also often difficult to apply to small-scale projects [1, 2]. Therefore, several other approaches have been proposed in predicting the axial load capacity of piles and improving the prediction accuracy. These methods include the use of empirical approaches based on in-situ test results, such as SPT (Standard Penetration Test), CPT (Cone Penetration Test), and geometrical parameters of the pile [3-8]. The empirical formulas include a few key parameters, so it is not enough to accurately predict the pile load capacity [8]. In addition, the use of several experimental coefficients, which have a wide range for different types of soil, further deviates from the actual results [9]. Over the past two decades, artificial intelligence (AI) and machine learning (ML) have made great progress, being applied to solve many real-world problems in general and engineering in particular. In details, some ML techniques have been used in solving many engineering problems such as geotechnical problems [10-13], mechanism properties of materials [14-20], rock blasting [21]. In details, Armaghani et al. [13] developed a hybrid ML model including artificial neural network (ANN) and particle swarm optimization (PSO) in predicting settlement of pile. Using hybrid Ensembling of Surrogate ML models, Asteris et al. [14, 18, 22] improved the accuracy of ML model in predicting compressive strength of concrete. Apostolopoulou et al. [16] develop ANN model in designing natural hydraulic lime mortars. Using an ANN model, Armaghani et al. [17] can predict the unconfined compressive strength of granite with only two input variables. Armaghani et Asteris [19] propose ANN and ANFIS models in predicting the compressive strength of cement based mortar with high performance. Developing hybrid ML model including meta-heuristic search of sociopolitical algorithm and Extreme Gradient Boosting to predict compressive strength of recycled aggregate concrete. The axial load capacity of concrete filled steel tube columns can be estimated by ANN model in the investigation of Le et al. [15]. In rock blasting, the peak particle velocity can be successfully predicted by support vector machine (SVM) [21] which is famous ML technique. In this clear trend, many studies apply artificial intelligence to solve the problem of estimating the bearing capacity of piles. For example, Kumar [23] developed a k-nearest neighbor (KNN) model to predict the soil parameters required for foundation design. Goh [24, 25] presented an ANN model to predict the bearing capacity of driven piles in clays. Besides, Shahin [26] developed an ANN model to estimate the bearing capacity of driven piles and drilled shafts using a series of in-situ load tests. Nawari [27] showed an ANN algorithm to predict the deflection of drilled shafts based on (SPT) data and the shaft geometry. Momeni [28] developed ANN models to predict the shaft and tip resistance of concrete piles. Pham et al. [10] presented two models, including ANN and RF to estimate the ultimate bearing capacity of the driven pile. Shahin and Jaksa [29] presented an ANN model to predict the bearing capacity of the drilled shaft using CPT data. The published literatures show that AI has good potential to accurately predict the load capacity of piles. However, it must be said that ML models have a very complex architecture, including many hyperparameters. These hyperparameters are particularly sensitive and greatly affect the model’s forecast results [11, 30–32]. The above studies did not show that how the model architecture model is selected to predict the pile load capacity. The choice of model architecture is usually done manually, which takes a lot of time and resources. As mentioned above, various studies have been carried out to evaluate the performance of ML algorithms in predicting pile bearing capacity. However, creating hybrid models using optimization algorithms to choose the best model is a matter of concern. RF model has been proving to be one of the best ML algorithms, achieving excellent performance in previous studies [10, 33, 34]. As a matter of course, there are many optimization algorithms used to solve the problem in techniques such as gradient descent [35], quasi-newton [36], hill climb [37], simulated annealing [38], particle swarm optimization (PSO) [39], and genetic algorithm (GA) [40]. Among those algorithms, GA and PSO do not use problem gradients to be optimized, it does not require optimization problems to be as distinct as standard optimization methods such as gradient descent and quasi-newton [41]. Therefore, these are two of the most powerful and popular algorithms today in solving general engineering problems. From the above analysis, the main objective of this present investigation is to apply the two-hybrid soft computing model RF-GA and RF-PSO for the better and quick prediction of axial bearing capacity of piles based on the 10 parameters of piles geometry and soil properties. To acquire this aim, a database consisting of 472 pile load tests collected from the available literature [11]. Various performance criteria including the coefficient of determination (R2), root mean squared error (RMSE), and the mean squared error (MAE) are considered to evaluate the prediction capability of the two-hybrid RF models and individual model RF. Furthermore, 1000 simulations taking into account the randomness of the model inputs were performed to fully evaluate the feasibility of these models.

2. Research significance

High performance estimation of axial bearing capacity of pile is meaningful due to foundation design and contributions to building design. Although some machine learning models were developed to predict the axial bearing capacity of pile. For instance, ANN model in the investigation of Shahin et al. [26, 29], however low number of data containing 80 samples, that were used to develop the ANN model, reduces the performance and reliability of ML model in predicting the axial bearing capacity of pile. The Random Forest model developed by Pham et al. [10] has performance values of prediction as following R2 = 0.866, RMSE = 0.0982 MN, MAE = 0.2924 MN. This performance of RF model can be improved. Thus, the following might emphasize various contributions of the current investigation: 0.9331, 0.0929, 0.0675 A database containing 10 input variables and 472 samples is presented; The hybrid models RF-GA and RF-PSO are developed to find the best hyperparameters of Random Forest model for predicting the axial bearing capacity of pile; Monte Carlo simulations are introduced to evaluate the performance and reliability of single RF, RF-GA and RF-PSO; The performance of axial bearing capacity prediction is increased by using hybrid model RF-GA; A sensitivity analysis is performed with aided Shapley Additive Explanations to reveal the effects of input variables on both magnitude of axial bearing capacity of pile and performance prediction of RF-GA model.

3. Database construction

The data used for this study were obtained from published literature [11]. To correctly predict the bearing capacity of piles, a thorough understanding of the factors that affect the bearing capacity of the pile is needed. Most traditional pile bearing capacity determination methods include the following parameters: pile geometry, pile material properties, and soil properties [3, 42, 43]. Since SPT is one of the most popular in-situ tests, the soil properties were characterized through SPT results. In this study, the average of SPT values along the pile shaft and pile tip is taken as the main input to determine the bearing capacity of the pile. In addition, information on pile geometry and thickness of soil layers are also collected to ensure sufficient factors are used for determining pile bearing capacity [3]. More specifically, the input parameters for the model include (i) Pile diameter (X1); (ii) length of pile tip segment (X2); (iii) length of 2nd pile segment (X3); (iv) length of pile top segment (X4); (v) the natural ground elevation (X5); (vi) pile top elevation (X6); (vii) guide pile segment stop driving elevation (X7); (viii) pile tip elevation (X8); (ix) the average SPT blow along the embedded length of the pile (X9) and (x) the average SPT blow at the tip of the pile (X10). The diagram of pile parameters was shown in Fig 1. The bearing capacity is the single output variable in this study (Pu).
Fig 1

Diagram of pile parameters.

As observed in Table 1, the pile diameter (X1) ranged from 0.3 to 0.4 m. The length of the pile tip section (X2) ranged from 3.4 to 5.7 m. The length of the second pile segment (X3) ranged from 1.5 to 8 m. The length of the pile top segment (X4) ranged from 0 to 1.69 m which 0 value means that the segment does not exist. The natural ground elevation (X5) varied from 0.68 to 3.4m. The pile top elevation (X6) varied from 3.04 to 4.12 m. The guide piles’ stop driving elevation (X7) varied from 1.03 to 4.35m. The pile tip elevation (X8) varied from 8.3 to 16.09 m. The average SPT blow along the embedded length of the pile (X9) ranged from 5.6 to 15.41. The average SPT blow at the tip of the pile (X10) ranged from 4.38 to 7.75. The bearing capacity load (Pu), ranged from 0.407 MN to 1.551 MN with a mean value of 0.984 MN and a standard deviation of 0.353 MN.
Table 1

Inputs and output of the present study.

X1X2X3X4X5X6X7X8X9X10Pu
Unitmmmmmmmm--MN
Count472472472472472472472472472472472
SD(*)0.0480.4821.6380.4570.6160.0800.5991.7982.2640.6600.353
Min0.33.41.500.683.041.038.35.64.380.407
Mean0.3643.8266.5790.3312.8043.4952.91813.53810.7437.0560.984
Median0.43.457.3102.953.483.27514.1110.87.1751.069
Max0.45.7081.693.44.124.3516.0915.417.751.551

SD(*) = Standard deviation.

SD(*) = Standard deviation. The data distribution between the input variables and axial bearing capacity is plotted in Fig 2, the linear correlation coefficients are shown in Fig 3. As Fig 1 clearly shows, some input variables are significantly correlated such as X3 and X8, X9 and X8. However, all input variables are considered in this investigation to increase the accuracy of the proposed model.
Fig 2

Data distribution of input variables and output Pu.

Fig 3

Relation between input and output via matrix Pearson correlation.

In this investigation, the collected dataset was divided into the training and testing datasets. The number of samples used for training should not be too small, so it will be difficult for the model to learn the generality of the data. In addition, because the number of samples is quite large, the selected sample ratio is 80% for training and 20% for testing in this study, still ensuring that the number of test samples is enough to confirm the model performance. Different from the original data, the training dataset (including 10 inputs and 1 output) was normalized in the [0; 1] range to help variables have the same importance. A normalization process of parameters, such as the minimum and maximum values of the training data were performed to scale the testing dataset.

4. Methods used

4.1. Random forest (RF)

Randomized Forest (RF) belongs to the family of ML methods, which includes different algorithms for generating a set of decision trees. The random forest method was first proposed by Ho [44], and quickly became one of the powerful ML algorithms, commonly used to solve various problems [10, 34, 45]. In essence, RF was a bagging ensemble method that can improve variable selection [46]. Breiman [47] showed that random forests which are grown using random vectors in the tree construction are equivalent to a kernel acting on the true margin. In this algorithm, two principles of "randomization" are used: Bagging and Random Feature Selection [48]. That is, each decision tree in a random forest was built based on a random number of input features. Therefore, the RF model adjusts the decision tree’s over-fitting habits into their training set, or in other words, the RF generally outperforms the decision tree. A general randomized forest model is shown in Fig 4.
Fig 4

Random forest model.

When Breiman introduced the RF model in [47], the author also demonstrated that when the number of trees exceeds a certain value, adding other trees does not systematically improve the performance of the RF. This result suggests that the number of trees in RF does not need to be too large to achieve a high-efficiency performance [45, 49]. In this study, the RF model used from the scikit-learn library [50], and five model hyper-parameters that have the greatest influence on the predicted results of the RF model are considered. To be more specific, these include H1—the maximum depth of the tree; H2—the maximum features used for random bagging in each decision tree; H3—the minimum number of samples required to be at a leaf node; H4—the minimum number of samples required to split a node; H5 –the number of the decision tree. While the number of trees does not need to be too large, parameters such as H1, H2, H3, H4 affect the complexity of the tree. Trees that are too complex can cause the model to over-fitting and not achieve high generalization.

4.2. Particle swarm optimization (PSO)

Particle Swarm Optimization (PSO) was one of the most widely used optimization techniques. J. Kennedy and R. Eberhart [39] were the first to present it. It became famous due to the fact that it was a type of continuous optimization procedure. PSO employs shifting the position of the particles in the herd at a constant velocity that is updated with each iteration to find the optimal solution. Each particle’s mobility is influenced by the swarm’s personal best position and global best (cf. Fig 5). PSO is widely employed for optimization issues in various domains of engineering, particularly geotechnical [51, 52].
Fig 5

Particle movement by the swarm direction.

The pseudo-code of the algorithm is presented below: FOR each particle i in swarm FOR each dimension j Initialize position Gij randomly Initialize velocity Vij randomly END FOR END FOR Iteration k = 1 DO FOR each particle i in swarm Calculate fitness value P(i) IF P(i) > P_best(i) THEN P_best(i) = P(i) END IF IF P(i) > G_best THEN G_best = P(i) END IF END FOR FOR each particle i in swarm FOR each dimension j Calculate new velocity: Vij(k+1) = wVij(k) + c1rand1(P_best(i)—Gij(i)) + c2rand1 (G_best–Gij(i)) Update particle positon: Gij(k+1) = Gij(k) + Vij(k+1) END FOR END FOR w = w.wd k = k + 1 WHILE k < maximum_Iteration In which: w is an inertial parameter; c₁, c₂ are the acceleration coefficients; wd is the reduction coefficient of w.

4.3. Genetic algorithm (GA)

GA is one of the most powerful global optimization algorithms, used to solve various problems. This Algorithm was first introduced by Holland [40]. The origins of this approach are based on the Darwinian theory, in which an evolved and adaptable population rests on the most powerful individuals. In the GA algorithm, the population size is one of the most important factors reflecting the total number of solutions and significantly affects the results of the problem [53], while the number of generations refers to the maximum number of iterations of the algorithm [54]. Same as PSO, GA does not use gradient descent, so GA allows finding the minimum of a function even in the absence of a derivative. Moreover, other studies used the GA method whose effectiveness has been proven [6, 53, 55–57]. In this study, using the GA algorithm, an optimization technique was developed to find the optimal model architecture for RF. The pseudo-code of the GA algorithm is presented below: FOR each chromosome i in Population FOR each gene j Initialize Gij randomly END FOR END FOR Generation k = 1 DO FOR each chromosome i in Population Calculate the fitness value Pi End FOR Mating the best chromosomes Mutates some children randomly Remove the weakest chromosomes k = k + 1 WHILE maximum generation

4.4. Modeling and hyper-parameters tuning

In this investigation, the RF model is proposed in modeling the nonlinear relationship between the inputs and the output. To get high performance, the hyper-parameters of RF will be tuned using optimization algorithms including GA and PSO. Five parameters of the RF model are tuned as suggested in the literature [33]. Table 2 showed the tuned hyper-parameters, the explanation, and the value of tuning ranges. Thus, the architecture of the population in GA (or the swarm in PSO) was illustrated in Fig 6. It can be seen that the population (or the swarm) has many members, and each member has five dimensions corresponding to the 5 hyperparameters of the RF model. The flowchart of hybrid RF models was illustrated in Fig 7. In these models, RF was used as the fitness function and the member with the best fitness value was considered as the best individual. The dimension value of the best individual are selected as the best hyperparameters of the RF model.
Table 2

Hyper-parameters description and tuning range.

NoDenoteHyperparametersExplanationRange
1H1Max_depthThe maximum depth of decision tree2–20
2H2Max_featuresThe maximum features which random chosen for bagging.1–10
3H3Min_samples_leafThe minimum number of samples required to be at a leaf node2–20
4H4Min_samples_splitThe minimum number of samples required to split an internal node2–20
5H5n_estimatorsThe number of trees in the forest2–200
Fig 6

The architecture of the population (the swarm).

Fig 7

The flowchart of hybrid RF models.

For more objective results, 20 models RF-GA and RF-PSO were developed, taking into account the random initialization of the population. To make the comparison between the optimization algorithms, the maximum number of iterations of the two-optimization algorithms is 100 and the number of the population is 30. It is important to note that, to avoid overfitting of the models to the data, the 10-fold CV technique on the training set was used in this step. In this technique, the training data set is divided into 10 folds, 9 folds for training, and 1-fold for verification. The average results of 10 such times were compared for each optimization iteration step to confirm the performance of the hybrid models. All initial parameter setting in the GA and PSO was determined by trial tests [33]. The best initial parameter settings for GA and PSO were given in Table 3.
Table 3

The initial values of optimization algorithms.

RF-GARF-PSO
ParameterValueParameterValue
Population30Number of particles30
Number of children12C11.4
Mutation rate0.4C22
Generation100w1
Fitness valueR2wd0.99
DataTraining set/10-Fold CVFitness valueR2
DataTraining set/10-Fold CV
Iteration100

4.4. Performance evaluation

In this paper, three indicators accounting for the error between the actual and predicted values were used, namely the mean absolute error (MAE), root mean square error (RMSE), squared correlation coefficient (R2). The R2 measures the squared correlation between the predicted and actual values, having values in the range of [0, 1]. Low RMSE and MAE show better accuracy of the proposed ML algorithms. On the other hand, RMSE calculates the squared root average difference, whereas MAE calculates the difference between the predicted and actual values. These values can be calculated using the following equations [58-60]: where k infers the number of the samples, vi, and are the actual and predicted outputs, respectively, and is the average value of the v.

5. Results and discussion

5.1. Hyperparameters tuning

Fig 8 showed the performance of the RF-GA and RF-PSO models after 20 runs with the random initiation population. It can be seen that the performance of the models after each run is different. Specifically, the best result of the RF-GA R2 model is 0.937 in the 12th round while the RF-PSO model gives the best result with R2 reaching 0.933 in the 8th round. In addition, the RF-GA model gives the lowest result was R2 = 0.932 at the 3rd iteration, while the RF-PSO model achieved the worst result R2 = 0.929 at the 11th iteration. Overall, the RF-GA model seemed to be more efficient compared with the RF-PSO model.
Fig 8

Best value of R2 in 20 optimization runs using RF-GA and RF-PSO.

Fig 9 illustrated the best results of the RF-GA and RF-PSO models after 100 iterations. It can be seen that the RF-GA model converged quickly and achieved the best results after the 34th iteration with a performance index R2 = 0.937. On the other side, the RF-PSO model appeared to be slower in convergence and had the best result of only R2 = 0.933 at the 94th loop. The loop increase may continue to give better performance for both two-hybrid models, however, in the framework of this study, 100 loops is the limit to compare the performance of two hybrid models.
Fig 9

Hyper-parameters tuning using RF-GA and PSO-GA.

The best hyper-parameters combinations found through the RF-GA and RF-PSO models were given in Table 4. It is worth noting that the min_sample_leaf value of the two optimal models is equal to 2, the other hyper-parameters were not the same.
Table 4

Best parameters proposed by GA and PSO algorithms.

Max_depthMax_featuresMin_samples_leafMin_samples_splitn_estimators
RF-GA121212144
RF-PSO822616

5.2. Performance comparison of RF, RF-PSO, RF-GA

From a statistical probability standpoint, the randomness of the division of training and test datasets should be carefully considered. In this step, 1000 random samplings of the training set and testing set were performed to verify the stability of the models. Specifically, three models were compared including RF-GA, RF-PSO, and RF, where the RF model is used with default parameters. Fig 10 showed the density graph of the models after 1000 runs for the performance indicators such as R2, RMSE, and MAE for training and testing part of 3 models while the results were summarized in Tables 5–7 for R2, RMSE, and MAE, respectively. The results showed that, on the training set, the RF-PSO model gives the best results with the average performance indicators reaching R2 = 0.982, RMSE = 0.04649 and MAE = 0.033, respectively. However, on the testing set, the RF-PSO model gave bad results when the performance indicators were R2 = 0.9242, RMSE = 0.0939, and MAE = 0.0679, respectively. This result is equivalent to the default-RF model when achieving the corresponding indicators R2 = 0.9240, RMSE = 0.0940, and MAE = 0.0680.
Fig 10

Density chart of models after 1000 runs for data set: (a) (c) (e)–training set; (b) (d) (f)–testing set.

Table 5

Summary of the 1000 simulations using R2 criteria.

ModelDatasetAverageMinMaxSD
RF-GATraining0.963110.95390.969470.00223
Testing0.930430.85870.964870.01574
RF-PSOTraining0.982010.977760.987390.0015
Testing0.924170.798260.963670.02054
RFTraining0.9630.954740.971030.00239
Testing0.924040.791430.967650.01996
Table 7

Summary of the 1000 simulations using MAE criteria.

ModelDatasetAverageMinMaxSD
RF-GATraining0.049530.044720.054760.00133
Testing0.066030.050140.085670.00582
RF-PSOTraining0.0330.029130.035950.00105
Testing0.067880.050070.097250.00621
RFTraining0.049570.044540.054170.0014
Testing0.068010.049230.097080.00622
Density chart of models after 1000 runs for data set: (a) (c) (e)–training set; (b) (d) (f)–testing set. This implies that the RF-PSO and default-RF models proved too fit for the training set and not good in the testing set. In other words, these models are a bit overfitting and do not generalize well the data set. In the opposite direction, the RF-GA model showed better generality when the results were good on the test set with the best average performance indexes and achieved R2 = 0.93043, RMSE = 0, 08847, MAE = 0.06603, and the corresponding results achieved on the training set are R2 = 0.96311, RMSE = 0.06545, MAE = 0.04953 respectively. In addition, the standard deviation of the RF-GA model on the test set is also the smallest in all 3 criteria, proving that the model has the best stability. Generally, in terms of stability and best generalization, the RF-GA model was selected as the last model in this study.

5.3. Prediction performance of hybrid model RF-GA

The best architecture of the RF model determined by GA algorithms was applied for this section. In this section, the predictive capacity of the best-performance RF-GA model was presented. In especially, the best RF architecture’s prediction results were presented. A regression model in Fig 11 showed the correlation between the actual and predicted values for the training and testing datasets, respectively. A linear fit was also applied and plotted in each case. It is observed that the linear regression lines were very close to the diagonal lines, which confirms the close correlation between the actual and predicted axial bearing capacity of piles. The calculated values of R2, RMSE and MAE for the training dataset were 0.9639, 0.0661, 0.0511 and 0.9331, 0.0929, 0.0675 for the testing dataset, respectively. The results of the performance criteria show that the RF model with the tuned hyperparameters can accurately predict the axial bearing capacity of piles. Fig 12 showed the error values corresponding to the training and testing databases are low. Almost all the error values between the actual and predicted values were about 0 for the training and testing part confirmed that the RF model has been successful in estimating the axial load capacity of the pile.
Fig 11

Regression graphs for the case of the best parameters of RF-GA model (a) training dataset; and (b) testing dataset.

Fig 12

Error between target and output values plots for the case of the best model RF-GA (a) training dataset; and (b) testing dataset.

Regression graphs for the case of the best parameters of RF-GA model (a) training dataset; and (b) testing dataset. Error between target and output values plots for the case of the best model RF-GA (a) training dataset; and (b) testing dataset. It is important to note that due to the limitations of this study, the best RF model developed only achieves high prediction performance under the condition that the input parameter values are between the minimum and maximum values. Input values that are outside the recommended range will cause the model to be confused and incorrectly predict the bearing capacity of the piles. Moreover, the range of input and output values is crucial in improving performance of ML model [22]. In this investigation, the value of axial bearing capacity of pile varies about from 0.5 to 1.5 MN. However, the missing value of range (0.7;1.0) for the axial bearing capacity of pile (cf. Fig 11) seems to reduce the performance of RF-GA model. The prediction of axial bearing capacity of pile in this missing range needs to be careful and is not recommended. With the database containing 472 samples and 10 input variables, the prediction of axial bearing capacity by RF-GA model is recommended in range (0.40;0.70) and (1.00;1.55) MN of axial bearing capacity of pile. Therefore, the performance and reliability of prediction can be improved if the missing of range is completed in future research. In practical engineering application, the RF model can be illustrated via a large number of decision trees which is built in the form of if-else structures in EXCEL, so the user only needs to enter 10 input variables and get the output variable, which is the pile load capacity. The EXCEL file of load capacity estimation which contains the final RF model is attached in S1 Data.

5.4. Sensitivity analysis

The RF algorithm is capable of evaluating the importance of the input parameters. The importance of each input variable is measured by the change in the accuracy of the prediction when the input variable is not selected during the division for each decision tree [61]. The importance of the variables is represented by the Shapley Additive Explanations [62]. This type of plot aggregates SHAP values for all the features is shown in Fig 13. According to the SHAP value, the X8 input corresponding to pile tip elevation is the most important feature. The pile tip elevation has a positive impact on the axial bearing capacity of the pile, in fact, with higher pile tip elevation, the bearing capacity is increased. However, the X5 input corresponding to natural ground elevation has a negative impact on the bearing capacity of the pile. With the lower natural ground elevation, the bearing capacity is higher. These behavior are concluded by Coyle et Sulaiman [63] and Liu et al. [64]. The lowest impact on the pile bearing capacity is pile top elevation which has a positive effect. With a higher elevation of pile top, the pile bearing capacity is slightly increased.
Fig 13

Feature importance of 10 variables used in this investigation.

6. Conclusion

In this study, RF hybrid models were developed to predict the axial load capacity of the pile. Two global optimization algorithms, GA and PSO, were selected for the hyperparameters optimization of the RF model. For research purposes, the data set including 472 pile load test results were used to train and test the model. The results show that out of the 2 optimized algorithms selected, GA seems to provide better performance than PSO in optimizing the RF model. Specifically, the RF-GA model gives good results in the training set, while also providing expert performance on the test set. Meanwhile, the RF-PSO model appears to be overfitting when it comes to excellent performance on the training set, but poorly on the testing set. In addition, when compared to the default RF model, both the RF-GA model and the RF-PSO model yield better results demonstrating the efficiency when using the optimal algorithms. In addition, a sensitivity analysis using the RF-GA model showed that amongst 10 input variables used to predict the axial bearing capacity of the pile, the pile tip elevation was the most important feature, this feature has a positive effect on the axial bearing capacity of piles. Overall, the RF model optimized by GA provides expert performance in predicting the axial load capacity of the pile. This model could be used as a quick and accurate tool to predict the axial load capacity of the pile. In addition, the model also has great potential in solving other technical problems. In order to increase the performance and reliability of ML model in predicting axial bearing capacity of pile, the range of axial bearing capacity of pile in [0.70;1.00] and the associated input variable values need to be completed.

Load capacity estimation.

(XLSX) Click here for additional data file. 21 Jan 2022
PONE-D-21-34770
Developing random forest hybridization models for estimating the axial bearing capacity of pile
PLOS ONE Dear Dr. Tran, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Mar 07 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Wajid Mumtaz Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf  and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: In this article, the authors present soft computing techniques to predict the axial bearing capacity of piles. Especially, artificial intelligent techniques such as Random Forest (RF) models for the prediction of the axial bearing capacity of piles are developed and proposed. The problem of the estimation of the axial bearing capacity of piles is an important significant issue in structural and geotechnical engineering, and such an attempt is of great interest. However the paper, in its present form, requires some substantial modifications in order to justify its publication in an International Journal such as Plos One. The following points should be further elaborated by the authors: 1. Authors are kindly requested in order to further strengthen their work to add/include a new short section after the Introduction titled Research Significance where they should to justify the need for further research on the subject. 2. The literature review is not adequately covered and complete as presented in the manuscript. Extensive and in-depth state-of-the-art reports can be found in the following works: [Armaghani, D.J.; Asteris, P.G.; Fatemi, S.A.; Hasanipanah, M.; Tarinejad, R.; Rashid, A.S.A.; Huynh, V.V. On the Use of Neuro-Swarm System to Forecast the Pile Settlement. Appl. Sci. 2020, 10, 1904. https://doi.org/10.3390/app10061904; Asteris, P.G., Skentou, A.D., Bardhan, A., Samui, P., and Pilakoutas, K. (2021). Predicting Concrete Compressive Strength using Hybrid Ensembling of Surrogate Machine Learning Models, Cement and Concrete Research, Volume 145, 106449; Le, T.-T., Asteris, P.G., Lemonis, M.E. (2020). Axial Load Capacity of Rectangular Concrete-filled Steel Tube Columns using Machnine Learning Techniques, Engineering with Computers, https://doi.org/10.1007/s00366-021-01461-0; Apostolopoulou, M., Asteris, P.G., Armaghani, D.J., Douvika, MG., Lourenço, P.B., Cavaleri, L., Bakolas, A., Moropoulou, A. (2020). Mapping and holistic design of natural hydraulic lime mortars, Cement and Concrete Research, 136, 106167, https://doi.org/10.1016/j.cemconres.2020.106167; Armaghani, D.J., Mamou, A., Maraveas, C., Roussis, P.C., Siorikis, V.G., Skentou, A.D., Asteris, P.G. (2021). Predicting the unconfined compressive strength of granite using only two non-destructive test indexes, Geomechanics and Engineering, 317-330]. An additional paragraph, containing the augmented literature review, should be added. 3. The authors are kindly requested to include a short paragraph about the limitations of their work. Namely, they should to make clear that their proposed models are valid for input parameters values among the minimum and maximum values of the ten input parameters. 4. Also, authors should include a comment about the reliability of the database used. As a general trend, it is noticed that, during the process of developing a forecast model, researchers pay particular attention to the computational model itself, while at the same time, not giving the same amount of attention to the database that is used for the development, training and validation of the model. Although research related to new computational models is of course of high importance and added value for the international scientific community, the authors believe that, since the ultimate goal is a reliable forecast, the reliability of the database should be of utmost importance and should be thoroughly investigated in this regard. In fact, a reliable database must comprise of not only reliable data, but also of a sufficient amount of data, that covers the full range of parameter values, regarding the parameters which influence the problem investigated [Asteris, P.G., Apostolopoulou, M., Armaghani, D.J., Cavaleri, L., Chountalas, A.T., Guney, D., Hajihassani, M., Hasanipanah, M., Khandelwal, M., Karamani, C., Koopialipoor, M., Kotsonis, E., Le, T-T., Lourenço, P.B., Ly, H-B., Moropoulou, A., Nguyen, H., Pham, B.T., Samui, P., Zhou, J. (2020). On the metaheuristic models for the prediction of cement-metakaolin mortars compressive strength, Metaheuristic Computing and Applications, 1(1), 63-99, DOI: http://dx.doi.org/10.12989/mca.2020.1.1.063]. Based on the above their database which consists of only 99 datasets while at the same time their input parameters are 10 they should to include a comment about this issu. 5. It is well known the majority of authors present in their published articles only the architecture of NN model. Any architecture without the values of final values of NN model weights has very little value for others researchers and practicing engineers. In order to be useful, a proposed NN architecture should be accompanied by the (quantitative) values of weights. Authors are kindly requested to present their models final values of weights and bias [Asteris, P.G., Mokos, V.G. (2020). Concrete Compressive Strength using Artificial Neural Networks, Neural Computing and Applications, 32, 1807–11826, https://doi.org/10.1007/s00521-019-04663-2 ; Armaghani, D.J., Asteris, P.G. (2020). A comparative study of ANN and ANFIS models for the prediction of cement-based mortar materials compressive strength, Neural Computing and Applications, https://doi.org/10.1007/s00521-020-05244-4 ; Duan, J., Asteris, P.G., Nguyen, H. Bui, X.-N., Moayedi, H. (2020). A Novel Artificial Intelligence Technique to Predict Compressive Strength of Recycled Aggregate Concrete Using ICA-XGBoost Model, Engineering With Computers, https://doi.org/10.1007/s00366-020-01003-0; Zeng, J., Roussis, P.C., Mohammed, A.S., Maraveas C.,Fatemi S.A., Armaghani, D.J., Asteris, P.G. (2021).Prediction of peak particle velocity caused by blasting through the combinations of boosted-chaid and svm models with various kernels, Applied Sciences (Switzerland), 2021, 11(8), 3705; ]. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Panagiotis G. Asteris [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.
16 Feb 2022 RESPONSES OF THE ACADEMIC EDITOR AND REVIEWERS’ COMMENTS I. RESPONSE TO REVIEWER #1 Comment #1. Authors are kindly requested in order to further strengthen their work to add/include a new short section after the Introduction titled Research Significance where they should to justify the need for further research on the subject. Response: We thank Reviewer #1 for this recommendation. A new section “Research Significance” is added in the revised manuscript. Comment #2. The literature review is not adequately covered and complete as presented in the manuscript. Extensive and in-depth state-of-the-art reports can be found in the following works: Armaghani, D.J.; Asteris, P.G.; Fatemi, S.A.; Hasanipanah, M.; Tarinejad, R.; Rashid, A.S.A.; Huynh, V.V. On the Use of Neuro-Swarm System to Forecast the Pile Settlement. Appl. Sci. 2020, 10, 1904. https://doi.org/10.3390/app10061904; Asteris, P.G., Skentou, A.D., Bardhan, A., Samui, P., and Pilakoutas, K. (2021). Predicting Concrete Compressive Strength using Hybrid Ensembling of Surrogate Machine Learning Models, Cement and Concrete Research, Volume 145, 106449; Le, T.-T., Asteris, P.G., Lemonis, M.E. (2020). Axial Load Capacity of Rectangular Concrete-filled Steel Tube Columns using Machnine Learning Techniques, Engineering with Computers, https://doi.org/10.1007/s00366-021-01461-0; Apostolopoulou, M., Asteris, P.G., Armaghani, D.J., Douvika, MG., Lourenço, P.B., Cavaleri, L., Bakolas, A., Moropoulou, A. (2020). Mapping and holistic design of natural hydraulic lime mortars, Cement and Concrete Research, 136, 106167, https://doi.org/10.1016/j.cemconres.2020.106167; Armaghani, D.J., Mamou, A., Maraveas, C., Roussis, P.C., Siorikis, V.G., Skentou, A.D., Asteris, P.G. (2021). Predicting the unconfined compressive strength of granite using only two non-destructive test indexes, Geomechanics and Engineering, 317-330]. An additional paragraph, containing the augmented literature review, should be added. Response: We thank Reviewer #1 for this recommendation. A new paragraph containing the augmented literature review is added in the revised manuscript. Comment #3. The authors are kindly requested to include a short paragraph about the limitations of their work. Namely, they should to make clear that their proposed models are valid for input parameters values among the minimum and maximum values of the ten input parameters. Response: We appreciate reviewer comments. We appreciate reviewer comments. This comment is completely worthwhile because, by nature, decision tree-based models like Random Forest will not be able to predict outside of the trained range. Therefore, we added the following limitation of the study in the resubmission: “It is important to note that due to the limitations of this study, the best RF model developed only achieves high prediction performance under the condition that the input parameter values are between the minimum and maximum values. Input values that are outside the recommended range will cause the model to be confused and incorrectly predict the bearing capacity of the piles.” Comment #4. Also, authors should include a comment about the reliability of the database used. As a general trend, it is noticed that, during the process of developing a forecast model, researchers pay particular attention to the computational model itself, while at the same time, not giving the same amount of attention to the database that is used for the development, training and validation of the model. Although research related to new computational models is of course of high importance and added value for the international scientific community, the authors believe that, since the ultimate goal is a reliable forecast, the reliability of the database should be of utmost importance and should be thoroughly investigated in this regard. In fact, a reliable database must comprise of not only reliable data, but also of a sufficient amount of data, that covers the full range of parameter values, regarding the parameters which influence the problem investigated Asteris, P.G., Apostolopoulou, M., Armaghani, D.J., Cavaleri, L., Chountalas, A.T., Guney, D., Hajihassani, M., Hasanipanah, M., Khandelwal, M., Karamani, C., Koopialipoor, M., Kotsonis, E., Le, T-T., Lourenço, P.B., Ly, H-B., Moropoulou, A., Nguyen, H., Pham, B.T., Samui, P., Zhou, J. (2020). On the metaheuristic models for the prediction of cement-metakaolin mortars compressive strength, Metaheuristic Computing and Applications, 1(1), 63-99, DOI: http://dx.doi.org/10.12989/mca.2020.1.1.063. Based on the above their database which consists of only 99 datasets while at the same time their input parameters are 10 they should to include a comment about this issue. Response: We thank Reviewer #1 for this excellent recommendation and discussion about reliability of database. Based on the interesting comment of Reviewer #1, a new paragraph is added in section 5.3 and conclusion of the revised manuscript. Comment #5. It is well known the majority of authors present in their published articles only the architecture of NN model. Any architecture without the values of final values of NN model weights has very little value for others researchers and practicing engineers. In order to be useful, a proposed NN architecture should be accompanied by the (quantitative) values of weights. Authors are kindly requested to present their models final values of weights and bias Asteris, P.G., Mokos, V.G. (2020). Concrete Compressive Strength using Artificial Neural Networks, Neural Computing and Applications, 32, 1807–11826, https://doi.org/10.1007/s00521-019-04663-2 ; Armaghani, D.J., Asteris, P.G. (2020). A comparative study of ANN and ANFIS models for the prediction of cement-based mortar materials compressive strength, Neural Computing and Applications, https://doi.org/10.1007/s00521-020-05244-4 ; Duan, J., Asteris, P.G., Nguyen, H. Bui, X.-N., Moayedi, H. (2020). A Novel Artificial Intelligence Technique to Predict Compressive Strength of Recycled Aggregate Concrete Using ICA-XGBoost Model, Engineering With Computers, https://doi.org/10.1007/s00366-020-01003-0; Zeng, J., Roussis, P.C., Mohammed, A.S., Maraveas C.,Fatemi S.A., Armaghani, D.J., Asteris, P.G. (2021). Prediction of peak particle velocity caused by blasting through the combinations of boosted-chaid and svm models with various kernels, Applied Sciences (Switzerland), 2021, 11(8), 3705; Response: We appreciate the reviewer's comment. We agree that in practice, engineers and researchers will find it difficult to apply studies if only information about the architecture of the models is available. However, due to the nature of the Random Forest model containing a large number of decision trees, a complete illustration of decision trees is difficult within the framework of the manuscript. Therefore, we suggest moving the final RF model to an open spreadsheet (specifically, the one in MS EXCEL). The decision trees model is built in the form of if-else structures in EXCEL, so the user only needs to enter 10 input variables and get the output variable, which is the pile load capacity. The EXCEL file which contains the final RF model is attached with the revised manuscript. Submitted filename: Response to reviewers_Final.docx Click here for additional data file. 8 Mar 2022 Developing random forest hybridization models for estimating the axial bearing capacity of pile PONE-D-21-34770R1 Dear Dr. Van Quan Tran, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Wajid Mumtaz Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: No The authors have adequately addressed the comments of the reviewers and the manuscript is ready to be sent to the production editor for a final check to see that everything is in order. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 10 Mar 2022 PONE-D-21-34770R1 Developing random forest hybridization models for estimating the axial bearing capacity of pile Dear Dr. Tran: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Wajid Mumtaz Academic Editor PLOS ONE
Table 6

Summary of the 1000 simulations using RMSE criteria.

ModelDatasetAverageMinMaxSD
RF-GATraining0.065450.059660.072260.0018
Testing0.088470.064720.124420.00889
RF-PSOTraining0.046490.039820.051320.00188
Testing0.093910.067680.152440.01117
RFTraining0.06560.058780.071480.00188
Testing0.094030.063290.159290.01097
  3 in total

1.  Gradient descent learning algorithm overview: a general dynamical systems perspective.

Authors:  P Baldi
Journal:  IEEE Trans Neural Netw       Date:  1995

2.  From Local Explanations to Global Understanding with Explainable AI for Trees.

Authors:  Scott M Lundberg; Gabriel Erion; Hugh Chen; Alex DeGrave; Jordan M Prutkin; Bala Nair; Ronit Katz; Jonathan Himmelfarb; Nisha Bansal; Su-In Lee
Journal:  Nat Mach Intell       Date:  2020-01-17

3.  Design deep neural network architecture using a genetic algorithm for estimation of pile bearing capacity.

Authors:  Tuan Anh Pham; Van Quan Tran; Huong-Lan Thi Vu; Hai-Bang Ly
Journal:  PLoS One       Date:  2020-12-17       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.