Literature DB >> 33561957

Efficient Feature Selection for Static Analysis Vulnerability Prediction.

Katarzyna Filus1, Paweł Boryszko1, Joanna Domańska1, Miltiadis Siavvas2, Erol Gelenbe1.   

Abstract

Common software vulnerabilities can result in severe security breaches, financial losses, and reputation deterioration and require research effort to improve software security. The acceleration of the software production cycle, limited testing resources, and the lack of security expertise among programmers require the identification of efficient software vulnerability predictors to highlight the system components on which testing should be focused. Although static code analyzers are often used to improve software quality together with machine learning and data mining for software vulnerability prediction, the work regarding the selection and evaluation of different types of relevant vulnerability features is still limited. Thus, in this paper, we examine features generated by SonarQube and CCCC tools, to identify those that can be used for software vulnerability prediction. We investigate the suitability of thirty-three different features to train thirteen distinct machine learning algorithms to design vulnerability predictors and identify the most relevant features that should be used for training. Our evaluation is based on a comprehensive feature selection process based on the correlation analysis of the features, together with four well-known feature selection techniques. Our experiments, using a large publicly available dataset, facilitate the evaluation and result in the identification of small, but efficient sets of features for software vulnerability prediction.

Entities:  

Keywords:  feature selection; machine learning; software vulnerability prediction; static analysis

Year:  2021        PMID: 33561957      PMCID: PMC7915846          DOI: 10.3390/s21041133

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  5 in total

1.  Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality.

Authors:  Anthony J Bishara; James B Hittner
Journal:  Educ Psychol Meas       Date:  2014-11-11       Impact factor: 2.821

2.  Diagnostic tests. 1: Sensitivity and specificity.

Authors:  D G Altman; J M Bland
Journal:  BMJ       Date:  1994-06-11

3.  Detecting stable distributed patterns of brain activation using Gini contrast.

Authors:  Georg Langs; Bjoern H Menze; Danial Lashkari; Polina Golland
Journal:  Neuroimage       Date:  2010-08-13       Impact factor: 6.556

Review 4.  Investigating the Interaction between Energy Consumption, Quality of Service, Reliability, Security, and Maintainability of Computer Systems and Networks.

Authors:  Dionysios Kehagias; Marija Jankovic; Miltiadis Siavvas; Erol Gelenbe
Journal:  SN Comput Sci       Date:  2021-01-08

5.  The Effect of Faking on the Correlation Between Two Ordinal Variables: Some Population and Monte Carlo Results.

Authors:  Marco Bressan; Yves Rosseel; Luigi Lombardi
Journal:  Front Psychol       Date:  2018-10-12
  5 in total
  1 in total

1.  Analytical Modeling for Identification of the Machine Code Architecture of Cyberphysical Devices in Smart Homes.

Authors:  Igor Kotenko; Konstantin Izrailov; Mikhail Buinevich
Journal:  Sensors (Basel)       Date:  2022-01-28       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.