Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 InDel marker detection by integration of multiple softwares using machine learning techniques.

Literature DB >> 27806691

InDel marker detection by integration of multiple softwares using machine learning techniques.

Jianqiu Yang¹, Xinyi Shi², Lun Hu¹, Daipeng Luo¹, Jing Peng¹, Shengwu Xiong¹, Fanjing Kong³, Baohui Liu³, Xiaohui Yuan⁴.

Abstract

BACKGROUND: In the biological experiments of soybean species, molecular markers are widely used to verify the soybean genome or construct its genetic map. Among a variety of molecular markers, insertions and deletions (InDels) are preferred with the advantages of wide distribution and high density at the whole-genome level. Hence, the problem of detecting InDels based on next-generation sequencing data is of great importance for the design of InDel markers. To tackle it, this paper integrated machine learning techniques with existing software and developed two algorithms for InDel detection, one is the best F-score method (BF-M) and the other is the Support Vector Machine (SVM) method (SVM-M), which is based on the classical SVM model.
RESULTS: The experimental results show that the performance of BF-M was promising as indicated by the high precision and recall scores, whereas SVM-M yielded the best performance in terms of recall and F-score. Moreover, based on the InDel markers detected by SVM-M from soybeans that were collected from 56 different regions, highly polymorphic loci were selected to construct an InDel marker database for soybean.
CONCLUSIONS: Compared to existing software tools, the two algorithms proposed in this work produced substantially higher precision and recall scores, and remained stable in various types of genomic regions. Moreover, based on SVM-M, we have constructed a database for soybean InDel markers and published it for academic research.

Entities: Chemical Species

Keywords: Evaluation; InDel detection; Insertions and deletions

Year: 2016 PMID： 27806691 DOI： 10.1186/s12859-016-1312-2

Source DB: PubMed Journal: BMC Bioinformatics ISSN： 1471-2105 Impact factor: 3.169

1 in total

1. Genome-wide identification and characterization of InDels and SNPs in Glycine max and Glycine soja for contrasting seed permeability traits.

Authors: G Ramakrishna; Parampreet Kaur; Deepti Nigam; Pavan K Chaduvula; Sangita Yadav; Akshay Talukdar; Nagendra Kumar Singh; Kishor Gaikwad
Journal: BMC Plant Biol Date: 2018-07-09 Impact factor: 4.215

1 in total