| Literature DB >> 27079510 |
Yuanda Lv1, Yuhe Liu2, Han Zhao3.
Abstract
BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data generation, and enabled genome-wide identification of insertion and deletion variation. However, current NGS-based InDel mining tools, such as Samtools, GATK and Atlas2, all rely on a reference genome for variant calling which hinders their application on unsequenced organisms and the output of short InDels compromised their use on gel-based genotyping platforms. To address these issues, an enhanced platform is needed to identify longer InDels and develop markers in absence of a reference genome.Entities:
Keywords: Genome-wide marker discovery; Insertions and deletions (InDels); Marker platform; Marker-assisted breeding; Next-generation sequencing (NGS)
Mesh:
Substances:
Year: 2016 PMID: 27079510 PMCID: PMC4832496 DOI: 10.1186/s12864-016-2614-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Overview of mInDel. mInDel is designed in five steps and implemented by five independent modules, which forms a pipeline of sequence analysis from raw sequences to InDel marker development. Modular design allows flexible choices of analysis steps based on users’ needs
Comparison between mInDel with other InDel calling tools
| Method | mInDel | SAMtools | GATK | Atlas2 |
|---|---|---|---|---|
|
| Yes | No | No | No |
| Reference genome-independent | Yes | No | No | No |
| Different aligners for different read lengths | Yes | No | No | No |
| InDel sized called | Short, Long | Short | Short | Short |
| High-throughput genotyping compatibility | Yes | Yes | Yes | Yes |
| PCR/electrophoresis genotyping amiability | High | Low | Low | Low |
Summary of three genotypes genome data
| Inbred Line | B73 genome | Mo17 | Zheng58 | Qi319 |
|---|---|---|---|---|
| Raw data | / | 24 Gb | 36.9 Gb | 2.7 Gb |
| Sequencing platform | / | Illumina | Illumina | Illumina |
| Cleaned data | / | 21.2Gb | 32.6Gb | 2.1Gb |
| Assembled size | 2.1G | 1.5G | 408.9 M | 183.6 M |
| No. of Contigs | / | 1,351,472 | 476,470 | 395,075 |
| Longest contig | / | 33,817 bp | 18,068 bp | 7,286 bp |
| N50 size | / | 2,012 bp | 1,157 bp | 457 bp |
Summary of InDel markers developed from two experiments
| Types | Mo17 vs. B73 (with reference genome) | Zheng58 vs. Qi319 (without reference genome) |
|---|---|---|
| Total InDels | 739,250 | 223,084 |
| Mean size | 71 bp | 66 bp |
| 1 ~ 10 bp | 76,376 | 25,144 |
| 11 ~ 20 bp | 63,714 | 21,291 |
| 21 ~ 100 bp | 374,847 | 123,788 |
| >100 bp | 213,698 | 52,861 |
| Discovery accuracy | 86.8 % (276 out of 318) | 72.6 % (437 out of 602) |