| Literature DB >> 27379109 |
Cécile Berthouly-Salazar1, Cédric Mariac1, Marie Couderc1, Juliette Pouzadoux1, Jean-Baptiste Floc'h1, Yves Vigouroux1.
Abstract
Next-generation sequencing opens the way for genomic studies of diversity even for non-model crops and animals. Genome reduction techniques are becoming progressively more popular as they allow a fraction of the genome to be sequenced for multiple individuals and/or populations. These techniques are an efficient way to explore genome diversity in non-model crops and animals for which no reference genome is available. Genome reduction techniques emerged with the development of specific pipelines such as UNEAK (Universal Network Enabled Analysis Kit) and Stacks. However, even for non-model crops and animals, transcriptomes are easier to obtain, thereby making it possible to directly map reads. We investigate the direct use of transcriptome as an alternative strategy. Our specific objective was to compare SNPs obtained from the UNEAK pipeline as well as SNPs obtained by directly mapping genotyping-by-sequencing reads on a transcriptome. We assessed the feasibility of both SNP datasets, UNEAK and transcriptome mapping, to investigate the diversity of 91 samples of wild pearl millet sampled across its distribution area. Both approaches produced several tens of thousands of single nucleotide variants, but differed in the way the variants were identified, leading to differences in the frequency spectrum associated with marked differences in the assessment of diversity. Difference in the frequency spectrum significantly biased a large set of diversity analyses as well as detection of selection approaches. However, whatever the approach, we found very similar inference of genetic structure, with three major genetic groups from West, Central, and East Africa. For non-model crops, using transcriptome data as a reference is thus a particularly promising way to obtain a more thorough analysis of datasets generated using genome reduction techniques.Entities:
Keywords: GBS; SNP; UNEAK; pearl millet; site frequency spectrum; transcriptome
Year: 2016 PMID: 27379109 PMCID: PMC4908121 DOI: 10.3389/fpls.2016.00777
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of diversity statistics for the two SNPs datasets.
| UNEAK | TM | ||||||
|---|---|---|---|---|---|---|---|
| Mean | Median | Standard deviation | Mean | Median | Standard deviation | ||
| HExp | 0.28 | 0.25 | 0.13 | 0.12 | 0.04 | 0.15 | <0.0001 |
| HObs | 0.16 | 0.13 | 0.13 | 0.09 | 0.03 | 0.14 | <0.0001 |
| 0.22 | 0.2 | 0.18 | 0.1 | 0.04 | 0.18 | <0.0001 | |
| 0.34 | 0.37 | 0.28 | 0.39 | 0.44 | 0.4 | <0.0001 | |