| Literature DB >> 29902967 |
Peng Qi1,2, Davis Gimode1,3,4, Dipnarayan Saha1,2,5, Stephan Schröder1,2,6, Debkanta Chakraborty7, Xuewen Wang8, Mathews M Dida9, Russell L Malmberg2, Katrien M Devos10,11,12.
Abstract
BACKGROUND: Research on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species.Entities:
Keywords: Chromosomal rearrangements; E. indica; Eleusine coracana; Finger millet; GBS-pipeline; Genetic mapping; Genotyping-by-sequencing (GBS); Paired-end reads; Polyploid; SNP calling
Mesh:
Substances:
Year: 2018 PMID: 29902967 PMCID: PMC6003085 DOI: 10.1186/s12870-018-1316-3
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Schematic overview showing use of the UGbS-Flex pipeline to generate a GBS reference
Schematic representation of the approach used to consolidate SNPs within the same GBS reference tag
| S1b | S2 | S3 | S4 | S5 | S6 | S7 | S8 | S9 | S10 | S11 | S12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP1a | – | Bc | – | C | D | A | B | H | A | B | H | A |
| SNP2a | A | – | – | – | – | D | C | C | A | D | A | A |
| SNP3a | – | – | H | – | – | A | C | D | C | B | A | B |
| After consolidation |
|
|
|
|
|
|
|
|
|
|
|
|
aSNP1, SNP2 and SNP3 are SNPs located on the same GBS tag
bS1 to S12 represent different situations
cA, B, H, C, D and – are genotypic scores
Summary statistics obtained for each of the three enzyme combinations for a subset of 1 M reads
| Enzyme combination | Accession | Total read number | Number of stacks (tags) within samplesa | Number of stacks (tags) common to all samplesb | Number of polymorphic tagsc | Number of polymorphic SNPsc | Average depth of scored SNPs |
|---|---|---|---|---|---|---|---|
| KNE 796 | 5,720,433 | 83,082 | 22,191 | 15.20 | |||
| MD-20 | 5,418,738 | 64,382 | 4440 | 4515 | 15.64 | ||
| Okhale-1 | 7,738,023 | 73,903 | 17.21 | ||||
| KNE 796 | 3,466,582 | 50,189 | 12,249 | 16.57 | |||
| MD-20 | 4,474,070 | 37,619 | 2475 | 2509 | 20.90 | ||
| Okhale-1 | 5,586,989 | 45,829 | 19.65 | ||||
| KNE 796 | 6,647,185 | 39,406 | 12,392 | 32.28 | |||
| MD-20 | 4,269,463 | 35,218 | 3232 | 3305 | 28.66 | ||
| Okhale-1 | 5,851,494 | 37,551 | 34.76 |
aDetermined using ‘ustacks’
bGBS tags that are common to all three accessions were selected from the ‘ASustacks’ output; if two or more tags had a level of homology ≥98%, only a single tag was retained
cOnly SNPs at a read depth ≥ 8× were scored
Number of SNPs identified with different combinations of SNP callers and GBS references
| Unified Genotyper (GATK) | Haplotype Caller (GATK) | |||||||
|---|---|---|---|---|---|---|---|---|
| Ref50 | Ref50_98 | Ref70 | Ref70_98 | Ref50 | Ref50_98 | Ref70 | Ref70_98 | |
| No. of GBS tags | 34,960 | 28,579 | 16,725 | 15,397 | 34,960 | 28,579 | 16,725 | 15,397 |
| No. of SNPs | 2358 | 5534 | 3939 | 4477 | 1766 | 4593 | 3413 | 3934 |
Fig. 2Venn-diagram showing the number of unique and common SNPs identified using GATK’s Unified Genotyper and Haplotype Caller in combination with GBS references Ref50_98 and Ref70_98
Fig. 3High-density genetic maps of finger millet (homoeologous groups 1, 2 and 3). Marker names are on the right-hand side, centiMorgan (Kosambi) distances on the left-hand side. For readability, only the first marker of each marker bin is represented on the map. Locations of all markers are available from Additional file 9: Table S6
Fig. 4High-density genetic maps of finger millet (homoeologous groups 4, 5 and 6). Marker names are on the right-hand side, centiMorgan (Kosambi) distances on the left-hand side. For readability, only the first marker of each marker bin is represented on the map. Locations of all markers are available from Additional file 9: Table S6
Fig. 5High-density genetic maps of finger millet (homoeologous groups 7, 8 and 9). Marker names are on the right-hand side, centiMorgan (Kosambi) distances on the left-hand side. For readability, only the first marker of each marker bin is represented on the map. Locations of all markers are available from Additional file 9: Table S6
Number of markers and map length of each of the nine A and nine B genome linkage groups
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ||
|---|---|---|---|---|---|---|---|---|---|---|
| A | Number of markers | 301 | 259 | 292 | 162 | 295 | 260 | 204 | 183 | 122 |
| Map length (cM) | 239.8 | 191.5 | 200.7 | 108.1 | 197.2 | 177.3 | 163.2 | 131.6 | 102.8 | |
| B | Number of markers | 148 | 112 | 295 | 198 | 289 | 251 | 39 | 183 | 179 |
| Map length (cM) | 188.6 | 85.4 | 167.1 | 144 | 195.5 | 173.7 | 50.5 | 131.4 | 131.8 | |
Fig. 6Scatterplots showing presence/absence in E. indica (AA genome) of GBS tags mapped in E. coracana (AABB genome). Linkage group designations are given on the right-hand side of each graph. GBS tags are ordered by map position (distance in cM). Tags that were present in at least two of the three E. indica accessions analyzed were placed at position ‘2’ on the Y-axis. Tags that were absent from all three E. indica accessions analyzed were placed at position ‘0’ on the Y-axis. GBS tags located on the B-genome of E. coracana are absent from E. indica. GBS tags on the A-genome of E. coracana are predominantly present (conserved tag) but can be absent from E. indica (polymorphism at restriction site, tag not amplified due to a large insertion, or tag not sequenced)
Fig. 7Box-and-whisker plot showing the read number distribution across pooled samples before optimization and after optimization of the GBS protocol
Fig. 8SNP distribution across PstI/MspI GBS tags for which forward and reverse reads were joined artificially during the GBS reference generation in three finger millet accessions (KNE 796, MD-20 and Okhale-1). Red bars flank the junction between forward and reverse reads
Fig. 9Box-and-whisker plots showing the read depth distribution at SNP positions along the length of the translocated (9AB and 9BA) and non-translocated regions (9AA and 9BB) in chromosomes 9A and 9B of five selected progeny. Progenies 16 and 56 carry a chromosome complement in which the 9BA region is absent and two copies of the 9AB region are present. Progeny 94 and 114 carry complete A and B genomes. Progeny 151 carries a chromosome complement in which the 9AB region is absent and two copies of the 9BA region are present