| Literature DB >> 27348435 |
Zhikai Liang1, James C Schnable1.
Abstract
Recent reports have shown than many identically named genetic lines used in research around the world actually contain large amounts of uncharacterized genetic variation as a result of cross contamination of stocks, unintentional crossing, residual heterozygosity within original stocks, or de novo mutation. 27 public, large scale, RNA-seq datasets from 20 independent research groups around the world were used to assess variation within the maize (Zea mays ssp. mays) inbred B73, a four decade old variety which served as the reference genotype for the original maize genome sequencing project and is widely used in genetic, genomic, and phenotypic research. Several clearly distinct clades were identified among putatively B73 samples. A number of these clades were defined by the presence of clearly defined genomic blocks containing a haplotype which did not match the published B73 reference genome. The overall proportion of the maize genotype where multiple distinct haplotypes were observed across different research groups was approximately 2.3%. In some cases the relationship among B73 samples generated by different research groups recapitulated mentor/mentee relationships within the maize genetics community.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27348435 PMCID: PMC4922647 DOI: 10.1371/journal.pone.0157942
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
B73 RNA-seq data sets sources.
| Sample Name | Run Accession | Library Layout (bp) | Institute |
|---|---|---|---|
| Control 1 | SRR3372478 | Paired (101) | University of Nebraska—Lincoln |
| Control 2 | SRR3371876 | Single (51) | University of Nebraska—Lincoln |
| USA 1 [ | SRR651051 | Paired (51) | University of Minnesota |
| USA 2 [ | SRR1819621 | Paired (52) | University of Minnesota |
| USA 3 [ | SRR404150 | Single (76) | University of Wisconsin—Madison |
| USA 4 [ | SRR514100 | Paired (151) | University of Wisconsin—Madison |
| USA 5 [ | SRR940300 | Single (101) | University of Wisconsin—Madison |
| USA 6 [ | Single (40) | Iowa State University | |
| USA 7 | SRR445245 | Paired (102) | Iowa State University |
| USA 8 [ | Single (35) | Danold Danforth Center | |
| USA 9 [ | Single (35) | Danold Danforth Center | |
| USA 10 [ | SRR1656746 | Single (101) | University of Nebraska—Lincoln |
| USA 11 [ | SRR1567899 | Paired (50) | Iowa State University |
| USA 12 | SRR504480 | Single (100) | University of California—Berkeley |
| USA 13 [ | SRR1587038 | Single (101) | University of Wisconsin—Madison |
| USA 14 [ | SRR1231518 | Single (100) | Cornell University |
| USA 15 [ | SRR1272115 | Paired (50) | DuPont Pioneer |
| USA 16 [ | SRR640263 | Single (35) | Yale University |
| USA 17 [ | Paired (51) | Cold Spring Harbor Laboratory | |
| USA 18 [ | SRR536834 | Single (76) | Virginia Tech |
| USA 19 [ | SRR999052 | Paired (50) | Cold Spring Harbor Laboratory |
| USA 20 [ | SRR248565 | Paired (81) | Stanford University |
| CHN 1 [ | SRR491307 | Paired (76) | China Agricultural University |
| CHN 2 [ | SRR1522119 | Paired (102) | China Agricultural University |
| CHN 3 [ | SRR910231 | Paired (91) | China Academy of Agricultural Sciences |
| DEU 1 [ | SRR924107 | Single (96) | MPIPZ |
| DEU 2 [ | SRR1030995 | Single (85) | University of Bonn |
* USA 12 harbors a long introgression on chromosome 2.
Fig 1Phylogenetic tree of 27 data sets.
(A) Distance-scaled branch lengths; (B) Unscaled tree. Only bootstrap values greater than or equal to 60 are displayed.
Fig 2SNP distribution pattern for each of the 27 samples on each of the first 6 chromosomes of maize.
Non-reference-like homozygous genotypes are indicated in blue and heterozygous genotypes in red. The sample order from top to bottom on Y-axis in each sub-figure is the same order displayed as in Fig 1B.
Relationship of Non-Reference-Genome Like SNP Blocks to Haplotypes Surveyed by HapMap2.
| Genomic blocks | Chr | Start (kb) | Stop (kb) | Closest haplotypes | Branch length | Present in |
|---|---|---|---|---|---|---|
| c2r1 | 2 | 40000 | 44300 | W22 | 0.00000018 | China |
| c2r2 | 2 | 212450 | 224250 | BKN010 | 0.41156403 | China |
| c4r1 | 4 | 169650 | 191550 | CAU178 | 0.64099035 | China |
| c5r1 | 5 | 201200 | 203000 | no single best match | - | China |
| c5r2 | 5 | 209732 | 211540 | B73 HapMap2 | 0.00000001 | China |
| c6r1 | 6 | 120 | 8800 | CML511 | 0.59542615 | China |
| c6r2 | 6 | 20900 | 24670 | OH7B | 0.08905230 | China |
Fig 3Zoom in on haplotype regions c2r1, c2r2 and c5r2.
(A) Haplotype region c2r1 on Chromosome 2; (B) Haplotype region c2r2 on Chromosome 2; (C) Haplotype region c5r2 on Chromosome 5. Non-reference-like homozygous genotypes are indicated in blue and heterozygous genotypes in red. Named haplotype regions are those between the green bars.
Fig 4Relationship of the China B73 version of haplotype region c2r1 to the maize HapMap2 varieties.