| Literature DB >> 29187731 |
Ning Yang1, Xi-Wen Xu1, Rui-Ru Wang1, Wen-Lei Peng1, Lichun Cai2, Jia-Ming Song1, Wenqiang Li1, Xin Luo1, Luyao Niu1, Yuebin Wang1, Min Jin1, Lu Chen1, Jingyun Luo1, Min Deng1, Long Wang1, Qingchun Pan1, Feng Liu1, David Jackson1,3, Xiaohong Yang2, Ling-Ling Chen4, Jianbing Yan5.
Abstract
Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17 × 10-8 ~3.87 × 10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.Entities:
Mesh:
Year: 2017 PMID: 29187731 PMCID: PMC5707364 DOI: 10.1038/s41467-017-02063-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Mo17-mexicana population development and characterization of the assembled genomes. a The construction of Mo17-mexicana BC2F7 population. Only a single F1 seed was used. b The genetic design for meta-assembly of the Mo17 and mexicana genomes. Ten well-chosen TM lines that genetically divided the Mo17 and mexicana genomes into different fragments. The fragment indicated by the dash lines are used to estimate the mutation rate in mexicana. c Schematic illustration of the meta-assembly pipeline. The Illumina reads were filtered and adapters were removed for each library and the assembly can be divided into three major steps: Step 1, three strategies were used to assemble contigs, including de novo assembly of 10 individuals, reference-based assembly based on B73 genome and de novo assembly of unmapped reads. Step 2, the contigs obtained from the three strategies were merged, and further connected or extended with Pacbio long reads. Long mate-pair libraries were then added to assemble scaffolds. Step 3, the NRGene scaffolds were integrated, and the merged scaffolds were the final scaffolds. The number represents contig length before diagonal and scaffold length after diagonal
Characteristics of the Mo17 and mexicana genome sequences
| Mo17 |
| |
|---|---|---|
| Length of assembled scaffolds (bp) | 2,041,547,554 | 1,204,281,382 |
| Number of scaffolds ≥ 500 bp | 48,268 | 107,418 |
| Largest scaffolds (bp) | 26,086,894 | 26,384,332 |
| Scaffold N50 (bp) | 2,995,073 | 107,689 |
| Length of assembled contigs (bp) | 2,005,908,899 | 1,157,520,532 |
| Number of contigs ≥ 500 bp | 116,972 | 149,360 |
| Largest contig (bp) | 564,568 | 564,568 |
| Contig N50 (bp) | 60,508 | 26,638 |
| Sequences anchored to chromosomes (%) | 98.4% | 95.9% |
| Numbers of gene models/transcripts | 40,003/97,069 | 31,387/71,535 |
| Mean transcript length (bp) | 2219.67 | 2065.46 |
| Mean coding sequence length (bp) | 1310.16 | 1255.75 |
| Total size of transposable elements (bp) | 1,580,816,259 | 832,324,464 |
Fig. 2Comparative analysis of Mo17 and mexicana genomes. Tracks from outer to inner circles indicate: SNPs, PAV density (window size of 1 Mb, the outer and inner layers indicate Mo17 (Blue) and mexicana (Orange)respectively), and a heatmap of introgression, indicating number of lines containing the corresponding introgression region (low = blue; high = red), hotspots are labeled with red arrows
Fig. 3The relationship between Inv9d and phenotype. a LD reveals the large inversion in Zea mays. mexicana based on 56 K SNPs in 95 mexicana accessions. LD (r2 ≥ 0.1, red; r2 <0.1, pink; P < 0.1, black; P ≥ 0.1, gray) for pairs of SNPs are shown across chromosome 9. Dashed black lines delineate the likely boundaries of structural variants discussed in the text. b QTLs for ear-leaf width located in Inv9d have been detected in 6 environments. The dashed line indicates the cutoff of LOD value (LOD ≥ 2.5). c The dot-plot of gene co-linearity between B73 and mexicana on chromosome 9. The histogram indicated the gene density in a 1 Mb window. The green block covered the gene density in the Inv9d region. d The gene co-linearity between chromosome 9 of B73 and chromosome 6 of rice. e The gene co-linearity between chromosome 9 of mexicana and chromosome 6 of rice. f The phenotype of ear-leaf width of the 10 sequenced lines. The phenotype of mexicana haplotype with Inv9d (orange) and Mo17 haplotypes without Inv9d (green) are shown. The length of scale bar is 10 cm
Fig. 4The characteristics of spontaneous mutations. a The distribution of mutations was not random. The distances between adjacent mutations (observed, orange) against a random distribution of mutation across the genome (expected, gray). b The proportion of deleterious mutations in centromeres (the inner circle) and chromosome arms (the outer circle) are shown in orange. c The distribution of mutations by functional classification. The relative positions of mutations in gene regions were normalized referring to an average length (4 kb) based on B73 gene models[4]. d Variation in mutation rate of different triplets. e The mutation number of transitions and transversions