| Literature DB >> 32434565 |
Jianing Liu1, Arun S Seetharam2, Kapeel Chougule3, Shujun Ou4, Kyle W Swentowsky5, Jonathan I Gent5, Victor Llaca6, Margaret R Woodhouse7, Nancy Manchanda4, Gernot G Presting8, David A Kudrna9, Magdy Alabady5,10, Candice N Hirsch11, Kevin A Fengler6, Doreen Ware3,12, Todd P Michael13, Matthew B Hufford4, R Kelly Dawe14,15.
Abstract
Creating gapless telomere-to-telomere assemblies of complex genomes is one of the ultimate challenges in genomics. We use two independent assemblies and an optical map-based merging pipeline to produce a maize genome (B73-Ab10) composed of 63 contigs and a contig N50 of 162 Mb. This genome includes gapless assemblies of chromosome 3 (236 Mb) and chromosome 9 (162 Mb), and 53 Mb of the Ab10 meiotic drive haplotype. The data also reveal the internal structure of seven centromeres and five heterochromatic knobs, showing that the major tandem repeat arrays (CentC, knob180, and TR-1) are discontinuous and frequently interspersed with retroelements.Entities:
Keywords: Gapless assembly; Knob structure; Long-read technology; Maize genome; Meiotic drive
Mesh:
Year: 2020 PMID: 32434565 PMCID: PMC7238635 DOI: 10.1186/s13059-020-02029-9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Assembly of the B73-Ab10 genome. a Whole-genome view. For each chromosome, the top to bottom tracks are gene density, Cinful-Zeon retrotransposon density, Gypsy superfamily retrotransposon density in 10 Kb sliding windows, repeat location (knob180 in blue, TR-1 in red, 45S rDNA in teal, CentC in magenta), and the distribution of gapless contigs. CENH3 ChIP-seq peaks identifying centromeres are marked by orange rectangles. The inset shows the centromere on chromosome 3, TR-1-rich knob on chromosome 4, and knob180-rich knob on chromosome 7. The five most common retroelement families are shown for each panel, along with centromeric retrotransposons (CRM) for the centromere. CENH3 enrichment in chromosome 3 is displayed in a heatmap. b The impact of assembly merging over a CentC-rich region on chromosome 9. Seven contigs (orange, above) from the PacBio assembly were originally misassembled, as can be seen in the alignment to the Bionano map (connecting lines show matching sites). CentC tracts and gaps are annotated. Assembly merging corrected the output, leaving an 11-Kb gap that was filled with nanopore reads. c Sequence alignment between normal chromosome 10 from B73 (N10) (140–152 Mb) and Ab10 (140–195 Mb) from B73-Ab10. Annotation is as in a, with Kindr genes marked with black bars in the top track. Links show homologous regions larger than 500 bp
Assembly metrics of the B73-Ab10 genome
| Contigs | Pseudomolecules | |||||
|---|---|---|---|---|---|---|
| N50 (Mb) | N90 (Mb) | Max Size (Mb) | Contig Number | Total Length (Mb) | Gapa Length (Mb) | |
| Nanopore | 2.0 | 0.5 | 8.3 | 1673 | 2161.1 | 93.2 |
| PacBio | 41.2 | 7.1 | 156.3 | 216 | 2162.7 | 2.6 |
| Merged | 162.0 | 20.4 | 235.9 | 63 | 2162.8 | 1.3 |
aGaps longer than 10 Ns