| Literature DB >> 30131865 |
Lei Zhang1,2, Xu Cai1, Jian Wu1, Min Liu3, Stefan Grob4, Feng Cheng1, Jianli Liang1, Chengcheng Cai1, Zhiyuan Liu1, Bo Liu1, Fan Wang3, Song Li3, Fuyan Liu3, Xuming Li3, Lin Cheng5, Wencai Yang2, Mai-He Li6, Ueli Grossniklaus4, Hongkun Zheng3, Xiaowu Wang1,5.
Abstract
Brassica rapa comprises several important cultivated vegetables and oil crops. Current reference genome assemblies of Brassica rapa are quite fragmented and not highly contiguous, thereby limiting extensive genetic and genomic analyses. Here, we report an improved assembly of the B. rapa genome (v3.0) using single-molecule sequencing, optical mapping, and chromosome conformation capture technologies (Hi-C). Relative to the previous reference genomes, our assembly features a contig N50 size of 1.45 Mb, representing a ~30-fold improvement. We also identified a new event that occurred in the B. rapa genome ~1.2 million years ago, when a long terminal repeat retrotransposon (LTR-RT) expanded. Further analysis refined the relationship of genome blocks and accurately located the centromeres in the B. rapa genome. The B. rapa genome v3.0 will serve as an important community resource for future genetic and genomic studies in B. rapa. This resource will facilitate breeding efforts in B. rapa, as well as comparative genomic analysis with other Brassica species.Entities:
Year: 2018 PMID: 30131865 PMCID: PMC6092429 DOI: 10.1038/s41438-018-0071-9
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Summary of comparisons of assembly and annotation for the three B. rapa genome assemblies
| Assembly | |||
|---|---|---|---|
| v3.0 | v2.5 | v1.5 | |
| Sequence genome size (Mb) | 353.14a | 389.19 | 283.81 |
| GC content (%) | 36.83 | 36.17 | 35.26 |
| Number of contigs | 1498 | 96,883 | 51,647 |
| Contig N50 size (kb) | 1446 | 53 | 46 |
| Number of scaffolds | 1301 | 86,986 | 40,576 |
| Scaffold N50 size (kb) | 4437 | 3378 | 1847 |
| Gaps total number | 396 | 10,158 | 11,426 |
| Gaps total length (kb) | 2078 | 22,776 | 10,710 |
| Gaps number per Mb | 1.12 | 25.98 | 40.09 |
| Gap length (kb) per Mb | 5.89 | 60.53 | 41.25 |
| Total gene models | 45,985 | 48,826 | 41,020 |
| Tandem arrays | 2077 | 3535 | 2077 |
| Tandem genes | 4963 | 8002 | 5004 |
| Redundancy removed | 43,099 | 44,359 | 38,093 |
| Syntenic genes | 39,858 | 40,442 | 35,464 |
| Nonsyntenic genes | 3241 | 3917 | 2629 |
aSee Supplementary Table S3
Fig. 1Integration of the physical and genetic maps of B. rapa genome v3.0.
The markers of the genetic map based on B. rapa genome v3.0 are shown on the x-axis; the markers of the physical map of B. rapa genome v3.0 are shown on the y-axis
Fig. 2Examples showing the invalid annotation of tandem genes in v2.5.
a An example of a 25 bp gap (thin yellow bar indicated by the red arrow) between the genes BraA01000818 and BraA01000819, indicating an invalid annotation in v2.5. b The genes BraA02003894 and BraA02003895 in v2.5 are annotated as a single gene in v3.0 (BraA02g039730.3C) and v1.5 (Bra020703). Figures were plotted using GEvo (https://genomevolution.org/coge/GEvo.pl)
Fig. 3The number of intact LTR-RTs birthed at different times (million years ago, MYA) in the three assemblies of the. genome and in the genome of.
Fig. 4Distribution of genomic blocks along ten chromosomes of the B. rapa genome v3.0.
Genome blocks on ten chromosomes were assigned to the subgenomes LF (red), MF1 (green), and MF2 (blue). Two or more segments of a single block were labeled using lowercase letters (a, b, etc.). The centromeres in the B. rapa genome are shown as black ovals, and the paleocentromeres are shown as gray triangles. Downward-pointing arrows are adjacent to GBs that are inverted relative to other blocks that originated from a single ACK chromosome
Fig. 5Circos plot of the features of centromeric regions on the ten chromosomes in B. rapa genome v3.0.
All the data are represented as heatmaps. The red color indicates low values, and the blue color indicates high values. a The ten chromosomes of the B. rapa genome v3.0. Centromeres are shown as black blocks. b TE density across the ten chromosomes of v3.0 (500 kb sliding window, 100 kb step). c Distribution of centromere-specific repeats along the ten chromosomes of v3.0 (2 Mb sliding window, 1 Mb step). d Gene density of the ten chromosomes of v3.0 (2 Mb sliding window, 1 Mb step). e The mean local recombination rate between markers along the ten chromosomes of v3.0 (5 Mb sliding window, 1 Mb step)