| Literature DB >> 31498838 |
Jee-Soo Park1, Ji-Hyun Park1, Young-Doo Park1.
Abstract
Chinese cabbage (Brassica rapa ssp. pekinensis) is a major crop that is widely cultivated, especially in Korea, Japan, and China. With the advent of next generation sequencing technology, the cost and time required for sequencing have decreased and the development of genome research accelerated. Genome sequencing of Chinese cabbage was completed in 2011 using the variety Chiifu-401-42, and since then the genome has been continuously updated. In the present study, we conducted whole-genome sequencing of Chinese cabbage inbred line CT001, a line widely used in traditional or molecular breeding, to improve the accuracy of genetic polymorphism analysis. The constructed CT001 pseudomolecule represented 85.4% (219.8 Mb) of the Chiifu reference genome, and a total of 38,567 gene models were annotated using RNA-Seq analysis. In addition, the spontaneous mutation rate of CT001 was estimated by resequencing DNA obtained from individual plants after sexual propagation for six generations to estimate the naturally occurring variations. The CT001 pseudomolecule constructed in this study will provide valuable resources for genomic studies on Chinese cabbage.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31498838 PMCID: PMC6733507 DOI: 10.1371/journal.pone.0222283
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Chromosomal mapping of Brassica rapa inbred line ‘CT001’ contigs to the B. rapa variety ‘Chiifu’ reference genome.
| Chromosome | Length of Chiifu genome (bp) | Contig count of CT001 | Covered CT001 sequences (bp) | Average coverage (%) |
|---|---|---|---|---|
| A01 | 26,791,028 | 2,990 | 21,988,864 | 82.1 |
| A02 | 26,939,826 | 2,849 | 22,748,939 | 84.4 |
| A03 | 31,765,688 | 2,751 | 28,297,308 | 89.1 |
| A04 | 19,269,589 | 2,006 | 16,628,331 | 86.3 |
| A05 | 25,303,532 | 2,370 | 21,181,456 | 83.7 |
| A06 | 25,210,368 | 2,599 | 21,417,483 | 85.0 |
| A07 | 25,876,096 | 2,416 | 22,352,648 | 86.4 |
| A08 | 20,826,945 | 2,047 | 18,034,844 | 86.6 |
| A09 | 38,884,800 | 3,917 | 32,748,158 | 84.2 |
| A10 | 16,405,180 | 1,430 | 14,365,407 | 87.6 |
| Genome | 257,273,052 | 25,375 | 219,763,438 | 85.4 |
Fig 1Dot plot comparison between Brassica rapa ‘CT001’ and ‘Chiifu’ genome by chromosome.
Whole-genome nucleotide sequence alignment of the CT001 scaffolds (plotted on y-axis) against the Chiifu reference sequence ver1.5 (plotted on x-axis). Red and blue lines represent sequences aligned in forward and reverse directions, respectively. The alignment was filtered by percentage identity ≥ 95 and alignment length ≥ 500 bp.
Gene annotation of Brassica rapa ‘CT001’.
| Feature | Value |
|---|---|
| Number of gene models | 38,567 |
| Total gene length (bp) | 72,036,748 |
| Number of single exon genes | 9,758 |
| Number of multiple exon genes | 28,809 |
| Average gene length (bp) | 1,868 |
| Total number of exon | 185,524 |
| Average number of exon/gene | 4.8 |
| Average exon length (bp) | 231.7 |
| Total number of intron | 146,967 |
| Average number of intron/gene | 3.8 |
| Average intron length (bp) | 182.8 |
| Number of genes with known plant protein homology (e-value <10−6) | 35,837 |
Fig 2Distribution of gene length in the Brassica rapa ‘CT001’ pseudomolecule.
The y-axis represents the number of genes with a certain length. Genes range in size from 0.1 kb to 39.2 kb.
Number of spontaneous mutations and their distribution in the Brassica rapa ‘CT001’ genome.
| Features | Transition | Transversion |
|---|---|---|
| Exon | 1 | 0 |
| Intron | 1 | 2 |
| Intergenic | 7 | 1 |
| Total mutation | 9 | 3 |
Fig 3Proportions of base substitution types of the 12 identified mutation.
Transitions made up the majority of the substitutions.
Distribution of mutations across the Brassica rapa ‘CT001’ chromosomes and related gene information.
| Chromosome | Location | Ref | Alt | SNP | Type | Gene ID | Gene description | |
|---|---|---|---|---|---|---|---|---|
| sm1 | A01_pseudo | 1447959 | C | T | Ts | Intergenic | ||
| sm2 | A01_pseudo | 8787755 | A | G | Ts | Exon | CT001_A01017300 | PREDICTED: Root meristem growth factor 6 |
| sm3 | A01_pseudo | 10776639 | A | G | Ts | Intergenic | ||
| sm4 | A02_pseudo | 16640494 | C | T | Ts | Intergenic | ||
| sm5 | A02_pseudo | 18875414 | A | G | Ts | Intron | CT001_A02067580 | PREDICTED: FACT complex subunit SSRP1 |
| sm6 | A03_pseudo | 14956214 | T | G | Tv | Intron | CT001_A03107270 | PREDICTED: B3 domain-containing protein REM8-like isoform X2 |
| sm7 | A04_pseudo | 10593814 | C | T | Ts | Intergenic | ||
| sm8 | A06_pseudo | 5646555 | T | G | Tv | Intron | CT001_A06206720 | PREDICTED: receptor like protein 30-like |
| sm9 | A07_pseudo | 16467555 | A | T | Tv | Intergenic | ||
| sm10 | A09_pseudo | 5066860 | A | G | Ts | Intergenic | ||
| sm11 | A09_pseudo | 10751579 | C | T | Ts | Intergenic | ||
| sm12 | A09_pseudo | 12032695 | C | T | Ts | Intergenic |
Notes
* Annotation of single nucleotide polymorphism (SNP) regions
Abbreviations: Ref, reference; Alt, altered; Ts, transition; Tv, transversion.