| Literature DB >> 32640982 |
Javaria Ashraf1,2, Dongyun Zuo1,3, Hailiang Cheng1,3, Waqas Malik2, Qiaolian Wang1,3, Youping Zhang1,3, Muhammad Ali Abid2, Qiuhong Yang4, Xiaoxu Feng1,3, John Z Yu5, Guoli Song6,7.
Abstract
BACKGROUND: Genome sequencing technologies have been improved at an exponential pace but precise chromosome-scale genome assembly still remains a great challenge. The draft genome of cultivated G. arboreum was sequenced and assembled with shotgun sequencing approach, however, it contains several misassemblies. To address this issue, we generated an improved reassembly of G. arboreum chromosome 12 using genetic mapping and reference-assisted approaches and evaluated this reconstruction by comparing with homologous chromosomes of G. raimondii and G. hirsutum.Entities:
Keywords: Gene loss; Genetic map; Reference-assisted assembly; Syntenic relationship; Transcription factor
Mesh:
Substances:
Year: 2020 PMID: 32640982 PMCID: PMC7346634 DOI: 10.1186/s12864-020-06814-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic diagram for reassembling of G. arboreum chromosome 12 (A_A12). Each rectangle corresponded to procedures applied for chromosome reassembling steps. Genotypic data of 24,569 SNP markers used in previous study [27] was first filtered out for construction of linkage groups, which were then assigned to 13 chromosomes of G. arboreum. Afterwards, linkage group belong to G. arboreum chromosome 12 was used for re-assembling. We checked the alignments of scaffolds belonging to G. arboreum chromosome 12 for following levels: (i) Alignment of G. arboreum scaffolds (obtained by the genetic map) to G. raimondii scaffolds [7], (ii) Orientation of G. raimondii (obtained from the previous step) and G. arboreum scaffolds along G. raimondii chromosome (D_D08) [36], and (iii) adjacency of G. arboreum scaffolds within G. hirsutum chromosome (AD_A12) [8]
Global statistics of reassembled G. arboreum chromosome (A_A12)
| Category | Statistics |
|---|---|
| Total length of the assembly (Mb) | 94.64 |
| Number of oriented scaffolds | 144 |
| Oriented scaffolds (N50) (Mb) | 0.912 |
| Maximum scaffold length (Mb) | 2.360 |
| Minimum scaffold length (Mb) | 0.002 |
| Number of protein coding genes | 3361 |
| Average gene size (bp) | 2527 |
| Average transcript length (bp) | 1263 |
| Gene density (per Mb of chromosome) | 36 |
| Total gene region | 8,493,379 |
| Total coding Region | 3,796,446 |
| Maximum CDS length (bp) | 14,331 |
| Average CDS length (bp) | 1130 |
| Mean exon number | 4.7 |
Fig. 2Syntenic relationship between corresponding homologous chromosomes of different Gossypium species. Syntenic relationship between homologous chromosomes 12 of; aG. raimondii (D_D08) and G. arboreum (A_A12), bG. hirsutum (AD_A12) and G. arboreum (A_A12), and cG. hirsutum (AD_D12) and G. arboreum (A_A12). Syntenic blocks were required to match at least five genes per block after masking repeat regions. Good syntenic relationship was found when comparing the homologous chromosomes of G. raimondii (D_D08) and G. hirsutum (AD_A12 and AD_D12) with reassembled chromosome of G. arboreum (A_A12)
Fig. 3Collinearity of reassembled G. arboreum chromosome (A_A12) with 26 chromosomes of G. hirsutum. Collinear relationship of reassembled G. arboreum chromosome (A_A12) with 26 chromosomes of G. hirsutum was determined by MCScan. After masking the repeat regions, collinearity analysis of G. arboreum chromosome A_A12 was carried out with all 26 chromosomes of G. hirsutum. Results indicated good collinear relationship of reassembled G. arboreum chromosome A_A12 with its corresponding homologous chromosomes 12 (AD_A12 and AD_D12) of G. hirsutum as compare to others chromosomes. G. arboreum chromosome 12 was shown by ‘A_A12’ while, chromosomes belong to A and D sub-genomes of G. hirsutum were indicated by ‘AD_A’ and ‘AD_D’
Fig. 4Dotplot representation between homologous chromosomes of different cotton species. A BLASTP search (with an E-value cutoff of 1 × 10− 5) was performed to identify orthologous genes. Afterwards, dotplots representation among homologous chromosomes of three cotton species was carried out by MCScan. aG. arboreum chromosome A_A12 (Y-axis) vs G. raimondii chromosome D_D08 (X-axis), bG. arboreum chromosome A_A12 (Y-axis) vs G. hirsutum chromosome AD_A12 (X-axis), and cG. arboreum chromosome A_A12 (Y-axis) vs G. hirsutum chromosome AD_D12 (X-axis)
Fig. 5Syntenic relationship with previously assembled chromosome 12 of G. arboreum (A_Ca9). Previously assembled chromosome 12 (A_Ca9) of G. arboreum was used to explore the syntenic relationship with a re-assembled G. arboreum chromosome A_A12 and, bG. hirsutum chromosome AD_A12. Syntenic blocks were required to match at least five genes per block. Results indicated poor syntenic relationship of G. arboreum chromosome A_Ca9 with these two chromosomes
Gene loss in homologous chromosomes 12 of G. hirsutum
| Genes loss in AD_A12 chromosome | Genes loss in AD_D12 chromosome | ||||
|---|---|---|---|---|---|
| D_D08 | A_A12 | AD_D12 | D_D08 | A_A12 | AD_A12 |
| Gorai.008G015200 | Cotton_A_15792 | Gh_D12G0137 | Gorai.008G026700 | Cotton_A_10793 | Gh_A12G0236 |
| Gorai.008G041500 | Cotton_A_02090 | Gh_D12G0372 | Gorai.008G063700 | Cotton_A_11364 | Gh_A12G0558 |
| Gorai.008G063000 | Cotton_A_11373 | Gh_D12G0567 | Gorai.008G080200 | Cotton_A_34337 | Gh_A12G0688 |
| Gorai.008G106800 | Cotton_A_31201 | Gh_D12G0942 | Gorai.008G095800 | Cotton_A_35255 | Gh_A12G0798 |
| Gorai.008G110900 | Cotton_A_27718 | Gh_D12G0984 | Gorai.008G157500 | Cotton_A_23027 | Gh_A12G1304 |
| Gorai.008G133700 | Cotton_A_26243 | Gh_D12G1202 | Gorai.008G160900 | Cotton_A_35616 | Gh_A12G1336 |
| Gorai.008G136900 | Cotton_A_22647 | Gh_D12G1233 | Gorai.008G161100 | Cotton_A_30134 | Gh_A12G1338 |
| Gorai.008G138200 | Cotton_A_22060 | Gh_D12G1246 | Gorai.008G164600 | Cotton_A_21032 | Gh_A12G1366 |
| Gorai.008G141000 | Cotton_A_33185 | Gh_D12G1271 | Gorai.008G171300 | Cotton_A_31070 | Gh_A12G1433 |
| Gorai.008G159100 | Cotton_A_23046 | Gh_D12G1444 | Gorai.008G187700 | Cotton_A_38211 | Gh_A12G1570 |
| Gorai.008G165500 | Cotton_A_21019 | Gh_D12G1498 | Gorai.008G190100 | Cotton_A_25801 | Gh_A12G1593 |
| Gorai.008G178300 | Cotton_A_06177 | Gh_D12G1616 | Gorai.008G193900 | Cotton_A_13403 | Gh_A12G1616 |
| Gorai.008G182200 | Cotton_A_06137 | Gh_D12G1649 | Gorai.008G206100 | Cotton_A_08046 | Gh_A12G1715 |
| Gorai.008G188300 | Cotton_A_25782 | Gh_D12G1706 | Gorai.008G217000 | Cotton_A_13589 | Gh_A12G1810 |
| Gorai.008G194300 | Cotton_A_13398 | Gh_D12G1760 | Gorai.008G230800 | Cotton_A_07177 | Gh_A12G1938 |
| Gorai.008G196900 | Cotton_A_13365 | Gh_D12G1787 | Gorai.008G240500 | Cotton_A_07085 | Gh_A12G2029 |
| Gorai.008G202800 | Cotton_A_27500 | Gh_D12G1844 | Gorai.008G241600 | Cotton_A_07074 | Gh_A12G2040 |
| Gorai.008G203500 | Cotton_A_08073 | Gh_D12G1852 | Gorai.008G283400 | Cotton_A_01373 | Gh_A12G2388 |
| Gorai.008G231000 | Cotton_A_07174 | Gh_D12G2120 | Gorai.008G020900 | Cotton_A_24594 | Gh_A12G0175 |
| Gorai.008G235800 | Cotton_A_07128 | Gh_D12G2164 | Gorai.008G151100 | Cotton_A_30237 | Gh_A12G1241 |
| Gorai.008G242300 | Cotton_A_14421 | Gh_D12G2224 | Gorai.008G207500 | Cotton_A_8032 | Gh_A12G1729 |
| Gorai.008G268800 | Cotton_A_19242 | Gh_D12G2414 | Gorai.008G244100 | Cotton_A_14443 | Gh_A12G2062 |
| Gorai.008G017400 | Cotton_A_15816 | Gh_D12G0157 | |||
| Gorai.008G077500 | Cotton_A_31087 | Gh_D12G0672 | |||
| Gorai.008G170600 | Cotton_A_25559 | Gh_D12G1546 | |||
| Gorai.008G194300 | Cotton_A_13398 | Gh_D12G1760 | |||
A_A12, G. arboreum chromosome; D_D08, G. raimondii chromosome; AD_A12 & AD_D12, G. hirsutum chromosomes
Gene loss in homologous chromosomes 12 of G. arboreum and G. raimondii
| Genes loss in A_A12 chromosome | Genes loss in D_D08 chromosome | ||||
|---|---|---|---|---|---|
| AD_D12 | AD_A12 | D_D08 | AD_D12 | AD_A12 | A_A12 |
| Gh_D12G0154 | Gh_A12G0141 | Gorai.008G016900 | Gh_D12G0046 | Gh_A12G0031 | Cotton_A_21998 |
| Gh_D12G1069 | Gh_A12G0957 | Gorai.008G119400 | Gh_D12G0145 | Gh_A12G0131 | Cotton_A_15801 |
| Gh_D12G1172 | Gh_A12G1052 | Gorai.008G130700 | Gh_D12G0571 | Gh_A12G0555 | Cotton_A_11368 |
| Gh_D12G1313 | Gh_A12G1191 | Gorai.008G145300 | Gh_D12G0937 | Gh_A12G0857 | Cotton_A_29573 |
| Gh_D12G1414 | Gh_A12G1292 | Gorai.008G156000 | Gh_D12G1073 | Gh_A12G0961 | Cotton_A_20925 |
| Gh_D12G1862 | Gh_A12G1699 | Gorai.008G204500 | Gh_D12G1353 | Gh_A12G1228 | Cotton_A_14576 |
| Gh_D12G2015 | Gh_A12G1845 | Gorai.008G220400 | Gh_D12G1992 | Gh_A12G1821 | Cotton_A_13578 |
| Gh_D12G2032 | Gh_A12G1861 | Gorai.008G222300 | Gh_D12G2303 | Gh_A12G2123 | Cotton_A_23201 |
| Gh_D12G2315 | Gh_A12G2135 | Gorai.008G254600 | Gh_D12G2444 | Gh_A12G2310 | Cotton_A_01291 |
| Gh_D12G2573 | Gh_A12G2447 | Gorai.008G291600 | |||
| Gh_D12G2634 | Gh_A12G2507 | Gorai.008G297900 | |||
| Gh_D12G2440 | Gh_A12G2304 | Gorai.008G275000 | |||
| Gh_D12G0980 | Gh_A12G0894 | Gorai.008G110500 | |||
A_A12, G. arboreum chromosome; D_D08, G. raimondii chromosome; AD_A12 & AD_D12, G. hirsutum chromosomes
Fig. 6Chromosomal mapping of the TF-related genes on homologous chromosome 12 of three cotton species. Physical mapping of five major TF-related family members including (a) MYB, (b) C2H2, (c) WRKY, (d) bHLH, (e) ERF was performed in homologous chromosome 12 of G. arboreum (A_A12), G. raimondii (D_D08) and G. hirsutum (AD_A12 and AD_D12). Genes in the positive and negative strands were represented by blue and red colors, while lines signified the collinear genes