| Literature DB >> 24836287 |
Fuguang Li1, Guangyi Fan2, Kunbo Wang1, Fengming Sun2, Youlu Yuan1, Guoli Song1, Qin Li3, Zhiying Ma4, Cairui Lu5, Changsong Zou5, Wenbin Chen6, Xinming Liang6, Haihong Shang5, Weiqing Liu6, Chengcheng Shi6, Guanghui Xiao7, Caiyun Gou6, Wuwei Ye5, Xun Xu6, Xueyan Zhang5, Hengling Wei5, Zhifang Li5, Guiyin Zhang8, Junyi Wang6, Kun Liu5, Russell J Kohel9, Richard G Percy9, John Z Yu9, Yu-Xian Zhu7, Jun Wang10, Shuxun Yu5.
Abstract
The complex allotetraploid nature of the cotton genome (AADD; 2n = 52) makes genetic, genomic and functional analyses extremely challenging. Here we sequenced and assembled the Gossypium arboreum (AA; 2n = 26) genome, a putative contributor of the A subgenome. A total of 193.6 Gb of clean sequence covering the genome by 112.6-fold was obtained by paired-end sequencing. We further anchored and oriented 90.4% of the assembly on 13 pseudochromosomes and found that 68.5% of the genome is occupied by repetitive DNA sequences. We predicted 41,330 protein-coding genes in G. arboreum. Two whole-genome duplications were shared by G. arboreum and Gossypium raimondii before speciation. Insertions of long terminal repeats in the past 5 million years are responsible for the twofold difference in the sizes of these genomes. Comparative transcriptome studies showed the key role of the nucleotide binding site (NBS)-encoding gene family in resistance to Verticillium dahliae and the involvement of ethylene in the development of cotton fiber cells.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24836287 DOI: 10.1038/ng.2987
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330