| Literature DB >> 28922347 |
Rajeev K Varshney1, Chengcheng Shi2, Mahendar Thudi1, Cedric Mariac3, Jason Wallace4, Peng Qi4, He Zhang2, Yusheng Zhao5, Xiyin Wang4, Abhishek Rathore1, Rakesh K Srivastava1, Annapurna Chitikineni1, Guangyi Fan2, Prasad Bajaj1, Somashekhar Punnuri6, S K Gupta1, Hao Wang7, Yong Jiang5, Marie Couderc3, Mohan A V S K Katta1, Dev R Paudel8, K D Mungra9, Wenbin Chen2, Karen R Harris-Shultz10, Vanika Garg1, Neetin Desai11,12, Dadakhalandar Doddamani1, Ndjido Ardo Kane13, Joann A Conner14, Arindam Ghatak11,15, Palak Chaturvedi11, Sabarinath Subramaniam16,17, Om Parkash Yadav18, Cécile Berthouly-Salazar3,19, Falalou Hamidou20,21, Jianping Wang8, Xinming Liang2, Jérémy Clotault3,22, Hari D Upadhyaya1, Philippe Cubry3, Bénédicte Rhoné3,23, Mame Codou Gueye13, Ramanjulu Sunkar24, Christian Dupuy25, Francesca Sparvoli26, Shifeng Cheng2, R S Mahala27, Bharat Singh6, Rattan S Yadav28, Eric Lyons16, Swapan K Datta29, C Tom Hash20, Katrien M Devos4, Edward Buckler7,30, Jeffrey L Bennetzen4, Andrew H Paterson4, Peggy Ozias-Akins14, Stefania Grando1, Jun Wang2, Trilochan Mohapatra31, Wolfram Weckwerth11,32, Jochen C Reif5, Xin Liu2,33, Yves Vigouroux3,22, Xun Xu2,33,34.
Abstract
Pearl millet [Cenchrus americanus (L.) Morrone] is a staple food for more than 90 million farmers in arid and semi-arid regions of sub-Saharan Africa, India and South Asia. We report the ∼1.79 Gb draft whole genome sequence of reference genotype Tift 23D2B1-P1-P5, which contains an estimated 38,579 genes. We highlight the substantial enrichment for wax biosynthesis genes, which may contribute to heat and drought tolerance in this crop. We resequenced and analyzed 994 pearl millet lines, enabling insights into population structure, genetic diversity and domestication. We use these resequencing data to establish marker trait associations for genomic selection, to define heterotic pools, and to predict hybrid performance. We believe that these resources should empower researchers and breeders to improve this important staple crop.Entities:
Mesh:
Year: 2017 PMID: 28922347 PMCID: PMC6871012 DOI: 10.1038/nbt.3943
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1Pearl millet genome.
Genome features in 1-Mb intervals across the seven pseudomolecules. Units on the circumference are megabase values of pseudomolecules. (1) Repeat density, (2) tandem repeat density, (3) gene density, (4) GC content and (5) SNPs identified by resequencing PMiGAP lines in 1-Mb bins. The genome assembly furnished an average GC content of 47.9% and contained 38,579 gene models with mean coding sequence length of 1,014.71 bp.
Statistics of genome assembly
| All scaffolds (≥1K) | Scaffold ≥ 2K | |
|---|---|---|
|
| ||
| Number of scaffolds | 25,241 | 10,605 |
| Total span | 1,793,241,529 bp | 1,773,407,327 bp |
| N50 (scaffolds) | 884,945 bp | 893,809 bp |
| Longest scaffold | 4,816,714 bp | 4,816,714 bp |
| Number of contigs | 175,708 | 160,430 |
| Total length of contigs | 1,556,180,121 bp | 1,536,443,592 bp |
| Longest contig | 282,901 bp | 282,901 bp |
| N50 (contigs) | 18,180 bp | 18,442 bp |
| GC content | 47.90% | 47.88% |
|
| ||
| Number of gene models | 38,579 | |
| Number of gene models (without transposable elements) | 38,542 | |
| Mean transcript length | 2,420.19 bp | |
| Mean coding sequence length | 1,014.71 bp | |
| Mean number of exons per gene | 4.09 | |
| Mean exon length | 248.06 bp | |
| Mean intron length | 454.77 bp | |
| Number of genes annotated | 29,344 (76.06%) | |
| Number of genes unannotated | 9,235 (23.94%) | |
|
| ||
| Number of miRNA genes | 183 | |
| Mean length of miRNA genes | 125.51 bp | |
| miRNA genes share in genome | 0.001% | |
| Number of rRNA fragments | 235 | |
| Mean length of rRNA fragments | 265.70 bp | |
| rRNA fragments share in genome | 0.003% | |
| Number of tRNA genes | 909 | |
| Mean length of tRNA genes | 75.86 bp | |
| tRNA genes share in genome | 0.004% | |
| Number of snRNA genes | 752 | |
| Mean length of snRNA genes | 119.04 bp | |
| snRNA genes share in genome | 0.005% |
Figure 2Gene conservation and gene family expansion and contraction in pearl millet.
(a) Venn diagrams show the number of genes shared between different grass species and among grass families; pearl millet shares 14,398 genes with sorghum and foxtail millet; 13,027 genes with maize and rice; 11,369 genes with barley and wheat. (b) 384 gene families are substantially expanded and 1,692 gene families are contracted in pearl millet compared with other plant genomes.
Figure 3Domestication and genetic diversity in elite and wild accessions of pearl millet.
(a) Principal component analysis of 376 pearl millet lines (345 PMiGAP lines and 31 wild accessions) using 450,000 high-quality SNPs. Four different groups were identified: cultivated lines (red) and wild lines from east (blue), west (orange) and central Africa (pink). (b) Neighbor joining (NJ) tree based on 450,000 high-quality SNPs. This analysis also identified separate groups of cultivated and wild lines from east, west and central parts of Africa. (c) Morphological differences between wild (i, ii) and cultivated accessions (iii, iv) of pearl millet. Wild accessions have numerous bristled spikes in the inflorescence and low seed density (i), and a plant architecture characterized by numerous basal and aerial branches (ii), with a plant height of around 1 m. Cultivated accessions have exposed seeds and a high seed density per spike (iii), with a few basal branches and no aerial branches (iv).
Figure 4Prediction of hybrid performance.
Grain yield of 64 different pearl millet hybrids, produced by crossing 20 male and 23 female lines, was evaluated at five locations (Jamnagar, Anand, SK Nagar, Mahuva, Kothara in India) during 2004–2013. Phenotyping data (Supplementary Data set 1), together with 302,110 high-quality SNP marker data obtained from 580 B and R- lines (Supplementary Table 27), were used to predict hybrid performance. Ridge regression-BLUP, which takes additive and dominance effects into account, was used to predict hybrid performance. (a) Prediction accuracy was studied using 500 cross-validation tests. In each cross-validation, 48 hybrids were randomly selected as a training set and the remaining 16 hybrids were used as a test set. (b) Promising hybrid combinations that include parental lines that have not been used in breeding efforts previously were identified for testing and release as better hybrids. (c) Heat map showing putative heterotic groups.