| Literature DB >> 24908277 |
G Albert Wu1, Simon Prochnik1, Jerry Jenkins2, Jerome Salse3, Uffe Hellsten4, Florent Murat3, Xavier Perrier5, Manuel Ruiz5, Simone Scalabrin6, Javier Terol7, Marco Aurélio Takita8, Karine Labadie9, Julie Poulain9, Arnaud Couloux9, Kamel Jabbari9, Federica Cattonaro6, Cristian Del Fabbro6, Sara Pinosio6, Andrea Zuccolo10, Jarrod Chapman4, Jane Grimwood2, Francisco R Tadeo7, Leandro H Estornell7, Juan V Muñoz-Sanz7, Victoria Ibanez7, Amparo Herrero-Ortega7, Pablo Aleza11, Julián Pérez-Pérez12, Daniel Ramón13, Dominique Brunel14, François Luro15, Chunxian Chen16, William G Farmerie17, Brian Desany18, Chinnappa Kodira18, Mohammed Mohiuddin18, Tim Harkins19, Karin Fredrikson18, Paul Burns20, Alexandre Lomsadze20, Mark Borodovsky21, Giuseppe Reforgiato22, Juliana Freitas-Astúa23, Francis Quetier24, Luis Navarro11, Mikeal Roose25, Patrick Wincker26, Jeremy Schmutz2, Michele Morgante27, Marcos Antonio Machado8, Manuel Talon7, Olivier Jaillon26, Patrick Ollitrault5, Frederick Gmitter28, Daniel Rokhsar29.
Abstract
Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes--a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes--and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement.Entities:
Mesh:
Year: 2014 PMID: 24908277 PMCID: PMC4113729 DOI: 10.1038/nbt.2906
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1A selection of mandarin, pummelo and orange fruits, including cultivars sequenced in this study. Pummelos (numbered 1, 2 in outline, on left) are large trees that produce very large fruit, with white, pink or red flesh color (2) and yellow or pink rinds. Most cultivars have large leaves having petioles with prominent wings. Apomictic reproduction is absent and most selections are self-incompatible. Mandarins (3–7) are smaller trees bearing smaller fruit, with orange flesh (9, 11) and rind color. Mandarins have both apomictic and zygotic reproduction and some are self-compatible. Oranges (8, 10) are generally intermediate in tree and fruit size, flesh (10) and rind color is commonly orange, and apomictic reproduction is always present. (The sour orange shown (12) is immature.)
Sequenced cultivars and proportions derived from the ancestral species C. reticulata and C. maxima
Three letter abbreviations as used throughout this work and common systematic designation are shown. Sequence depth reported as count of aligned reads to reference, after removal of duplicate reads. Chloroplast genome type inferred from shotgun reads aligning to the sweet orange chloroplast genome[38], with M indicating mandarin type and P indicating pummelo type. Diploid nuclear genotype proportions refer to fraction of genome in megabases using the HCR physical map (proportions of unknown genotype are not shown but can be inferred by subtracting the three genotype proportions from 100%). The last two columns show proportions of C. maxima and C. reticulata haplotypes, and are derived from the three genotype proportions. max. = C. maxima; ret. = C.reticulata.
| Cultivar | Abbr. | Common | Sequence | Cp | ret./ | ret./ | max./ | ret. | max. |
|---|---|---|---|---|---|---|---|---|---|
| Haploid Clementine | HCR | 7× Sanger | M | n/a | n/a | n/a | 89% | 11% | |
| Clementine mandarin | CLM | 110× Illumina | M | 58% | 42% | 0% | 79% | 21% | |
| Ponkan mandarin | PKM | 55× Illumina | M | 85% | 14% | 0.7% | 92% | 8% | |
| Willowleaf mandarin | WLM | 110× Illumina | M | 91% | 8.8% | 0% | 95% | 4.4% | |
| W. Murcott mandarin | WMM | 25× Illumina | M | 69% | 30% | 0.4% | 85% | 15% | |
| Chandler pummelo | CHP | 22× Illumina | P | 0% | 0.4% | 99.6% | 0.2% | 99.8% | |
| Low acid pummelo | LAP | 17× Illumina | P | 0% | 0% | 100% | 0% | 100% | |
| Sweet orange | SWO | 80× Illumina | P | 14% | 82% | 3% | 55% | 44% | |
| Seville sour orange | SSO | 36× Illumina | P | 0% | 98% | 0% | 49% | 49% |
Ponkan mandarin is widely assumed to represent C. reticulata, but as shown here it has substantial admixture from C. maxima.
Figure 2Nucleotide diversity distribution in citrus.(a) Nucleotide heterozygosity distribution computed in overlapping 100kb windows (with 5 kb step size) across the Low acid (LAP) and Chandler (CHP) pummelo genomes and between the non-shared haplotypes of this parent-child pair (LAP/CHP) is shown. The peak at ~6 heterozygous sites/kb in all three pairwise comparisons represents the characteristic nucleotide diversity of the species C. maxima; the peak near ~1 heterozygous site/kb reflects a bottleneck in the ancestral C. maxima population after divergence from C. reticulata (Supplementary Note 10). (b) Nucleotide heterozygosity for the traditional Willowleaf mandarin (WLM) plotted along chromosome 6, computed in overlapping windows of 200 kb (with 100 kb step size). This chromosome shows an example of the clear discontinuity in single nucleotide variant heterozygosity levels between ~5/kb in the M/M segment (orange bar) and ~17/kb in the M/P segment (blue bar). (c) Nucleotide heterozygosity distribution computed in overlapping 500kb windows (with 5 kb step size) in Ponkan (PKM, solid line) and Willowleaf (WLM, dashed line) mandarins. Genomic segments are designated M/M, M/P or P/P based on a set of 1,537,264 SNPs that differentiate C. reticulata (M) from C. maxima (P). Both mandarins contain admixed segments from C. maxima introgression (M/P) as well as M/M segments, and these are plotted and normalized separately for easy comparison.. (d) Nucleotide heterozygosity distribution computed in overlapping windows of 500kb (5 kb offsets) for sweet orange (SWO) and sour orange (SSO). The three different genotypes of the SWO genome (M/M, P/P and M/P), and the SSO genotype M/P are normalized and plotted separately
Figure 3Admixture patterns and nucleotide diversity in cultivated citrus. For each of the three groups of sequenced citrus, variation in nucleotide diversity (averaged over 500kb windows with step size 250kb) is shown across the genome for one representative cultivar above genotype maps (horizontal bars: green = C. maxima/C. maxima; blue = C. maxima/C. reticulata; orange= C. reticulata/C. reticulata; grey=unknown; the 9 chromosomes are numbered at the top). (a) Sweet orange (SWO) nucleotide diversity with genotype maps for SWO and sour orange (SSO). Note the C. maxima/C. maxima genotype (green segments present on chromosomes 2 and 8) in SWO. (b) Willowleaf mandarin (WLM) nucleotide diversity and genotype maps for three traditional mandarins (Ponkan mandarin (PKM), WLM, Huanglingmiao (HLM)) and three recent mandarin types (Clementine (CLM), W. Murcott mandarin (WMM), haploid Clementine reference (HCR)). For the haploid Clementine reference sequence (HCR), red and green segments indicate C.reticulata and C. maxima haplotypes, respectively. All five mandarin types show pummelo introgressions (blue or green segments). (c) Low acid pummelo (LAP) nucleotide diversity and genotype maps for two pummelos (LAP, Chandler pummelo (CHP)).
Figure 4Mangshan mandarin is a species distinct from C. maxima and C. reticulata
(a) Midpoint-rooted neighbor-joining phylogenetic tree of citrus chloroplast genomes. (b) The frequency distributions of the pairwise sequence divergences (across 100 kb windows) between Mangshan mandarin (CMS) and C. maxima (green), CMS and C. reticulata (orange), C. reticulata and C. maxima (light blue), as well as the distinctly lower CMS intrinsic nucleotide diversity (dashed blue). (c) The first two coordinates of principal coordinate analysis of the citrus nuclear genomes, based on pairwise distances and the metric multidimensional scaling. The C. maxima - C. reticulata axis (Principle coordinate 1, 47.5% variance) separates pummelos (green) from mandarins (orange), with oranges (blue) lying in between; Principle coordinate 2 (19.6% of variance) separates CMS (purple) from the others.