| Literature DB >> 31832282 |
Hayley Mangelson1, David E Jarvis1, Patricia Mollinedo2, Oscar M Rollano-Penaloza2, Valeria D Palma-Encinas2, Luz Rayda Gomez-Pando3, Eric N Jellen1, Peter J Maughan1.
Abstract
PREMISE: Cañahua is a semi-domesticated crop grown in high-altitude regions of the Andes. It is an A-genome diploid (2n = 2x = 18) relative of the allotetraploid (AABB) Chenopodium quinoa and shares many of its nutritional benefits. Cañahua seed contains a complete protein, a low glycemic index, and offers a wide variety of nutritionally important vitamins and minerals.Entities:
Keywords: Amaranthaceae; Andean crops; Chenopodium pallidicaule; Hi‐C; proximity‐guided assembly
Year: 2019 PMID: 31832282 PMCID: PMC6858295 DOI: 10.1002/aps3.11300
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Passport and sequence archive information for plant materials used. Raw sequencing data for each accession are deposited in the Sequence Read Archive (SRA) at the National Center for Biotechnology Information (NCBI).
| Name | Collection | Accession ID | Collection location | Altitude (m a.s.l.) | Sequencing technology | SRA ID |
|---|---|---|---|---|---|---|
| WGS reference information | ||||||
| PI 478407 | USDA | PI 478407 | −17.2333, −67.9166 | 3800 | PacBio | SRR9661228 |
| PI 478407 | USDA | PI 478407 | −17.2333, −67.9166 | 3800 | Hi‐C (Illumina) | SRR9661229 |
| PI 478407 | USDA | PI 478407 | −17.2333, −67.9166 | 3800 | WGS (Illumina) | SRR4425239 |
| PI 478407 | USDA | PI 478407 | −17.2333, −67.9166 | 3800 | RNA‐Seq | SRR4425240–SRR4425243 |
| Diversity panel information | ||||||
| P1 | UNALM | BYU 1780 | −15.6967, −70.20510 | 3830 | WGS (Illumina) | SRR9620980 |
| P2 | UNALM | BYU 1781 | −15.7268, −70.23560 | 3838 | WGS (Illumina) | SRR9640749 |
| P4 | UNALM | BYU 1785 | −15.7693, −70.27050 | 3860 | WGS (Illumina) | SRR9640748 |
| U7 | USDA | PI 510525 | −16.3628, −69.2765 | NA | WGS (Illumina) | SRR9640742 |
| U8 | USDA | PI 510526 | −16.2833, −69.2833 | NA | WGS (Illumina) | SRR9640741 |
| U9 | USDA | PI 510527 | −16.0000, −69.7833 | 3810 | WGS (Illumina) | SRR9640740 |
| U12 | USDA | PI 510530 | −16.4500, −70.2333 | NA | WGS (Illumina) | SRR9640747 |
| U13 | USDA | PI 665279 | −17.2333, −67.9166 | 3700 | WGS (Illumina) | SRR9640746 |
| U14 | USDA | PI 665280 | −17.2333, −67.9166 | 3700 | WGS (Illumina) | SRR9640745 |
| U15 | USDA | PI 665281 | −17.2333, −67.9166 | 3700 | WGS (Illumina) | SRR9640744 |
| U16 | USDA | PI 665282 | −17.2333, −67.9166 | 3700 | WGS (Illumina) | SRR9640743 |
| B17 | UMSA | Bol‐1.1 | −15.7472, −68.8091 | 3845 | WGS (Illumina) | SRR9640755 |
| B18 | UMSA | Bol‐3.1 | −16.5344, −68.0622 | 3445 | WGS (Illumina) | SRR9640754 |
| B20 | UMSA | Bol‐19.1 | −17.8241, −67.7702 | 3721 | WGS (Illumina) | SRR9640757 |
| B21 | UMSA | Bol‐20.123 | −17.7850, −68.1447 | 4025 | WGS (Illumina) | SRR9640756 |
| B22 | UMSA | Bol‐21.123 | −17.6483, −67.2072 | 3777 | WGS (Illumina) | SRR9640751 |
| B23 | UMSA | Bol‐22.123 | −18.2166, −67.0333 | 3707 | WGS (Illumina) | SRR9640750 |
| B24 | UMSA | Bol‐23.123 | −16.5344, −68.0622 | 3445 | WGS (Illumina) | SRR9640753 |
| B25 | UMSA | Bol‐24.123 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640752 |
| B26 | UMSA | Bol‐25.123 | −16.5344, −68.0622 | 3445 | WGS (Illumina) | SRR9640759 |
| B27 | UMSA | Bol‐26.123 | −16.5344, −68.0622 | 3445 | WGS (Illumina) | SRR9640758 |
| B28 | UMSA | Bol‐28.123 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640732 |
| B29 | UMSA | Bol‐29.123 | −16.5344, −68.0622 | 3445 | WGS (Illumina) | SRR9640733 |
| B30 | UMSA | Bol‐30.123 | −17.2500, −67.9166 | 3800 | WGS (Illumina) | SRR9640734 |
| B31 | UMSA | Bol‐4.3 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640735 |
| B32 | UMSA | Bol‐6.2 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640736 |
| B33 | UMSA | Bol‐7.1 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640737 |
| B34 | UMSA | Bol‐8.1 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640738 |
| B35 | UMSA | Bol‐13.3 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640739 |
| B36 | UMSA | Bol‐27.123 | −16.6740, −68.3183 | 3900 | WGS (Illumina) | SRR9640731 |
m a.s.l. = meters above sea level; NA = not available.
Germplasm collection center. USDA = United States Department of Agriculture, Ames, Iowa, USA; UNALM = Universidad Nacional Agraria La Molina, Lima, Peru; UMSA = Universidad Major de San Andrés La Paz, Bolivia; BYU = Brigham Young University, Provo, Utah, USA.
Sequence Read Archive (SRA) identifier.
Deposited in BioProject ID PRJNA326220. All other sequences are deposited in BioProject ID PRJNA552289.
Assembly statistics for the ASRA, PGA1, PGA1.5, and PGA2 assemblies.
| Assembly statistic | ASRA | PGA1 | PGA1.5 | PGA2 |
|---|---|---|---|---|
| Assembly size (Mbp) | 337 | 337 | 363 | 363 |
| No. of scaffolds | 3015 | 623 | 591 | 4633 |
| Scaffold N50 size (Mbp) | 0.357 | 35.6 | 37.8 | 38.1 |
| Scaffold L50 count | 243 | 5 | 5 | 5 |
| Longest scaffold (Mbp) | 2.9 | 40.4 | 43.2 | 45.5 |
| No. of contigs | 8984 | 8984 | 2580 | 8210 |
| Contig N50 size (Mbp) | 0.083 | 0.083 | 0.516 | 0.236 |
| Contig L50 count | 1096 | 1096 | 168 | 401 |
| % missing bases | 2.5 | 2.6 | 0.2 | 0.1 |
| Assembly size (Mbp) in top 9 scaffolds | 20 | 321 | 344 | 350 |
| Assembly % in top 9 scaffolds | 5.8 | 95.4 | 94.8 | 96.5 |
ASRA = ALLPATHS‐LG Short‐Read Assembly; PGA1 = Proximity‐Guided Assembly 1; PGA1.5 = Proximity‐Guided Assembly 1.5; PGA2 = Proximity‐Guided Assembly 2.
Figure 1Genome annotation overview. An overview of gene and repetitive element annotations in the Chenopodium pallidicaule genome. Track 1: chromosome names and sizes; Track 2: frequency of pericentromeric 12‐13P repetitive elements (purple); Track 3: frequency of 18‐24J repetitive element (blue) and the 5S rRNA locus (red); Track 4: frequency of canonical telomeric repeat; Track 5: gene density.
Figure 2Chloroplast annotation overview. The outside track shows genes transcribed in a clockwise direction, the second track shows genes transcribed in a counterclockwise direction, and the inside track shows G/C content levels. Annotation reveals a quadripartite structure, including two copies of the inverted repeat (bolded line) dividing large and small single‐copy regions.
Figure 3Diversity panel. (A) The unrooted tree was developed using 16,194 single‐nucleotide polymorphisms (SNPs) filtered to remove SNPs with >10% missing data, minor allele frequency <5%, and linkage disequilibrium <40%. Colors represent the collection source (purple = United States Department of Agriculture, green = Universidad Nacional Agraria La Molina, blue = Universidad Major de San Andrés La Paz), and bolded lines indicate wild accessions. (B) Geographic location (see Table 1 for passport information) combined with population structure information developed by Structure with K = 4. There is no significant correlation between collection site and genetic distance (P = 0.837). The wild Chenopodium pallidicaule accessions are identified with arrows. (C) Population structure and admixture in the diversity panel.
Figure 4Genomic comparison of cañahua with beet, amaranth, and quinoa. Synteny dot plot (left) and dual syteny plots (right) show syntenic regions between cañahua and beet (A), amaranth (B), and quinoa (C) coding sequences. The dual synteny plot of the quinoa genome is divided into A‐ and B‐subgenomes with cañahua in the center. Increasing color intensity is associated with increasing homology in the dot plots. The arrows identify the chromosomal fusion (red) and loss (blue) in amaranth.
Comparison of gene synteny, synonymous substitutions rate, and divergence since the last common ancestor relative to cañahua.
| Metric | Amaranth | Beet | Quinoa A‐subgenome | Quinoa B‐subgenome |
|---|---|---|---|---|
| Total no. of genes | 45,947 | 45,334 | 43,663 | 44,638 |
| Unique syntenic genes | 23,878 | 23,075 | 26,230 | 25,327 |
| % of syntenic genes | 52.0 | 50.9 | 60.1 | 56.7 |
| Syntenic genes/block | 46.3 | 71.9 | 71.1 | 46.1 |
| Average syntenic block size (Mbp) | 1.3 | 3.1 | 4.7 | 4.9 |
|
| 0.64 | 0.48 | 0.025 | 0.05 |
| Last common ancestor (mya) | 21.33–39.51 | 16–29.63 | 0.830–1.54 | 1.67–3.09 |
K s = synonymous substitutions per synonymous site.
Total number of annotated genes in cañahua and the comparison species.
Total number of unique syntenic genes in cañahua and the comparison species.