| Literature DB >> 31010823 |
Hui Zhang1, Nathan Hall1, Leslie R Goertzen2, Charles Y Chen1, Eric Peatman3, Jinesh Patel1, J Scott McElroy4.
Abstract
Relationships in the genus Eleusine were obtained through transcriptome analysis. Eleusine coracana (E. coracana ssp. coracana), also known as finger millet, is an allotetraploid minor crop primarily grown in East Africa and India. Domesticated E. coracana evolved from wild E. africana (E. coracana ssp. africana) with the maternal genome donor largely supported to be E. indica; however, the paternal genome donor remains elusive. We developed transcriptomes for six Eleusine species from fully developed seedlings using Illumina technology and three de novo assemblers (Trinity, Velvet, and SOAPdenovo2) with the redundancy-reducing EvidentialGene pipeline. Mapping E. coracana reads to the chloroplast genes of all Eleusine species detected fewer variants between E. coracana and E. indica compared to all other species. Phylogenetic analysis further supports E. indica as the maternal parent of E. coracana and E. africana, in addition to a close relationship between E. indica and E. tristachya, and between E. floccifolia and E. multiflora, and E. intermedia as a separate group. A close relationship between E. floccifolia and E. multiflora was unexpected considering they are reported to have distinct nuclear genomes, BB and CC, respectively. Further, it was expected that E. intermedia and E. floccifolia would have a closer relationship considering they have similar nuclear genomes, AB and BB, respectively. A rethinking of the labeling of ancestral genomes of E. floccifolia, E. multiflora, and E. intermedia is maybe needed based on this data.Entities:
Keywords: Eleusine africana; Eleusine coracana; relationships; transcriptome
Mesh:
Year: 2019 PMID: 31010823 PMCID: PMC6553535 DOI: 10.1534/g3.119.400214
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Biological, genomic, and GRIN Accession Number for seven Eleusine species utilized. Genomic and biological acquired from the following sources
| Species | 2n chromosome numbers, genome, ploidy | Life cycle | Type | GRIN Accession Number |
|---|---|---|---|---|
| 16, CC, diploid | Annual | Wild | 226067 | |
| 18, BB or other, diploid | Perennial | Wild | 196853 | |
| 18 AA, diploid | Annual | Wild | 331791 | |
| 18 AB, diploid | Perennial | Wild | 273888 | |
| 36 AABB, allotetraploid | Annual | Wild | 226270 | |
| 36 AABB, allotetraploid | Annual | Cultivated | 462949 | |
| 18 AA, diploid | Annual or Perennial | Wild | Collect | |
| 20 DD, diploid | Perennial | Wild | Unavailable | |
| 38 AADD, allotetraploid | Perennial | Wild | Unavailable |
GRIN, Germplasm Resources Information Network.
E. indica was collected locally from a crop field in Tallassee, Alabama. In other published work by J.S. McElroy it is referred to by the acronym PBU referring to its origin at the Alabama Agricultural Experiment Station Plant Breeding Unit. E. indica is known to exist as a weedy perennial in managed ecosystems of southern Florida and Hawai’i.
Figure 1Workflow of transcriptome sequencing data analysis and assembly. Three de novo assemblers (Trinity, Velvet, and SOAPdenovo2) and a redundancy-reducing EvidentialGene tr2aacds pipeline were used for constructing optimized transcriptome references.
The number and average length of Eleusine transcriptome sequencing reads before and after trimming
| Species | Number of reads | Average length | Number of reads after trim | % reads removed | Average length after trim |
|---|---|---|---|---|---|
| 61,348,758 | 100 | 52,236,532 | 15% | 99.4 | |
| 59,140,884 | 100 | 50,053,954 | 15% | 99.4 | |
| 53,661,434 | 100 | 45,004,810 | 16% | 99.4 | |
| 106,867,304 | 100 | 84,798,308 | 21% | 99.4 | |
| 197,003,984 | 100 | 156,392,016 | 21% | 99.3 | |
| 139,928,698 | 100 | 111,917,028 | 20% | 99.3 | |
| 230,466,942 | 100 | 183,323,866 | 17% | 99.4 |
Summary statistics of transcriptome assemblies following implementation of N50, sequences number, and total length in EvidentialGene tr2aacids pipeline
| Species | Unique CDSs | Unique CDSs + Possible isoforms | ||||
|---|---|---|---|---|---|---|
| N50 (bp) | Sequences number | Total length (bp) | N50 (bp) | Sequences number | Total length (bp) | |
| 1567 | 30,394 | 32,083,609 | 1357 | 52,610 | 50,466,628 | |
| 1585 | 36,364 | 37,932,847 | 1361 | 72,602 | 69,442,718 | |
| 1549 | 35,856 | 37,243,265 | 1353 | 72,764 | 69,722,866 | |
| 1693 | 39,540 | 43,739,409 | 1451 | 87,270 | 87,954,199 | |
| 1516 | 56,375 | 54,910,276 | 1236 | 144,921 | 129,354,728 | |
| 1471 | 59,223 | 561,062,47 | 1232 | 144,460 | 128,133,958 | |
| 1562 | 25,878 | 28,239,951 | 1408 | 36,959 | 37,055,659 | |
The mapped reads, covered references, mapped percentage and the length of SNVs, MNVs, replacements, insertions, and deletions detected from the E. coracana reads mapped to the chloroplast genes of all Eleusine species
| Assembled species | Mapped reads | Covered reference | Mapped percentage | SNVs | MNVs | Replacements | Insertions | Deletions |
|---|---|---|---|---|---|---|---|---|
| 225,962 | 37,969 | 0.2% | 15 | 0 | 0 | 0 | 0 | |
| 238,136 | 37,056 | 0.2% | 106 | 0 | 0 | 0 | 0 | |
| 246,733 | 38,012 | 0.2% | 80 | 0 | 0 | 0 | 0 | |
| 234,583 | 34,937 | 0.2% | 41 | 0 | 0 | 2 | 0 | |
| 226,923 | 36,287 | 0.2% | 364 | 0 | 1 | 1 | 1 | |
| 248,315 | 40,171 | 0.2% | 14 | 1 | 0 | 2 | 2 | |
| 249,884 | 42,162 | 0.2% | 33 | 0 | 0 | 0 | 3 |
The length of covered reference is similar but not same, because some chloroplast gene sequences are not exactly same.
The summary of chloroplast genes used for determination of maternal genome donor of E. coracana
| Category | Group | Gene name |
|---|---|---|
| Photosynthesis | Subunits of NADH-dehydrogenase | |
| Subunits of photosystem I | ||
| Subunits of photosystem II | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Large subunit of rubisco | ||
| Replication | Small subunit of ribosome | |
| Large subunit of ribosome | ||
| DNA dependent RNA polymerase | ||
| Other | Maturase | |
| Protease | ||
| c-type cytochrome synthesis gene |
Figure 2(A) Phylogenetic tree made using concatenated chloroplast genes in RAxML. Chloroplast genes of E. Indica_cp means these genes downloaded from NCBI (KU833246), which were accurate assembled and uploaded before. However, genes of E. indica_trans were got using same blast method with other species and we can also use this method to verify our result. (B) Phylogenetic tree constructed based on orthologous genes.