| Literature DB >> 25368628 |
Arwa Shahin1, Marinus J M Smulders1, Jaap M van Tuyl1, Paul Arens1, Freek T Bakker2.
Abstract
Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium.Entities:
Keywords: Consensus Network; Lilium; POFAD; RAxML; Tulipa; allelic variation; genetic divergence
Year: 2014 PMID: 25368628 PMCID: PMC4202788 DOI: 10.3389/fpls.2014.00567
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Floral morphology of the four .
Description of the 27 .
| Contig_19510 | 372 | 26 | CBI16691.3 | unnamed protein product [ | 7.00E-15 | |
| Contig_36292 | 378 | 6 | XP_002489102.1 | hypothetical protein SORBIDRAFT_1962s002010 [ | 9.00E-20 | |
| Contig_34203 | 408 | 15 | XP_002271397.1 | unknown [ | 1.00E-65 | |
| Contig_36290 | 423 | 6 | XP_002488914.1 | hypothetical protein SORBIDRAFT_0070s002020 [ | 3.00E-25 | |
| Contig_21012 | 490 | 43 | CBI28652.3 | unnamed protein product [ | 1.00E-66 | |
| Contig_30305 | 500 | 12 | XP_002322328.1 | predicted protein [ | 1.00E-66 | |
| Contig_22926 | 510 | 1 | CAN66875.1 | hypothetical protein VITISV_009275 [ | 6.00E-14 | |
| Contig_35696 | 551 | 32 | CBI27136.3 | unnamed protein product [ | 3.00E-43 | |
| Contig_25751 | 588 | 11 | ACI31551.1 | heat shock protein 90-2 [ | 7.00E-91 | |
| Contig_48560 | 615 | 11 | XP_002280853.1 | hypothetical protein [ | 8.00E-50 | |
| Contig_19882 | 630 | 40 | NP_001060290.1 | hypothetical protein OsJ_25146 [ | 7.00E-61 | |
| Contig_34918 | 634 | 41 | XP_002460541.1 | hypothetical protein SORBIDRAFT_02g030210 [ | 1.00E-33 | |
| Contig_36700 | 639 | 71 | AAZ57445.1 | lipoxygenase LOX2 [ | 5.00E-40 | |
| Contig_34983 | 660 | 18 | NP_001183774.1 | hypothetical protein LOC100502367 [ | 2.00E-113 | |
| Contig_34202 | 666 | 29 | ACU18883.1 | PREDICTED: hypothetical protein [ | 6.00E-100 | |
| Contig_6165 | 714 | 3 | YP_003587262.1 | ATPase subunit 4 [ | 1.00E-71 | |
| Contig_21042 | 717 | 39 | XP_002284696.1 | PREDICTED: hypothetical protein [ | 1.00E-62 | |
| Contig_5703 | 720 | 20 | EEE57528.1 | hypothetical protein OsJ_07840 [ | 6.00E-101 | |
| Contig_30546 | 729 | 49 | CAN72815.1 | hypothetical protein VITISV_004099 [ | 6.00E-44 | |
| Contig_36051 | 736 | 12 | EEC79215.1 | hypothetical protein OsI_19939 [ | 2.00E-125 | |
| Contig_72799 | 747 | 20 | ACZ82298.1 | cellulose synthase [ | 2.00E-115 | |
| Contig_34429 | 817 | 24 | AAY43222.1 | cellulose synthase BoCesA5 [ | 5.00E-121 | |
| Contig_20744 | 840 | 18 | ABB46861.2 | Enolase, putative, expressed [ | 2.00E-133 | |
| Contig_31438 | 957 | 10 | ACG36494.1 | histone mRNA exonuclease 1 [ | 2.00E-94 | |
| Contig_6081 | 987 | 2 | AAV44205.1 | unknow protein [ | 3.00E-62 | |
| Contig_6523 | 1017 | 29 | AAW78691.1 | peroxisomal acyl-CoA oxidase 1A [ | 3.00E-166 | |
| Contig_10364 | 1230 | 35 | NP_001151315.1 | transmembrane 9 superfamily protein member 1 [ | 0 | |
| Total | 18,275 | 623 |
As Table 1 but for the seven orthologous contigs between .
| Contig_36290 | Contig_47963 | 423 | 17 | hypothetical protein SORBIDRAFT_0070s002020 [ | 3.00E-25 |
| Contig_34202 | Contig_49866 | 666 | 86 | PREDICTED: hypothetical protein [ | 6.00E-100 |
| Contig_5307 | Contig_49304 | 720 | 87 | hypothetical protein OsJ_07840 [ | 6.00E-101 |
| Contig_72799 | Contig_34429 | 747 | 69 | cellulose synthase [ | 2.00E-115 |
| Contig_6081 | Contig_29742 | 987 | 8 | unknown protein [ | 3.00E-62 |
| Contig_6523 | Contig_48627 | 1017 | 120 | peroxisomal acyl-CoA oxidase 1A [Solanum cheesmaniae] | 3.00E-166 |
| Contig_10364 | Contig_55032 | 1230 | 200 | transmembrane 9 superfamily protein member 1 [ | 0 |
| Total | 5790 | 587 |
Figure 2Representation of the relationships of the four Neighbor Network based on 26 Lilium contigs, using the POFAD approach. (B) RAxML tree (with 100 rep. bootstrap support values) of the 26 Lilium concatenated consensus, (C) Consensus Network based on parsimony trees (see text) using a threshold of 0.33 split conflict, and (D) Consensus Network for the same trees using a threshold of 10%. Branch length is proportional to the genetic divergence among genotypes (A,B, with the scale bar indicating numbers of substitutions per site), and proportional to the occurrence of splits in the consensus network analysis (D,C).
Figure 3As Figure .