| Literature DB >> 27346943 |
Abstract
The tree of life is currently an active object of research, though next to vertical gene transmission non vertical gene transfers proved to play a significant role in the evolutionary process. To overcome this difficulty, trees of life are now constructed from genes hypothesized vital, on the assumption that these are all transmitted vertically. This view has been challenged. As a frame for this discussion, we developed a partitional taxonomical system clustering taxa at a high taxonomical rank. Our analysis (1) selects RNase P RNA sequences of bacterial, archaeal, and eucaryal genera from genetic databases, (2) submits the sequences, aligned, to k-medoid analysis to obtain clusters, (3) establishes the correspondence between clusters and taxa, (4) constructs from the taxa a new type of taxon, the genetic community (GC), and (5) classifies the GCs: Archaea-Eukaryotes contrastingly different from the six others, all bacterial. The GCs would be the broadest frame to carry out the phylogenies.Entities:
Keywords: RNase P RNA; bioinformatics; classification; cluster analysis; evolution; k-medoid analysis
Year: 2016 PMID: 27346943 PMCID: PMC4912232 DOI: 10.4137/EBO.S38288
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
The OTUs crossed with the 17 clusters.
| CLUSTERS | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OTUs | C1 | C2 | C3 | C4 | C5 | C6 | C7 | C8 | C9 | C10 | C11 | C12 | C13 | C14 | C15 | C16 | C17 | ||
| A | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 40 | 95 | |
| At | 0 | 1 | 0 | 0 | 1 | 5 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 43 | 77 | |
| Ba1 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 73 | |
| FL | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 | 91 | |
| Cy | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 19 | 95 | |
| Co1 | 5 | 3 | 0 | 3 | 0 | 0 | 2 | 0 | 2 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 23 | 26 | |
| Ng | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | 100 | |
| Al1 | 0 | 1 | 3 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 11 | 36 | |
| Al2 | 0 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 33 | 79 | |
| Al3 | 0 | 0 | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 | 75 | |
| Bu | 0 | 0 | 7 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 18 | 50 | |
| Ga1 | 0 | 4 | 0 | 1 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 43 | 44 | |
| Ga2 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 | 57 | |
| E1 | 9 | 0 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 57 | 65 | |
| E2 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 27 | 82 | |
| E3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 5 | 40 | |
| 55 | 48 | 23 | 35 | 37 | 33 | 5 | 5 | 13 | 10 | 10 | 19 | 41 | 20 | 16 | 10 | 26 | |||
| τ | 69 | 35 | 30 | 74 | 51 | 100 | 100 | 40 | 46 | 70 | 100 | 95 | 90 | 90 | 88 | 90 | 85 | ||
Note: The boldfaced numbers correspond to the intersection of each OTU with its Cmax and represent the number of genera within the OTU and Cmax in question.
Comparison of statistic descriptors of variable O for the three OIs (analysis on the LFBUs).
| OI TYPE | TABLE | ō (MOI) |
| JB | ωinf | NUMBER OF TAXA WITH ω |
|---|---|---|---|---|---|---|
| Dice index | S3 | 0.54 | 0.28 | ns | 0.46 | 11 |
| Jaccard index | S4 | 0.42 | 0.26 | ns | 0.34 | 10 |
| Cosine index | S5 | 0.56 | 0.26 | ns | 0.49 | 11 |
Abbreviations: JB, Jarque–Bera normality test statistic49; ō, MOIs; ns, nonsignificant.
Figure 1Plot diagrams inferred by CA. Inertia rates in brackets next to the factorial axes (FAs). Squares with C in gray are clusters. Dots with abbreviated names in black are taxa. Factorial planes generated by two factorial axes: (A) F1 and F2; (B) F1 and F3; (C) F3 and F4; (D) F4 and F5; (E) F5 and F6; (F) F5 and F7.
Comparative results between the OIA and CA.
| OIA | CA | |||
|---|---|---|---|---|
| OTUs | Cmax | MOIs | ASSOCIATED CLUSTERS | FACTORIAL PLANES |
| FL | C11 | 0.95 | C11 | F1 × F2 |
| Cy | C12 | 0.95 | C12 | F1 × F2 |
| E2 | C17 | 0.83 | C17 | F1 × F3 |
| Al3 | C14 | 0.82 | C14 | F3 × F4 |
| At | C6 (C7) | 0.78 | C6, C7 | F3 × F4 |
| E1 | C13 | 0.78 | C13 | F1 × F3 |
| A | C1 | 0.76 | C1 | F1 × F3 |
| Co1 | C8 | 0.73 | C8 | F5 × F7 |
| Al2 | C4 | 0.72 | C4 | F3 × F4 |
| Bu | C16 (C3) | 0.67 | C16 | F4 × F5 |
| Ga2 | C15 (C9) | 0.53 | C15 | F4 × F5 |
| Ng | C10 | 0.45 | C10 | F5 × F7 |
| Ga1 | C2 (C9) | 0.34 | C2 | F4 × F5 |
| Ba1 | C2 | 0.32 | C2 | F1 × F12 |
Notes: OTUs sorted in decreasing order of MOI. Clusters in parentheses, in OIA, cluster with the second largest number of genera for a given taxon.
Abbreviations: OIA, overlap index analysis; CA, correspondence analysis.
Figure 2HCA dendrogram. Distance = DI = Manhattan; aggregation method = Ward. Cut at distance ca. 3200. = cluster families: = {C1, C13, C17}; = {C2, C5, C10}; = {C4, C6, C14}; = {C11}; = {C12}; = {C15}; and = {C16}.
Cluster families inferred by the HCA of the TSCs.
| CLUSTER FAMILIES
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ti |
|
|
|
|
|
|
| ω | |
| 115 (0.99) | 2 (0.02) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 117 | 0.99 | |
| 0 (0) | 47 (0.68) | 1 (0.01) | 0 (0) | 0 (0) | 14 (0.46) | 0 (0) | 62 | 0.68 | |
| 0 (0) | 13 (0.41) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 13 | 0.41 | |
| 0 (0) | 12 (0.17) | 54 (0.71) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 66 | 0.71 | |
| 0 (0) | 2 (0.04) | 33 (0.59) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 35 | 0.59 | |
| 0 (0) | 0 (0) | 0 (0) | 10 (1) | 0 (0) | 0 (0) | 0 (0) | 10 | 1 | |
| 0 (0) | 1 (0.03) | 0 (0) | 0 (0) | 18 (0.97) | 0 (0) | 0 (0) | 19 | 0.97 | |
| 0 (0) | 1 (0.03) | 0 (0) | 0 (0) | 0 (0) | 1 (0.08) | 9 (0.90) | 11 | 0.90 | |
| 116 | 12 | 22 | 19 | 68 | 57 | 61 | 355 | ||
Notes: T, PGCs. At the intersection of T and A: n = number of genera in taxon T and cluster family A; n. number of genera in taxon in T, and n number of genera in taxon in A. In brackets, OI between PGCs and CFs. ω. = MOI of T. Mean of . From calculation, ωinf = 0.69.