| Literature DB >> 28922368 |
Pawan Kumar Jayaswal1,2, Vivek Dogra1, Asheesh Shanker3, Tilak Raj Sharma1, Nagendra Kumar Singh1.
Abstract
Rapid advances in DNA sequencing technologies have resulted in the accumulation of large data sets in the public domain, facilitating comparative studies to provide novel insights into the evolution of life. Phylogenetic studies across the eukaryotic taxa have been reported but on the basis of a limited number of genes. Here we present a genome-wide analysis across different plant, fungal, protist, and animal species, with reference to the 36,002 expressed genes of the rice genome. Our analysis revealed 9831 genes unique to rice and 98 genes conserved across all 49 eukaryotic species analysed. The 98 genes conserved across diverse eukaryotes mostly exhibited binding and catalytic activities and shared common sequence motifs; and hence appeared to have a common origin. The 98 conserved genes belonged to 22 functional gene families including 26S protease, actin, ADP-ribosylation factor, ATP synthase, casein kinase, DEAD-box protein, DnaK, elongation factor 2, glyceraldehyde 3-phosphate, phosphatase 2A, ras-related protein, Ser/Thr protein phosphatase family protein, tubulin, ubiquitin and others. The consensus Bayesian eukaryotic tree of life developed in this study demonstrated widely separated clades of plants, fungi, and animals. Musa acuminata provided an evolutionary link between monocotyledons and dicotyledons, and Salpingoeca rosetta provided an evolutionary link between fungi and animals, which indicating that protozoan species are close relatives of fungi and animals. The divergence times for 1176 species pairs were estimated accurately by integrating fossil information with synonymous substitution rates in the comprehensive set of 98 genes. The present study provides valuable insight into the evolution of eukaryotes.Entities:
Mesh:
Year: 2017 PMID: 28922368 PMCID: PMC5603157 DOI: 10.1371/journal.pone.0184276
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information of 49 selected model organism for the comparative genomic analysis.
| Kingdom/Phylum | Sub Category (Scientific Name) | Genome Size (MB) | EST Unigene /CDS /cDNA | |
|---|---|---|---|---|
| Kingdom Plantae | ||||
| Eudicotyledons | 125 | 30633 | ||
| 858 | 59,515 | |||
| 1,115 | 35982 | |||
| 390 | 18045 | |||
| 485 | 15056 | |||
| 900 | 18071 | |||
| 487 | 22501 | |||
| Liliopsida | 272 | 10698 | ||
| 389 | 44235 | |||
| 2,500 | 92266 | |||
| 17,000 | 56955 | |||
| 5,100 | 26945 | |||
| 772 | 13736 | |||
| Zingiberales | 523 | 36549 | ||
| Chlorophyta | 121 | 7579 | ||
| Streptophyta | 487 | 17573 | ||
| Gymnosperm | 20,100 | 17390 | ||
| 20,000 | 27848 | |||
| Kingdom Fungi & Protista | ||||
| Ascomycota | 37 | 12,063 | ||
| 59.9 | 17,708 | |||
| 39.9 | 17,073 | |||
| Basidiomycota | 89 | 15,979 | ||
| Mucormycotina | 45.3 | 17459 | ||
| 40.3 | 11054 | |||
| 18.4 | 6,210 | |||
| Oomycetes | 240 | 8920 | ||
| Apicomlexa | 63 | 6237 | ||
| Dictyosteliida | 34 | 6187 | ||
| Chytridiomycota | 55 | 11736 | ||
| Kingdom Animalia | ||||
| Mammalia | 2,860 | 45364 | ||
| 2,910 | 130055 | |||
| 2,500 | 30386 | |||
| 2,400 | 20479 | |||
| 3035 | 29026 | |||
| 3,080 | 22451 | |||
| Reptiles | 2,200 | 20765 | ||
| 1,780 | 25137 | |||
| Actinopterygii | 1,412 | 53559 | ||
| Amphibia | 3,000 | 31434 | ||
| Ascidiacea | 160 | 28121 | ||
| Aves | 1,050 | 34025 | ||
| Echinodermata | 814 | 14718 | ||
| Insecta | 1,800 | 24392 | ||
| 180 | 17132 | |||
| 530 | 13952 | |||
| 278 | 14672 | |||
| Nematoda | 97 | 23151 | ||
| Cnidaria- Anthrozoa | 450 | 14574 | ||
| Cnidaria-Hydrozoa | 1,000 | 11072 | ||
*unigene: https://www.ncbi.nlm.nih.gov/unigene
◊cds:http://banana-genome.cirad.fr/download
†: https://www.broadinstitute.org/fungal-genome-initiative
●cDNA:http://asia.ensembl.org/index.html
Fig 1Pipeline to identify and analyse the conserved gene among 49 species and development of phylogenetic tree.
Flow diagram showing scheme of genome wide comparative analysis of rice genes in 48 other eukaryotic species.
Fig 2Annotation of uniquely expressed genes in rice.
Functional annotations of 9,831 genes uniquely expressed in rice in comparison to 48 other eukaryotic species. Annotations with more than ten genes per family only are shown here.
Fig 3Clustering of homologs genes in between rice and other plant species.
Chromosome wise distribution of expressed rice gene homologs expressed in 16 other plant species. Parenthesis showed the number of homologs gene of individual species.
Frequency distribution of expressed rice gene homologs in seven different fungal and four protista species.
The number shown in the column represent the distribution of the expressed homologous rice gene sequences among the total unigene of their respective organisms.
| Fungus and Protista species | Total no. of genes | Number of conserved genes on rice chromosome | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chr1 | Chr2 | Chr3 | Chr4 | Chr5 | Chr6 | Chr7 | Chr8 | Chr9 | Chr10 | Chr11 | Chr12 | Total | ||
| 17459 | 142 | 116 | 152 | 95 | 116 | 93 | 99 | 81 | 68 | 72 | 67 | 78 | 1179 | |
| 11054 | 96 | 92 | 114 | 58 | 87 | 72 | 52 | 44 | 34 | 34 | 34 | 35 | 752 | |
| 17708 | 108 | 83 | 111 | 46 | 84 | 69 | 58 | 43 | 36 | 32 | 36 | 40 | 746 | |
| 12063 | 106 | 83 | 113 | 39 | 85 | 59 | 56 | 42 | 33 | 31 | 30 | 37 | 714 | |
| 17073 | 103 | 73 | 108 | 45 | 80 | 62 | 48 | 37 | 34 | 28 | 29 | 34 | 681 | |
| 6210 | 101 | 78 | 105 | 36 | 76 | 56 | 53 | 33 | 32 | 30 | 29 | 34 | 663 | |
| 15979 | 88 | 74 | 98 | 31 | 63 | 54 | 47 | 29 | 24 | 31 | 25 | 29 | 593 | |
| 8920 | 194 | 165 | 187 | 91 | 131 | 102 | 92 | 74 | 61 | 59 | 52 | 58 | 1266 | |
| 11736 | 116 | 109 | 127 | 46 | 86 | 75 | 58 | 47 | 47 | 33 | 41 | 44 | 829 | |
| 6237 | 83 | 65 | 87 | 29 | 66 | 51 | 40 | 27 | 32 | 29 | 28 | 28 | 565 | |
| 6187 | 73 | 60 | 82 | 32 | 48 | 44 | 32 | 25 | 20 | 24 | 23 | 24 | 487 | |
Fig 4Clustering of homologs genes in between rice and animal species.
Chromosome wise distribution of rice gene homologs expressed in 20 different animal species. Figures in parenthesis indicate total number of EST-unigenes in the respective animal species.
Fig 5Gene annotation.
Gene ontology (GO) based annotation of 98 rice genes conserved across 49 eukaryotic species using BLAST2GO programme. The genes were classified based on three different criteria: (a) Biological process, (b) Cellular localization and (c) Molecular function.
Fig 6Phylogenetic tree of conserved gene sequences of Oryza sativa.
(a) Phylogenetic tree of 98 expressed rice gene homologs conserved across 49 eukaryotic species. Unrooted Bayesian tree was constructed after alignment of the 98 rice CDS sequences. Posterior probability of each clade is shown at the respective node. (b) Multiple sequence alignment of 22 of the 98 rice genes, taking one representative from each functional category. Nucleotide base is color coded to facilitate visualization of the homology. The Jalview alignment picture was cropped to show the conserved parts of the genes. Black bars at the bottom show the level of sequence conservation.
Fig 7Eukaryotic tree of life.
A rooted eukaryotic phylogenetic tree based on concatenated sequences of 98 rice gene homologs conserved across 49 eukaryotic species using Bayesian approach (Mrbayes v 3.2). Bayesian posterior probability for each node is 1. Tree was rooted using Chlamydomonas reinhardtii (Green algae) sequence.
Divergence times of 50 sampled pairs of species out of total 1,176 pairs of species anlysed (Table L in S2 File).
| Organism Combination | Calibration Time (Ma) | Reference | Synonymous substitution rate based on mean Ks Value | Estimated Date in million years ago (Ma) |
|---|---|---|---|---|
| 110 | [ | 1.59E-09 | 53.46 | |
| 110 | [ | 1.04545E-09 | 31.73 | |
| 110 | [ | 5.91E-10 | 42.37 | |
| 110 | [ | 2.27273E-09 | 22.03 | |
| 110 | [ | 1.54545E-09 | 61.69 | |
| 54 | [ | 3.24074E-09 | 24.55 | |
| 54 | [ | 3.88889E-09 | 36.84 | |
| 54 | [ | 3.42593E-09 | 7.35 | |
| 100 | [ | 3.4875E-09 | 29.89 | |
| 200 | [ | 1.40E-09 | 46.65 | |
| 270 | [ | 5.92E-10 | 49.92 | |
| 350 | [ | 8.30E-10 | 35.84 | |
| 968 | [ | 2.22E-10 | 180.18 | |
| 1500 | [ | 1.68E-10 | 515.57 | |
| 1547 | [ | 7.02E-11 | 209.76 | |
| 1642 | [ | 1.38E-10 | 1117.52 | |
| 400 | [ | 6.3441E-10 | 91.56 | |
| 400 | [ | 4.95979E-10 | 93.95 | |
| 400 | [ | 4.81E-10 | 112.92 | |
| 400 | [ | 5.87432E-10 | 96.42 | |
| 1538 | [ | 1.76E-10 | 495.43 | |
| 1642 | [ | 1.80E-10 | 586.59 | |
| 1642 | [ | 1.81E-10 | 439.64 | |
| 315 | [ | 7.62E-10 | 235.26 | |
| 670 | [ | 3.99E-10 | 143.26 | |
| 700 | [ | 3.51E-10 | 404.14 | |
| 964 | [ | 2.82E-10 | 343.79 | |
| 1547 | [ | 1.87E-10 | 337.97 | |
| 670 | [ | 4.89E-10 | 182.72 | |
| 741 | [ | 1.60E-10 | 102.83 | |
| 1298 | [ | 1.89E-10 | 174.34 | |
| 600 | [ | 3.98E-10 | 124.23 | |
| 400 | [ | 8.22E-10 | 103.89 | |
| 445 | [ | 6.42E-10 | 80.41 | |
| 450 | [ | 6.29E-10 | 105.92 | |
| 964 | [ | 2.01E-10 | 308.96 | |
| 445 | [ | 5.40E-10 | 60.81 | |
| 315 | [ | 6.91E-10 | 101.48 | |
| 350 | [ | 7.52E-10 | 65.71 | |
| 340 | [ | 1.16E-09 | 68.65 | |
| 340 | [ | 8.96E-10 | 95.81 | |
| 340 | [ | 7.54E-10 | 84.48 | |
| 1547 | [ | 1.85E-10 | 345.90 | |
| 340 | [ | 7.54E-10 | 84.48 | |
| 300 | [ | 7.75E-10 | 136.14 | |
| 964 | [ | 2.74E-10 | 159.12 | |
| 66 | [ | 8.49E-10 | 13.44 | |
| 1547 | [ | 1.62E-10 | 480.40 | |
| 1547 | [ | 2.29E-10 | 545.85 | |
| 1547 | [ | 1.51E-10 | 820.67 |