| Literature DB >> 23574621 |
Sree V Chintapalli1, Gaurav Bhardwaj, Jagadish Babu, Loukia Hadjiyianni, Yoojin Hong, George K Todd, Casey A Boosalis, Zhenhai Zhang, Xiaofan Zhou, Hong Ma, Andriy Anishkin, Damian B van Rossum, Randen L Patterson.
Abstract
BACKGROUND: The recA/RAD51 gene family encodes a diverse set of recombinase proteins that affect homologous recombination, DNA-repair, and genome stability. The recA gene family is expressed across all three domains of life - Eubacteria, Archaea, and Eukaryotes - and even in some viruses. To date, efforts to resolve the deep evolutionary origins of this ancient protein family have been hindered by the high sequence divergence between paralogous groups (i.e. ~30% average pairwise identity).Entities:
Mesh:
Substances:
Year: 2013 PMID: 23574621 PMCID: PMC3637515 DOI: 10.1186/1471-2164-14-240
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Qualitative and quantitative analysis of 17 sub-groups within the Reca/RAD51 superfamily
| | | ||||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | | ||||
| | | | |||||
| | | | |||||
| | | | |||||
| | | | | ||||
Abbreviations are as follows: Protists (Pr), Insects (In), Nematodes (Nm), Fungi (Fu), Plants (Pl), and Chordate (Ch). Parentheses in RAD51B, D and XRCC2, XRCC3 groups denote species which are putative members of the respective group but were not included in the phylogenetic inference because they disrupt the overall topology and cannot be unambiguously assigned. These 14 sequences are listed below along with their GI numbers and species names.
XRCC2_303290256_Micromonas_pusilla_Plants, XRCC2_332024988_Acromyrmex_echinatior_Insecta, XRCC2_255074101_Micromonas_Plants, XRCC2_66803939_Dictyostelium_discoideum_Protists, XRCC2_281210087_Polysphondylium_pallidum_Protists, RAD51D_170071670_Culex_quinquefasciatus_Insecta, RAD51D_321474080_Daphnia_pulex_Animal, RAD51D_111226459_Dictyostelium_discoideum_Protist, XRCC3_307191609_Harpegnathos_saltator_Insecta, XRCC3_281201100_Polysphondylium_pallidum_Protist, XRCC3_170044836_Culex_quinquefasciatus_Insecta, XRCC3_307171500_Camponotus_floridanus_Insecta, RAD51B_45685353_Chlamydomonas_reinhardtii_Protists, ID9_Unknown2_118195642_Cenarchaeum_symbiosum_Protists.
Figure 2Phylogenetic Inference of the recA/RAD51 Superfamily using PHYRN-NJ. (A) Unrooted phylogram of recA/RAD51 clades of Set-1 of 545 sequences. (B) Unrooted phylogram of recA/RAD51 clades of Set-2 of 633 sequences (comprising of Set-1 + 88 metagenomic sequences). Confidence values are calculated by Jackknife resampling for 5000 replicates for both the sets. Scale bar is proportional to PHYRN-derived Euclidean distance scaled between 0-1.
Figure 3Evolution of recA sequences. (A) A phylogenetic tree of 242 recA sequences inferred using PHYRN-NJ and rooted with Spirochaetes. Branch statistics are derived from Jacknife resampling tests. The notation (-) is indicative of no support for the given branching pattern. Scale bar is proportional to PHYRN-derived Euclidean distance scaled between 0–1.
Figure 1Distribution and Characterization of PHYRN-Derived Phylogenetic Signal in recA/RAD51 Superfamily. (A) Distribution of PHYRN Phylogenetic signal (%identity x %coverage) for recA/RAD51 superfamily. PHYRN score is calculated from alignments between full length query sequences and the respective recA/RAD51-specific PSSM library. PHYRN scores are represented as log-scaled values ranging from 0 (blue) to 4 (red). (B) Graphical representation of PHYRN phylogenetic signal of recA/RAD51 sequences (signal) as compared to their randomized versions (i.e. noise, 100 replicates). Comparative analysis is represented as Difference Ratio (DR).
Figure 4Model of the Evolutionary History of the recA/RAD51 Superfamily. Graphical representation of a model for evolution of recA/RAD51 family based on the phylogenetic trees obtained using PHYRN methodology. Endosymbiotic gene transfer events from cyanobacteria to protists and algae to plants are labeled. (*) represents Meiosis specific gene.