| Literature DB >> 34254817 |
Liangzhi Li1,2, Zhenghua Liu1,2, Zhicheng Zhou3, Min Zhang1,2, Delong Meng1,2, Xueduan Liu1,2, Ye Huang4,5, Xiutong Li4,5, Zhen Jiang4,5, Shuiping Zhong6,7, Lukasz Drewniak8, Zhendong Yang8, Qian Li1, Yongjun Liu3,9, Xiaolong Nan10, Biguang Jiang10, Chengying Jiang4,5, Huaqun Yin1,2.
Abstract
DPANN is known as highly diverse, globally widespread, and mostly ectosymbiotic archaeal superphylum. However, this group of archaea was overlooked for a long time, and there were limited in-depth studies reported. In this investigation, 41 metagenome-assembled genomes (MAGs) belonging to the DPANN superphylum were recovered (18 MAGs had average nucleotide identity [ANI] values of <95% and a percentage of conserved proteins [POCP] of >50%, while 14 MAGs showed a POCP of <50%), which were analyzed comparatively with 515 other published DPANN genomes. Mismatches to known 16S rRNA gene primers were identified among 16S rRNA genes of DPANN archaea. Numbers of gene families lost (mostly related to energy and amino acid metabolism) were over three times greater than those gained in the evolution of DPANN archaea. Lateral gene transfer (LGT; ∼45.5% was cross-domain) had facilitated niche adaption of the DPANN archaea, ensuring a delicate equilibrium of streamlined genomes with efficient niche-adaptive strategies. For instance, LGT-derived cytochrome bd ubiquinol oxidase and arginine deiminase in the genomes of "Candidatus Micrarchaeota" could help them better adapt to aerobic acidic mine drainage habitats. In addition, most DPANN archaea acquired enzymes for biosynthesis of extracellular polymeric substances (EPS) and transketolase/transaldolase for the pentose phosphate pathway from Bacteria. IMPORTANCE The domain Archaea is a key research model for gaining insights into the origin and evolution of life, as well as the relevant biogeochemical processes. The discovery of nanosized DPANN archaea has overthrown many aspects of microbiology. However, the DPANN superphylum still contains a vast genetic novelty and diversity that need to be explored. Comprehensively comparative genomic analysis on the DPANN superphylum was performed in this study, with an attempt to illuminate its metabolic potential, ecological distribution and evolutionary history. Many interphylum differences within the DPANN superphylum were found. For example, Altiarchaeota had the biggest genome among DPANN phyla, possessing many pathways missing in other phyla, such as formaldehyde assimilation and the Wood-Ljungdahl pathway. In addition, LGT acted as an important force to provide DPANN archaeal genetic flexibility that permitted the occupation of diverse niches. This study has advanced our understanding of the diversity and genome evolution of archaea.Entities:
Keywords: DPANN superphylum; comparative genomics; evolution; genome reduction; lateral gene transfer
Year: 2021 PMID: 34254817 PMCID: PMC8407392 DOI: 10.1128/mSystems.00602-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Spearman rank correlation between the genome size and number of CDS, G+C content, and average protein length and coding density, colored according to habitat (top) and taxonomy (bottom) and calculated with OriginPro 2020b. Spearman ρ and the associated P value are shown for each scatterplot (a P value of ≤0.05 was considered significant).
FIG 2Profile of presence or absence of metabolic or biosynthetic capacities in DPANN archaea and Euryarchaeota (outgroup) based on annotation done with eggNOG-mapper v. 2.0 (default parameters: E value < 10−3, bit score > 60). The phylogenomic tree shown in the left was constructed based on whole-genome sequences with CVTree3 (k-mer = 4; Euryarchaeota was used as outgroup), and the phylogenetic groups were colored according to the original taxonomic assignment in the genome database. The types of habitats from which each genome was recovered are shown in the first bar (on the left) by different colors. The solid and open cells represent the presence and absence of the enzymes, respectively. The cells involved in different pathways are distinguished with different colors. Detailed description for abbreviations is provided in Data S1 at https://doi.org/10.6084/m9.figshare.14806173.v1.
FIG 3(a) Evolutionary timeline of the DPANN archaea (left) predicted with the RelTime method in MEGA X. Data of asteroid impacts, solar luminosity, and fluctuations of atmospheric oxygen and carbon dioxide amount are displayed synchronously with divergence times in the form of time panels. The estimated occurrence time of the Great Oxidation Event (GOE) (∼2,400 Mya) is marked with a red dotted line. Nodes applying corrections provided by Timetree (http://www.timetree.org) are indicated with a red star. Ancestral genome content reconstruction of DPANN archaea (right) was performed with Dollo parsimony algorithms implemented in the COUNT program. The numbers of gene families of each genome are shown before the names of organisms. The numbers of gene families of the reconstructed respective most recent common ancestor (MRCA) are shown on the nodes. The numbers of gain and loss events are marked on each lineage of the tree. Plus signs indicate gain events, and minus signs indicate loss events. The stacked-bar diagram (right) shows sizes of genes shared by partial genomes (i.e., the accessory genome) and numbers of strain-specific genes (i.e., unique genes). (b) Functional proportions of DPANN gene families undergoing gain and loss events based on COG categories. Detailed description for the COG categories is provided in Data S1 at https://doi.org/10.6084/m9.figshare.14806173.v1.
FIG 4Distributions and relations of COG categories, with the predicted laterally transferred genes (LTGs) annotated and taxonomy of the donors shown. Identification of LTGs was performed through the Integrated Microbial Genomes (IMG) system based on the principles described in Materials and Methods. FCB group = Fibrobacteres-Chlorobi-Bacteroidetes superphylum; PVC group = Planctomycetes-Verrucomicrobia-Chlamydia superphylum. Descriptions for the COG categories and each LTGs are provided in Data S1 at https://doi.org/10.6084/m9.figshare.14806173.v1 and Table S5 at https://doi.org/10.6084/m9.figshare.14806140.v1.
FIG 5Metabolic reconstruction of major metabolic pathways in DPANN archaea. Annotation was performed with eggNOG-mapper v. 2.0 (default parameters: E value < 10−3, bit score > 60). Each phylum of DPANN is depicted as a colored circle. Black arrows indicate that the corresponding proteins were detected for the pathways, whereas gray arrows indicate that the corresponding proteins were not detected. Full circles represent over 50% presence, while half circles represent 15 to 50% presence. Detailed distribution data are provided in Table S6 at https://doi.org/10.6084/m9.figshare.14806212.v1. Detailed description for abbreviations is provided in Data S1 at https://doi.org/10.6084/m9.figshare.14806173.v1.