| Literature DB >> 34849853 |
José Cerca1,2,3, Ellie E Armstrong1,4, Joel Vizueta5,6, Rosa Fernández7, Dimitar Dimitrov8, Bent Petersen9,10, Stefan Prost11,12,13, Julio Rozas5, Dmitri Petrov4, Rosemary G Gillespie1.
Abstract
Spiders (Araneae) have a diverse spectrum of morphologies, behaviors, and physiologies. Attempts to understand the genomic-basis of this diversity are often hindered by their large, heterozygous, and AT-rich genomes with high repeat content resulting in highly fragmented, poor-quality assemblies. As a result, the key attributes of spider genomes, including gene family evolution, repeat content, and gene function, remain poorly understood. Here, we used Illumina and Dovetail Chicago technologies to sequence the genome of the long-jawed spider Tetragnatha kauaiensis, producing an assembly distributed along 3,925 scaffolds with an N50 of ∼2 Mb. Using comparative genomics tools, we explore genome evolution across available spider assemblies. Our findings suggest that the previously reported and vast genome size variation in spiders is linked to the different representation and number of transposable elements. Using statistical tools to uncover gene-family level evolution, we find expansions associated with the sensory perception of taste, immunity, and metabolism. In addition, we report strikingly different histories of chemosensory, venom, and silk gene families, with the first two evolving much earlier, affected by the ancestral whole genome duplication in Arachnopulmonata (∼450 Ma) and exhibiting higher numbers. Together, our findings reveal that spider genomes are highly variable and that genomic novelty may have been driven by the burst of an ancient whole genome duplication, followed by gene family and transposable element expansion.Entities:
Keywords: Araneae; arthropod; gene family; hawai'i; repeatome; transposable element
Mesh:
Year: 2021 PMID: 34849853 PMCID: PMC8693713 DOI: 10.1093/gbe/evab262
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Transposable element and repeat characterization (A) Web diagram showing the representation of TE and repeats in the assemblies. Assemblies and correspondent assembly sizes are represented on the edges of the web diagram. Different transposable element families or repeats are presented in different colors on the plot, and the total content masked by RepeatMasker is shown in blue. The numbers for each web-line indicate the percent of the genome occupied by each transposable element, or the percent masked. (B) Repeat/transposable element landscape plots for the various assemblies. The three most represented transposable element categories are present for every genome (e.g. DNA/TcMar, DNA/hAT, and unknown for T. kauaiensis). Each plot shows the Kimura substitution level (x axis) and percent of genome covered by repeats (y axis).
Gene family expansion (A) Tree topology obtained for single-copy orthologs. Numbers in blue indicate significantly expanded gene families as determined by CAFE. (B) Treemap representation of Gene Ontology Biological Function Annotation of the significantly expanded gene families as retrieved by REVIGO. Branches/Nodes with significant expansions, including Araneidae, Theridiidae, and Node 1 are represented together with the different genomes.
Venom gene phylogenies. Phylogenies for the ten largest orthogroups of identified venom genes. For each tree, we indicate the Orthogroup ID and tree scale. Different colors correspond to different species, as displayed in the legend. Arrows highlight scorpion toxin genes and show that most orthogroups were already present in before the split between scorpions and spiders.
Gustatory and ionotropic reception phylogenies (A) Gustatory receptor phylogeny. The phylogeny has 5,595 genes and includes every GR identified in the assemblies herein studied. (B) Ionotropic receptor phylogeny. The phylogeny has 1,932 genes and includes every IR identified in the assemblies herein studied. Arrows indicate major duplications private to specific genomes, whereas squared arrows highlight potentially conserved IR genes (small branch length and small duplicates).
Phylogeny of other chemosensory genes. (A) CCP phylogeny; (B) CD36-SNMP phylogeny; (C) NPC2, phylogeny; (D) OBP-like phylogeny.
Silk genes (spidroins) phylogeny. These include Major and Minor Ampullate spidroins (Ma/MiSp), Aciniform spidroins (AcSp), Aggregate spidroins (AgSp), Flagelliform spridroins (Flag), Pyriform spidroins (PySp), an unidentified spidroins group present in the Trichonephila clavipes genome and the Tubuliform spidroins (TuSp).