| Literature DB >> 32011707 |
Greer A Dolby1,2, Matheo Morales1, Timothy H Webster1,3, Dale F DeNardo1, Melissa A Wilson1,4, Kenro Kusumi1.
Abstract
Toll-like receptors (TLRs) are a complex family of innate immune genes that are well characterized in mammals and birds but less well understood in nonavian sauropsids (reptiles). The advent of highly contiguous draft genomes of nonmodel organisms enables study of such gene families through analysis of synteny and sequence identity. Here, we analyze TLR genes from the genomes of 22 tetrapod species. Findings reveal a TLR8 gene expansion in crocodilians and turtles (TLR8B), and a second duplication (TLR8C) specifically within turtles, followed by pseudogenization of that gene in the nonfreshwater species (desert tortoise and green sea turtle). Additionally, the Mojave desert tortoise (Gopherus agassizii) has a stop codon in TLR8B (TLR8-1) that is polymorphic among conspecifics. Revised orthology further reveals a new TLR homolog, TLR21-like, which is exclusive to lizards, snakes, turtles, and crocodilians. These analyses were made possible by a new draft genome assembly of the desert tortoise (gopAga2.0), which used chromatin-based assembly to yield draft chromosomal scaffolds (L50 = 26 scaffolds, N50 = 28.36 Mb, longest scaffold = 107 Mb) and an enhanced de novo genome annotation with 25,469 genes. Our three-step approach to orthology curation and comparative analysis of TLR genes shows what new insights are possible using genome assemblies with chromosome-scale scaffolds that permit integration of synteny conservation data.Entities:
Keywords: immune system; orthology; reference genome; reptile; synteny; testudines
Year: 2020 PMID: 32011707 PMCID: PMC7058155 DOI: 10.1093/gbe/evaa016
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Assembly and Gene Statistics across Genome Builds
| Assembly Statistics | |||||
|---|---|---|---|---|---|
| Total Length (Mb) | L50 (No. Scaffolds) | N50 (Mb) | Longest Scaffold (Mb) | ||
| gopAga1.0 | 2,399 | 2,592 | 0.25 | 2 | |
| gopAga2.0 | 2,338 | 26 | 28.36 | 107 | |
| Assembly BUSCO Statistics | Annotation Statistics | ||||
| Complete | Fragmented | Missing | Total Genes | Mean Gene Length (bp) | |
| gopAga1.0 | 92.7% | 4.6% | 2.7% | 20,172 | 19,062 |
| gopAga2.0 | 93.7% | 3.7% | 2.6% | 25,469 | 21,364 |
note.—Assembly BUSCO statistics are from the Tetrapoda gene set in the genome assemblies. Total genes and mean gene length are calculated based on the annotations for gopAga1.0 and gopAga2.0 (see supplementary table S2, Supplementary Material online).
. 3.—TLR21-like gene discovered. (A) Syntenically conserved regions of TLR21-like (gold, middle) for 22 tetrapods. TLR21-like does not colocalize near TLR21 within the genome. Anolis carolinensis has LINGO1 (a leucine-rich repeat gene) instead of TLR21-like (yellow outline). Synteny information was not available for Celonoides abingdonii; therefore, only the present/absence of genes are shown (i.e., no black line). (B) Phylogenetic reconstruction of TLR11 gene subfamily including the TLR21-like gene discovered here (gold highlight). Posterior support values for nodes are 1 unless labeled. (C) Cartoon showing how TLR21 protein initiates the innate immune response by recognizing the CpG nucleotide sequence.
. 2.—TLR8 gene expansion in turtles. (A) Syntenically conserved region for TLR7 and TLR8 (gold, middle) for 21 tetrapods (Chelonoidis abingdonii is excluded because its TLR8 orthology cannot be determined from available data). Genes are labeled at the top and cladogram is shown on left. Black horizontal lines represent contiguous sequence. TLR8 is expanded once in testudines and crocodilian archosaurs (TLR8B) and a second time in testudines (TLR8C) followed by pseudogenization (gray genes). (B) Phylogenetic reconstruction of TLR7 gene subfamily; reptiles are bolded, gene expansions are highlighted in gold. Posterior support values for nodes are 1 unless labeled. (C) Cartoon summarizing how TLR7 and TLR8 proteins initiate the innate immune response by recognizing single-stranded RNA (ssRNA) viruses. (D) A summary of inferred gene duplication based on results in this article.
. 1.—Genome annotation and assembly. (A) Summary of methodology. (B) Results from chromosome painting of conserved synteny blocks shown between the Mojave Desert tortoise genome (gopAga2.0) and chicken (top right), and green anole lizard (bottom right). The following pairs of scaffolds (left) and chromosomes (right) between Mojave Desert tortoise and chicken are particularly conserved: B-27/28, D-15, E-14, G-11, I-17, O-21, W-22. Black bars show 50-Mb scale. Microchromosomes are labeled “LG”; there are no conserved synteny blocks for LGa, LGd, and LGg. Only homologous linkage blocks are colored.