| Literature DB >> 26938477 |
Michael P Denyer1,2, Dammy Y Pinheiro1, Oliver A Garden1, Adrian J Shepherd2.
Abstract
The Forkhead box transcription factor FoxP3 is pivotal to the development and function of regulatory T cells (Tregs), which make a major contribution to peripheral tolerance. FoxP3 is believed to perform a regulatory role in all the vertebrate species in which it has been detected. The prevailing view is that FoxP3 is absent in birds and that avian Tregs rely on alternative developmental and suppressive pathways. Prompted by the automated annotation of foxp3 in the ground tit (Parus humilis) genome, we have questioned this assumption. Our analysis of all available avian genomes has revealed that the foxp3 locus is missing, incomplete or of poor quality in the relevant genomic assemblies for nearly all avian species. Nevertheless, in two species, the peregrine falcon (Falco peregrinus) and the saker falcon (F. cherrug), there is compelling evidence for the existence of exons showing synteny with foxp3 in the ground tit. A broader phylogenomic analysis has shown that FoxP3 sequences from these three species are similar to crocodilian sequences, the closest living relatives of birds. In both birds and crocodilians, we have also identified a highly proline-enriched region at the N terminus of FoxP3, a region previously identified only in mammals.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26938477 PMCID: PMC4777427 DOI: 10.1371/journal.pone.0150988
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The genomic conservation of foxp3 with its downstream flanking genes, cacna1f and ccdc22.
| Class | No. of genomes | |||||||
|---|---|---|---|---|---|---|---|---|
| Mammals | 105 | 87 | 86 | 88 | 1 | 2 | 0 | 0 |
| Amphibians | 2 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
| Reptiles | 13 | 5 | 6 | 4 | 0 | 0 | 1 | 2 |
Summary data showing the consistent association between foxp3 and its two downstream flanking genes, ccdc22 (or ccdc22-like) on the opposing strand and cacna1f (or cacna1f-like) on the same strand. Summary data was compiled from all NCBI scaffold and chromosome assembled genomes (S1 Table).
aFive NCBI genomes had multiple scaffolds in this region: the Weddell seal (Leptonychotes weddellii) had an annotation of cacna1f with a downstream gap before gene ppp1r3f, which in mammals is typically found downstream of foxp3, and had foxp3 and ccdc22 annotated on a separate scaffold (and terminating upstream of ccdc22); there were two cases where ccdc22 and cacna1f are separated; there was one case where ccdc22 and foxp3 are separated; and one case where there are partial ccdc22 annotations on two scaffolds. Approximately a quarter of mammalian genomes also had another gene or pseudogene (e.g. heat shock protein family B (small) member 1 pseudogene 2 in humans) annotated between ccdc22 and cacna1f.
bThree missing gene annotations (one cacna1f and two ccdc22) are attributed to incomplete assemblies in the foxp3 gene region.
cIn the gray short-tailed opossum (Monodelphis domestica) NCBI genome ccdc22 is not annotated, but it is annotated in Ensembl.
dThe western painted turtle (Chrysemys picta bellii) was the only identified reptilian genome without the conserved location of both flanking genes, as cacna1f and ccdc22 were both annotated on the same scaffold, but remote from each other. foxp3 was not annotated. Both alligators also had an inversion of ccdc22, but this did not alter the conserved location of this gene with respect to foxp3 and cacna1f.
eThe garter snake (Thamnophis sirtalis) genome was missing foxp3 and cacna1f annotations. However, downstream of ccdc22, there is an annotation of cacna1d-like with numerous masked repeat regions, hence a possible mis-annotation for cacna1f. Upstream the scaffold terminates after ccdc22 but before the predicted foxp3 locus.
Fig 1The genomic context of avian foxp3.
(A) The gene neighbourhood of foxp3 in a number of archosaurs showing the significant masked repeats of the American crow between hsd17b10 and cacna1f (gaps in the line indicate intergenic repeat regions), the approximate location of the foxp3 locus in the saker and peregrine falcon (marked by a pink arrow with a dashed line), and the low nucleotide sequence identity of foxp3 between the American alligator and the ground tit. In the displayed region the overall masked repeats are as follows: 54.7% in the American crow, 28.0% in the saker falcon, 31.4% in the peregrine falcon, 6.8% in the ground tit and 0.02% in the American alligator. (B) The foxp3 gene neighbourhood of the mouse, American alligator and ground tit, showing the gene variation upstream of foxp3 in mammals, birds and crocodilians and the low nucleotide sequence identity of murine foxp3 versus the American alligator and ground tit. (C) A comparison of the approximate foxp3 loci in both falcons with the ground tit foxp3 locus, showing masked repeats (gaps in the line) and predicted exons (rectangles). All subfigures were produced using Easyfig [47] with BLASTN identity comparisons indicated by the scales on each subfigure.
Avian genome quality and its relationship to the annotation and assembly of the foxp3 gene neighbourhood.
| Zero annotated scaffolds | Single gene scaffold(s) only | One or more multi-gene scaffold(s) | |
|---|---|---|---|
| No. of genomes | 31 | 15 | 14 |
| Average coverage | 46x | 59x | 92x |
| Average scaffold N50 | 931,132 | 4,481,081 | 7,011,777 |
| Average no. of scaffolds | 67,526 | 47,461 | 47,280 |
The table shows the association between the quality of avian genomes and the assembly and annotation of the avian foxp3 gene neighbourhood (hsd17b10, naa10, tspyl2, ppp1r3f, ccdc22 and cacna1f). The better the genome quality in the foxp3 region the easier it is to annotate genes and assemble multi-gene scaffolds (in the absence of a reference sequence). Summary data was compiled from all NCBI scaffold and chromosome assembled genomes (S2 Table).
Fig 2Ground tit foxp3 transcriptome coverage.
The 11 numbered exons of the putative ground tit foxp3 sequence annotated with the sequence regions that encode ground tit FoxP3 functional regions. A collection of reads with 100% identity (shown in green) were found with a BLASTN search of ground tit RNA-Seq data from the muscle in the SRA (SRX246872; S2 File).
Fig 3A summary of the structural domains and regions of FoxP3 in different clades of the Animal Kingdom.
Fig 4Alignment of the forkhead domain of FoxP3 and FoxP3-like proteins, highlighting mammalian and avian signature residues.
The consistent exon boundaries in the ground tit and peregrine falcon were used to curate the gap within the sequence of the saker falcon. The mouse sequence is used as the baseline sequence and only amino acids in other sequences that differ are shown.
Fig 5The distinct forkhead domain of FoxP3.
The FoxP3 forkhead domain of the mouse, alligator and ground tit, aligned to FoxP1, FoxP2 and FoxP4 paralogues in the same species, demonstrating its divergent evolution and unique signature residues. The mouse FoxP3 sequence is used as the baseline sequence and only amino acids in other sequences that differ are shown.