| Literature DB >> 28077068 |
Vanessa Romero1,2, Kazuyoshi Hosomichi2,3, Hirofumi Nakaoka1,2, Hiroki Shibata4, Ituro Inoue5,6.
Abstract
BACKGROUND: The evolutionary dynamics of repeat sequences is quite complex, with some duplicates never having differentiated from each other. Two models can explain the complex evolutionary process for repeated genes-concerted and birth-and-death, of which the latter is driven by duplications maintained by selection. Copy number variations caused by random duplications and losses in repeat regions may modulate molecular pathways and therefore affect phenotypic characteristics in a population, resulting in individuals that are able to adapt to new environments. In this study, we investigated the filaggrin gene (FLG), which codes for filaggrin-an important component of the outer layers of mammalian skin-and contains tandem repeats that exhibit copy number variation between and within species. To examine which model best fits the evolutionary pathway for the complete tandem repeats within a single exon of FLG, we determined the repeat sequences in crab-eating macaque (Macaca fascicularis), orangutan (Pongo abelii), gorilla (Gorilla gorilla), and chimpanzee (Pan troglodytes) and compared these with the sequence in human (Homo sapiens).Entities:
Keywords: Birth-and-death evolution; Copy number variation; Duplication/loss; Filaggrin
Mesh:
Substances:
Year: 2017 PMID: 28077068 PMCID: PMC5225520 DOI: 10.1186/s12862-016-0851-5
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Partial and complete repeat sequences of FLG in human, chimpanzee, gorilla, orangutan, and crab-eating macaque. Left shows the repeat order of FLG in these primates based on the sequences from the National Center for Biotechnology Information database; the gorilla sequence had an unframed repeat at the end of the sequence, and the orangutan sequence had unframed repeats in the middle. Right shows the repeat order of FLG sequences acquired through the use of both PacBio RSII and MiSeq in this study. Black = partial repeats, white = complete repeats, and gray = regions with gaps. The nucleotide length of each repeat is shown beneath each repeat. The number of repeats varies between species
Fig. 2Maximum likelihood tree reconstruction using all complete repeats in these primates and the following parameters: partial deletion option, Tamura-Nei model with gamma distribution and invariable sites, nearest-neighbor-interchange heuristic method, 1000 bootstrap resampling, and a cutoff value of 50%. Bootstrap values are shown at the beginning of each branch. Crab-eating macaque was used as an outgroup. Crab-eating macaque repeats and orangutan repeats grouped into their own clusters (“crab-eating macaque cluster” and “orangutan cluster,” respectively). By contrast, the gorilla repeats, chimpanzee repeats, and human repeats are scattered across the “gorilla/chimpanzee/human cluster.” We divided the “gorilla/chimpanzee/human cluster” into subclusters in accordance with the reconciled tree reconstruction shown in Fig. 3
Fig. 3“Reconciled” trees using complete repeats of FLG in human, chimpanzee, gorilla, orangutan, and crab-eating macaque. a “Reconciled” gene tree indicating duplication (red squares) and loss events (light gray italics) from the most common ancestor of these primates: the crab-eating macaque repeats duplicated 11 times, orangutan repeats duplicated 8 times, and gorilla/chimpanzee/human repeats duplicated 18 times, and in human, the counterpart to chimpanzee-repeat 9 was lost, while in chimpanzee, the counterparts to human-repeats 10 and 8 were lost, and in gorilla, the counterpart to human-repeat 2, 9 and 8 and chimpanzee-repeat 8 was lost. b “Reconciled” species tree indicating duplication and loss events in each species from their most common ancestor: crab-eating macaque repeats duplicated 11 times, orangutan repeats duplicated 8 times, gorilla repeats duplicated 6 times, and chimpanzee repeats duplicated 3 times, while the most common ancestor between human and chimpanzee duplicated 6 times, and the most common ancestor between human, chimpanzee, and gorilla duplicated 4 times; and 1 repeat was lost in gorilla, 2 repeats were lost in chimpanzee, and 1 repeat was lost in human. The number of repeats found in each species is provided in parentheses
Comparison of all repeat sequences of the filaggrin gene within five primate species
| Species | No. of repeats | Total no. variation sites | Average (1 - Ka/Ks) × 100a | Nucleotide variation |
|---|---|---|---|---|
| Human | 10 | 195 | 31.49 | 0.077 |
| Chimpanzee | 10 | 189 | 38.91 | 0.075 |
| Gorilla | 10 | 140 | 29.90 | 0.061 |
| Orangutan | 9 | 198 | 33.19 | 0.077 |
| Crab-eating macaque | 12 | 69 | 70.96 | 0.032 |
a(1 - Ka/Ks) × 100 measures the percentage of purifying selection
Positively selected codons and branches of the filaggrin gene in five primate species
| Site-based test | ||
| Models compared |
| Positively selected codons |
| M1a vs M2aa | 1.40 E-42 | 21, 24, 26, 75, 99, 110, 135, 144, 150, 157, 178, 187, 190, 191, 224, 226, 228, 231, 252, 268, 269, 305, 323 |
| M7 vs M8a | 1.05 E-50 | 4, 21, 24, 26, 75, 99, 110, 127, 135, 144, 150, 152, 157, 178, 187, 190, 191, 205, 224, 226, 228, 231, 245, 252, 268, 269, 305, 309, 320, 323 |
| Branch-based test | ||
| Models compared |
| |
| M0 vs free ratiob | 0.13 | – |
| Branch-site test | ||
| M0N0 vs M2N2c |
| Positively selected codons |
| Crab-eating macaque cluster | 1.27 E-60 | No sites |
| Orangutan cluster | 2.54 | 26, 119, 160, 177, 179, 207, 250, 284 |
| Gorilla/chimpanzee/human cluster | 1.80 E-93 | 24, 26, 78, 99, 110, 114, 134, 135, 144, 150, 152, 157, 169, 178, 187, 191, 224, 228, 231, 245, 268, 320, 323 |
aThe site-based test compared the M1a (nearly neutral) and M2a (positive selection) models and the M7 (Beta) and M8 (Beta and ω) models
bThe branch-based test compared the free ratio (or independent ω per branch) model with the one-ratio null (or one ω for all branches) model (M0)
cThe branch-site test was used to detect positively selected codons on a specific branch
Fig. 4“Divergence” tree reconstruction using all complete repeats of FLG in five primate species. The following parameters were used: the site model TN93, a substitution rate with gamma distribution = 4, a log-normal relaxed clock, and the birth-and-death model for all three nucleotide positions. The x-axis scale is time in Mya. The crab-eating macaque ancestor repeat diverged around 28 Mya, while the orangutan ancestor repeat diverged around 21 Mya. The gorilla/chimpanzee/human repeats duplicated during the last 9 Mya, while the crab-eating macaque repeats duplicated in the last 5.5 Mya
Fig. 5Repeat order of gorilla, chimpanzee, and human FLG repeats, as inferred by phylogenetic and “reconciled” tree. Gorilla duplicated repeats “C,” “D,” and “G,” chimpanzee and human duplicated repeat “A,” and human duplicated repeat “B” are shown. In chimpanzee, the counterpart to human-repeats B-2 has been lost, while in human, the counterpart to chimpanzee-repeat “I” has been lost, and in chimpanzee, the counterparts to human and gorilla-repeat H have been lost. In gorilla, the counterpart to human and chimpanzee repeats B has been lost