| Literature DB >> 31426485 |
Yi Sun1, Chao Liu2,3, Moli Huang2, Jian Huang2, Changhong Liu2, Jiguang Zhang2,3, John H Postlethwait4, Han Wang5,6.
Abstract
Circadian rhythms are biological rhythms with a period of approximately 24 h. While canonical circadian clock genes and their regulatory mechanisms appear highly conserved, the evolution of clock gene families is still unclear due to several rounds of whole genome duplication in vertebrates. The spotted gar (Lepisosteus oculatus), as a non-teleost ray-finned fish, represents a fish lineage that diverged before the teleost genome duplication (TGD), providing an outgroup for exploring the evolutionary mechanisms of circadian clocks after whole-genome duplication. In this study, we interrogated the spotted gar draft genome sequences and found that spotted gar contains 26 circadian clock genes from 11 families. Phylogenetic analysis showed that 9 of these 11 spotted gar circadian clock gene families have the same number of genes as humans, while the members of the nfil3 and cry families are different between spotted gar and humans. Using phylogenetic and syntenic analyses, we found that nfil3-1 is conserved in vertebrates, while nfil3-2 and nfil3-3 are maintained in spotted gar, teleost fish, amphibians, and reptiles, but not in mammals. Following the two-round vertebrate genome duplication (VGD), spotted gar retained cry1a, cry1b, and cry2, and cry3 is retained in spotted gar, teleost fish, turtles, and birds, but not in mammals. We hypothesize that duplication of core clock genes, such as (nfil3 and cry), likely facilitated diversification of circadian regulatory mechanisms in teleost fish. We also found that the transcription factor binding element (Ahr::Arnt) is retained only in one of the per1 or per2 duplicated paralogs derived from the TGD in the teleost fish, implicating possible subfuctionalization cases. Together, these findings help decipher the repertoires of the spotted gar's circadian system and shed light on how the vertebrate circadian clock systems have evolved.Entities:
Keywords: circadian clocks; conserved synteny; functional divergence; genome duplication; spotted gar
Mesh:
Substances:
Year: 2019 PMID: 31426485 PMCID: PMC6723592 DOI: 10.3390/genes10080622
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Circadian clock genes in spotted gar [16].
| Gene Names | Ensembl Gene ID | Protein Length (aa) | Genome Location |
|---|---|---|---|
|
| ENSLOCG00000003999 | 678 | Chromosome LG27: 8,317,763–8,344,549 |
|
| ENSLOCG00000015224 | 639 | Chromosome LG8: 3,189,570–3,226,867 |
|
| ENSLOCG00000014043 | 744 | Chromosome LG4: 72,323,329–72,339,605 |
|
| ENSLOCG00000014750 | 886 | Chromosome LG7: 42,295,248–42,323,335 |
|
| ENSLOCG00000015272 | 647 | Chromosome LG8: 4,053,475–4,074,670 |
|
| ENSLOCG00000011417 | 675 | Chromosome LG3: 32,901,867–32,938,020 |
|
| ENSLOCG00000014655 | 569 | Scaffold JH591436.1: 96,765–111,323 |
|
| ENSLOCG00000011465 | 586 | Chromosome LG3: 33,014,383–33,053,389 |
|
| ENSLOCG00000013344 | 1445 | Chromosome LG2: 58,185,728–58,201,428 |
|
| ENSLOCG00000004441 | 1385 | Chromosome LG14: 7,862,125–7,881,568 |
|
| ENSLOCG00000002607 | 1165 | Chromosome LG25: 4,785,705–4,800,981 |
|
| ENSLOCG00000011701 | 273 | Chromosome LG12: 35,099,178-35,103,163 |
|
| ENSLOCG00000010962 | 409 | Chromosome LG5: 27,670,723–27,673,540 |
|
| ENSLOCG00000015327 | 422 | Chromosome LG8: 4,744,466–4,746,682 |
|
| ENSLOCG00000008217 | 443 | Chromosome LG2: 25,281,929–25,283,356 |
|
| ENSLOCG00000018299 | 544 | Chromosome LG6: 17,036,778–17,038,412 |
|
| ENSLOCG00000018298 | 394 | Chromosome LG6: 17,009,772–17,010,956 |
|
| ENSLOCG00000006223 | 362 | Chromosome LG4: 15,608,213–15,662,480 |
|
| ENSLOCG00000006818 | 604 | Chromosome LG11: 20,445,844–20,461,290 |
|
| ENSLOCG00000011595 | 323 | Chromosome LG12: 34,848,929–34,860,886 |
|
| ENSLOCG00000012233 | 298 | Chromosome LG10: 33,240,269–33,260,238 |
|
| 69 | Scaffold JH591448.1:146984–147190 | |
|
| ENSLOCG00000014779 | 519 | Chromosome LG3: 53,154,612–53,409,505 |
|
| ENSLOCG00000009712 | 462 | Chromosome LG2: 34,273,732–34,319,408 |
|
| ENSLOCG00000006502 | 467 | Chromosome LG19: 9,503,786–9,519,701 |
|
| ENSLOCG00000004180 | 1225 | Chromosome LG4: 11,892,308–11,918,271 |
Figure 1The phylogenetic trees of circadian clock genes. (A) bmal genes; (B) clock genes; (C) per genes; (D) csnk1e genes; (E) tim genes; (F) dec genes; (G) nr1d genes; (H) ror genes. The trees were constructed by the neighbor-joining method with MEGA6 [33], and the numbers on the nodes are percent bootstrap values based on 1000 pseudoreplications. The outgroups are ortholog genes in fly or lamprey. Red dots indicate spotted gar genes, blue dots indicate zebrafish genes, and yellow dots indicate human genes. The sequences used are listed in Supplementary Table S1.
Figure 2The phylogenetic and conserved syntenic analyses of the nfil3 family. (A) Phylogenetic trees based on sequences of nfil3 genes. The tree was constructed by the neighbor-joining method with MEGA6 [33], and the numbers on the nodes are the percent bootstrap values based on 1000 pseudoreplications. The outgroup is fly vri. Red dots indicate spotted gar genes, blue dots indicate zebrafish genes, and yellow dots indicate human genes. (B) The conserved synteny surrounding nfil3-1 genes in the chromosomes of lampreys, coelacanths, spotted gar, zebrafish, medaka, fugu, frogs, lizards, chickens, ducks, mice and humans was displayed with Genomicus [35]. (C) The conserved synteny of nfil3-1b.1 and nfil3-1b.2 genes in the chromosomes of zebrafish, cave fish, tilapia, platyfish, medaka, tilapia, and fugu. (D) Conserved synteny of nfil3-2 and nfil3-3 genes in chromosomes of coelacanth, spotted gar, zebrafish, medaka, teraodon, fugu, tilapia, platyfish, frogs, lizards, mice, and humans. The chromosome number or linkage group number is shown to the right of the chromosomes. The same colored rectangles represent the orthologous genes. Red and blue boxes indicate the duplicated genes.
Figure 3The phylogenetic, conserved syntenic, and exon structural analyses of the cry family. (A) Phylogenetic trees based on sequences of cry genes. The tree was constructed by the neighbor-joining method with MEGA6, and the numbers on the nodes are the percent bootstrap values based on 1000 pseudoreplications. The outgroup is Drosophila melanogaster cry. Red dots indicate spotted gar genes, blue dots indicate zebrafish genes, and yellow dots indicate human genes. (B) The conserved synteny surrounding the cry1a, cry1b/cry3, and cry2 genes in chromosomes of coelacanth, spotted gar, zebrafish, medaka, teraodon, fugu, chickens, mice, and humans. The chromosome number or linkage group number is shown to the right of the chromosomes. The same colored rectangles represent the orthologous genes. Red and blue boxes indicate the duplicated genes. (C) The exonic structures of cry genes in spotted gar, medaka, fugu, chickens and humans. The boxes represent the exons. The size of each exon is drawn to scale.
Figure 4The phylogenetic and exon structural analyses of the par family. (A) Phylogenetic trees based on sequences of PAR family genes. The tree was constructed by the neighbor-joining method with MEGA6, and the numbers on the nodes are the percent bootstrap values based on 1000 pseudoreplications. The outgroup is Drosophila melanogaster pdp1. Red dots indicate spotted gar genes, blue dots indicate zebrafish genes, and yellow dots indicate human genes. (B) The exonic structures of par family genes in spotted gar, zebrafish, and humans. The boxes represent the exons. The size of each exon is drawn to scale.
Figure 5The transcription factor binding element (Ahr::Arnt) in upstream regions of duplicated Per genes from spotted gar to mammals. The complementary presence of the Ahr::Arnt motif in duplicated Per1 (A) or Per2 (B) paralog genes provides possible cases predicted by the duplication-degeneration-complementation (DDC) model. The upstream region of each gene is 3000 bp in length, with the exception of that of human PER1, which is 2642 bp only. The transcription factor binding elements were predicted using the JASPAR dataset, with a profile score threshold of 99%. The bisque line represents duplication in the lineage, and the dotted line separates the duplicated genes and unduplicated genes.
Figure 6A hypothetical model for the evolution of nfil3 genes in vertebrates. The teleost fish and tetrapod nfil3 had a common ancestor. The ancestral nfil3 gave rise to nfil3-1/2 and nfil3-3/4 by tandem duplication. During the second round of vertebrate genome duplication, the cluster was duplicated to nfil3-1, nfil3-2, nfil3-3, and nfil3-4 genes, with the quick loss of nfil3-4. The nfil3-1 is conserved in all vertebrates, with the exception of losses in a few teleost fish. The nfil3-1b.1 and nfil3-1b.2 linkage has been preserved only in teleost fish. The nfil3-2 and nfil3-3 linkage was conserved in the teleost fish and tetrapod lineages. The teleost-specific genome duplication produced the nfil3-2a/nfil3-2b duplicate, which was observed in zebrafish, medaka, tetrodon, tilapia, platyfish, and cave fish; and the nfil3-3a/nfil3-3b duplicate, which was observed in zebrafish, tilapia, and platyfish. However, amphibians retained only nfil3-3 and reptiles retained only nfil3-2. Then, Nfil3-2 and Nfil3-3 were completely lost in the lineages of mammals and birds during evolution. The grey box means gene loss.