| Literature DB >> 25015101 |
Naoual Azzouzi, Frederique Barloy-Hubler, Francis Galibert1.
Abstract
BACKGROUND: To help understand the molecular mechanisms underlying the remarkable phenotypic diversity displayed by cichlids, the genome sequences of O. niloticus, P. nyererei, H. burtoni, N. brichardi and M. zebra were recently determined. Here, we present the contents of the olfactory receptor (OR) repertoires in the genomes of these five fishes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25015101 PMCID: PMC4122780 DOI: 10.1186/1471-2164-15-586
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
OR genes identified in the five cichlid and five fish model genomes
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 coding exon | 146 | 78 | 94 | 62 | 81 | 143 [ | 78(a) | 73(a) | 40 [ | 42(a) |
| >1 coding exon | 12 | 12 | 8 | 7 | 7 | |||||
| Pseudo | 6 | 6 | 11 | 12 | 8 | 10 | 46 | 28 | 54 | |
| +1f | +3e | +2e | +1e | +3e | ||||||
| +1 s | ||||||||||
| Edge | 100 | 50 | 28 | 36 | 32 | |||||
| +1 s | +1 s | |||||||||
| Fragment | 0 | 1 | 0 | 3 | 0 |
(a)From a larger set of OR sequences retrieved from ENSEMBL and GENBANK, we characterized a subset of true OR genes by multiple alignment of AA sequences, phylogenic tree construction and BLAST analysis. DNA samples used by the BROAD institute to determine the genomic sequences were for each species extracted from a single fish with 2 N chromosomes.
Distribution of pseudogenes in the five cichlids
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Frameshift | 4 | 8 | 9 | 6 | 6 |
| In frame stop codon | 3 | 5 | 3 | 7 | 3 |
Distribution of OR into families and subfamilies
| Cichlids | Fish models | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| A1 | 3(e2,p1) | 6(e2,p2,f1) | 6(e2,p1) | 14(e8,p1) | 8(e1,p2) | 2 | ||||
| A2 | 3(e1,p1) | 3(e2,p1) | 5(e2) | 4(e2) | 6(e2) | |||||
| A3 | 1(p1) | 2(e1) | 3 | 2(e1) | 2(e1) | |||||
| A4 | 1(p1) | 2 | 2 | 2(e2) | 2 | 1 | 8 | |||
| A5 | 3(e2,p3) | 4(e1) | 4(p1) | 4(e1) | 5(pe1) | 10 | 2 | |||
| A6 | (e1) | 1(p1) | 1 | 1 | 4 | |||||
| A7 | 3 | 3 | ||||||||
| A8 | 1 | |||||||||
| B1 | 1 | 1 | 1(e1) | 1 | ||||||
| C1 | 1 | 1 | 1 | (e1) | 1 | 1 | 1 | |||
| D1 | 3(e2) | 4(e6) | 7(e3,p3) | 11(e14) | 7(p2) | |||||
| D2 | 6 | 2 | 1 | |||||||
| E1 | 7(e3) | 12(e1,pe1) | 8(e2,pe1) | 13(e5,p2) | 7(e7,pe1) | 5 | 3 | 1 | ||
| E2 | (e1) | 2 | 2 | 3 | 2 | 2 | 1 | 3 | ||
| F1 | 12 | |||||||||
| F2 | 1 | |||||||||
| F3 | 1 | |||||||||
| F4 | (pe1) | 1 | 1 | 1 | 1 | 1 | 1 | 3 | ||
| F5 | 1 | 1 | (p1) | 1 | 1 | 3 | 1 | 4 | ||
| G1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| H1 | 6 | |||||||||
| H2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | ||
| H3 | 2(e2) | 5 | 6 | 6(e3) | 3(e1,p1) | 1 | 2 | |||
| H4 | 5 | 3 | ||||||||
| H5 | 2(e1) | (e3) | 4(e4) | 4 | 4 | 2 | ||||
| H6 | 2(e2,p1,f1) | 3(e2,p1) | 4(e1,p1) | 10(e1,p2) | 4(e1) | 7 | ||||
| H7 | 1 | 2 | ||||||||
| I1 | 1 | 1 | 3 | 1 | 2 | 2 | 1 | |||
| J1 | 3 | |||||||||
| J2 | 1 | (e1) | 1 | 1 | 1 | 1 | 2 | 1 | ||
| J3 | 1(e1) | (e2) | 1(e1) | (e1) | 1 | 3 | 2 | 1 | 1 | |
| J4 | 1 | |||||||||
| J5 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | |
| K1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| K2 | 6 | |||||||||
| K3 | 6 | |||||||||
| K4 | 1 | 1(e1) | 2 | 2 | 1(e1) | 2 | 2 | 1 | ||
| K5 | (e1) | 1(e2) | 2(e1,ps1) | 4 | 2(p1) | 1 | 4 | 3 | ||
| K6 | 3 | |||||||||
| K7 | 1(e2) | |||||||||
| K8 | (e1) | 1 | ||||||||
| L1 | 12 | |||||||||
| L2 | 5(e2,f1) | 4(e6,p1) | 5(e4,pe1) | 16(e15,p1) | 6(e2) | |||||
| M1 | 3 | |||||||||
| M2 | (e1) | (e3) | 2 | (e2) | 1 | 5 | 2 | 2 | 1 | |
| N1 | 2 | |||||||||
| N2 | 2(e2) | 2(e1) | 2(e1) | 1(e3) | 2(p1) | 11 | 9 | 1 | 1 | |
| N3 | 1(e3,p2) | 1(e5,p1) | 2(p1) | 3(e4) | 6(p1) | |||||
| N4 | 11 | |||||||||
| N5 | 12 | 1 | ||||||||
| N6 | 3 | 2(pe1) | 2(e1) | 7(e2) | 1(e3) | 1 | ||||
| O1 | 1 | |||||||||
| O2 | 1 | |||||||||
| O3 | 2 | |||||||||
| O4 | 5 | |||||||||
| O5 | 2 | 1(e1) | 2(e1) | 2(e4) | 1(e4) | 3 | ||||
| O6 | 5 | |||||||||
| O7 | 1(e1) | (e1) | (e4) | 1 | 1 | 2 | 2 | 1 | ||
| O8 | 1(e1) | (e2) | 1 | 1(e1) | 1 | 1 | 1 | |||
| O9 | (e1) | (e1) | ||||||||
| P1 | 1 | |||||||||
| P2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| P3 | 2 | 2 | (e2) | 4(e1) | 2 | 2 | ||||
| P4 | 3 | |||||||||
| Q1 | 1 | |||||||||
| Q2 | 1 | 1 | 1 | (p1) | 1 | 1 | ||||
| R1 | 1 | |||||||||
| R2 | 1(e1) | 1 | 1 | |||||||
| R3 | 2 | 4 | ||||||||
| R4 | (e2,p1) | 4(e1) | (e2) | 9(e8) | 3(p1) | 3 | ||||
| R5 | 1 | (e3) | ||||||||
| S1 | 2 | |||||||||
| S2 | 1(e3,p1) | 3(e1) | 1(e2) | 11(e2) | 3(e2,p1) | 2 | ||||
| S3 | 2 | 2(e1) | 1(e1) | 4(e1) | 1(e2) | 3 | 1 | |||
| S4 | 1 | 1 | ||||||||
| T1 | 1 | |||||||||
| T2 | 1 | |||||||||
| T3 | 6 | |||||||||
| T4 | 1 | (pe1) | ||||||||
| U1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| V1 | 1 | 1 | (e2) | 1 | 1 | |||||
| W1 | 1 | |||||||||
| W2 | 2 | |||||||||
| W3 | 1 | 1 | 1 | 1 | ||||||
| W4 | 1 | |||||||||
| W5 | 1 | 1(e1) | ||||||||
| W6 | 1 | |||||||||
| W7 | 2 | 2 | 2 | 4(e2) | 1 | 1 | 1 | |||
| W8 | 1 | 1 | 1 | 1 | 1 | |||||
| W9 | 1(e1) | 1 | (e1) | 1 | 1 | |||||
| X1 | 1(e1) | (e1) | (e2) | (e1) | ||||||
| Y1 | 1 | 1 | ||||||||
| Z1 | 1 | |||||||||
| Z2 | 1 | |||||||||
| AB1 | 1 | |||||||||
| AB2 | (f1) | 1 | 1 | (p1) | 4 | |||||
| AC1 | (e3) | |||||||||
| AD1 | 1 | |||||||||
| AD2 | 1 | |||||||||
| AE1 | 1 | |||||||||
| AE2 | 2 | |||||||||
| AF1 | 5 | |||||||||
| AG1 | 5 | |||||||||
| AG2 | 3 | |||||||||
| AH1 | 1 | |||||||||
| AI1 | 1 | |||||||||
| AJ1 | 5 | |||||||||
| AK | 9 | |||||||||
| AL | 1 | |||||||||
| AM | 1 | |||||||||
| AN | 2 | |||||||||
|
| 69 | 90 | 88 | 158 | 102 |
|
|
|
|
|
|
|
|
|
|
| ||||||
| S/F |
|
|
|
|
|
|
|
|
|
|
| F |
|
|
|
|
|
|
|
|
|
|
ORs were classified into families and subfamilies according to the phylogenetic tree and the percentage of AA identity calculated by MAFFT alignments. Families (F) were named by letters and subfamilies (S/F) by Arabic numbers (left column). For example, A1 (family A subfamily 1) contains 3 complete genes, 2 edge genes (e2) and 1 pseudogene (p1) from N. brichardi. Of the 376 model fish OR (143 zebra fish/D. rerio, 78 stickleback/G. aculeatus, 73 medaka/O. latipes, 40 fugu/T. rubripes and 42 tetraodon/T. nigroviridis – for more details see Additional file 4) 182 were in subfamilies also containing one of more cichlid ORs.
As shown in this Table the 143 zebrafish ORs are distributed into 37 sub-families and 22 families. A similar number of sub-families was reported by Alioto and Ngai [18] analyzing the same set of ORs, however they described height families only, four of them corresponding to several families in our study. Correspondences between the families in [18] and the families described in this work are as follow: Families A, B, C and G described in [18] correspond to families P, AB1, O and L respectively (this work); Families D [18] correspond to AH, M and N; Family E [18] corresponds to families F, H, AD, AE, AF, AG; Family F [18] corresponds to families K and J; Family H [18] corresponds to families S, T, U, AJ, AK, AL and A.
Figure 1Phylogenetic tree for the cichlid and fish model ORs (see also Table 1 and Additional file 2 ). Family names A to AN are alternately coloured in red and blue and similarly sub-families designated by Arabic numbers are coloured in green and purple.
Distribution of OR gene pairs, triplets and quadruplets sharing a strong percentage level of nucleotide and AA identities
| Pairs | Bur | Zeb | Bri | Nye | Til | |
|---|---|---|---|---|---|---|
| Bur | 2 | 9 | 0 | 11 | 0 | |
| Zeb | 0 | 0 | 10 | 0 | ||
| Bri | 0 | 1 | 0 | |||
| Nye | 0 | 0 | ||||
| Til | 7 | |||||
|
| Bur | Zeb | Nye | 37 | ||
| Bur | Bur | Nye | 1 | |||
| Til | Til | Til | 1 | |||
|
| Bur | Zeb | Nye | Bri | 2 | |
| Bur | Zeb | Nye | Til | 1 | ||
| Til | Til | Til | Til | 1 |
Olfactory receptors sharing 99% of AA identity were identified from the phylogenetic tree. The greatest numbers of pairs or triplets were found between H. burtoni, M. zebra and P. nyererei, in agreement with their closer phylogenetic relatedness. In O. niloticus 7 pairs, 1 triplet and 2 quadruplets of paralogous genes were identified consistent with this repertoire having undergone a higher level of duplication. The list of genes is shown in Additional file 5.
dN/dS ratios for the various OR gene pairs identified in 14 families
| Panel a | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Family names | Number of sub-families | Number of genes | Means | Min. | Max. | |||||||||
| Fam A | 6 | 100 | 0.40±0.09 | 0.00 | >10 | |||||||||
| Fam D | 1 | 32 | 0.44±0.10 | 0.15 | 1.30 | |||||||||
| Fam E | 2 | 56 | 0.40±0.11 | 0.10 | 2.27 | |||||||||
| Fam G | 1 | 5 | 0.28±0.10 | 0.18 | 0.41 | |||||||||
| Fam H | 4 | 50 | 0.41±0.14 | 0.00 | 1.14 | |||||||||
| Fam I | 1 | 6 | 0.43±0.29 | 0.00 | >10 | |||||||||
| Fam K | 4 | 22 | 0.29±0.18 | 0.12 | >10 | |||||||||
| Fam L | 2 | 36 | 0.50±0.12 | 0.04 | 1.20 | |||||||||
| Fam N | 5 | 37 | 0.39±0.14 | 0.18 | 1.79 | |||||||||
| Fam O | 3 | 14 | 0.44±0.10 | 0.12 | 0.83 | |||||||||
| Fam P | 3 | 15 | 0.37±0.10 | 0.07 | 0.86 | |||||||||
| Fam R | 4 | 20 | 0.43±0.08 | 0.19 | 0.88 | |||||||||
| Fam S | 2 | 26 | 0.39±0.09 | 0.14 | 1.19 | |||||||||
| Fam W | 5 | 24 | 0.32±0.13 | 0.00 | 1.48 | |||||||||
|
| ||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| dN/dS | 11 | 1 | 4 | 0 | 9 | 2 | 2 | 4 | 5 | 5 | 0 | 0 | 1 | 2 |
| dN/dS | 2 | 0 | 0 | 0 | 5 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| ||||||||||||||
|
| ||||||||||||||
| bri/bri | 0.436 | bri/cich | 0.446 | |||||||||||
| bur/bur | 0.385 | bur/cich | 0.411 | |||||||||||
| zeb/zeb | 0.439 | zeb/cich | 0.432 | |||||||||||
| nye/nye | 0.422 | nye/cich | 0.447 | |||||||||||
| til/til | 0.451 | til/cich | 0.432 | |||||||||||
|
| ||||||||||||||
| bri/bri | 0.380 | bri/cich | 0.374 | |||||||||||
| bur/bur | 4.414 | bur/cich | 0.408 | |||||||||||
| zeb/zeb | 0.378 | zeb/cich | 0.382 | |||||||||||
| nye/nye | 0.440 | nye/cich | 0.416 | |||||||||||
| til/til | 0.396 | til/cich | 0.382 | |||||||||||
|
| ||||||||||||||
| bri/bri | 0.399 | bri/cich | 0.407 | |||||||||||
| bur/bur | 0.448 | bur/cich | 0.367 | |||||||||||
| zeb/zeb | 0.407 | zeb/cich | 0.399 | |||||||||||
| nye/nye | 0.431 | nye/cich | 0.398 | |||||||||||
| til/til | 0.414 | til/cich | 0.399 | |||||||||||
|
| ||||||||||||||
| bur/bur | 0.503 | bur/cich | 0.527 | |||||||||||
| bri/bri | 0.507 | bri/cich | 0.523 | |||||||||||
| til/til | 0.494 | til/cich | 0.480 | |||||||||||
| zeb/zeb | 0.490 | zeb/cich | 0.478 | |||||||||||
| nye/nye | 0.464 | nye/cich | 0.496 | |||||||||||
|
| ||||||||||||||
| bur/bur | 0.328 | bur/cich | 0.363 | |||||||||||
| bri/bri | 0.359 | bri/cich | 0.375 | |||||||||||
| til/til | 0.425 | til/cich | 0.401 | |||||||||||
| zeb/zeb | 0.446 | zeb/cich | 0.426 | |||||||||||
| nye/nye | 0.360 | nye/cich | 0.426 | |||||||||||
dN/dS ratios were calculated for each pair of OR genes identified in the 14 families with 4 or more genes (panel a). The numbers of OR pairs per family with a dN/dS ratio above 1 are indicated in panel b. For those in which dS was 0, the dN/dS was arbitrarily given the value >10. In panel c, dN/dS ratios of pairs of paralogous genes (columns 2 and 6) were compared with the ratios of pairs of orthologous genes (columns 4 and 8).
dN/dS ratios for various OR protein domains
| Family D | Entire molecule | ||||
|---|---|---|---|---|---|
|
|
| TM1 | 0.28±0.18 | IN1 | 0.36±0.33 |
| TM2 | 0.24±0.23 | IN2 | 0.69±0.72 | ||
| TM3 | 0.64±0.46 | IN3 | 0.26±0.26 | ||
| TM4 | 0.83±1.01 | OUT1 | 0.22±0.22 | ||
| TM5 | 0.58±0.39 | OUT2 | 1.14±1.06 | ||
| TM6 | 0.42±0.41 | OUT3 | 0.80±0.56 | ||
| TM7 | 0.36±0.35 | ||||
|
|
| ||||
|
|
| TM1 | 0.57±0.27 | IN1 | 0.24±0.01 |
| TM2 | 0.52±0.48 | IN2 | 0.42±0.37 | ||
| TM3 | 0.73±0.79 | IN3 | 0.32±0.37 | ||
| TM4 | 0.69±0.39 | OUT1 | 0.43±0.24 | ||
| TM5 | 0.70±0.44 | OUT2 | 0.30±0.23 | ||
| TM6 | 0.23±0.21 | OUT3 | 0.41±0.38 | ||
| TM7 | 0.21±0.12 | ||||
|
|
| ||||
|
|
| TM1 | 0.56±0.47 | IN1 | 0.35±0.24 |
| TM2 | 0.24±0.22 | IN2 | 0.23±0.15 | ||
| TM3 | 0.48±0.43 | IN3 | 0.34±0.24 | ||
| TM4 | 0.53±0.39 | OUT1 | 0.39±0.51 | ||
| TM5 | 0.76±0.68 | OUT2 | 0.64±0.36 | ||
| TM6 | 0.84±0.51 | OUT3 | 0.67±0.72 | ||
| TM7 | 0.27±0.29 | ||||
|
|
| ||||
|
|
| TM1 | 0.56±0.42 | IN1 | 0.23±0.11 |
| TM2 | 0.59±0.58 | IN2 | 0.41±0.38 | ||
| TM3 | 0.76±0.43 | IN3 | 0.59±0.33 | ||
| TM4 | 1.48±1.54 | OUT1 | 0.29±0.34 | ||
| TM5 | 0.63±0.36 | OUT2 | 0.35±0.22 | ||
| TM6 | 0.72±0.42 | OUT3 | 0.40±0.34 | ||
| TM7 | 0.56±0.58 | ||||
|
|
| ||||
|
|
| TM1 | 0.45±0.35 | IN1 | 0.57±0.35 |
| TM2 | 0.79±0.37 | IN2 | 0.60±0.55 | ||
| TM3 | 0.68±0.57 | IN3 | 0.41±0.33 | ||
| TM4 | 0.95±0.54 | OUT1 | 0.35±0.27 | ||
| TM5 | 0.56±0.41 | OUT2 | 0.24±0.11 | ||
| TM6 | 0.60±0.41 | OUT3 | 0.40±0.13 | ||
| TM7 | 1.20±1.07 |
dN/dS ratios for the 7 TM regions, and the 3 external and 3 internal loops for the 4 largest families were calculated. TM regions and loops were identified with PolyPhobius.
Figure 2WebLogo graphical representation of the 5 most significant motifs identified by MEME in cichlid and OR repertoires and located at particular positions: motif 1: internal loop 2, motif 2: internal loop 1, motif 3: TM7-intracellular extension and motif 4: internal loop 3. Motif 5 is not well conserved and its position differs between fish species.
Figure 3WebLogo representation of the AA conservation around the MAYDRY motif. Multiple alignment with MAFFT followed by PHYML clustering revealed two subgroups of cichlid OR: one with a classical MAYDRY motif followed by 3 cysteine residues indicated by a blue star; and a second with an altered MAYDRY motif in which the aspartate residue (D) is replaced by a glutamate residue (E). See Additional file 9 for the genes of each of these two groups and a complete alignment of their sequences.
Figure 4Sequence logo representation of donor and acceptor splice sites identified in cichlid OR genes aligned onto their cognate contigs and manually corrected using both MAFFT multiple alignment and the FSPLICE tool (Softberry, Fish model).
Intron positions within OR genes
| OR names | Last codon | Intron phase | Codon position | Intron position | |
|---|---|---|---|---|---|
|
| contig034988-NyeORs.A033 | GTC.AG | 2 | 159 | TM4 |
| contig050024-NyeORs.W129 | CAA.CAG | 0 | 50 | IN 1 | |
| contig050025-NyeORs.W131 | AAC.AAG | 0 | 50 | IN 1 | |
| contig050025-NyeORs.W130 | CAC.CAG | 0 | 50 | IN 1/TM2 | |
| contig050026-NyeORs.W132 | CAC.CAG | 0 | 52 | IN 1 | |
| contig090286-BriORs.W112 | CAA.CAG | 0 | 50 | IN 1 | |
| contig090288-BriORs.W113 | CAA.CAG | 0 | 50 | IN 1 | |
| contig090291-BriORs.W114 | AAC.AA | 2 | 41 | TM1 | |
| contig090292-BriORs.W115 | CAG.CAG | 0 | 52 | IN 1 | |
| contig090301-BriORs.W116 | TAT.CAG | 0 | 49 | IN 1 | |
| contig046002-ZebORs.K090 | AAG.TAT | 0 | 24 | N ter | |
| contig067811-ZebORs.W140 | CAA.CAG | 0 | 50 | IN 1 | |
| contig062664-ZebORs.W141 | AAA.CA | 2 | 43 | IN 1 | |
| contig025842-ZebORs.W142 | AAA.CAC | 0 | 51 | IN 1 | |
| contig025841-ZebORs.W139 | AGT.ATC | 0 | 52 | IN 1 | |
| contig045454-BurORs.W131 | CAA.CAG | 0 | 50 | IN 1 | |
| contig066785-BurORs.W148 | TAT.CAG | 0 | 49 | IN 1 | |
| contig045453-BurORs.W132 | CAC.CAG | 0 | 50 | IN 1 | |
| contig045452-BurORs.W133 | CAC.CAG | 0 | 52 | IN 1 | |
| contig045453-BurORs.W134 | AAC.AAG | 0 | 50 | IN 1 | |
| contig041638-BurORs.W135 | AAA.CA | 2 | 43 | IN 1 | |
| contig041640-BurORs.V144 | CGA.CAC | 0 | 59 | IN 1 | |
| contig049605-BurORs.AB153 | AAC.AGT | 0 | 77 | IN 1 | |
| contig046708-TilORs.K143 | AAG.TAT | 0 | 24 | N ter | |
| contig027203-TilORs.W238 | CAC.CAG | 0 | 50 | IN 1 | |
| contig027204-TilORs.W239 | AAC.CGG | 0 | 50 | IN 1 | |
| contig027206-TilORs.W240 | CAC.CAG | 0 | 50 | IN 1 | |
| contig027209-TilORs.W241 | CAC.CAG | 0 | 50 | IN 1 | |
| contig027202-TilORs.W243 | AAA.CAC | 0 | 51 | IN 1 | |
| contig046717-TilORs.AB275 | TAT.GTG | 0 | 72 | TM1 | |
| contig027194-TilORs.V262 | CGA.CAC | 0 | 59 | IN 1 | |
|
| contig046495-NyeORs.I079 | GAG.AGG | 0 | 121 | IN 2 |
| ACA.ATC | 0 | 232 | OUT3 | ||
| contig051999-NyeORs.U128 | TAT.CA | 2 | 15 | N ter | |
| CAC.CAG | 0 | 54 | IN 1 | ||
| contig090301-BriORs.U109 | TAT.CAG | 0 | 16 | N ter | |
| CAG.GAT | 0 | 56 | OUT1 | ||
| contig025847-ZebORs.U137 | TAT.CAG | 0 | 16 | N ter | |
| CAC.CAG | 0 | 54 | IN 1 | ||
| contig026932-ZebORs.I082 | GAC.AG | 2 | 125 | IN 2 | |
| GAC.ATC | 0 | 200 | OUT2-TM5 | ||
| contig048321-BurORs.I076 | GAC.AG | 2 | 125 | IN 2 | |
| ATC.TAT | 0 | 201 | OUT2 | ||
| contig041640-BurORs.U130 | TAT.CA | 2 | 15 | N ter | |
| CAC.CAG | 0 | 54 | IN 1 | ||
| contig046690-TilORs.I128 | GAC.AG | 2 | 120 | IN 2 | |
| AAC.AT | 2 | 194 | OUT2-TM5 | ||
| contig046694-TilORs.I129 | GAC.AG | 2 | 125 | IN 2 | |
| ATC.TAT | 0 | 201 | OUT2-TM5 | ||
| contig046695-TilORs.I130 | GAC.AG | 2 | 120 | IN 2 | |
| ATC.TAT | 0 | 196 | OUT2-TM5 | ||
| contig027194-TilORs.U236 | TAT.CA | 2 | 15 | N ter | |
| CAC.CAG | 0 | 54 | IN 1 | ||
|
| contig090302-BriORs.V122 | CAC.AG | 2 | 80 | IN 1 |
| CTT.CTG | 0 | 127 | OUT1 | ||
| GTG.CAG | 0 | 269 | TM6 | ||
| contig025847-ZebORs.V149 | CAC.AG | 2 | 62 | IN 1 | |
| CTT.CTG | 0 | 109 | OUT1 | ||
| GTG.CAG | 0 | 251 | TM6 | ||
| contig041641-BurORs.T129 | CCC.AG | 2 | 48 | IN 1 | |
| AAC.AAG | 0 | 96 | OUT1 | ||
| GTC.CAG | 0 | 184 | TM5 |
N ter: Extracellular end, IN: Internal loops, TM: Transmembrane region, OUT: External loops. OR belonging to the different cichlids are alternatively colored.
Figure 5Phylogenetic tree constructed with the cichlid OR repertoires (in blue) and 247 non-OR class A GPCRs (Additional file 1 ) (in red). This tree clearly shows that the cichlid ORs are clearly distinct from the non-OR class A GPCRs.
Distribution of OR genes with more than 1 coding exon among the families of OR
| Families | Sub-families | Genes numbers | Genes with >1 coding exon | Number of exons |
|---|---|---|---|---|
| Fam A | 6 | 100 | 1 | 2 |
| Fam K | 4 | 22 | 2 | 2 |
| Fam T | 1 | 2 | 1 | 4 |
| Fam U | 1 | 5 | 5 | 3 |
| Fam V | 1 | 4 | 4 | 2 genes/2 exons |
| 2 genes/4 exons | ||||
| Fam W | 5 | 24 | 24 | 2 |
| Fam AB | 1 | 4 | 2 | 2 |