| Literature DB >> 24571567 |
Heather E Machado, Ginger Jui, Domino A Joyce, Christian R L Reilly, David H Lunt, Suzy C P Renn1.
Abstract
BACKGROUND: Gene duplication is a source of evolutionary innovation and can contribute to the divergence of lineages; however, the relative importance of this process remains to be determined. The explosive divergence of the African cichlid adaptive radiations provides both a model for studying the general role of gene duplication in the divergence of lineages and also an exciting foray into the identification of genomic features that underlie the dramatic phenotypic and ecological diversification in this particular lineage. We present the first genome-wide study of gene duplication in African cichlid fishes, identifying gene duplicates in three species belonging to the Lake Malawi adaptive radiation (Metriaclima estherae, Protomelas similis, Rhamphochromis "chilingali") and one closely related species from a non-radiated riverine lineage (Astatotilapia tweddlei).Entities:
Mesh:
Year: 2014 PMID: 24571567 PMCID: PMC3944005 DOI: 10.1186/1471-2164-15-161
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Maximum likelihood phylogeny illustrating the positions of experimental (stars) and reference (circle) taxa. The maximum likelihood tree is based on 1785 bp mitochondrial ND2. Nodes not supported by 50% maximum likelihood SH values collapsed and Lake Victoria, Lake Makgadikgadi, and Lake Tanganyika radiations are represented by triangles. The tree is rooted with Oreochromis and the scale bar indicates the mean number of nucleotide substitutions per site (DRYAD doi:10.5061/dryad.7vs2c).
Genes duplicated relative to with informative BLAST hits
| CN4688282/2 | Adenine nucleotide translocator s598 | ns | ns | ns | 0.60 | 567 |
| 0.0019 | ||||||
| DY630000 | Alcohol dehydrogenase Class VI | ns | ns | 0.73 | ns | 379 |
| 0.0015 | ||||||
| DY630424 | Alkylated DNA repair protein alkB homolog 7 | ns | 0.43 | ns | ns | 304 |
| 0.0002 | ||||||
| DY629046 | Arsenic (+3 oxidation state) methyltransferase | ns | ns | ns | 1.06 | 150 |
| 0.0031 | ||||||
| DY626788 | ATPase, H + transporting, lysosomal V0 subunit E | ns | 0.76 | ns | ns | 87.8 |
| 0.0028 | ||||||
| DY628437 | Claudin 29a (cldn29a) gene | ns | 0.60 | ns | ns | 526 |
| 0.005 | ||||||
| DY632040 | Coiled-coil domain containing protein 80 | ns | ns | 1.19 | 2.13 | 434 |
| 3E-05 | 5E-07 | |||||
| DY629141 | Crystallin gamma M2b | ns | ns | ns | 0.43 | 829 |
| 0.0024 | ||||||
| DY6262041/2 | C-type lectin domain family 4 member C | ns | 0.38 | ns | ns | 246 |
| 0.0039 | ||||||
| DY631088 | Cystatin-B | 0.45 | ns | ns | ns | 150 |
| 0.0016 | ||||||
| DY630353 | Cytosolic sulfotransferase 3 | ns | ns | 0.62 | 0.64 | 713 |
| 0.0015 | 0.0013 | |||||
| CN470675 | Dazl gene | ns | ns | ns | 0.57 | 89.7 |
| 0.0040 | ||||||
| DY6299674/8 | Ferritin heavy subunit | ns | ns | ns | 0.82 | 1160 |
| 8E-05 | ||||||
| DY631817 | Fish virus induced TRIM protein | ns | ns | 0.590 | ns | 170 |
| 0.0005 | ||||||
| DY626596 | Fish virus induced TRIM protein | ns | ns | 0.41 | 0.44 | 145 |
| 0.0045 | 0.0030 | |||||
| DY628624 | Gamma M7 crystallin | ns | ns | ns | 0.42 | 169 |
| 0.0054 | ||||||
| DY630388 | Glutamyl-tRNA(Gln) amidotransferase | 0.48 | ns | ns | ns | 347 |
| 0.0016 | ||||||
| DY6261151/2 | GTPase IMAP family member 7 | ns | ns | 1.14 | ns | 370 |
| 0.0016 | ||||||
| CN471284 | High-mobility group 20B | 0.60 | ns | ns | ns | 163 |
| 0.0004 | ||||||
| CN469367 | Hox gene cluster | 1.34 | 1.16 | 0.86 | 1.11 | 183 |
| 3E-05 | 9E-05 | 0.0006 | 0.0002 | |||
| DY627986 | Hox gene cluster | 1.81 | 1.12 | 0.80 | 1.22 | 95.1 |
| 3E-07 | 1E-05 | 0.0001 | 5E-06 | |||
| DY629113 | Immunoglobulin light chain | ns | ns | 0.65 | ns | 482 |
| 0.0025 | ||||||
| CN468953 | Iron-sulfur cluster assembly enzyme ISCU | ns | ns | ns | 0.86 | 610 |
| 0.0022 | ||||||
| DY628151 | Kallikrein-8 precursor | 1.02 | ns | ns | ns | 102 |
| 0.0015 | ||||||
| DY627800 | Kinesin-like protein 2 (knsl2) | ns | 0.86 | 1.84 | 1.14 | 398 |
| 0.0013 | 9E-06 | 0.0003 | ||||
| CN469578 | KLR1 gene | 1.04 | ns | ns | ns | 154 |
| 4E-05 | ||||||
| DY629760 | LOC100150543, polyprotein | 1.35 | ns | 0.65 | 0.79 | 141 |
| 8E-06 | 0.001 | 0.0003 | ||||
| CN468718 | LOC100151545, similar to Protein KIAA0284 | 0.72 0.0004 | ns | ns | ns | 145 |
| DY629780 | MHC class I | ns | 0.84 | 1.26 | 1.05 | 161 |
| 0.0053 | 0.0005 | 0.0016 | ||||
| DY630620 | MHC class IA antigen | ns | ns | 0.42 | ns | 120 |
| 0.0026 | ||||||
| DY630701 | MHC class II alpha subunit | ns | ns | 0.49 | ns | 764 |
| 0.0053 | ||||||
| DY631898 | MHC class II antigen alpha chain | ns | ns | 0.94 | ns | 87.8 |
| 0.0004 | ||||||
| DY631847 | Mitotic spindle assembly checkpoint protein MAD2A | 0.60 | ns | ns | ns | 374 |
| 0.0008 | ||||||
| DY627079 | Muscle-type creatine kinase CKM2 | ns | 0.41 | ns | ns | 787 |
| 0.0049 | ||||||
| CN4693752/3 | Peptidyl-prolyl cis-trans isomerase NIMA-interacting 1 | ns | 0.69 | ns | ns | 663 |
| 0.0003 | ||||||
| DY628779 | Post-GPI attachment to proteins factor 2 | ns | 0.87 | ns | ns | 123 |
| 0.0002 | ||||||
| DY6261142/2 | Ras association domain-containing protein 4 | ns | 0.82 | ns | ns | 1086 |
| 0.0001 | ||||||
| DY630104 | Ras-related C3 botulinum toxin substrate 2 | 1.44 | 0.83 | 1.47 | 1.90 | 331 |
| 8E-06 | 0.0003 | 7E-06 | 1E-06 | |||
| DY6305081/3 | Replication factor C subunit 5 | 1.04 | ns | ns | ns | 1234 |
| 2E-05 | ||||||
| DY628495 | Ribosomal protein, large P2 (60S) | ns | ns | ns | 1.01 | 161 |
| 0.0001 | ||||||
| DY6308322/3 | Ribosomal protein S20 (40S) | ns | 0.65 | ns | ns | 663 |
| 0.001 | ||||||
| DY626643 | Serine/threonine phosphatase gene | ns | 0.57 | 0.57 | 0.54 | 87.8 |
| 0.0004 | 0.0004 | 0.0063 | ||||
| CN470072 | Sestrin 3 | ns | 1.30 | 1.61 | 1.70 | 116 |
| 0.0007 | 0.0002 | 0.0002 | ||||
| DY629126 | Short coiled-coil protein | ns | ns | ns | 0.59 | 242 |
| 0.0025 | ||||||
| DY630540 | Small inducible cytokine SCYA102 | ns | 0.64 | ns | ns | 1204 |
| 0.0019 | ||||||
| CN471492 | Solute carrier family 9 (sodium/hydrogen exchanger) | ns | ns | 0.63 | ns | 197 |
| 0.0021 | ||||||
| CN4711031/3 | Ubiquitin | ns | ns | 1.27 | ns | 985 |
| 0.0042 | ||||||
| DY629776 | UDP glycosyltransferase 2 family, polypeptide A1 | ns | 0.92 | ns | ns | 304 |
| 0.0006 | ||||||
| CN469822 | Vacuolar ATP synthase subunit G 1 | 0.79 | ns | ns | ns | 277 |
| 0.0007 | ||||||
| DY632057 | Pituitary adenylate cyclase activating polypeptide receptor 1A | ns | 1.73 | 1.98 | 1.79 | 170 |
| DY626009 | Non-LTR retrotransposon Rex1a | 0.70 | ns | ns | ns | 82.4 |
| 0.002 | ||||||
| DY629391 | Non-LTR retrotransposon Rex3_Tet | 0.94 | ns | ns | ns | 122 |
| 0.0028 | ||||||
| DY631649 | SINE sequence | ns | 0.78 | ns | ns | 138 |
| 0.0002 |
Genomic Content Value: Log2 Hybridization coefficients for each species relative to A. burtoni as estimated by the linear model following background correction and normalization followed by the uncorrected p-value for those significantly different (P < FDR 0.1). BitScore: the quality of the alignment for the annotated homology. A.twe: A. tweddlei; M.est: M. estherae; P.sim: P. similis; R.chi: R. “chilingali”; “ns”: not significant; superscript numbers: when multiple array features represent a gene, these numbers indicate the number of significant array features and the total number of array features for that gene represented in the table by a single GenBank number and data row.
Genes duplicated relative to with no informative BLAST hit
| CN4691252/2 | 1.32 | ns | ns | ns |
| 2E-5 | ||||
| CN469431 | ns | ns | ns | 0.50 |
| 0.0017 | ||||
| CN469460 | ns | ns | 0.64 | ns |
| 0.0007 | ||||
| CN469913 | ns | ns | ns | 0.63 |
| 0.0008 | ||||
| CN470216 | ns | 0.39 | ns | ns |
| 0.0039 | ||||
| CN470402 | ns | ns | 0.48 | 0.45 |
| 0.0034 | 0.0051 | |||
| CN470540 | ns | ns | ns | 0.60 |
| 0.0051 | ||||
| CN470597 | 0.65 | ns | ns | ns |
| 0.0013 | ||||
| CN470646 | 0.73 | ns | ns | ns |
| 0.0003 | ||||
| CN470701 | ns | ns | ns | 0.65 |
| 0.0052 | ||||
| CN470713 | ns | ns | 0.68 | ns |
| 0.0009 | ||||
| CN4707241/2 | 1.02 | ns | ns | ns |
| 0.0027 | ||||
| CN470781 | 0.58 | ns | ns | ns |
| 0.0025 | ||||
| CN470835 | ns | 1.55 | ns | ns |
| 0.0002 | ||||
| CN470857 | ns | 0.67 | ns | 0.96 |
| 0.005 | 0.0006 | |||
| CN470988 | ns | 1.28 | ns | 1.34 |
| 0.0007 | 0.0005 | |||
| CN471261 | ns | ns | 0.93 | ns |
| 0.001 | ||||
| CN471811 | ns | 1.35 | 1.22 | ns |
| 6E-6 | 1E-5 | |||
| CN471851 | ns | ns | 0.47 | ns |
| 0.0018 | ||||
| CN472050 | ns | ns | ns | 0.70 |
| 0.0002 | ||||
| DY625804 | ns | 1.23 | ns | ns |
| 0.0001 | ||||
| DY6258451/2 | 1.16 | ns | ns | ns |
| 6E-6 | ||||
| DY625884 | 0.49 | ns | ns | ns |
| 0.0024 | ||||
| DY625919 | 1.18 | ns | ns | ns |
| 0.0001 | ||||
| DY626122 | 1.50 | ns | ns | ns |
| 2E-5 | ||||
| DY626140 | ns | 1.05 | ns | ns |
| 0.0036 | ||||
| DY626192 | ns | 0.75 | ns | ns |
| 0.0002 | ||||
| DY626304 | ns | ns | 0.99 | 1.50 |
| 4E-5 | 2.2E-6 | |||
| DY626389 | ns | ns | 0.57 | ns |
| 0.0022 | ||||
| DY626428 | ns | ns | ns | 0.39 |
| 0.0053 | ||||
| DY626737 | ns | ns | 0.79 | ns |
| 0.0001 | ||||
| DY626766 | ns | 0.81 | 0.98 | 0.68 |
| 0.0003 | 0.0001 | 0.0009 | ||
| DY627085 | ns | 0.44 | ns | ns |
| 0.0043 | ||||
| DY627087 | 1.15 | ns | ns | ns |
| 3E-5 | ||||
| DY627338 | 0.72 | ns | ns | ns |
| 0.0006 | ||||
| DY627361 | ns | ns | ns | 0.74 |
| 0.0001 | ||||
| DY627641 | ns | 0.76 | 0.85 | ns |
| 0.0012 | 0.0006 | |||
| DY627780 | ns | 1.51 | ns | ns |
| 3E-6 | ||||
| DY627911 | 1.04 | 0.87 | 0.49 | ns |
| 5E-5 | 0.0002 | 0.0046 | ||
| DY628148 | 0.50 | ns | ns | ns |
| 0.0012 | ||||
| DY628172 | 1.38 | ns | ns | ns |
| 4E-6 | ||||
| DY628268 | ns | ns | 0.46 | ns |
| 0.0017 | ||||
| DY628316 | ns | 0.64 | ns | ns |
| 0.0026 | ||||
| DY628477 | ns | ns | ns | 0.58 |
| 0.0052 | ||||
| DY628517 | 0.76 | ns | ns | ns |
| 0.0002 | ||||
| DY628561 | ns | ns | ns | 0.42 |
| 0.0052 | ||||
| DY628642 | ns | ns | 1.62 | 1.13 |
| 9E-5 | 0.0087 | |||
| DY6287021/2 | ns | ns | ns | 0.67 |
| 0.0004 | ||||
| DY628714 | ns | ns | ns | 0.48 |
| 0.0027 | ||||
| DY629058 | ns | ns | ns | 2.41 |
| 3.7E-5 | ||||
| DY629123 | ns | 0.87 | 0.83 | 1.41 |
| 0.0001 | 0.0001 | 3.8E-6 | ||
| DY629387 | ns | ns | 1.18 | ns |
| 4E-5 | ||||
| DY629482 | 1.39 | 0.71 | ns | 1.12 |
| 1E-5 | 0.0008 | 4.3E-5 | ||
| DY629717 | ns | ns | 1.60 | 1.28 |
| 0.0002 | 0.0007 | |||
| DY6298821/2 | ns | ns | ns | 0.77 |
| 0.0032 | ||||
| DY629912 | 1.41 | 2.21 | 1.06 | 1.16 |
| 8E-6 | 0.0003 | 7E-6 | 1.1E-6 | |
| DY630229 | ns | 0.88 | 0.83 | ns |
| 0.0015 | 0.0021 | |||
| DY630284 | ns | 0.54 | ns | ns |
| 0.0032 | ||||
| DY6303731/2 | ns | 0.89 | 1.00 | 1.23 |
| 0.0001 | 6E-5 | 1.5E-5 | ||
| DY630867 | 0.97 | 0.64 | ns | ns |
| 0.0002 | 0.0024 | |||
| DY630964 | ns | ns | ns | 0.95 |
| 0.0037 | ||||
| DY630993 | ns | 0.67 | ns | ns |
| 0.002 | ||||
| DY631067 | ns | 0.78 | 0.80 | 1.02 |
| 0.0008 | 0.0007 | 0.0002 | ||
| DY631315 | ns | ns | 1.57 | 1.16 |
| 0.003 | 0.0019 | |||
| DY631408 | ns | ns | 0.50 | ns |
| 0.0033 | ||||
| DY631442 | ns | 1.40 | 1.16 | ns |
| 0.0014 | 0.004 | |||
| DY6315051/2 | ns | 0.72 | ns | ns |
| 0.0029 | ||||
| DY631507 | 0.67 | 0.69 | 0.78 | 0.61 |
| 0.0013 | 0.0011 | 0.0005 | 0.0022 | |
| DY631680 | ns | ns | ns | 1.02 |
| 0.0019 | ||||
| DY631698 | ns | ns | 1.03 | ns |
| 0.0006 | ||||
| DY631821 | ns | ns | 0.61 | 0.60 |
| 0.0038 | 0.0039 | |||
| DY631827 | ns | 0.86 | ns | ns |
| 0.0046 | ||||
| DY631850 | 0.85 | ns | ns | ns |
| 0.0009 | ||||
| DY631869 | ns | 0.39 | ns | ns |
| 0.0038 | ||||
| DY632007 | 0.99 | ns | ns | ns |
| 0.0002 | ||||
| DY6320583/3 | ns | 0.90 | 0.72 | 0.71 |
| 0.0007 | 0.0026 | 0.0028 | ||
| DY632092 | ns | 1.09 | ns | ns |
| 0.0007 | ||||
| DY632097 | ns | 0.79 | ns | 0.82 |
| 0.0031 | 0.0002 | |||
| DY6321342/2 | 0.94 | 0.86 | ns | 0.84 |
| 0.0002 | 0.0003 | 0.0003 | ||
| DY632256 | ns | ns | 1.56 | ns |
| 0.0033 | ||||
| DY632294 | ns | 0.41 | ns | ns |
| 0.0035 |
Genomic Content Value: Log2 Hybridization coefficients for each species relative to A. burtoni as estimated by the linear model following background correction and normalization followed by the uncorrected p-value for those significantly different. A.twe: A. tweddlei; M.est: M. estherae; P.sim: P. similis; R.chi: R. chilingali; “ns”: not significant; “*”: the GenBank number is a representative for multiple array features for that gene.
Figure 2Genes identified as duplicated among test species (P < 0.1 FDR). A. twe: A. tweddlei; M. est: M. estherae; P. sim: P. similis; R. chi: R. “chilingali”. Shared: genes found duplicated in multiple species; Specific: genes found duplicated in only one species; lake: species belonging to the Lake Malawi radiation (M. estherae, P. similis, R. “chilingali”); river: the river species A. tweddlei.
Figure 3qPCR validates gene copy number determined by aCGH. Abbreviations are genus and species initials. Primer loci are named for the Genbank number of the A. burtoni array feature sequence. **P <0.1 FDR, *P <0.2 FDR found by array analysis.
Oligonucleotide primers used for qPCR designed against GenBank sequence available for microarray features
| | |||||
|---|---|---|---|---|---|
| DY626766 | F: TCGGTCTCCTTAACCGGATG | No Hit | 193 | 86 | 74 |
| | R: CTGAGTTTGGCTGCCCGTAA | | | | |
| DY627986 | F: ACGAACACCCGAACGGAAAC | Hox gene cluster | 222 | 100 | 104 |
| | R: GGTGCACGCACATGAACTGT | | | | |
| DY631898 | F: CGTCCCAGTGAGGATGAGGA | MHC class II antigen | 161 | 82 | 82 |
| | R: TGATGCTGATCGGTTGATGC | | | | |
| DY632057 | F: ATTACTGCGAGTGCCGTCCA | Pituitary adenylate cyclase activating | 150 | 91 | 78 |
| R: CTGCGCCCTGAAAGAACAGA | polypeptide receptor 1A | ||||
Primer Efficiency: percent is based on 4-fold template dilutions for A. burtoni and one heterologous test species indicated in parentheses.