| Literature DB >> 26446539 |
Christopher M Austin1, Mun Hua Tan1, Larry J Croft2, Michael P Hammer3, Han Ming Gan4.
Abstract
The Asian arowana (Scleropages formosus) is of commercial importance, conservation concern, and is a representative of one of the oldest lineages of ray-finned fish, the Osteoglossomorpha. To add to genomic knowledge of this species and the evolution of teleosts, the genome of a Malaysian specimen of arowana was sequenced. A draft genome is presented consisting of 42,110 scaffolds with a total size of 708 Mb (2.85% gaps) representing 93.95% of core eukaryotic genes. Using a k-mer-based method, a genome size of 900 Mb was also estimated. We present an update on the phylogenomics of fishes based on a total of 27 species (23 fish species and 4 tetrapods) using 177 orthologous proteins (71,360 amino acid sites), which supports established relationships except that arowana is placed as the sister lineage to all teleost clades (Bayesian posterior probability 1.00, bootstrap replicate 93%), that evolved after the teleost genome duplication event rather than the eels (Elopomorpha). Evolutionary rates are highly heterogeneous across the tree with fishes represented by both slowly and rapidly evolving lineages. A total of 94 putative pigment genes were identified, providing the impetus for development of molecular markers associated with the spectacular colored phenotypes found within this species.Entities:
Keywords: evolutionary rate; fish; genome; phylogenomics; pigmentation genes
Mesh:
Substances:
Year: 2015 PMID: 26446539 PMCID: PMC4684697 DOI: 10.1093/gbe/evv186
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
List of Species Included in the Phylogenetic Analyses
| Organismsource | Scientific Name | Class | Order | Reference |
|---|---|---|---|---|
| Asian arowana* | Actinopterygii | Osteoglossiformes | This study | |
| European eelZ | Actinopterygii | Anguilliformes | ||
| MedakaE | Actinopterygii | Beloniformes | ||
| Blind cave fishE | Actinopterygii | Characiformes | ||
| Common carpC | Actinopterygii | Cypriniformes | ||
| ZebrafishE | Actinopterygii | Cypriniformes | ||
| Amazon mollyE | Actinopterygii | Cyprinodontiformes | Unpublished | |
| Southern platyfishE | Actinopterygii | Cyprinodontiformes | ||
| Northern pikeV | Actinopterygii | Esociformes | ||
| Atlantic codE | Actinopterygii | Gadiformes | ||
| Three-spined sticklebackE | Actinopterygii | Gasterosteiformes | ||
| Electric eelF | Actinopterygii | Gymnotiformes | ||
| Spotted garE | Actinopterygii | Lepisosteiformes | Unpublished | |
| Nile tilapiaE | Actinopterygii | Perciformes | ||
| Atlantic salmonS | Actinopterygii | Salmoniformes | ||
| Rainbow troutG | Actinopterygii | Salmoniformes | ||
| Japanese pufferE | Actinopterygii | Tetraodontiformes | ||
| Green spotted pufferE | Actinopterygii | Tetraodontiformes | ||
| African coelacanthE | Sarcopterygii | Coelacanthiformes | ||
| | Sarcopterygii | Lepidosireniformes | ||
| Elephant shark | Chondrichthyes | Chimaeriformes | ||
| | Chondrichthyes | Carchariniformes | ||
| | Chondrichthyes | Rajiformes | ||
| Western clawed frogE | Amphibia | Anura | ||
| ChickenE | Aves | Galliformes | ||
| HumanE | Mammalia | Primates | ||
| LizardE | Reptilia | Squamata | ||
Note.—Codes for source: A*STAR (A), CarpBase (C), Ensembl (E), efish genomics (F), Genoscope (G), SalmonDB (SA), SkateBase (SK), SRA (SR), UVic (V), ZF Genomics (Z), this study (*).
aRaw transcriptome reads were used.
bAssembled transcripts were used.
FPhylogenetic relationships among fish species. The phylogenetic tree was inferred from a supermatrix containing the alignment of sequences from 27 species (177 orthologous proteins, 71,360 aligned amino acid positions, 7.07% gaps) and was rooted with the Chondrichthyes. Black circles indicate maximum nodal support with bootstrap values of 100% and Bayesian posterior probabilities of 1.00. The yellow and green circles represent 93% and 98% bootstrap support values, respectively, both with maximal Bayesian posterior probability values of 1.00. Branch length information is included and the rate of molecular evolution (number of amino acid substitutions per site) for each fish lineage is placed beside each taxa label. These values were calculated from the split of all ray-finned fish from lobe-finned fish and tetrapod lineages (node indicated with the orange star). A (T) is placed next to the species for which transcriptome data were utilized.
Putative Arowana Pigmentation Genes
| Gene | Accession ( | Locus ID (arowana) | PID | Accession (annotation) | Species | |
|---|---|---|---|---|---|---|
| adam17 | NP_003174.3 | Z043_115716 | 68.98 | 0.00 | XP_010733184.1 | |
| adamts20 | NP_079279.3 | Z043_106475 | 71.36 | 0.00 | XP_008274326.1 | |
| creb1 | NP_004370.1 | Z043_122987 | 95.37 | 0.00 | XP_005167757.1 | |
| ece1 | NP_001106819.1 | Z043_112628 | 80.03 | 0.00 | CDQ77702.1 | |
| Ednrb | NP_001116131.1 | Z043_105076 | 81.50 | 0.00 | XP_007254865.1 | |
| Egfr | NP_958439.1 | Z043_114891 | — | — | — | |
| fgfr2 | NP_000132.3 | Z043_104866 | 84.50 | 0.00 | KKF10433.1 | |
| frem2 | NP_997244.4 | Z043_101382 | 70.22 | 0.00 | XP_012683949.1 | |
| fzd4 | NP_036325.2 | Z043_108755 | 89.76 | 0.00 | XP_012693402.1 | |
| gna11 | NP_002058.2 | Z043_106310 | 96.02 | 0.00 | XP_010750457.1 | |
| gnaq | NP_002063.2 | Z043_114081 | 86.57 | 0.00 | XP_010735114.1 | |
| gpc3 | NP_001158091.1 | Z043_101235 | 52.03 | 3 × 10−175 | XP_006639062.1 | |
| gpr161 | NP_722561.1 | Z043_116750 | 73.06 | 0.00 | XP_007227875.1 | |
| hdac1 | NP_004955.2 | Z043_108210 | 96.71 | 0.00 | XP_006631299.1 | |
| ikbkg | NP_003630.1 | Z043_105761 | 64.16 | 2 × 10−170 | XP_010903123.1 | |
| itgb1 | NP_596867.1 | Z043_116749 | 71.96 | 0.00 | NP_001030143.1 | |
| Kit | NP_001087241.1 | Z043_118854 | 71.89 | 0.00 | XP_008297546.1 | |
| lef1 | NP_057353.1 | Z043_100731 | — | — | — | |
| lmx1a | NP_001167540.1 | Z043_108871 | 91.03 | 9 × 10−180 | XP_008417499.1 | |
| mbtps1 | NP_003782.1 | Z043_104391 | 86.31 | 0.00 | XP_009291810.1 | |
| mcoln3 | NP_060768.8 | Z043_110213 | 69.96 | 0.00 | XP_006634884.1 | |
| mitf | NP_937801.1 | Z043_105357 | 83.91 | 0.00 | XP_006630679.1 | |
| pax3 | NP_039230.1 | Z043_107599 | — | — | — | |
| rab32 | NP_006825.1 | Z043_104281 | 78.47 | 6 × 10−118 | XP_012671987.1 | |
| scarb2 | NP_005497.1 | Z043_105397 | 78.22 | 0.00 | NP_001117983.1 | |
| sfxn1 | NP_073591.2 | Z043_121119 | 89.10 | 0.00 | XP_010895582.1 | |
| snai2 | NP_003059.1 | Z043_117231 | 85.88 | 5 × 10−164 | XP_003759837.1 | |
| sox10 | NP_008872.1 | Z043_106242 | 77.78 | 0.00 | XP_008294581.1 | |
| sox18 | NP_060889.1 | Z043_107469 | 61.33 | 3 × 10−161 | XP_001337702.1 | |
| sox9 | NP_000337.1 | Z043_118917 | 79.08 | 0.00 | XP_006635207.1 | |
| tfap2a | NP_001027451.1 | Z043_119933 | 86.12 | 0.00 | XP_006634534.1 | |
| trpm1 | NP_001238949.1 | Z043_111666 | 71.06 | 0.00 | XP_006629107.1 | |
| trpm7 | NP_060142.3 | Z043_100441 | 82.16 | 0.00 | XP_006628750.1 | |
| wnt1 | NP_005421.1 | Z043_120129 | 93.51 | 0.00 | XP_010873444.1 | |
| wnt3a | NP_149122.1 | Z043_118184 | 96.12 | 0.00 | XP_008312650.1 | |
| zic2 | NP_009060.2 | Z043_101779 | 88.54 | 0.00 | XP_006638968.1 | |
| dct | NP_001913.2 | Z043_108526 | 73.9 | 0.00 | XP_008326759.1 | |
| rab32 | NP_006825.1 | Z043_116536 | 67.76 | 1 × 10−88 | XP_003224067.2 | |
| rab38 | NP_071732.1 | Z043_122112 | 90.05 | 1 × 10−126 | AAI50366.1 | |
| slc24a4 | NP_705934.1 | Z043_114251 | 81.84 | 0.00 | XP_005803162.1 | |
| slc24a5 | NP_995322.1 | Z043_103396 | 82.06 | 0.00 | XP_005814818.1 | |
| tyrp1 | NP_000541.1 | Z043_107956 | 74.52 | 0.00 | XP_005743086.1 | |
| ap3d1 | NP_003929.4 | Z043_120762 | 73.21 | 0.00 | XP_011472829.1 | |
| fig4 | NP_055660.1 | Z043_103115 | 86.55 | 0.00 | XP_006626354.1 | |
| gpr143 | NP_000264.2 | Z043_102175 | 78.42 | 0.00 | XP_012680526.1 | |
| hps3 | NP_115759.2 | Z043_100370 | 70.79 | 0.00 | XP_012680760.1 | |
| lyst | NP_001288294.1 | Z043_100757 | 69.99 | 0.00 | XP_008300589.1 | |
| nsf | NP_006169.2 | Z043_108447 | 93.61 | 0.00 | XP_005164054.1 | |
| pldn | NP_036520.1 | Z043_109414 | 78.42 | 4 × 10−73 | XP_008274283.1 | |
| rabggta | NP_004572.3 | Z043_121567 | — | — | — | |
| txndc5 | NP_110437.2 | Z043_116626 | 77.02 | 0.00 | CDQ77189.1 | |
| vps11 | NP_068375.3 | Z043_121081 | 90.41 | 0.00 | XP_010863485.1 | |
| vps18 | NP_065908.1 | Z043_111267 | 85.09 | 0.00 | XP_010892538.1 | |
| vps33a | NP_075067.2 | Z043_116542 | 94.66 | 0.00 | CDQ76904.1 | |
| vps39 | NP_056104.2 | Z043_117047 | 89.05 | 0.00 | XP_010749485.1 | |
| mlph | NP_077006.1 | Z043_101687 | 62.90 | 0.00 | XP_005168768.1 | |
| myo5a | NP_000250.3 | Z043_102448 | 86.24 | 0.00 | XP_006628770.1 | |
| myo7a | NP_001120652.1 | Z043_100931 | 78.91 | 0.00 | AAI63570.1 | |
| rab27a | NP_899059.1 | Z043_111973 | 87.89 | 2 × 10−148 | XP_006628775.1 | |
| creb1 | NP_004370.1 | Z043_122987 | 95.37 | 0.00 | XP_005167757.1 | |
| drd2 | NP_000786.1 | Z043_112980 | 83.67 | 0.00 | XP_006642348.1 | |
| mc1r | NP_002377.4 | Z043_121636 | 76.15 | 4 × 10−167 | AGC50885.1 | |
| mgrn1 | NP_001135763.2 | Z043_111249 | 85.27 | 0.00 | XP_006637253.1 | |
| pomc | NP_001030333.1 | Z043_103340 | 51.72 | 7 × 10−66 | AAO17793.1 | |
| atp6ap1 | NP_001174.2 | Z043_108102 | 66.24 | 0.00 | XP_012682891.1 | |
| atp6ap2 | NP_005756.2 | Z043_100882 | 75.14 | 0.00 | XP_012675204.1 | |
| atp6v0c | NP_001185498.1 | Z043_125122 | 95.36 | 3 × 10−90 | XP_008434615.1 | |
| atp6v0d1 | NP_004682.2 | Z043_121933 | 94.48 | 0.00 | NP_955914.1 | |
| atp6v1e1 | NP_001687.1 | Z043_104549 | 92.09 | 2 × 10−143 | XP_007579195.1 | |
| atp6v1f | NP_004222.2 | Z043_100808 | 100.00 | 4 × 10−81 | XP_006633325.1 | |
| atp6v1h | NP_998784.1 | Z043_113483 | 90.61 | 0.00 | XP_007260238.1 | |
| atp7b | NP_000044.2 | Z043_122088 | 54.41 | 0.00 | XP_010017200.1 | |
| rps19 | NP_001013.1 | Z043_118939 | 91.67 | 7 × 10−95 | XP_008329573.1 | |
| rps20 | NP_001014.1 | Z043_107890 | 100.00 | 4 × 10−80 | NP_001117836.1 | |
| atp6v1e1 | NP_001687.1 | Z043_104549 | 92.09 | 2 × 10−143 | XP_007579195.1 | |
| atp6v1h | NP_998784.1 | Z043_113483 | 90.61 | 0.00 | XP_007260238.1 | |
| csf1r | NP_001275634.1 | Z043_118854 | 71.89 | 0.00 | XP_008297546.1 | |
| ednrb | NP_001116131.1 | Z043_105076 | 81.50 | 0.00 | XP_007254865.1 | |
| ghr | NP_001229389.1 | Z043_101160 | 57.24 | 0.00 | BAD20706.1 | |
| pax3 | NP_039230.1 | Z043_107599 | — | — | — | |
| sox10 | NP_008872.1 | Z043_106242 | 77.78 | 0.00 | XP_008294581.1 | |
| gchi | NP_001019195.1 | Z043_110449 | 81.94 | 1 × 10−125 | XP_007231033.1 | |
| mycbp2 | NP_055872.4 | Z043_104473 | 91.14 | 0.00 | XP_007251746.1 | |
| paics | NP_001072992.1 | Z043_121868 | 87.94 | 0.00 | XP_010870568.1 | |
| pcbd1 | NP_000272.1 | Z043_105842 | 95.05 | 1 × 10−66 | XP_012672435.1 | |
| Pts | NP_000308.1 | Z043_103015 | 81.21 | 2 × 10−84 | XP_012670027.1 | |
| qdpr | NP_000311.2 | Z043_109962 | 86.83 | 5 × 10−129 | XP_006137052.1 | |
| Spr | NP_003115.1 | Z043_114288 | 63.64 | 6 × 10−126 | NP_001133746.1 | |
| xdh | NP_000370.2 | Z043_115384 | 69.12 | 0.00 | XP_006636840.1 | |
| atp6v1h | NP_998784.1 | Z043_113483 | 90.61 | 0.00 | XP_007260238.1 | |
| dac | NP_001077.2 | Z043_123292 | 73.28 | 0.00 | ACN11084.1 | |
| ednrb | NP_001116131.1 | Z043_105076 | 81.50 | 0.00 | XP_007254865.1 | |
| Ltk | NP_002335.2 | Z043_118424 | 68.81 | 0.00 | XP_010877407.1 | |
| sox10 | NP_008872.1 | Z043_106242 | 77.78 | 0.00 | XP_008294581.1 | |
| sox9 | NP_000337.1 | Z043_118917 | 79.08 | 0.00 | XP_006635207.1 | |
| trim33 | NP_056990.3 | Z043_115609 | 66.93 | 0.00 | NP_001002871.2 | |
| vps18 | NP_065908.1 | Z043_111267 | 85.09 | 0.00 | XP_010892538.1 | |
| vps39 | NP_056104.2 | Z043_117047 | 89.05 | 0.00 | XP_010749485.1 | |
| abhd11 | NP_683711.1 | Z043_117262 | 79.64 | 9 × 10−155 | XP_010893523.1 | |
| ebna1bp2 | NP_006815.2 | Z043_123300 | 77.78 | 7 × 10−146 | XP_006634973.1 | |
| gfpt1 | NP_002047.2 | Z043_101574 | 95.16 | 0.00 | XP_006625541.1 | |
| gja5 | NP_859054.1 | Z043_107343 | 71.02 | 0.00 | XP_008273833.1 | |
| irf4 | NP_002451.2 | Z043_102759 | 75.71 | 0.00 | XP_006634623.1 | |
| kcnj13 | NP_002233.2 | Z043_119194 | 71.76 | 7 × 10−173 | XP_010768290.1 | |
| pabpc1 | NP_002559.2 | Z043_109572 | 96.20 | 0.00 | XP_007230879.1 | |
| skiv2l2 | NP_056175.3 | Z043_112154 | 91.68 | 0.00 | XP_006627067.1 | |
| tpcn2 | NP_620714.2 | Z043_115041 | 62.50 | 0.00 | CDQ78014.1 | |