Literature DB >> 24291769

Advances in genomics of bony fish.

Herman P Spaink1, Hans J Jansen, Ron P Dirks.   

Abstract

In this review, we present an overview of the recent advances of genomic technologies applied to studies of fish species belonging to the superclass of Osteichthyes (bony fish) with a major emphasis on the infraclass of Teleostei, also called teleosts. This superclass that represents more than 50% of all known vertebrate species has gained considerable attention from genome researchers in the last decade. We discuss many examples that demonstrate that this highly deserved attention is currently leading to new opportunities for answering important biological questions on gene function and evolutionary processes. In addition to giving an overview of the technologies that have been applied for studying various fish species we put the recent advances in genome research on the model species zebrafish and medaka in the context of its impact for studies of all fish of the superclass of Osteichthyes. We thereby want to illustrate how the combined value of research on model species together with a broad angle perspective on all bony fish species will have a huge impact on research in all fields of fundamental science and will speed up applications in many societally important areas such as the development of new medicines, toxicology test systems, environmental sensing systems and sustainable aquaculture strategies.

Entities:  

Keywords:  aquaculture; fish models; genomics; medaka; next-generation sequencing; teleosts; zebrafish

Mesh:

Year:  2013        PMID: 24291769      PMCID: PMC3954040          DOI: 10.1093/bfgp/elt046

Source DB:  PubMed          Journal:  Brief Funct Genomics        ISSN: 2041-2649            Impact factor:   4.241


INTRODUCTION

In the recent years there have been tremendous advances in genomic studies of many vertebrate species. In these studies the attention to various representatives of the bony fish species (the superclass of Osteichthyes) has been increasing enormously, especially focussing on the infraclass of Teleostei that represent approximately 96% of the species of this superclass. This increase in attention is partly the result of the fact that this superclass with about 27 000 living species represents more than 50% of all known vertebrate species [1-4]. In our opinion, it also reflects the trend that fundamental and applied scientific interests in the genomics of bony fish are now converging. On the one hand, fish species such as zebrafish and medaka have clearly shown their broad applicability for studies of fundamental processes underlying development and disease. The tremendous attention these fish species have obtained for an extensive range of fundamental and applied research purposes have earned them the qualification of model fish species. On the other hand, the economical value of the bony fish for food resources coincides with their applicability for biomedical applications and toxicology studies. Together, these fundamental and applied scientific purposes have made it possible that the most advanced genomics technologies have been used for studies of many bony fish species, ranging from the model fish species zebrafish and medaka to ‘living fossils’ such as the coelacanths and the fresh water eels [5-11]. The fresh water eels have only recently been termed living fossils since apparently they have retained most of the genome duplication that occurred after the radiation of the bony fish from the common ancestor with the mammals. This is an example that these studies already are giving an unprecedented insight into the evolution of all bony fish species. The teleost species are extremely interesting for evolutionary studies because they are widespread in an incredible range of microenvironments containing water, ranging from the deepest levels of the oceans, to caves completely devoid of any light or even in environments which most of a year do not contain any water. This has led to remarkable adaptations to life at extreme conditions as exemplified by the tilapia species that can survive at 44°C at very high salinity, Antarctic toothfish that can thrive at temperatures below 0°C and deep sea fish such as from the genus Coryphaenoides that can stand pressures of more than 60 MPa [2, 12]. This has made bony fish species very attractive for studies on the effects of adverse conditions such as high gravity that are applicable to space travel research [13-15], or the absence of light that has important implications for studies of circadian rhythm in adults and embryonic stages [16-20]. On the other hand, the response of many bony fish species such as trouts and minnows to toxic compounds is very similar to that in humans. Therefore, these fish have been extensively used for toxicology research already for many decades [21-24] and recently this attention has been extended to the model fish species zebrafish and medaka [25-32]. In this review, we will give an overview of genome sequencing and assembly technologies that have been most popular to study the bony fish and the near future possibilities that will still have to gain in importance. Secondly, we will discuss the impact of fundamental and applied research on model fish species with special attention to the current status of genome sequencing and the impact for further genomic studies. Thirdly, we will give an overview of the advances in genomics of non-model bony fish species. Finally, we will discuss the predicted impact of bony fish genomics on biomedical and aquacultural applications and their importance for future evolutionary studies in a broader perspective than the bony fish.

COMPARISON OF SEQUENCING PLATFORMS

Over the past 8 years a number of so-called next-generation sequencing platforms have hit the market. They are all based on parallel sequencing of immobilized targets and have revolutionized the genomics field by generating an abundance of sequencing data. Several different sequencing strategies are employed by these platforms. Each of them has their own characteristics. Here we will briefly discuss some of the more popular platforms which are widely used in fish genomics today. An overview of several characteristics of these platforms is shown in Table 1.
Table 1:

Overview of high-throughput sequencing platforms

PlatformRoche 454 FLX +Life Technologies SOLiD 5500XLIllumina HiSeq High OutputIllumina HiSeq Rapid RunIllumina MiSeqPacific Biosciences PacBio RS IILife Technologies Ion Torrent PGMLife Technologies Ion Torrent Proton
Mean read length (bp)7002 × 602 × 1002 × 1502 × 2504500400170
Reads/run∼1 M1.4 G6 G1.2 G30 M40–60 K∼5 M60–80 M
Yield/run0.7 Gb155 Gb600 Gb120 Gb8 Gb230 Mb1 Gb8–10 Gb
Raw error rate<1%∼5%∼0.1%∼0.1%∼0.1%∼15%0.5–2%<1%
Run time23 h8 days11 days27 h39 h120 min7 h4 h
TechnologyPyrosequencing with luciferase detection.Ligation system with fluorescent dibase tags.Single nucleotides are incorporated into the synthesized strand, imaged. The terminator is removed after imaging allowing incorporation of the next nucleotide.Live imaging of fluorescent strand synthesis.Pyrosequencing with pH detection.
RemarksShort runtime.Short read length.Lower coverage on AT- and GC-rich sequences.Long read length.Short runtime.
Errors accumulate at end of read.No sequence bias.Homopolymers cannot be properly resolved.
Homopolymers cannot be properly resolved.Low coverage on GC-rich sequences.Short run time in Rapid run and on MiSeq.High raw error rate.Low coverage on AT-rich sequences.
Cost/Mb$10.00$0.07$0.05$0.05$0.14$3.00$1.00$0.10
Overview of high-throughput sequencing platforms There are now four companies who together dominate the market. Roche (454 GS FLX) and Life Technologies (Ion Torrent machines) both developed systems that use pyrosequencing to read the DNA sequence. Although this technique is fast it has problems reading through homopolymers. The read length on the Ion Torrent machine does not match these from the 454 GS FLX but is likely to increase as new chips and chemistry become available. Next to their Ion Torrent machines Life Technologies also has the SOLiD platform in its portfolio. This platform is more comparable in terms of throughput and costs per base to the Illumina platform. Whereas SOLiD employs a ligation system with dibase tags, Illumina’s HiSeq and MiSeq use a process called sequencing by synthesis (SBS). This SBS technology has already been on the market for a few years now and lately the development of this technology has mainly resulted in longer read length and not so much in more reads per flow-cell. All these machines need clonal copies of the DNA molecule to obtain enough signal for reliable base calling. The amplification step needed to obtain these copies can be a source of bias in the sequence data and information about DNA modifications is lost. An altogether different system is used by the PacBio RS II from Pacific Biosciences. In this machine strand synthesis is followed on single DNA molecules. Although this produces reads spanning several kilobases the raw error rate is high due to the nature of imaging single molecules. Since no amplification is needed it has the benefit that DNA modifications can also be detected and there is no bias in the sequence data. When using different applications like de novo genome sequencing, resequencing and transcriptome sequencing different parameters are important that influence the choice of the sequencing platform. For de novo genome sequencing it is important to have even coverage in all regions and to have a low error rate. To facilitate assembly the read length should be as long as possible. The combined use of Illumina HiSeq and PacBio RS platforms are best suited for this type of applications. When sequencing a transcriptome a high throughput is desirable but read length is a less important factor. In the coming years we can expect a further drop in cost/Mb driven by ongoing development of the current technologies and the introduction of new sequencing technologies like sequencing using nanopores. This will result in tools that will make de novo genome sequencing and resequencing even more efficient and easier. The sequencing endeavours of non-model fish species are increasingly based on whole genome shotgun sequencing (WGS). This kind of sequence data is still inferior in coverage to map-based sequence data, for instance based on BAC sequencing. This is notwithstanding the fact that even in the absence of large scaffolded WGS data sets it is still possible to obtain highly valuable complete exome predictions that also make use of transcriptome data sets and improved gene prediction models. However, especially chromosomal areas with many repetitive sequences will be poorly covered by WGS assemblies. Furthermore, for polyploid species it will be very difficult to obtain a reliable estimate of the coverage of the entire genome. The bioinformatics needed for scaffolding of WGS is still in the development stage. In Table 2, we present an overview of the software that has been used for de novo assembly and scaffolding of WGS data. It can be argued that in the future the technologies mentioned above will further improve to such extent that the disadvantages of WGS will become less pronounced. For instance, when PacBio sequencing length runs and coverage will further increase it could be used to obtain larger scaffolds even for difficult areas of a WGS assembly. This was recently demonstrated by sequencing the genome of the Arabidopsis Ler-0 mutant solely using the PacBio RS II platform (data available from github.com/PacificBiosciences/DevNet/wiki/Datasets).
Table 2:

An overview of software packages that have been used for the de novo assembly and scaffolding of fish genomes

Software packageSoftware ReferenceWeb resourceGenome assemblies using this softwareGenome Reference
ArachneJaffe et al., 2003 [33]ftp://ftp.broadinstitute.org/pub/crd/ARACHNE/Petromyzon marinus (sea lamprey)Smith et al., 2013 [34]
BowtieLangmead et al., 2009 [35]http://bowtie-bio.sourceforge.net/index.shtmlThunnus orientalis (Pacific bluefin tuna)Nakamura et al., 2013 [36]
Celera AssemblerMyers et al., 2000 [37]http://wgs-assembler.sourceforge.netCallorhinchus milii (elephant shark)Venkatesh et al., 2007 [38]
Gadus morhua (Atlantic cod)Star et al., 2011 [39]
Takifugu rubripes (pufferfish)Aparicio et al., 2002 [40]
CLCBio Assembly CellN.A.http://www.clcbio.com/products/clc-assembly-cell/Anguilla japonica (Japanese eel)Henkel et al., 2012a [6]
Cyprinus carpio (common carp)Henkel et al., 2012 [41]
CLCBio Genome WorkbenchN.A.http://www.clcbio.com/products/clc-genomics-workbench/Leucoraja erinacea (little skate),King et al., 2011 [42]
Xiphophorus maculatus (platyfish)Schartl et al., 2013 [43]
Anguilla anguilla (European eel)Henkel et al., 2012b [7]
FuzzypathSudbery et al., 2009 [44]ftp://ftp.sanger.ac.uk/pub/zn1/fuzzypath/Danio rerio Zv9 (zebrafish)Howe et al., 2013 [9]
Newbler/GS De Novo AssemblerMargulies et al., 2005 [45]http://www.454.com/products/analysis-software/Gadus morhua (Atlantic cod)Star et al., 2011 [39]
Xiphophorus maculatus (platyfish)Schartl et al., 2013 [43]
Thunnus orientalis (Pacific bluefin tuna)Nakamura et al., 2013 [36]
PCAPHuang et al., 2003 [46]http://seq.cs.iastate.edu/pcap.htmlXiphophorus maculatus (platyfish)Schartl et al., 2013 [43]
PhrapDe la Bastide and McCombie, 2007 [47]http://www.phrap.org/phredphrapconsed.htmlLabeotropheus fuelleborni (blue mbuna)Loh et al., 2008 [48]
Maylandia zebra (Zebra Mbuna)
Mchenga conophoros
Melanochromis auratus (golden Mbuna)
Rhamphochromis esox
PhusionMullikin and Ning, 2003 [49]http://www.sanger.ac.uk/resources/software/phusion/Danio rerio Zv9 (zebrafish)Howe et al., 2013 [9]
RAMEN AssemblerN.A.N.A.Oryzias latipes (Medaka)Kasahara et al., 2007 [50]
Ahsan et al., 2008 [51]
SSPACE ScaffolderBoetzer et al., 2011 [52]http://www.baseclear.com/landingpages/basetools-a-wide- range-of-bioinformatics-solutions/sspace-premium/Anguilla anguilla (European eel)Henkel et al., 2012 [41]
Anguilla japonica (Japanese eel)Henkel et al., 2012a [6]
Cyprinus carpio (common carp)Henkel et al., 2012b [7]
An overview of software packages that have been used for the de novo assembly and scaffolding of fish genomes It should also be mentioned that alternative methods to BAC sequencing have been developed that are highly applicable to obtaining genetic maps of fish species. To obtain a genetic map of an organism restriction associated DNA (RAD) tag sequencing can be employed as demonstrated for the spotted gar [53], the threespine stickleback [54] and the Xiphophorus sequencing projects [43]. This method uses next-generation sequencing to map sequence variants in the neighbourhood of restriction sites in the offspring from a cross. From the inheritance of the variants a high-density genetic linkage map can be constructed. This map can then be used to align scaffolds in higher order structures. More recently optical mapping of nicking sites on the genome in nanochannel arrays has also been employed to create a high-density genome map that can be used to order contigs and scaffolds [55].

GENOMICS IN MODEL FISH SPECIES

The most frequently studied fish species are zebrafish (Danio rerio) and medaka (Oryzias latipes). Although statistically the zebrafish is currently used most often as a research model, the use of medaka has particular advantages and the importance of the availability of two genomically well-characterized models for comparative purposes and tool development should not be underestimated [5, 56, 57]. For instance, the use of the Tol2 transposon from medaka in the zebrafish, where this transposon does not occur, is the basis for the most successful transgenesis protocols in zebrafish [58]. As a result of the combined efforts of a very large number of research groups these fish species have now established themselves in every field of biology, and also have propagated the use of fish species for chemical, physical and mathematical studies [59-61] and therefore have earned the name model fish species. Although historically these models have earned their fame by their contribution to large forward genetic screens linked to vertebrate developmental studies [62], in recent years these model species have also been extensively used for biomedical applications, and there are already several examples of medicines in clinical trials that were originally developed in zebrafish models. These studies have shown that research in model fish species can greatly speed up the discovery of new medicines [63-66]. Model fish species are also increasingly used for comparative studies in experiments with other fish species that are of importance for aquaculture, e.g. as a model for the effects of swimming exercise on muscle development [67]. Reversely, species that are very important in aquaculture, such as rainbow trout and common carp (Cyprinus carpio), have shown to have benefits for fundamental research. Research with the latter species is especially relevant to biomedical studies in the very closely related zebrafish owing to its large body size, the availability of highly inbred lines and a very large spawn size that offers possibilities for high-throughput screening [41, 68]. From a genomics perspective the zebrafish genome is now the most advanced model in that the sequencing efforts have reached the stage in which the completed genome will be further perfected by the Genome Reference Consortium (http://genomereference.org) [9]. The recently published zebrafish reference genome will undoubtedly have a major impact on future genomics studies, for instance by its major role in aiding the identification of protein functions, as shown recently by Kettleborough et al. [69] and Varshney et al. [70], and by supporting the identification of mutations in forward genetic screens [71]. Howe et al. [9] have shown examples of how the available genomic sequence data can lead to new insights into the evolution of genome architecture and can identify new biological functions for instance involved in sex determination. The results obtained from the zebrafish models can now be compared with other fish species such as medaka that has been extensively used for studies of sex determinants and is thereby the basis to obtain a better understanding of the evolution of sex determination in all bony fish with implications for mammalian research on sex chromosome evolution [72-75]. Due to the rapid evolutionary turn-over of sex chromosomes in fish, sex-linked markers found in medaka and zebrafish will not be directly translatable to results in other fish species. However, by comparative genomic studies with the data obtained in species such as medaka and rainbow trout [76] the resulting knowledge on sex determination mechanisms in several bony fish might also lead to predicted gender markers for other fish species. This will have applications for aquaculture, since methods for determining the sex ratios of offspring of cultured fish species is of economical value. The genome sequence of the zebrafish demonstrates that even between closely related fish species there can be large differences in repetitive DNA content. For instance, in zebrafish the type II DNA transposable elements cover 39% of the entire genome sequence [9], whereas in common carp there is a very low number of repetitive elements, as low as in fugu [41]. This, together with smaller intron and intergenic region sizes, explains why common carp as a pseudo-tetraploid species has a similar DNA content as zebrafish. We recently have obtained a shotgun sequence of the giant Danio (genus Devario) showing that it has a diploid genome that resembles the zebrafish rather than common carp in its richness of repeat sequences (Spaink and Dirks, unpublished data). In addition to these comparative studies, the available model fish genome sequences are an essential basis for the successful interpretation of the extensive transcriptome, proteome and metabolome data sets that are now rapidly accumulating, also for non-model fish species, as illustrated by a small representation of the many recent publications that have stimulated our research in this area [41, 77–93]. The limited annotation of particular classes of genes, such as non-coding RNAs and genes that are only expressed during disease, are bottlenecks that still need to be addressed. Furthermore, there is still a lack of information on orthology relationships between genes from different fish species and mammalian genes. This is a pity since the application in model fish of many new genomics technologies, for instance in epigenetic analysis [94-98], will be more difficult to translate to comparative epigenetic studies in other fish species and mammals.

NEW INSIGHTS FROM NON-MODEL TELEOST FISH GENOMES

Commercial availability of massive parallel sequencing or next-generation sequencing technologies in 2005 triggered an exponential growth of the number of species for which draft assemblies of complete genome sequences were released. The genome sequence of the giant panda was the first sequence of a vertebrate species that was de novo assembled based on next-generation technology alone [99]. As of 2 July 2013 a total of 3263 eukaryotic genomes were registered at NCBI's genome database (http://www.ncbi.nlm.nih.gov/genome/). Animal genomes accounted for 977 entries and the majority of these belong to the groups of mammals (378) and insects (285). Teleost fish, although the largest known group of vertebrates (∼ 27 000 species), are only poorly represented in this database, namely by 93 species and including 42 entries with the status ‘no data’ and 17 entries with the status ‘SRA/traces’. A combined search for whole genome sequencing projects of ray-finned fish (Actinopterygii) and lobe-finned fish (Sarcopterygii) in three commonly used databases, namely NCBI, ENSEMBL (http://www.ensembl.org/index.html) and GOLD (www.genomesonline.org/), resulted in a list of 61 registered fish genomics projects (Table 3), some of which have the status ‘Scaffolds or contigs’ (27), or ‘Chromosomes’ (6), and more than half of which are still incomplete. Clearly, the orders of the Cypriniformes (6 projects), Cyprinodontiformes (11 projects) and Perciformes (18 projects) are currently the most popular for genomics projects.
Table 3:

Whole genome sequencing projects (as of 2 July 2013) of ray-finned fish (Actinopterygii) and alobe-finned fish (Sarcopterygii)

Organism/nameOrder; familyCommon nameNCBI Project IDGoldCARD IDSize (Mb)GC%ChrsWGSScaffoldsStatusRef.
Acanthemblemaria mariaPerciformes; ChaenopsidaeSecretary blennyPRJNA175737Gi0044402
Anguilla anguillaAnguilliformes; AnguillidaeEuropean eelPRJNA73577Gi0045243[7]
Anguilla japonicaAnguilliformes; AnguillidaeJapanese eelPRJNA158309Gi0053798[6]
Anoplopoma fimbriaScorpaeniformes; AnoplopomatidaeSablefishPRJNA202249Gi0048049
Aphanius shiriniCyprinodontiformes; CyprinodontidaePRJNA203365Gi0049840
Astronotus crassipinnisPerciformes; Cichlidae‘Fat Oscarfish’PRJNA167777Gi0044952
Astyanax mexicanusCharaciformes; CharacidaeMexican tetraPRJNA89115Gi0044658964.3137.9APWO0110735Scaffolds or contigs
Carassius auratus red var.Cypriniformes; CyprinidaeRed crucian carpPRJNA80997Gi0045250
Chaenocephalus aceratusPerciformes; ChannichthyidaeBlackfin icefishPRJNA89117Gi0044639
Clarias fuscus CLFUWH01Siluriformes; ClariidaeWhitespotted clariasPRJNA38195Gi06053
Coilia nasus COECWH01Clupeiformes; EngraulidaeJapanese grenadier anchovyPRJNA38187Gi06054
Ctenopharyngodon idellaCypriniformes; CyprinidaeGrass carpPRJNA30857Gi06056
PRJNA39737Gi07179
Cynoglossus semilaevisPleuronectiformes; CynoglossidaeTongue solePRJNA73987Gi0043417
Cyprinodon variegatusCyprinodontiformes; CyprinodontidaeSheepshead minnowPRJNA89149Gi0044689
Cyprinus carpio carpioCypriniformes; CyprinidaeCommon carpPRJNA73579Gi0045244[41]
Danio rerioCypriniformes; CyprinidaeZebrafishPRJNA139221412.4736.725CABZ014560Chromosomes[9]
PRJNA11776Gc00272
Dicentrarchus labraxPerciformes; MoronidaeEuropean seabassPRJEA39865Gi0718198.2540.3CABK01Scaffolds or contigs
Engraulis encrasicolusClupeiformes; EngraulidaeEuropean anchovyPRJNA202430Gi0048051
Gadus morhuaGadiformes; GadidaeAtlantic codPRJNA41391Gi05656608.2945.6CAEA01427427Scaffolds or contigs[39]
Gasterosteus aculeatusGasterosteiformes; GasterosteidaeThree-spined sticklebackPRJNA13579Gi00269446.6244.6AANH01Scaffolds or contigs
Haplochromis burtoni (Astatotilapia burtoni)Perciformes; CichlidaePRJNA60363Gi03070698.9840.5AFNZ018001Scaffolds or contigs
Labeotropheus fuelleborniPerciformes; CichlidaeBlue mbunaPRJNA29479Gi0337169.3542.2ABPK0158245Scaffolds or contigs[48]
Lateolabrax japonicusPerciformes; LateolabracidaeJapanese sea bassPRJNA38197Gi07170
Latimeria chalumnaeaCoelacanthiformes; LatimeriidaeAfrican coelacanthPRJNA56111Gi083502183.7241.2 42.0AFYH0122818Scaffolds or contigs[100]
PRJDB5002612.11BAHO01Scaffolds or contigs
Latimeria menadoensisaCoelacanthiformes; LatimeriidaeIndonesian coelacanthPRJNA38001Gi04473[101]
Leiocassis longirostrisSiluriformes; BagridaeChinese longsnout catfishPRJNA38185Gi07164
Lepisosteus oculatusLepisosteiformes; LepisosteidaeSpotted garPRJNA68247Gi0043560945.8640.429AHAT012105Chromosomes
Leptobotia elongataCypriniformes; CobitidaeRoyal clown loachPRJNA205477Gi0049849
Maylandia zebraPerciformes; CichlidaeZebra mbunaPRJNA29483Gi0307277.0342.5 28.0ABPM0165094Scaffolds or contigs[48]
PRJNA198780713.57AGTA023725Scaffolds or contigs
Mchenga conophoros (Copadichromis conophorus)Perciformes; CichlidaePRJNA29477Gi0337071.4341.9ABPJ0161923Scaffolds or contigs[48]
Gi18480
Melanochromis auratusPerciformes; CichlidaeGolden mbunaPRJNA29481Gi0336966.5541.6ABPL0163297Scaffolds or contigs[48]
Mulloidichthys flavolineatusPerciformes; MullidaeYellowstripe goatfishPRJNA184890Gi0045086
Neolamprologus brichardiPerciformes; CichlidaePrincess of BurundiPRJNA60365Gi08440685.9640.4AFNY019098Scaffolds or contigs
Nothobranchius furzeriCyprinodontiformes; NothobranchiidaeTurquoise killifishPRJNA29535Gi044605.3244.9 44.3ABLO015299Scaffolds or contigs[102]
PRJNA33315Gi044615.25ACCZ015617Scaffolds or contigs[102]
Nothobranchius kuhntaeCyprinodontiformes; NothobranchiidaeBeira killifishPRJNA33401Gi044625.2444.8ACDA015934Scaffolds or contigs[102]
Notothenia coriicepsPerciformes; NototheniidaeBlack rockcodPRJNA66471Gi0044648
Oncorhynchus mykissSalmoniformes; SalmonidaeRainbow troutPRJNA172149Gi0044272
Opsanus betaBatrachoidiformes; BatrachoididaeGulf toadfishPRJNA196921Gi0048061
Oreochromis niloticusPerciformes; CichlidaeNile tilapiaPRJNA72943816.1240.4AERX015901Scaffolds or contigs
PRJNA59571Gi08705927.6839.122AERX015909Chromosomes
Oryzias latipesBeloniformes; AdrianichthyidaeJapanese rice fishPRJNA19569Gi01531585.3340.4BAAE0182496Scaffolds or contigs[50]
PRJNA183868869.8224BAAF047307Chromosomes[50]
PRJNA16702Gi02165
Parabramis pekinensisCypriniformes; CyprinidaeWhite Amur breamPRJNA38199Gi07171
Paralichthys olivaceusPleuronectiformes; ParalichthyidaeOlive flounderPRJNA73673Gi0045242
Pelteobagrus fulvidracoSiluriformes; BagridaeYellowhead catfishPRJNA38193Gi07169
(Tachysurus fulvidraco)
Poecilia formosaCyprinodontiformes; PoeciliidaeAmazon mollyPRJNA89109Gi0044650
Poecilia latipinnaCyprinodontiformes; PoeciliidaeSailfin mollyPRJNA196862Gi0048062
Poecilia mexicanaCyprinodontiformes; PoeciliidaeAtlantic mollyPRJNA196869Gi0048063
Psetta maximaPleuronectiformes; ScophthalmidaeTurbotPRJNA38189Gi07165
Pundamilia nyerereiPerciformes; CichlidaePython islandPRJNA60367Gi08441698.8040.6AFNX017236Scaffolds or contigs
Rhamphochromis esoxPerciformes; CichlidaePRJNA29485Gi0336769.8742.4ABPN0155751Scaffolds or contigs[48]
Salmo salarSalmoniformes; SalmonidaeAtlantic salmonPRJNA72713Gi00445192435.3142.6AGKD01Scaffolds or contigs[103]
Sebastes nigrocinctusScorpaeniformes; SebastidaeTiger rockfishPRJNA171384Gi0045199
Sebastes rubrivinctusScorpaeniformes; SebastidaeFlag rockfishPRJNA62009Gi08706
Sparus aurataPerciformes; SparidaeGilt-head seabreamPRJEA49009Gi0044643
Stegastes partitusPerciformes; PomacentridaeBicolour damselfishPRJNA89147Gi0044663
Takifugu flavidusTetraodontiformes; TetraodontidaeSansaifuguPRJNA168966Gi0044522314.9545.2AOOT0134332Scaffolds or contigs
Takifugu rubripesTetraodontiformes; TetraodontidaePufferfishPRJNA1434281.5745.522CAAB027091Chromosomes[40]
PRJNA166939391.49
Gi18232
Tetraodon nigroviridisTetraodontiformes; TetraodontidaeGreen spotted pufferPRJNA12350Gc00229308.4546.6CAAE01Scaffolds or contigs
Xiphophorus birchmanniCyprinodontiformes; PoeciliidaeSheepshead swordtailPRJNA172015Gi0044901
Xiphophorus clemenciaeCyprinodontiformes; PoeciliidaeYellow swordtailPRJNA178205Gi0044902
Xiphophorus helleriiCyprinodontiformes; PoeciliidaeGreen swordtailPRJNA178402Gi0044903
Xiphophorus maculatusCyprinodontiformes; PoeciliidaeSouthern platyfishPRJNA72525Gi0045000652.8438.8AGAJ0120640Scaffolds or contigs[43]

Adapted from NCBI (http://www.ncbi.nlm.nih.gov/genome/), ENSEMBL (http://www.ensembl.org/index.html) and GOLD (http://www.genomesonline.org/).

Whole genome sequencing projects (as of 2 July 2013) of ray-finned fish (Actinopterygii) and alobe-finned fish (Sarcopterygii) Adapted from NCBI (http://www.ncbi.nlm.nih.gov/genome/), ENSEMBL (http://www.ensembl.org/index.html) and GOLD (http://www.genomesonline.org/). Another important resource of fish genomics data is NCBI's Bioproject database (http://www.ncbi.nlm.nih.gov/bioproject), which partially overlaps with the genome database. The Bioprojects database contained almost 900 registered teleost projects (2 July 2013) divided over 12 Project Data Type categories (Table 4). The majority of bioprojects are ‘Transcriptome or gene expression’ projects (84%) and most of the remaining projects are ‘Genome sequencing’ projects (9%). Although the Bioprojects comprise over 168 individual teleost species, only 12 species already account for ∼70% of all projects. Most of the Bioprojects are based on the popular zebrafish model D. rerio (37.3%) and other laboratory models, such as fathead minnow (Pimephales promelas, 3.4%), mummichog (Fundulus heteroclitus, 2.4%), goldfish (Carassius auratus, 2.4%), Japanese rice fish/Medaka (O. latipes, 1.6%) and three-spined stickleback (Gasterosteus aculeatus, 1.3 %). In addition, species that are important for fisheries and aquaculture are well represented, such as rainbow trout (Oncorhynchus mykiss, 10%), Atlantic salmon (Salmo salar, 5.1%), gilt-head (sea) bream (Sparus aurata, 2.2%), Sockeye (red) salmon (Oncorhynchus nerka, 1.3%), largemouth bass (Micropterus salmoides, 1.3%) and channel catfish (Ictalurus punctatus, 1.1%). Also worth mentioning is a set of 30 Bioprojects that include nearly all 28 known species of the genus Xiphophorus (swordtails and platyfish), divided over 5 genome and 25 transcriptome projects.
Table 4:

Teleost Bioprojects registered at NCBI (2 July 2013) according to ‘Project Data Type’

Project Data TypeNumber of projects
Transcriptome or gene expression758
Genome sequencing80
Epigenomics21
Refseq genome12
Variation8
Map8
RAD tag4
Random survey3
Phenotype or genotype2
Targeted locus1
Clone ends1
Microsatellite1
Teleost Bioprojects registered at NCBI (2 July 2013) according to ‘Project Data Type’ Additional draft assemblies of complete teleost genomes have been published, but are not yet available from the NCBI database. For example, genomic scaffolds of the European eel (Anguilla anguilla) [7], Japanese eel (Anguilla japonica) [6], and the common carp (C. carpio) [41] are all accessible via the website www.zfgenomics.com. Recently, a draft assembly of the complete genome of Pacific bluefin tuna (Thunnus orientalis) was published [36], which is accessible via GenBank (accession nos. BADN01000001–BADN01133062). Availability of the complete genome sequence of model and non-model fish species has a strong catalytic effect on a broad range of scientific disciplines and on applied science, as indicated by the following examples. Sequence analysis of the complete genome of the atlantic cod (Gadus morhua) uncovered that these cold-adapted teleosts lack a functional major histocompatibility complex (MHC) II pathway. Apparently, this is compensated for by expansion of the number of MHCI genes and by specific adaptations in the Toll-like receptor (TLR) families, thereby providing new fundamental insight into the evolution of the adaptive immune system in vertebrates [39]. The draft genome sequences of the European eel (A. anguilla) and Japanese eel (A. japonica) showed that these fish species, in contrast to most other teleosts, retained fully populated Hox gene clusters, which may be correlated with their peculiarly complex life cycle that includes two larval stages [6, 7]. In contrast, elasmobranch fishes, such as the cat shark (Scyliorhinus canicula) and the little skate (Leucoraja erinacea), seem to have lost all HoxC cluster genes [42]. This sheds a completely new light on the relative importance of this family of genes for body plan formation in the fish embryo. Detailed analysis of the genome sequence of the Pacific bluefin tuna (T. orientalis) revealed remarkable adaptations in multiple visual pigment genes, which may not only explain their specific predatory behaviour in the blue-pelagic ocean but may also contribute to improved aquaculture conditions [36]. The recent publication of the genome sequence of the platyfish (Xiphophorus maculatus) has already significantly broadened our understanding of a wide variety of phenomena, such as live-bearing fish reproduction, pigmentation patterns and melanoma tumorigenesis, and even complex behavioural traits [43].

CONCLUSIONS AND FUTURE OUTLOOK

The state-of-the-art in genomics of the bony fish has advanced so enormously in the last few years that even in the context of the recent large human sequencing projects, for example in the Encode projects [104], it is no longer possible to catch phrase the recent advances under the term of ‘fishy genomics’ or ‘fish and chips’. The latter catch phrase anyway will have to suffer increasing unpopularity with the prediction that RNA and DNA microarray technologies will soon lose most of their importance, as they will be gradually replaced by methods based on sequencing technologies in the coming years. As explained above, teleost fish species have much to offer for research that is dependent on whole organism test models and for biomedical applications they have in many aspects advantages even over the use of mammalian test systems as recently discussed by Spaink et al. [68]. Independently of its applied values, genome-wide studies of the bony fish have great impact for comparative genomics: it will provide a deep understanding of the recent half billion years of evolution in vertebrates and of more recent era that led to an extreme diversification of particular subgroups of the Teleostei, such as the cichlids that have been intensively studied from an evolutionary perspective [105]. It will also provide enormous opportunities for data mining and will provide the possibility to trace back the origins of genes from the organisms closest to the earliest evolutionary branches to its origins within invertebrates. For this purpose it is fortunate that many invertebrate species such as the tunicates are also increasingly being analysed with genomics technologies (http://www.tunicate-portal.org/wordpress/). That this can lead to unexpected findings is nicely illustrated by the recent discovery of a completely novel fluorescent protein in the Japanese eel [106]. Furthermore, it can lead to new insights into the origin of individual genes, for instance the interesting example of horizontal gene transfer of a transposon between lamprey species and their hosts indicate that transfer of genetic material between species mediated by parasite–host interactions could be very frequent [107]. In addition to fundamental evolutionary research there will also be important applied aspects, for instance in nature conservation biology and the impact of ancient climate changes on species diversification or extinction processes. This could lead to better prediction models for the effects of current estimated climate changes on biodiversity of the teleost fish species and thereby could provide better guidelines for knowledge-based fishery regulations. Sequence technology has reached the stage that the capacity of instrumentation is not limiting anymore for sequencing a large number of vertebrates, in contrast to the period at the end of the 20th century when, as an illustration, one of the reasons for sequencing the genome of the Fugu (Fugu rubripes) was its small size genome. With the super high capacity of shotgun sequencing facilities it might already now be possible to obtain WGS data for all teleost fish species. Although this would still be extremely costly and no plans have yet been proposed for this, there are bigger problems than cost involved: the bioinformatics and curation facilities that are still not adapted to handle the next-generation sequencing data flow coming from many independent sequencing projects, at least not in a user friendly way. Especially since the quality of WGS shotgun sequences does not make the data highly suitable yet to be integrated in a bioinformatic setting such as ENSEMBL it is needed that complementary bioinformatics and data curation solutions become available at low thresholds to analyse and compare the early versions of WGS assemblies [108]. In addition, it would be desirable to strive to common genome data curation and annotation facilities that cover all fish species as now is offered for zebrafish within VEGA [109] (vega.sanger.ac.uk) and to obtain a comprehensive web site that links all bony fish gene annotations and functional studies following the example presented by ZFIN for zebrafish (zfin.org). In the context of genome evolution, we can see the great progress in the last years in answering several old questions that have been extensively debated for over decades such as the origin of the Teleostei gene duplication. Since it is likely that a majority of all vertebrates will be sequenced within the coming decades, we can get new insights in many fish species into the correlation between genome duplications and repeat content of genomes, on the one hand, with environmental selection pressures and particular adaptations of body architecture. We can also predict that we can soon obtain new insights into the mechanisms that were the cause of gene losses resulting in the trimmed genomes of the modern fishes that we are now studying. This will certainly give an amazing view of the genome dynamics that took place during a period of natural selection that lasted for many hundreds of millions of years. This knowledge can form a bridge between molecular biological studies carried out at the very basic molecular levels in microbes and lower vertebrates and studies in mammalian systems. We have therefore no doubts that genomic studies in the bony fish species will remain to play an important role in uniting the levels of molecular and evolutionary studies, e.g. by being perfect models for system biology studies [60, 61, 110, 111]. Next-generation sequencing has revolutionized de novo assembly of fish genomes sequences. Fish models are rapidly gaining importance at all levels of fundamental and applied science. We predict that advances will further accelerate and that the resulting genomic data sets will lead to unprecedented new insights in to vertebrate gene functions and evolutionary mechanisms. The application for nucleotide sequencing in transcriptomics technologies will further increase and will gradually replace expression microarray technologies. There is an increased need for better and more user-friendly bioinformatic tools and curated database storage of data might become a bottleneck.
  108 in total

Review 1.  Life-cycle experiments of medaka fish aboard the international space station.

Authors:  Kenichi Ijiri
Journal:  Adv Space Biol Med       Date:  2003

2.  Zebrafish embryos as an alternative model for screening of drug-induced organ toxicity.

Authors:  S Scholz
Journal:  Arch Toxicol       Date:  2013-03-30       Impact factor: 5.153

Review 3.  Light reaches the very heart of the zebrafish clock.

Authors:  Amanda-Jayne F Carr; T Katherine Tamai; Lucy C Young; Veronica Ferrer; Marcus P Dekens; David Whitmore
Journal:  Chronobiol Int       Date:  2006       Impact factor: 2.877

4.  Developmental energetics of zebrafish, Danio rerio.

Authors:  S Augustine; B Gagnaire; M Floriani; C Adam-Guillermin; S A L M Kooijman
Journal:  Comp Biochem Physiol A Mol Integr Physiol       Date:  2011-04-05       Impact factor: 2.320

Review 5.  Genomics of adaptation and speciation in cichlid fishes: recent advances and analyses in African and Neotropical lineages.

Authors:  Shaohua Fan; Kathryn R Elmer; Axel Meyer
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2012-02-05       Impact factor: 6.237

Review 6.  The National BioResource Project Medaka (NBRP Medaka): an integrated bioresource for biological and biomedical sciences.

Authors:  Takao Sasado; Minoru Tanaka; Kayo Kobayashi; Tadashi Sato; Mitsuru Sakaizumi; Kiyoshi Naruse
Journal:  Exp Anim       Date:  2010

7.  Robotic injection of zebrafish embryos for high-throughput screening in disease models.

Authors:  Herman P Spaink; Chao Cui; Malgorzata I Wiweger; Hans J Jansen; Wouter J Veneman; Rubén Marín-Juez; Jan de Sonneville; Anita Ordas; Vincenzo Torraca; Wietske van der Ent; William P Leenders; Annemarie H Meijer; B Ewa Snaar-Jagalska; Ron P Dirks
Journal:  Methods       Date:  2013-06-11       Impact factor: 3.608

8.  Deep short-read sequencing of chromosome 17 from the mouse strains A/J and CAST/Ei identifies significant germline variation and candidate genes that regulate liver triglyceride levels.

Authors:  Ian Sudbery; Jim Stalker; Jared T Simpson; Thomas Keane; Alistair G Rust; Matthew E Hurles; Klaudia Walter; Dee Lynch; Lydia Teboul; Steve D Brown; Heng Li; Zemin Ning; Joseph H Nadeau; Colleen M Croniger; Richard Durbin; David J Adams
Journal:  Genome Biol       Date:  2009-10-13       Impact factor: 13.583

Review 9.  Small molecule screening in zebrafish: swimming in potential drug therapies.

Authors:  Owen J Tamplin; Richard M White; Lili Jing; Charles K Kaufman; Scott A Lacadie; Pulin Li; Alison M Taylor; Leonard I Zon
Journal:  Wiley Interdiscip Rev Dev Biol       Date:  2012-02-28       Impact factor: 5.814

10.  Reprogramming the maternal zebrafish genome after fertilization to match the paternal methylation pattern.

Authors:  Magdalena E Potok; David A Nix; Timothy J Parnell; Bradley R Cairns
Journal:  Cell       Date:  2013-05-09       Impact factor: 41.582

View more
  9 in total

Review 1.  The Genome 10K Project: a way forward.

Authors:  Klaus-Peter Koepfli; Benedict Paten; Stephen J O'Brien
Journal:  Annu Rev Anim Biosci       Date:  2015       Impact factor: 8.923

2.  Next-generation sequencing detects repetitive elements expansion in giant genomes of annual killifish genus Austrolebias (Cyprinodontiformes, Rivulidae).

Authors:  G García; N Ríos; V Gutiérrez
Journal:  Genetica       Date:  2015-03-20       Impact factor: 1.082

Review 3.  A new model army: Emerging fish models to study the genomics of vertebrate Evo-Devo.

Authors:  Ingo Braasch; Samuel M Peterson; Thomas Desvignes; Braedan M McCluskey; Peter Batzel; John H Postlethwait
Journal:  J Exp Zool B Mol Dev Evol       Date:  2014-08-11       Impact factor: 2.656

4.  Predicting Metabolism from Gene Expression in an Improved Whole-Genome Metabolic Network Model of Danio rerio.

Authors:  Leonie van Steijn; Fons J Verbeek; Herman P Spaink; Roeland M H Merks
Journal:  Zebrafish       Date:  2019-06-19       Impact factor: 1.985

Review 5.  Applications in the search for genomic selection signatures in fish.

Authors:  María E López; Roberto Neira; José M Yáñez
Journal:  Front Genet       Date:  2015-01-14       Impact factor: 4.599

6.  A full-body transcriptome and proteome resource for the European common carp.

Authors:  I C R M Kolder; S J van der Plas-Duivesteijn; G Tan; G F Wiegertjes; M Forlenza; A T Guler; D Y Travin; M Nakao; T Moritomo; I Irnazarow; J T den Dunnen; S Y Anvar; H J Jansen; R P Dirks; M Palmblad; B Lenhard; C V Henkel; H P Spaink
Journal:  BMC Genomics       Date:  2016-09-02       Impact factor: 3.969

Review 7.  Fish Immunoglobulins.

Authors:  Sara Mashoof; Michael F Criscitiello
Journal:  Biology (Basel)       Date:  2016-11-21

8.  Biomonitoring of marine vertebrates in Monterey Bay using eDNA metabarcoding.

Authors:  Elizabeth A Andruszkiewicz; Hilary A Starks; Francisco P Chavez; Lauren M Sassoubre; Barbara A Block; Alexandria B Boehm
Journal:  PLoS One       Date:  2017-04-25       Impact factor: 3.240

9.  Large-scale SNP screenings identify markers linked with GCRV resistant traits through transcriptomes of individuals and cell lines in Ctenopharyngodon idella.

Authors:  Zhiwei Liao; Quanyuan Wan; Xueying Shang; Jianguo Su
Journal:  Sci Rep       Date:  2017-04-26       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.