Literature DB >> 25671091

Comparative genomic data of the Avian Phylogenomics Project.

Guojie Zhang1, Bo Li2, Cai Li3, M Thomas P Gilbert4, Erich D Jarvis5, Jun Wang6.   

Abstract

BACKGROUND: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics.
FINDINGS: The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (~30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 ~ 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses.
CONCLUSIONS: Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of genomic data that has been generated and used in our Avian Phylogenomics Project. To the best of our knowledge, the Avian Phylogenomics Project is the biggest vertebrate comparative genomics project to date. The genomic data presented here is expected to accelerate further analyses in many fields, including phylogenetics, comparative genomics, evolution, neurobiology, development biology, and other related areas.

Entities:  

Keywords:  Avian genomes; Phylogenomics; Whole genome sequencing

Year:  2014        PMID: 25671091      PMCID: PMC4322804          DOI: 10.1186/2047-217X-3-26

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Data description

Here we presented the genomes of 48 bird species, representing 36 orders of birds, including all Neognathae and two of the five Palaeognathae orders, collected by the Avian Genome Consortium ( [1], full author list of the Consortium provided in Additional file 1 and data in GigaDB [2]). The Chicken, Zebra finch, and Turkey genomes (sequenced using the Sanger method) were collected from the public domain. Another three genomes, the Pigeon, Peregrine Falcon and Duck, have been published during the development of this project [3-5], and five genomes, the Budgerigar, Crested Ibis, Little Egret, Emperor and Adele penguins, are reported in companion studies of this project [6, 7]. The data downloads for the remaining 38 genomes are released here.

Genome sequencing

Tissue samples were collected from multiple sources, with the largest contributions from the Copenhagen Zoo (Denmark) and the Louisiana State University (USA). Most DNA samples were processed and quality control performed at the University of Copenhagen (Dr. Gilbert’s lab, Denmark) and Duke University (Dr. Jarvis’ lab, USA). The collected samples were then used for constructing pair-end libraries and sequenced using Illumina HiSeq 2000 platforms at the BGI (China). For the high-coverage birds, multiple pair-end libraries with a series of up to 9 insert sizes (170 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb and 20 kb) were constructed for each species, as part the first 100 species of the G10K project. For four birds (Anas platyrhynchos, Picoides pubescens, Ophisthocomus hoazin and Tinamus guttatus), libraries of some insert sizes were not constructed due to limited sample amounts or the sequencing strategies applied to those species. In addition, for the budgerigar genome, Roche 454 longer reads of multiple insert sizes were used [6]. For the low-coverage genomes, libraries of two insert sizes (500 bp and 800 bp) were constructed. The sequencing depths for high-coverage genomes were 50X to 160X, whereas the sequencing depths for low-coverage genomes were 24X to 39X. An effort was made to obtain DNA samples from tissues with associated museum voucher specimens with high quality metadata.

Genome assembly

Before assembly, several quality control steps were performed to filter the low-quality raw reads. The clean reads of each bird were then passed to SOAPdenovo v1.05 [8] for de novo genome assembly. We tried different k-mers (from 23-mer to 33-mer) to construct contigs and chose the k-mer with the largest N50 contig length. In addition, we also tried different cut-offs of read pairs for different libraries to link contigs into scaffolds. The assembly with the largest N50 length was finally used. All the assemblies have similar genome sizes, ranging from 1.04-1.26Gb (Table  1). The high-coverage genomes have a N50 scaffold length of >1 Mb, except for the White-throated Tinamou (Tinamous guttatus) with a scaffold N50 of 242 Kb and Bald Eagle (Haliaeetus leucocephalus) with a scaffold N50 of 670 Kb, due to no 10 kb and 20 kb libraries for these two genomes. For low-coverage genomes, the scaffold N50 lengths ranged from 30 kb to 64 kb. The N50 contig lengths for high-coverage genomes were from 19 kb to 55 kb, and the low coverage genomes were from 12 kb to 20 kb. The Parrot and Ostrich genomes were further assembled with the aid of optical mapping data, thus achieving much larger scaffold N50 sizes.
Table 1

Basic statistics for the assemblies of avian species

SpeciesCommon nameSequencing depthLibraryAssembly (contig/scaffold N50;total length)
Published (Sanger sequencing)
Gallus gallus Chicken7X-36 K/7.07 M;1.05G
Taeniopygia guttata Zebra finch6X-39 K/10 M;1.2G
Meleagris gallopavo Turkey17X-12.6 K/1.5 M;1.04G
High - coverage genomes
Anas platyrhynchos domestica Peking duck50X200,500,2 k,5 k,10 k26 K/1.2 M;1.1G
Columba livia Pigeon63X200,500,800,2 k,5 k,10 k,20 k22 K/3.2 M;1.11G
Falco peregrinus Peregrine falcon105X200,500,800,2 k,5 k,10 k,20 k28 K/3.9 M;1.18G
Pygoscelis adeliae Adelie penguin60X200,500,800,2 k,5 k,10 k,20 k19 K/5.0 M;1.23G
Aptenodytes forsteri Emperor penguin60X200,500,2 k,5 k,10 k,20 k30 K/5.1 M;1.26G
Nipponia nippon Crested ibis105X200,500,800,2 k,5 k,10 k,20 k22 K/5.4 M;1.17G
Egretta garzetta Little egret74X200,500,800,2 k,5 k,10 k,20 k24 K/3.1 M;1.2G
Calypte anna Anna's hummingbird110X200,500,800,2 k,5 k,10 k,20 k23 K/4 M;1.1G
Chaetura pelagica Chimney swift103X200,500,800,2 k,5 k,10 k,20 k27 K/3.8 M;1.1G
Charadrius vociferus Killdeer100X200,500,800,2 k,5 k,10 k,20 k32 K/3.6 M;1.2G
Cuculus canorus Common cuckoo100X200,500,800,2 k,5 k,10 k,20 k31 K/3 M;1.15G
Ophisthocomus hoazin Hoatzin100X200,500,800,2 k,5 k,10 k24 K/2.9 M;1.14G
Geospiza fortis Medium ground finch115X200,500,800,2 k,5 k,10 k,20 k30 K/5.2 M;1.07G
Manacus vitellinus Golden-collared manakin110X200,500,800,2 k,5 k,10 k,20 k34 K/2.5 M;1.12G
Melopsittacus undulatus Budgerigar160X200, 500, 800, 2 k, 5 k, 10 k55 K/10.6 M;1.1G
Picoides pubescens Downy woodpecker105X200,500,800,2 k,5 k,10 k20 K/2 M;1.17G
Struthio camelus Ostrich85X200,500,800,2 k,5 k,10 k,20 k29 K/3.5 M;1.23G
Tinamus guttatus White-throated tinamou100X200,500,800,2 k,5 k24 K/242 K;1.05G
Corvus brachyrhynchos American crow80X200,500,800,2 k,5 k,10 k,20 k24 K/6.9 M;1.1G
Haliaeetus leucocephalus Bald eagle88X300,400,3 k,8 k10 K/670 K;1.26G
Low - coverage genomes
Antrostomus carolinensis Chuck-will's-widow30X500, 80017 K/45 K;1.15G
Cariama cristata Red-legged seriema24X500, 80017 K/54 K;1.15G
Colius striatus Speckled mousebird27X500, 80018 K/45 k;1.08G
Merops nubicus Carmine bee-eater37X500, 80020 K/47 K;1.06G
Gavia stellata Red-throated loon33X500, 80016 K/45 K;1.15G
Balearica regulorum Grey-crowned crane33X500, 80018 K/51 K;1.14G
Apaloderma vittatum Bar-tailed trogon28X500, 80019 K/56 K;1.08G
Phalacrocorax carbo Great cormorant24X500, 80015 K/48 K;1.15G
Phaethon lepturus White-tailed tropicbird39X500, 80018 K/47 K;1.16G
Phoenicopterus ruber ruber American flamingo33X500, 80016 K/37 K;1.14G
Podiceps cristatus Great-crested grebe30X500, 80013 K/30 K;1.15G
Fulmarus glacialis Northern fulmar33X500, 80017 K/46 K;1.14G
Tyto alba Barn owl27X500, 80013 K/51 K;1.14G
Tauraco erythrolophus Red-crested turaco30X500, 80018 K/55 K;1.17G
Cathartes aura Turkey vulture25X500, 80012 K/35 K;1.17G
Eurypyga helias Sunbittern33X500, 80016 K/46 K;1.1G
Mesitornis unicolor Brown mesite29X500, 80018 K/46 K;1.1G
Leptosomus discolor Cuckoo-roller32X200, 500, 80019 K/61 K;1.15G
Chlamydotis macqueenii MacQueen's Bustard27X500, 80018 K/45 K;1.09G
Pelecanus crispus Dalmatian pelican34X500, 80018 K/43 K;1.17G
Pterocles gutturalis Yellow-thoated sandgrouse25X500, 80017 K/49 K;1.07G
Acanthisitta chloris Rifleman29X500, 80018 K/64 K;1.05G
Buceros rhinoceros Rhinoceros hornbill35X500, 80014 K/51 K;1.08G
Nestor notabilis Kea32X500, 80016 K/37 K;1.14G
Haliaeetus albicilla White-tailed eagle26X500, 80020 K/56 K;1.14G
Basic statistics for the assemblies of avian species

Repeat annotation

RepeatMasker [9] and RepeatModeler [10] were used to perform repeat annotations for the bird genomes. The overall annotated content of transposable elements (TE) range from within 2-9% of all bird genomes except Woodpecker (Table  2). These TEs include long interspersed nuclear elements [LINEs], short interspersed nuclear elements [SINEs], long-terminal repeat [LTR] elements and DNA transposons). The exception Woodpecker genome has a TE content of 22%, which reflects a larger number of LINE CR1 elements (18% of the genome).
Table 2

Percentages of genome annotated as transposable elements (TEs)

SpeciesLINESINELTRDNARCUnknownTotal
Merops nubicus 5.010.071.300.140.011.267.78
Picoides pubescens 18.200.050.890.170.002.8422.15
Buceros rhinoceros 3.620.081.050.160.011.096.00
Apaloderma vittatum 5.970.121.310.230.010.828.44
Leptosomus discolor 2.930.121.320.190.011.886.45
Colius striatus 6.540.102.190.190.000.399.42
Haliaeetus albicilla 2.550.141.710.190.010.775.37
Haliaeetus leucocephalus 2.010.171.890.220.002.596.89
Cathartes aura 2.210.171.050.190.000.924.54
Tyto alba 2.640.131.790.190.010.745.49
Geospiza fortis 3.650.063.370.310.040.808.23
Taeniopygia guttata 3.790.064.110.320.021.399.68
Corvus brachyrhynchos 3.730.072.430.220.020.907.37
Manacus vitellinus 4.430.081.080.250.010.726.58
Acanthisitta chloris 6.380.101.460.210.010.568.72
Melopsittacus undulatus 6.490.081.970.200.010.459.19
Nestor notabilis 4.600.101.320.180.000.376.57
Falco peregrinus 3.090.151.270.280.000.715.50
Cariama cristata 3.510.180.910.200.000.695.49
Egretta garzetta 3.920.121.420.240.011.226.93
Pelecanus crispus 3.940.151.870.210.011.277.45
Nipponia nippon 3.690.131.220.290.010.836.16
Phalacrocorax carbo 3.950.161.290.210.000.626.23
Aptenodytes forsteri 2.410.201.170.260.001.465.50
Pygoscelis adeliae 3.310.201.320.260.000.956.04
Fulmarus glacialis 2.860.181.190.220.010.875.32
Gavia stellata 3.170.140.710.220.010.855.09
Eurypyga helias 4.610.101.600.150.000.466.92
Phaethon lepturus 3.910.121.710.220.001.487.44
Ophisthocomus hoazin 4.690.111.300.160.011.637.90
Balearica regulorum 3.350.141.510.240.010.836.08
Charadrius vociferus 4.530.131.120.200.011.057.03
Calypte anna 5.620.071.230.210.010.918.05
Chaetura pelagica 5.280.110.900.190.002.579.05
Antrostomus carolinensis 5.400.121.840.330.020.538.24
Chlamydotis macqueenii 3.970.171.400.230.000.576.35
Tauraco erythrolophus 2.760.091.800.160.013.838.64
Cuculus canorus 7.840.080.670.270.010.589.45
Mesitornis unicolor 4.620.091.380.380.011.037.51
Pterocles gutturalis 3.460.091.360.170.010.675.75
Columba livia 4.180.090.760.350.011.877.25
Phoenicopterus ruber 2.690.151.040.230.011.495.60
Podiceps cristatus 4.800.101.600.200.010.607.31
Gallus gallus 6.010.081.651.010.011.079.82
Meleagris gallopavo 5.400.051.110.820.000.527.90
Anas platyrhynchos 4.050.101.100.200.010.395.85
Struthio camelus 2.880.180.170.360.010.904.49
Tinamus guttatus 2.730.090.300.330.010.654.11
Percentages of genome annotated as transposable elements (TEs)

Protein-coding gene annotation

We used the homology-based method to annotate genes, with gene sets of chicken, zebra finch and human in Ensembl release 60 [11]. Because the quality of homology-based prediction strongly depends on the quality of the reference gene sets, we carefully chose the reference genes for the annotation pipeline. The protein sequences of these three species were compiled and used as a reference gene set template for homology-based gene predictions for the newly assembled bird genomes. We aligned protein sequences of the reference gene set to each genome by TBLASTN and used Genewise [12] to predict gene models in the genomes. A full description of the homology-based annotations is in our comparative genomics paper [1]. All the avian genomes have similar coding DNA sequence (CDS), exon, and intron lengths (Table  3).
Table 3

Statistics of protein-coding gene annotations of all the birds

SpeciesGene numberMean gene length (kb)Mean CDS length (bp)Mean exon length (bp)Mean intron length (bp)Mean intergenic length (kb)
Acanthisitta chloris 1459613.51242158.6180012
Anas platyrhynchos domestica 1652117.81317160.7229842
Antrostomus carolinensis 1467612.01177164.1174712
Apaloderma vittatum 1361513.51247160.8180612
Aptenodytes forsteri 1607020.91397161.6254656
Balearica regulorum 1417313.81276162.7182811
Buceros rhinoceros 1387313.51267160.4176711
Calypte anna 1600018.51386161.7226447
Cariama cristata 1421613.71249161.8184911
Cathartes aura 1353410.81109166.4171610
Chaetura pelagica 1537319.81411161.0236451
Charadrius vociferus 1686019.11324161.8248252
Chlamydotis macqueenii 1358212.91257162.9173410
Colius striatus 1353812.41190161.1175411
Columba livia 1665218.31363161.0227746
Corvus brachyrhynchos 1656217.91363161.1222048
Cuculus canorus 1588920.01400160.7241348
Egretta garzetta 1658518.61274160.7249652
Eurypyga helias 1397412.31193163.9176311
Falco peregrinus 1624219.91403160.7238949
Fulmarus glacialis 1430612.81230163.0176511
Gallus gallus 1651621.11433158.1243748
Gavia stellata 1345413.21250162.1177611
Geospiza fortis 1628617.91362160.1219846
Haliaeetus albicilla 1383114.21258161.1190312
Haliaeetus leucocephalus 1652619.01359160.7237036
Leptosomus discolor 1483113.91236163.2192614
Manacus vitellinus 1528518.81392159.7226246
Meleagris gallopavo 1605117.41305158.0221552
Melopsittacus undulatus 1547019.81395162.2241552
Merops nubicus 1346713.01224162.1179811
Mesitornis unicolor 1537111.41169163.6166611
Nestor notabilis 1407414.41307160.1182212
Nipponia nippon 1675619.41358161.2243451
Ophisthocomus hoazin 1570220.01336162.1258255
Pelecanus crispus 1481311.91183164.8174011
Phaethon lepturus 1497012.71220163.9178111
Phalacrocorax carbo 1347913.51258162.0181011
Phoenicopterus ruber 1402411.71179165.3171610
Picoides pubescens 1557620.01390161.7245047
Podiceps cristatus 1391310.41137165.815838
Pterocles gutturalis 1386712.81235162.5175711
Pygoscelis adeliae 1527021.31392160.3258958
Struthio camelus 1617819.51289161.0260154
Taeniopygia guttata 1747121.41383153.5249353
Tauraco erythrolophus 1543513.21200164.0189412
Tinamus guttatus 1578814.71288162.0193425
Tyto alba 1361313.81240160.8187112
Statistics of protein-coding gene annotations of all the birds

Syntenic-based orthlogous annotation

To obtain more accurate orthology annotations for phylogenetic analyses in [13], we re-annotated some genes of the Chicken and Zebra Finch based on synteny, thereby correcting errors in the annotations due to being annotated independently with different methods. We first ran bi-directional BLAST to recognize the reciprocal best hits (considered as pairwise orthologs) between our re-annotated chicken genome and each of the other genomes. Then we identified syntenic blocks by using pairwise orthologs as anchors. We only kept the pairwise orthologs with syntenic support. In addition, we also considered the genomic syntenic information inferred from the LASTZ genome alignments, and removed pairwise orthologs without genomic syntenic support. After the above filtering, all the remaining pairwise orthologs were combined into a merged list by using a chicken gene set as a reference. We also required each orthologous group to have members in at least 42 out of 48 avian species. Ultimately, we obtained a list of 8295 syntenic-based orthologs. We used the same methods to generate 12815 syntenic-based orthologs of 24 mammalian species. A full description of the synteny-based annotations is found in our phylogenomics paper [13].

Sequence alignments

Protein coding gene alignment

CDS alignments for all orthologous genes were obtained by two rounds of alignments. In order to preserve the reading frames of CDS, we aligned the amino acid sequences and then back translated them into DNA alignments. In the first round of alignment, SATé-Prank [14] was employed to obtain the initial alignments, which were used to identify the aberrant over-aligned and under-aligned sequences. The aberrant sequences were then removed, and the second round of alignment were performed by SATé-MAFFT [14] for the filtered sequences to create the final multiple sequence alignments. The default JTT model inside SATé [14] was used as we found it to fit the data best for most genes. We also used the same method to generate the alignments of mammalian orthologs. More details of the alignment are presented in Jarvis et al. [13].

Whole genome alignment

Whole genome alignments are very useful for comparative analyses, so we generated a multiple genome alignment of all 48 bird species. Firstly, pairwise alignments for each two genomes (with repeats masked) were produced by LASTZ [15], using chicken as the reference genome. Next chainNet [16] was introduced to obtain improved pairwise alignments. Finally, we used MULTIZ [17] to merge the pairwise alignments into multiple genome alignments. Approximately 400 Mb of each avian genome made it into the final alignment result. Thereafter, the alignment was filtered for over- and under-aligned errors, and for presence in 42 of 48 avian species. The resultant alignment was about 322 Mb, representing about one third of each genome, suggesting a large portion of the genome has been under strong constraints after different bird species diverged from their common ancestor. More details of the alignment are presented in Jarvis et al. [13].

dN/dS estimates

We deposit dN/dS estimates (ratio of non-synonymous versus synonymous substitution rates) of the protein coding genes from Zhang et al. [1]. The dN/dS ratios were estimated by PAML [18] program for the orthologs. Based on the CDS alignment of either protein coding data set, we used the one-ratio branch model to estimate the overall dN/dS ratios for each avian orthologous group and each mammalian orthologous group. In addition, to investigate the evolutionary rates in three major avian clades (Palaeognathae, Galloanserae and Neoaves), we used the three-ratio branch model, which estimated one identical dN/dS ratio for each clade. More details about dN/dS analyses are presented in Zhang et al. [1].

DNA sequence conservation

The overall level of conservation at the single nucleotide level could be estimated by PhastCons [19] based on multiple sequence alignments (MSA). First, the four-fold degenerate sites were extracted from 48-avian MSA and were used to estimate a neutral phylogenetic model by phyloFit [20], which is considered as the non-conserved model in PhastCons; we then ran PhastCons to estimate the conserved model. The conservation scores were predicted based on non-conserved and conserved models. We also used this method to estimate the sequence conservation for the 18-way mammalian genome alignments from the University of California at Santa Cruz (UCSC). Additional details of genome conservation are presented in the comparative genomics paper [1].

List of scripts used in avian comparative genome project

We also deposit the key scripts used in the avian comparative genome project in GigaDB [2], which include: 1) scripts for cleaning raw reads and assembling the genome using SOAPdenovo; 2) scripts for RepeatMasker and RepeatModeler repeat annotation; 3) scripts for homology-based protein-coding gene annotation and combining the gene annotation evidences into final gene sets; 4) scripts for generating whole genome alignment of multiple genomes; 5) scripts for running PAML to estimate branch model dN/dS ratios; 6) scripts for calculating conservation scores based on whole genome alignments and predicting highly conserved elements; 7) scripts for quantifying gene synteny percentages in birds and mammals; 8) scripts for identifying large segmental deletions from list of orthologous genes; 9) scripts for detecting gene loss in 48 avian genomes. We provide readme files in the script directories describing the usage of the scripts.

Availability and requirements

Download page for scripts: https://github.com/gigascience/paper-zhang2014 Operating system: Linux Programming language: Perl, R, Python Other requirements: Some pipelines need external bioinformatics software, for which we provided executable files in the directories. License: GNU General Public License version 3.0 (GPLv3) Any restrictions to use by non-academics: No

Availability of supporting data

The NCBI BioProject/SRA/Study IDs for are listed in Additional file 2. Other data files presented in this data note are available in the GigaScience repository, GigaDB [2].

Authors’ information

The full author list of Avian Genome Consortium is provided in Additional file 1. Additional file 1: Author list of the Avian Genome Consortium and contribution information of each author. (XLS 85 KB) Additional file 2: NCBI accession numbers and GigaDB DOI for each bird. (XLS 46 KB)
  16 in total

1.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

2.  Aligning multiple genomic sequences with the threaded blockset aligner.

Authors:  Mathieu Blanchette; W James Kent; Cathy Riemer; Laura Elnitski; Arian F A Smit; Krishna M Roskin; Robert Baertsch; Kate Rosenbloom; Hiram Clawson; Eric D Green; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

3.  SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.

Authors:  Kevin Liu; Tandy J Warnow; Mark T Holder; Serita M Nelesen; Jiaye Yu; Alexandros P Stamatakis; C Randal Linder
Journal:  Syst Biol       Date:  2011-12-01       Impact factor: 15.683

4.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

5.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.

Authors:  Adam Siepel; Gill Bejerano; Jakob S Pedersen; Angie S Hinrichs; Minmei Hou; Kate Rosenbloom; Hiram Clawson; John Spieth; Ladeana W Hillier; Stephen Richards; George M Weinstock; Richard K Wilson; Richard A Gibbs; W James Kent; Webb Miller; David Haussler
Journal:  Genome Res       Date:  2005-07-15       Impact factor: 9.043

6.  Comparative genomics reveals insights into avian genome evolution and adaptation.

Authors:  Guojie Zhang; Cai Li; Qiye Li; Bo Li; Denis M Larkin; Chul Lee; Jay F Storz; Agostinho Antunes; Matthew J Greenwold; Robert W Meredith; Anders Ödeen; Jie Cui; Qi Zhou; Luohao Xu; Hailin Pan; Zongji Wang; Lijun Jin; Pei Zhang; Haofu Hu; Wei Yang; Jiang Hu; Jin Xiao; Zhikai Yang; Yang Liu; Qiaolin Xie; Hao Yu; Jinmin Lian; Ping Wen; Fang Zhang; Hui Li; Yongli Zeng; Zijun Xiong; Shiping Liu; Long Zhou; Zhiyong Huang; Na An; Jie Wang; Qiumei Zheng; Yingqi Xiong; Guangbiao Wang; Bo Wang; Jingjing Wang; Yu Fan; Rute R da Fonseca; Alonzo Alfaro-Núñez; Mikkel Schubert; Ludovic Orlando; Tobias Mourier; Jason T Howard; Ganeshkumar Ganapathy; Andreas Pfenning; Osceola Whitney; Miriam V Rivas; Erina Hara; Julia Smith; Marta Farré; Jitendra Narayan; Gancho Slavov; Michael N Romanov; Rui Borges; João Paulo Machado; Imran Khan; Mark S Springer; John Gatesy; Federico G Hoffmann; Juan C Opazo; Olle Håstad; Roger H Sawyer; Heebal Kim; Kyu-Won Kim; Hyeon Jeong Kim; Seoae Cho; Ning Li; Yinhua Huang; Michael W Bruford; Xiangjiang Zhan; Andrew Dixon; Mads F Bertelsen; Elizabeth Derryberry; Wesley Warren; Richard K Wilson; Shengbin Li; David A Ray; Richard E Green; Stephen J O'Brien; Darren Griffin; Warren E Johnson; David Haussler; Oliver A Ryder; Eske Willerslev; Gary R Graves; Per Alström; Jon Fjeldså; David P Mindell; Scott V Edwards; Edward L Braun; Carsten Rahbek; David W Burt; Peter Houde; Yong Zhang; Huanming Yang; Jian Wang; Erich D Jarvis; M Thomas P Gilbert; Jun Wang
Journal:  Science       Date:  2014-12-11       Impact factor: 47.728

7.  Ensembl 2012.

Authors:  Paul Flicek; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Laurent Gil; Leo Gordon; Maurice Hendrix; Thibaut Hourlier; Nathan Johnson; Andreas K Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Monika Komorowska; Gautier Koscielny; Eugene Kulesha; Pontus Larsson; Ian Longden; William McLaren; Matthieu Muffato; Bert Overduin; Miguel Pignatelli; Bethan Pritchard; Harpreet Singh Riat; Graham R S Ritchie; Magali Ruffier; Michael Schuster; Daniel Sobral; Y Amy Tang; Kieron Taylor; Stephen Trevanion; Jana Vandrovcova; Simon White; Mark Wilson; Steven P Wilder; Bronwen L Aken; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Jennifer Harrow; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; Giulietta Spudich; Jan Vogel; Andy Yates; Amonida Zadissa; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

8.  Two Antarctic penguin genomes reveal insights into their evolutionary history and molecular changes related to the Antarctic environment.

Authors:  Cai Li; Yong Zhang; Jianwen Li; Lesheng Kong; Haofu Hu; Hailin Pan; Luohao Xu; Yuan Deng; Qiye Li; Lijun Jin; Hao Yu; Yan Chen; Binghang Liu; Linfeng Yang; Shiping Liu; Yan Zhang; Yongshan Lang; Jinquan Xia; Weiming He; Qiong Shi; Sankar Subramanian; Craig D Millar; Stephen Meader; Chris M Rands; Matthew K Fujita; Matthew J Greenwold; Todd A Castoe; David D Pollock; Wanjun Gu; Kiwoong Nam; Hans Ellegren; Simon Yw Ho; David W Burt; Chris P Ponting; Erich D Jarvis; M Thomas P Gilbert; Huanming Yang; Jian Wang; David M Lambert; Jun Wang; Guojie Zhang
Journal:  Gigascience       Date:  2014-12-12       Impact factor: 6.524

9.  The duck genome and transcriptome provide insight into an avian influenza virus reservoir species.

Authors:  Yinhua Huang; Yingrui Li; David W Burt; Hualan Chen; Yong Zhang; Wubin Qian; Heebal Kim; Shangquan Gan; Yiqiang Zhao; Jianwen Li; Kang Yi; Huapeng Feng; Pengyang Zhu; Bo Li; Qiuyue Liu; Suan Fairley; Katharine E Magor; Zhenlin Du; Xiaoxiang Hu; Laurie Goodman; Hakim Tafer; Alain Vignal; Taeheon Lee; Kyu-Won Kim; Zheya Sheng; Yang An; Steve Searle; Javier Herrero; Martien A M Groenen; Richard P M A Crooijmans; Thomas Faraut; Qingle Cai; Robert G Webster; Jerry R Aldridge; Wesley C Warren; Sebastian Bartschat; Stephanie Kehr; Manja Marz; Peter F Stadler; Jacqueline Smith; Robert H S Kraus; Yaofeng Zhao; Liming Ren; Jing Fei; Mireille Morisson; Pete Kaiser; Darren K Griffin; Man Rao; Frederique Pitel; Jun Wang; Ning Li
Journal:  Nat Genet       Date:  2013-06-09       Impact factor: 38.330

10.  High-coverage sequencing and annotated assemblies of the budgerigar genome.

Authors:  Ganeshkumar Ganapathy; Jason T Howard; James M Ward; Jianwen Li; Bo Li; Yingrui Li; Yingqi Xiong; Yong Zhang; Shiguo Zhou; David C Schwartz; Michael Schatz; Robert Aboukhalil; Olivier Fedrigo; Lisa Bukovnik; Ty Wang; Greg Wray; Isabelle Rasolonjatovo; Roger Winer; James R Knight; Sergey Koren; Wesley C Warren; Guojie Zhang; Adam M Phillippy; Erich D Jarvis
Journal:  Gigascience       Date:  2014-07-08       Impact factor: 6.524

View more
  50 in total

1.  Whole-Genome Analyses Resolve the Phylogeny of Flightless Birds (Palaeognathae) in the Presence of an Empirical Anomaly Zone.

Authors:  Alison Cloutier; Timothy B Sackton; Phil Grayson; Michele Clamp; Allan J Baker; Scott V Edwards
Journal:  Syst Biol       Date:  2019-11-01       Impact factor: 15.683

2.  Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome.

Authors:  Derek M Bickhart; Benjamin D Rosen; Sergey Koren; Brian L Sayre; Alex R Hastie; Saki Chan; Joyce Lee; Ernest T Lam; Ivan Liachko; Shawn T Sullivan; Joshua N Burton; Heather J Huson; John C Nystrom; Christy M Kelley; Jana L Hutchison; Yang Zhou; Jiajie Sun; Alessandra Crisà; F Abel Ponce de León; John C Schwartz; John A Hammond; Geoffrey C Waldbieser; Steven G Schroeder; George E Liu; Maitreya J Dunham; Jay Shendure; Tad S Sonstegard; Adam M Phillippy; Curtis P Van Tassell; Timothy P L Smith
Journal:  Nat Genet       Date:  2017-03-06       Impact factor: 38.330

Review 3.  Dissecting evolution and disease using comparative vertebrate genomics.

Authors:  Jennifer R S Meadows; Kerstin Lindblad-Toh
Journal:  Nat Rev Genet       Date:  2017-07-24       Impact factor: 53.242

4.  Comparative genomics reveals insights into avian genome evolution and adaptation.

Authors:  Guojie Zhang; Cai Li; Qiye Li; Bo Li; Denis M Larkin; Chul Lee; Jay F Storz; Agostinho Antunes; Matthew J Greenwold; Robert W Meredith; Anders Ödeen; Jie Cui; Qi Zhou; Luohao Xu; Hailin Pan; Zongji Wang; Lijun Jin; Pei Zhang; Haofu Hu; Wei Yang; Jiang Hu; Jin Xiao; Zhikai Yang; Yang Liu; Qiaolin Xie; Hao Yu; Jinmin Lian; Ping Wen; Fang Zhang; Hui Li; Yongli Zeng; Zijun Xiong; Shiping Liu; Long Zhou; Zhiyong Huang; Na An; Jie Wang; Qiumei Zheng; Yingqi Xiong; Guangbiao Wang; Bo Wang; Jingjing Wang; Yu Fan; Rute R da Fonseca; Alonzo Alfaro-Núñez; Mikkel Schubert; Ludovic Orlando; Tobias Mourier; Jason T Howard; Ganeshkumar Ganapathy; Andreas Pfenning; Osceola Whitney; Miriam V Rivas; Erina Hara; Julia Smith; Marta Farré; Jitendra Narayan; Gancho Slavov; Michael N Romanov; Rui Borges; João Paulo Machado; Imran Khan; Mark S Springer; John Gatesy; Federico G Hoffmann; Juan C Opazo; Olle Håstad; Roger H Sawyer; Heebal Kim; Kyu-Won Kim; Hyeon Jeong Kim; Seoae Cho; Ning Li; Yinhua Huang; Michael W Bruford; Xiangjiang Zhan; Andrew Dixon; Mads F Bertelsen; Elizabeth Derryberry; Wesley Warren; Richard K Wilson; Shengbin Li; David A Ray; Richard E Green; Stephen J O'Brien; Darren Griffin; Warren E Johnson; David Haussler; Oliver A Ryder; Eske Willerslev; Gary R Graves; Per Alström; Jon Fjeldså; David P Mindell; Scott V Edwards; Edward L Braun; Carsten Rahbek; David W Burt; Peter Houde; Yong Zhang; Huanming Yang; Jian Wang; Erich D Jarvis; M Thomas P Gilbert; Jun Wang
Journal:  Science       Date:  2014-12-11       Impact factor: 47.728

Review 5.  Divergence and gene flow among Darwin's finches: A genome-wide view of adaptive radiation driven by interspecies allele sharing.

Authors:  Daniela H Palmer; Marcus R Kronforst
Journal:  Bioessays       Date:  2015-07-22       Impact factor: 4.345

6.  Linked-Read Sequencing of Eight Falcons Reveals a Unique Genomic Architecture in Flux.

Authors:  Justin J S Wilcox; Barbara Arca-Ruibal; Jaime Samour; Victor Mateuta; Youssef Idaghdour; Stéphane Boissinot
Journal:  Genome Biol Evol       Date:  2022-06-14       Impact factor: 4.065

Review 7.  Epithelial sodium channel (ENaC) family: Phylogeny, structure-function, tissue distribution, and associated inherited diseases.

Authors:  Israel Hanukoglu; Aaron Hanukoglu
Journal:  Gene       Date:  2016-01-07       Impact factor: 3.688

8.  G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences.

Authors:  Vasileios Panagiotis E Lenis; Martin Swain; Denis M Larkin
Journal:  Gigascience       Date:  2018-05-01       Impact factor: 6.524

9.  Phylogenomic analyses of the genus Drosophila reveals genomic signals of climate adaptation.

Authors:  Fang Li; Rahul V Rane; Victor Luria; Zijun Xiong; Jiawei Chen; Zimai Li; Renee A Catullo; Philippa C Griffin; Michele Schiffer; Stephen Pearce; Siu Fai Lee; Kerensa McElroy; Ann Stocker; Jennifer Shirriffs; Fiona Cockerell; Chris Coppin; Carla M Sgrò; Amir Karger; John W Cain; Jessica A Weber; Gabriel Santpere; Marc W Kirschner; Ary A Hoffmann; John G Oakeshott; Guojie Zhang
Journal:  Mol Ecol Resour       Date:  2021-12-08       Impact factor: 8.678

Review 10.  Alternative Animal Models of Aging Research.

Authors:  Susanne Holtze; Ekaterina Gorshkova; Stan Braude; Alessandro Cellerino; Philip Dammann; Thomas B Hildebrandt; Andreas Hoeflich; Steve Hoffmann; Philipp Koch; Eva Terzibasi Tozzini; Maxim Skulachev; Vladimir P Skulachev; Arne Sahm
Journal:  Front Mol Biosci       Date:  2021-05-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.