| Literature DB >> 22776250 |
Mark P Peterson1, Danielle J Whittaker, Shruthi Ambreth, Suhas Sureshchandra, Aaron Buechlein, Ram Podicheti, Jeong-Hyeon Choi, Zhao Lai, Keithanne Mockatis, John Colbourne, Haixu Tang, Ellen D Ketterson.
Abstract
BACKGROUND: Though genomic-level data are becoming widely available, many of the metazoan species sequenced are laboratory systems whose natural history is not well documented. In contrast, the wide array of species with very well-characterized natural history have, until recently, lacked genomics tools. It is now possible to address significant evolutionary genomics questions by applying high-throughput sequencing to discover the majority of genes for ecologically tractable species, and by subsequently developing microarray platforms from which to investigate gene regulatory networks that function in natural systems. We used GS-FLX Titanium Sequencing (Roche/454-Sequencing) of two normalized libraries of pooled RNA samples to characterize a transcriptome of the dark-eyed junco (Junco hyemalis), a North American sparrow that is a classically studied species in the fields of photoperiodism, speciation, and hormone-mediated behavior.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22776250 PMCID: PMC3476391 DOI: 10.1186/1471-2164-13-305
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sequencing and assembly statistics
| Reads | 1.17 million | 434 million | 372 |
| Contigs | 40,564 | 35.8 million | 884 |
| Singletons | 166,177 | 57.1 million | 344 |
| Isotigs | 31,739 | NA | 1,248 |
| Isogroups | 22,765 | NA | NA |
Sequencing reads were assembled into contigs which were further grouped into isotigs and isogroups (see text for details).
Figure 1Distribution of isotig lengths. The frequency of isotigs of various lengths. Note that isotigs longer than 6,000 bases are omitted from the figure for clarity. The frequency continues to decline for greater lengths.
Number of isogroups and singletons annotated by different approaches, confirmed expressed, and containing SNPs
| Total Assembly | 22,765 | 166,177 |
| NCBI nr-protein | 10,276 | 15,871 |
| Ensembl - ZF | 9,863 | 14,019 |
| Gene Ontology | 9,120 | 737 |
| KEGG | 3,827 | 3,984 |
| OrthoDB | 10,062 | 14,019 |
| Expressed | 16,781 | 16,096 |
| SNP present | 6,992 | NA |
Sequences were annotated by sequence similarity to multiple databases, yielding the above number of annotations.
Additional annotation approach details
| Functional Annotation | NCBI- NR | Blast2GO | 10-15 | [ | |
| Pathway Annotation | KEGG | tBlastX | 10-5 | [ | Limited to zebra finch, chicken, and mouse. |
| Orthologous Group | OrthoDB, Ensembl | BlastX | 10-10 | [ | Limited to zebra finch. Matched Ensembl hits to OrthoDB assignments |
To obtain more detailed information about the function of the sequenced genes, searches beyond simple non-redundant proteins were conducted.