| Literature DB >> 25407802 |
Daniel I Speiser1,2, M Sabrina Pankey3, Alexander K Zaharoff4, Barbara A Battelle5, Heather D Bracken-Grissom6, Jesse W Breinholt7, Seth M Bybee8, Thomas W Cronin9, Anders Garm10, Annie R Lindgren11, Nipam H Patel12, Megan L Porter13, Meredith E Protas14, Ajna S Rivera15, Jeanne M Serb16, Kirk S Zigler17, Keith A Crandall18,19, Todd H Oakley20.
Abstract
BACKGROUND: Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25407802 PMCID: PMC4255452 DOI: 10.1186/s12859-014-0350-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Collection data for the samples from which we generated 28 new transcriptomes
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| 1 |
| Cephalopod (squid) | Adult eye tissue | ROV (MBARI) | CA, USA | 36°69'N | 122°05'W | 446 |
| 2 |
| Cephalopod (squid) | Adult eye tissue | Hand net | Honolulu, HI, USA | 21°27'N | 157°77'W | 0.5 |
| 3 |
| Cephalopod (squid) | Adult eye tissue | ROV (MBARI) | CA, USA | 36°69'N | 122°05'W | 556 |
| 4 |
| Cephalopod (octopus) | Adult dermal tissue | SCUBA | Santa Barbara, CA, USA | 34°43'N | 119°71'W | ? |
| 5 |
| Cephalopod (squid) | Adult eye tissue | Fishmarket | Numazu, Japan | 35°08'N | 138°86'E | ? |
| 6 |
| Cephalopod (vampire squid) | Adult eye tissue | ROV (MBARI) | CA, USA | 36°08'N | 122°30'W | 1096 |
| 7 |
| Arthropod (cave isopod) | Adult head | Hand net | Planina cave, Slovenia | 45°82'N | 14°25'E | 65 |
| 8 |
| Arthropod (surface isopod) | Embryos and hatchlings | Lab colony | Planina cave, Slovenia | 45°82'N | 14°25'E | N/A |
| 9 |
| Arthropod (hybrid isopod) | Adult head | Lab colony | Planina cave, Slovenia | 45°82'N | 14°25'E | N/A |
| 10 |
| Arthropod (surface isopod) | Adult head | Lab colony | Planina cave, Slovenia | 45°82'N | 14°25'E | N/A |
| 11 |
| Arthropod (shrimp) | Adult eye tissue | Benthic skimmer | Northern Gulf of Mexico | 28°48’N | 88°12’W | 1350 |
| 12 |
| Arthropod (cave isopod) | Adult head | Hand net | Franklin County, TN, USA | 35°15'N | 86°10'W | 0.1 |
| 13 |
| Arthropod (cave isopod) | Whole embryos | Hand net | Franklin County, TN, USA | 35°15'N | 86°10'W | 0.1 |
| 14 |
| Arthropod (surface isopod) | Adult head | Hand net | Sewanee, TN, USA | 35°22'N | 85°97'W | 0.5 |
| 15 |
| Arthropod (surface isopod) | Whole embryos | Hand net | Sewanee, TN, USA | 35°22'N | 85°97'W | 0.5 |
| 16 |
| Arthropod (ostracod) | Whole embryos | Hand net | Half Moon Bay, CA, USA | 37°29'N | 122°29'W | 1 |
| 17 |
| Arthropod (stomatopod, male) | Adult eye tissue | Dredge | Orange County, CA, USA | 33°67'N | 117°78'W | ? |
| 18 |
| Arthropod (damselfly, female) | Adult head | Hand net | Austin, TX, USA | 30°28'N | 97°78'W | N/A |
| 19 |
| Arthropod (horseshoe crab) | Adult lateral eye | Hand | Indian River near Titusville, FL, USA | 28°74'N | 80°75'W | Surface |
| 20 |
| Arthropod (horseshoe crab) | Adult median eye | Hand | Indian River near Titusville, FL, USA | 28°74'N | 80°75'W | Surface |
| 21 |
| Arthropod (horseshoe crab) | Adult ventral eye | Hand | Indian River near Titusville, FL, USA | 28°74'N | 80°75'W | Surface |
| 22 |
| Arthropod (crayfish) | Adult eye tissue | Hand net | Fisheating Creek, Glades County, FL, USA | 26°90'N | 81°24'W | Surface |
| 23 |
| Arthropod (crayfish) | Adult eye tissue | Hand net | Orange Lake Cave, Marion County, FL, USA | Contact Authors | Contact Authors | Surface |
| 24 |
| Arthropod (stomatopod) | Adult eye tissue | Dredge | Isla Magueyes, Puerto Rico | 17°97'N | 67°05'W | ? |
| 25 |
| Arthropod (shrimp) | Adult eye tissue | Benthic skimmer | Nothern Gulf of Mexico | 28°48’N | 88°12’W | 1350 |
| 26 |
| Arthropod (damselfly) | Juvenile head | Hand net | Austin, TX, USA | 30°28'N | 97°78'W | N/A |
| 27 |
| Cnidarian (cubozoan) | Adult rhopalia | Snorkel | La Parguera, Puerto Rico | 17°58'N | 67°04'W | Surface |
| 28 |
| Cnidarian (cubozoan) | Whole planula larvae | Snorkel | La Parguera, Puerto Rico | 17°58'N | 67°04'W | Surface |
Figure 1An example of results from phylogenetically-informed annotation (PIA). Here, we search a transcriptome generated for an eye from the squid Chiroteuthis calyx for relatives of the gene G protein-coupled receptor kinase 1 (Gprk1), a component of the rhabdomeric phototransduction pathway. Sequences marked with red squares and labeled "LANDMARK1" are homologs of Gprk1 that have been well-characterized functionally and are thought to share similar functions. In contrast, sequences labeled "LANDMARK2" are well-characterized genes that are more distantly related to Gprk1. Sequences marked with yellow circles and labeled "QUERY" are protein sequences predicted from our transcriptome for C. calyx. Based on their phylogenetic positions and branch lengths, it is likely that one hit (UN0029) represents an ortholog of Gprk1 and that two hits (UN1121 and UN2338) represent genes that are distant paralogs of Gprk1. We conclude that the eyes of C. calyx express an ortholog of Gprk1, a component of the rhabdomeric phototransduction pathway.
Figure 2The results of using phylogenetically-informed annotation (PIA) to search 28 new transcriptomes for light-interacting genes. Here, cells shaded in black represent orthologs of LIT genes that are present in our transcriptomes. We have named genes based on conventions for Drosophila melanogaster whenever possible. Otherwise, gene names are given for Mus musculus. Cells marked "C" are hits from our transcriptomes that may represent contamination or assembly errors. The columns represent the following transcriptomes (where A = Cephalopods; B = Arthropods; and C = Cnidarians): 1 = Chiroteuthis calyx eye; 2 = Euprymna scolopes eye; 3 = Galiteuthis armata eye; 4 = Octopus bimaculoides skin; 5 = Uroteuthis edulis eye; 6 = Vampyroteuthis infernalis eye; 7 = Asellus aquaticus cave head; 8 = Asellus aquaticus embryos and hatchlings; 9 = Asellus aquaticus hybrid head; 10 = Asellus aquaticus surface head; 11 = Benthesicymus bartletti eye; 12 = Caecidotea bicrenata adult head; 13 = Caecidotea bicrenata embryos; 14 = Caecidotea forbesi adult head; 15 = Caecidotea forbesi embryos; 16 = Euphilomedes carcharodonta embryos; 17 = Hemisquilla californiensis eyes; 18 = Ischnura ramburii head; 19 = Limulus polyphemus lateral eye; 20 = Limulus polyphemus median eye; 21 = Limulus polyphemus ventral eye; 22 = Procambarus alleni eye; 23 = Procambarus franzi eye; 24 = Pseudosquilla ciliata eye; 25 = Systellaspis debilis eye; 26 = Telebasis salva head; 27 = Tripedalia cystophora eyes; 28 = Tripedalia cystophora planula larvae.