| Literature DB >> 25632945 |
Philippe Rocca-Serra1, Ramona Walls2, Jacob Parnell2, Rachel Gallery2, Jie Zheng2, Susanna-Assunta Sansone2, Alejandra Gonzalez-Beltran2.
Abstract
The advent of affordable sequencing technology provides for a new generation of explorers who probe the world's microbial diversity. Projects such as Tara Oceans, Moorea Biocode Project and Gut Microbiome rely on sequencing technologies to probe community diversity. Either targeted gene surveys (also known as community surveys) or complete metagenomes are evaluated. The former, being the less costly of the two methods, relies on the identification of specific genomic regions, which can be used as a proxy to estimate genetic distance between related species in a Phylum. For instance, 16 S ribosomal RNA gene surveys are used to probe bacterial communities while internal transcribed spacer surveys, for example, can be used for probing fungal communities. With the explosion of projects and frenzy to explore new domains of life, scientists in the field have issued guidelines to report minimal information (following a checklist), ensuring that information is contextualized in a meaningful way. Yet the semantics of a checklist are not explicit. We demonstrate here how a tabular template can be used to collect information on microbial diversity using an explicit representation in the Resource Description Framework that is consistent with community agreed-upon knowledge representation patterns found in the Ontology for Biomedical Investigations.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25632945 PMCID: PMC4309925 DOI: 10.1093/database/bau132
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.A screenshot of ISAcreator showing the transposed assay table indicating the overall sample processing workflow and relevant MiXs labels
Figure 2.A graphical overview representing the various steps of an environmental gene survey, from the sampling step down to the data analysis
Current term requests logged in the respective term tracker (OBI terms are available since OBI release-2014-12-03, subversion revision 3955).
| Class label | Class definition | Target ontology |
|---|---|---|
| targeted gene survey | is an assay which aims to provide information about taxonomic information and community diversity by mean of sequencing specimen genomic regions used as marker of identity or diversity | OBI |
| multiplexing | a planned process which consists in running a set of samples as a pool in one single instrument run of a process while retaining the ability to associate individual results to each of the samples | OBI |
| library sequence deconvolution | is a data transformation which uses sequence alignment and ‘multiplex identifier sequence’ information to pull together all reads belonging to a given single sample following the sequencing of a multiplexed library which combining several samples in one sequencing event | OBI |
| PCR program | is a plan specification which is executed during a PCR by a thermal cycler instrument that will iterate through the changes in temperature and duration of each of the annealing, denaturation, elongation steps | OBI |
| multiplex identifier sequence | is a nucleic acid sequence which is used in a ligation step of library preparation process to allow pooling of samples while maintaining ability to identify individual source material | OBI |
| OTU matrix | OTU matrix is a data item, organized as a table, where organismal taxonomic units, computed by sequence analysis and genetic distance calculation, are counted in a set of biological or environmental samples. The table is used to appraise biodiversity of a population or community of living organism | PCO |
| target gene | is a data item about a coding genomic region which is the focus of a planned process such as an assay. | OBI |
| target subfragment | is a data item about a genomic region which is the focus of a planned process such as an assay. | OBI |