| Literature DB >> 25745418 |
Claire Bertelli1, Sébastien Aeby2, Bérénice Chassot3, James Clulow3, Olivier Hilfiker3, Samuel Rappo3, Sébastien Ritzmann3, Paolo Schumacher3, Céline Terrettaz3, Paola Benaglio4, Laurent Falquet5, Laurent Farinelli6, Walid H Gharib7, Alexander Goesmann8, Keith Harshman9, Burkhard Linke8, Ryo Miyazaki10, Carlo Rivolta4, Marc Robinson-Rechavi7, Jan Roelof van der Meer10, Gilbert Greub2.
Abstract
With the widespread availability of high-throughput sequencing technologies, sequencing projects have become pervasive in the molecular life sciences. The huge bulk of data generated daily must be analyzed further by biologists with skills in bioinformatics and by "embedded bioinformaticians," i.e., bioinformaticians integrated in wet lab research groups. Thus, students interested in molecular life sciences must be trained in the main steps of genomics: sequencing, assembly, annotation and analysis. To reach that goal, a practical course has been set up for master students at the University of Lausanne: the "Sequence a genome" class. At the beginning of the academic year, a few bacterial species whose genome is unknown are provided to the students, who sequence and assemble the genome(s) and perform manual annotation. Here, we report the progress of the first class from September 2010 to June 2011 and the results obtained by seven master students who specifically assembled and annotated the genome of Estrella lausannensis, an obligate intracellular bacterium related to Chlamydia. The draft genome of Estrella is composed of 29 scaffolds encompassing 2,819,825 bp that encode for 2233 putative proteins. Estrella also possesses a 9136 bp plasmid that encodes for 14 genes, among which we found an integrase and a toxin/antitoxin module. Like all other members of the Chlamydiales order, Estrella possesses a highly conserved type III secretion system, considered as a key virulence factor. The annotation of the Estrella genome also allowed the characterization of the metabolic abilities of this strictly intracellular bacterium. Altogether, the students provided the scientific community with the Estrella genome sequence and a preliminary understanding of the biology of this recently-discovered bacterial genus, while learning to use cutting-edge technologies for sequencing and to perform bioinformatics analyses.Entities:
Keywords: annotation; biocuration; chlamydia; genome sequencing; genomics; metabolic pathways; teaching
Year: 2015 PMID: 25745418 PMCID: PMC4333871 DOI: 10.3389/fmicb.2015.00101
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Organization of the course “Sequence a genome” in 2010–2011. Topics of corresponding lectures are detailed on the left. The main steps of genome sequencing performed by the students as well as the main scientific skills acquired by the students are indicated on the right. The central arrow indicates the time (in hours) dedicated to lectures (left) and practical work (right) during the autumn and spring semester of the year 2010–2011. Lectures were immediately followed by the practical work in a 4 h class every 2 weeks. “//” represents the semester break.
Assembly statistics of .
| CLCbio | 613 | 350 | 72 | 1044 | 11402 | 74998 | 2762344 |
| ABySS | 102 | 60 | 7 | 1063 | 137730 | 303994 | 2966869 |
| Velvet | 109 | 39 | 5 | 1052 | 234440 | 464604 | 2827833 |
| SOAPdenovo | 152 | 52 | 6 | 1017 | 139628 | 438768 | 2815006 |
| Final assembly | 29 | 29 | 4 | 1052 | 268998 | 464198 | 2819825 |
n, number of scaffolds; n:1000, number of scaffolds > 1000 bp long; n:N50, number of scaffolds > N50; min, size of the smallest contig; max, size of the largest scaffold; sum, sum of all nucleotides in the assembly without Ns.
Genomic characteristics of members of the .
| Chlamydiaceae | 60/28 | D/UW-3/CX | 1′042′519 | 41.3 | 895 | 37 | 6 | 7′493 | |
| 5/8 | Nigg | 1′072′950 | 40.3 | 903 | 37 | 6 | 7′501 | ||
| 5/1 | CWL029 | 1′230′230 | 40.6 | 1′122 | 38 | 3 | – | ||
| 4/4 | E58 | 1′106′197 | 41.1 | 988 | 38 | 3 | – | ||
| 1/− | Fe/C-56 | 1′166′239 | 39.4 | 1′005 | 38 | 3 | 7′552 | ||
| 1/− | GPIC | 1′173′390 | 39.2 | 998 | 38 | 3 | 7′966 | ||
| 1/1 | S26/3 | 1′144′377 | 39.9 | 932 | 38 | 3 | – | ||
| 16/33 | 6BC | 1′171′660 | 39.1 | 967 | 38 | 3 | 7′553 | ||
| −/1 | MD56 | 1′079′683 | 42.0 | 937 | 37 | 3 | 5′976 | ||
| −/1 | 08-1274/3 | 1′050′923 | 37.9 | 907 | 39 | 4 | – | ||
| −/1 | 10DC88 | 1′041′170 | 36.9 | 940 | 39 | 3 | 7′099 | ||
| −/1 | 10-1398/6 | 1′146′066 | 38.3 | 1′018 | 38 | 3 | – | ||
| Waddliaceae | 1/1 | WSU86-1044 | 2′116′312 | 43.8 | 1′934 | 37 | 6 | 15′593 | |
| Criblamydiaceae | −/1 | CRIB-18 | 2′969′604 | 38.2 | 2′418 | 40 | 12 | 89′525 | |
| −/1 | CRIB-30 | 2′820′195 | 48.2 | 2′213 | 40 | 9 | 9′136 | ||
| Parachlamydia ceae | −/1 | UV-7 | 3′072′383 | 39.0 | 2′789 | 40 | 10 | – | |
| 1/1 | UWE25 | 2′414′465 | 34.7 | 2′031 | 35 | 9 | – | ||
| −/3 | S13 | 3′187′070 | 38.0 | NA | NA | NA | NA | ||
| Simkaniaceae | 1/− | Z | 2′496′337 | 41.8 | 2′519 | 35 | 3 | 132′038 |
Number of rRNA was estimated as the ratio between the coverage of contigs encoding for 16S-23S-5S rRNA and the average contig coverage.
Figure 2Conservation of the type III secretion system (adapted from Bertelli et al., . Comparison of the genetic clusters encoding for T3SS genes between E. lausannensis (E. la), Parachlamydia acanthamoebae (Pa. ac), Protochlamydia amoebophila (Pr. am), Waddlia chondrophila (W. ch) and Chlamydia trachomatis (C. tr) that belong to 4 different families within the Chlamydiales order. Gray shading indicates the conservation of the genes. Gene names and ORF numbers are respectively indicated above and below each gene. Genes are colored according to their specific functions. Capital letters refer to sct gene names according to the unified nomenclature proposed by Hueck (1998). sycE and sycD are genes encoding for SycE-like and SycD/LcrH-like T3SS chaperones.
Figure 3Purine biosynthesis pathways predicted in . The pathway for the biosynthesis of IMP from ribose or amino acids appears to be absent in E. lausannensis. Green arrows indicate enzyme reactions predicted to be catalyzed by proteins encoded in the genome of E. lausannensis. Red arrows indicate reactions catalyzed by enzymes that could not be detected in the genome. Colored boxes indicate the presence of an enzyme reaction in four other members of the Chlamydiales order.
Figure 4Pyrimidine biosynthesis pathways predicted in . The pathway for the biosynthesis of UMP from PRPP and amino-acids appears to be absent in E. lausannensis. Green arrows indicate enzyme reactions predicted to be catalyzed by proteins encoded in the genome of E. lausannensis. Red arrows indicate reactions catalyzed by enzymes that could be discovered in the genome. Colored boxes indicate the presence of an enzyme catalyzing the reaction in four other members of the Chlamydiales order.
Figure 5Amino acid metabolism. E. lausannensis is able to synthesize (A) cysteine, glycine, serine, threonine, and alanine from pyruvate as well as (B) aspartate, glutamate and their amidated forms asparagine and glutamine from oxaloacetate. Green arrows indicate enzyme reactions predicted to be catalyzed by proteins encoded in the genome of E. lausannensis. Red arrows indicate reactions catalyzed by enzymes that could not be discovered in the genome. Colored boxes indicate the presence of an enzyme catalyzing the reaction in four other members of the Chlamydiales order.
Figure 6Co-factor metabolism in . Predicted intermediate metabolism for biotin (A) and menaquinone (B) biosynthesis. Green arrows indicate enzyme reactions predicted to be catalyzed by proteins encoded in the genome of E. lausannensis. Red arrows indicate reactions catalyzed by enzymes that could not be discovered in the genome. Colored boxes indicate the presence of an enzyme catalyzing the reaction in four other members of the Chlamydiales order.
Figure 7Map of the 9.1-kb . The map shows the predicted location of the 14 open reading frames on the plasmid. Hypothetical genes with no known functions are depicted in orange boxes. Blue, hypothetical genes conserved in related species. Green, the predicted RelE toxin-antitoxin module, which is also found in Protochlamydia amoebophila. Yellow, genes with clear functional assignment. Image created using GenDB 2.4 Circular Plot tool.
Comparison of genomics courses and teaching initiatives.
| Academic Institute | UCLA | JGI | University of Florida | UNIL | San Diego State University | UC Davis |
| Available in city, country | Los Angeles, USA | Web-based, international | Gainesville, USA | Lausanne, Switzerland | San Diego, USA | Davis, USA |
| Title of the course or project | The Undergraduate Genomics Research Initiative | Undergraduate Research in Microbial Genome Analysis | Bacterial genome sequencing | Sequence a genome | Ecological metagenomics | Undergraduate Research: Built Environment Genomes |
| Start year | janv.04 | janv.08 | janv.08 | sept.10 | 2010 | déc.11 |
| Web based | no | yes | no | no | no | no |
| Publication | Kerfeld and Simons, | Drew and Triplett, | Edwards et al., | |||
| Web page | ||||||
| Study design | Teachers | X | Teachers | Teachers | Teachers | Teachers |
| Bacterium choice | Teachers | X° | Teachers | Teachers | Teachers | Students |
| Bacterial culture | Teachers | X | NA | Students | Students | Students |
| DNA extraction | Students | X | Students | Students | Students | Students |
| DNA sequencing | Students | X | NA | Demo | Students | Students |
| Sequencing technology | Sanger | X | GS20 | Illumina | GS FLX | Illumina |
| Genome assembly | Students | X | Students | Students | Students | Students |
| Gap closure | Students | X | Students | Students | X | X |
| Annotation | Students | Students | Students | Students | Students | Students |
| Biocuration | Students | Students | NA | Students | NA | X |
| Paper writing | NA | X | Students | Students | Students | Students |
| Oral communication | NA | X | Students | Students | NA | NA |
X, according to available information not performed in the framework of the course; NA, information not found in publication or website information; ° provided by the GEBA project.