| Literature DB >> 23671698 |
Juan Ning1, Minxiao Wang, Chaolun Li, Song Sun.
Abstract
BACKGROUND: Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus.Entities:
Mesh:
Year: 2013 PMID: 23671698 PMCID: PMC3646036 DOI: 10.1371/journal.pone.0063741
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the sequencing and assembly of the C. sinicus transcriptome.
| Raw reads (bade pairs) | 1,470,799(523,341,132) |
| Clean reads | 1,368,381 |
| Isogroup | 19,149 |
| Isotigs | 31,591 |
| Isotig N50 | 873 |
| Mean # isotigs per isogroup | 1.6 |
| Contigs | 56,809 |
| Mean # contigs per isotig | 2.7 |
| Singletons | 244,869 |
Figure 1Overview of C. sinicus transcriptome sequencing and assembly.
(A) Size distribution of 454 sequencing after removal of adapter and short reads (<40 bases). (B) Sequence coverage. (C) Size distribution of isotigs.
Figure 2GO term distribution of BLAST hits from the C. sinicus transcriptome.
Selected GO categories are shown within the top-level divisions of Biological Process, Molecular Function, and Cellular Component. The relative percentages of genes falling into GO categories are comparable between our C. sinicus transcriptome (white) and the D. pulex genome (black).
KEGG biochemical mappings for C. sinicus.
| KEGG pathway | Number of genes | Number of sequences (Isogroup + singleton) |
|
|
|
|
| Carbohydrate Metabolism | 182 | 570 |
| Energy Metabolism | 142 | 435 |
| Lipid Metabolism | 131 | 324 |
| Nucleotide Metabolism | 82 | 302 |
| Amino Acid Metabolism | 154 | 389 |
| Metabolism of Other Amino Acids | 58 | 135 |
| Glycan Biosynthesis and Metabolism | 95 | 201 |
| Metabolism of Cofactors and Vitamins | 80 | 163 |
| Metabolism of Terpenoids and Polyketides | 28 | 56 |
| Biosynthesis of Other Secondary Metabolites | 27 | 81 |
| Xenobiotics Biodegradation and Metabolism | 55 | 210 |
|
|
|
|
| Transcription | 93 | 142 |
| Translation | 199 | 409 |
| Folding, Sorting and Degradation | 183 | 299 |
| Replication and Repair | 52 | 144 |
|
|
|
|
| Membrane Transport | 24 | 39 |
| Signal Transduction | 178 | 482 |
| Signaling Molecules and Interaction | 62 | 91 |
|
|
|
|
| Transport and Catabolism | 167 | 348 |
| Cell Motility | 47 | 84 |
| Cell Growth and Death | 81 | 211 |
| Cell Communication | 77 | 148 |
|
|
|
|
| Immune System | 102 | 339 |
| Endocrine System | 99 | 254 |
| Circulatory System | 41 | 91 |
| Digestive System | 92 | 234 |
| Excretory System | 50 | 81 |
| Nervous System | 114 | 346 |
| Sensory System | 15 | 31 |
| Development | 51 | 94 |
| Environmental Adaptation | 17 | 29 |
|
|
|
|
Selected development process genes identified in the C. sinicus transcriptome.
| Process | NumIsotig | Length(range) |
|
| ||
| ATP-binding cassette | 7 | 415–1579 |
| prosaposin | 1 | 465 |
| saposin | 10 | 388–1856 |
| vitellogenin receptor | 2 | 488–2772 |
| histone-lysine N-methyltransferase | 7 | 588–1583 |
| serine/threonine-protein kinase PAK 2 | 16 | 357–2002 |
| pre-mRNA-processing factor | 3 | 743–1029 |
| calsyntenin | 1 | 1206 |
| 60S ribosomal protein L12 | 1 | 494 |
| DNA methyltransferase 1-associated protein 1 | 1 | 602 |
| chromodomain-helicase-DNA-binding protein | 1 | 895 |
| transcription factor family member | 92 | 403–1285 |
| insulin receptor | 1 | 729 |
| DNA-binding protein A | 1 | 895 |
| elongation factor | 33 | 356–1885 |
| actin | 196 | 331–1342 |
| ubiquitin-conjugating enzyme E2 E3 | 14 | 378–1810 |
| adenosylhomocysteinase | 6 | 200–1665 |
| menin | 2 | 1084–2603 |
| histone acetyltransferase | 3 | 663–1486 |
|
| ||
| CYP315A1; ecdysteroid 2-hydroxylase | 1 | 1390 |
| CYP306A1 | 3 | 485–507 |
| CYP302A1 | 5 | 406–490 |
| CYP305A1 | 2 | 138–151 |
| ecdysone-induced protein 75(E75) | 1 | 747 |
| ecdysone receptor | 4 | 305–2056 |
| ecdysoneless | 1 | 2056 |
| CYP15A1 | 10 | 390–1610 |
| farnesoic acid O-methyltransferase(FAMeT) | 6 | 451–1175 |
| juvenile hormone-inducible protein | 2 | 492–1349 |
| juvenile hormone esterase | 6 | 652–1924 |
| juvenile hormone epoxide hydrolase | 8 | 414–513 |
| ferritin | 56 | 247–773 |
| elongation of very long chain fatty acids protein(ELOV) | 10 | 365–1208 |
| fatty acid binding protein(FABP) | 17 | 467–747 |
| short-chain dehydrogenase/reductase (SDR) | 6 | 449–961 |
Expression levels of genes that may be invovled with the diapause regulation of C. sinicus.
| Gene | Gene length/bp | RPKM | p-value | Regulation | |
| Copepodite | Adult | ||||
| Cuticle protein | 1568 | 42 | 0 | 8.10E–11 | up |
| Ferritin | 754 | 70 | 505 | 7.35E–38 | down |
| Heat shock protein | 1003 | 15 | 63 | 1.54E–05 | down |
| ELOV | 1006 | 301 | 6 | 1.38E–44 | up |
| Cytochrome P450 | 1763 | 25 | 7 | 0.00399 | up/down |
| Juvenile hormone esterase | 1924 | 0 | 31 | 6.64E–10 | up/down |
| FAMeT | 1175 | 47 | 77 | 0.02038 | up/down |
| Ecdysteroid receptor | 2056 | 13 | 2 | 0.00343 | up/down |
| Retinoid X receptor | 747 | 25 | 27 | 0.85347 | up/down |
up/down: Both up and down regulated sequences were found in C. sinicus copepodite or adults. ELOV: elongation of very long chain fatty acids protein. FAMeT: farnesoic acid O-methyltransferase. Gene expression level is calculated by using reads per kilobase of the transcript per million mapped reads (RPKM).
Oligonucleotide primer sequences for real time PCR.
| Gene | Primer sequence | Product size(bp) |
| ELOV | F: 5′ | 149 |
| R: 5′ | ||
| Actin | F: 5′ | 109 |
| R: 5′ | ||
| Cuticle protein 7 | F: 5′ | 118 |
| R: 5′ | ||
| Cuticle protein 19 | F: 5′ | 63 |
| R: 5′ | ||
| Ferritin | F: 5′ | 104 |
| R: 5′ | ||
| HSP 70 | F: 5′ | 116 |
| R: 5′ | ||
| HSP 90 | F: 5′ | 78 |
| R: 5′ |
Figure 3Real-time PCR validation of differentially expressed genes that may be involved in diapause.
The values were normalized against actin. Significant levels were *P<0.05, **P<0.01, and values indicate the mean ± S.E.
Figure 4Classification of single nucleotide polymorphisms (SNPs) indentified in the sinicus transcriptome.
The overall frequency of these SNPs is one per 179 bp.