| Literature DB >> 22577894 |
Daniel Garcia de la Serrana1, Alicia Estévez, Karl Andree, Ian A Johnston.
Abstract
BACKGROUND: The gilthead sea bream (Sparus aurata L.) occurs around the Mediterranean and along Eastern Atlantic coasts from Great Britain to Senegal. It is tolerant of a wide range of temperatures and salinities and is often found in brackish coastal lagoons and estuarine areas, particularly early in its life cycle. Gilthead sea bream are extensively cultivated in the Mediterranean with an annual production of 125,000 metric tonnes. Here we present a de novo assembly of the fast skeletal muscle transcriptome of gilthead sea bream using 454 reads and identify gene paralogues, splice variants and microsatellite repeats. An annotated transcriptome of the skeletal muscle will facilitate understanding of the genetic and molecular basis of traits linked to production in this economically important species.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22577894 PMCID: PMC3418159 DOI: 10.1186/1471-2164-13-181
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of reads obtained per experimental condition and their respective Newbler assembly results
| 351895 | 486993 | 439734 | 459853 | 433264 | 2711149 | ||
| | 166460 | 239792 | 189314 | 258331 | 200313 | 1157833 | |
| | 19809 | 24231 | 19140 | 21125 | 24896 | 96351 | |
| | 6502 | 9152 | 8407 | 9922 | 9524 | 50515 | |
| | 6254 | 8595 | 8242 | 9554 | 9267 | 43461 | |
| | 875 | 589 | 936 | 551 | 550 | 454 | |
| | 1092 | 732 | 1269 | 672 | 642 | 679 | |
| 2634 | 2791 | 3031 | 3211 | 3029 | 10465 |
Singletons: reads not contained in the final assembly.
Isotig: contigs consistently connected by a set of reads.
N50: The value was computed by sorting all contigs from largest to smallest and by determining the minimum set of contigs whose sizes total 50% of the entire transcriptome.
Figure 1(A) Barr chart summarizing the percentage of isotigs showing any change in their sequence compared with 80 NCBI sequences (B) Distribution of the differences between NCBI and transcriptome sequences in categories of mismatch, insertion and deletion. Classification of the discrepancies between sequences was carried out by analysis of the ClustalW alignment.
Figure 2Myofibrillar genes represented in the transcriptome mapped onto a reconstruction of a half sarcomere based on published models for filaments and M-line[25]and z-disc structure[26]. Numbers on the right side of the gene name represents isotig length (bp), isotig mean coverage and percentage of identity with the zebrafish orthologue.
Transcripts with functional domains deleted that were experimentally confirmed by PCR
| Aspartate beta hydroxylase | 37 | 4 | ENSTNIT000000003370 | 3 | 2 | IPR018939 | Authophagic related protein 27 (ATG27) |
| Coagulation factor x | 75 | 4 | ENSGACT00000011445 | 13 | 9 and 10 | IPR000294 | GLA domain |
| | | | | | | IPR000742 | EGF3 domain |
| Nucleotide binding | 88 | 4 | ENSGACT00000002245 | 9 | 1 and 2 | IPR000808 | Mrp site |
| | | | | | | IPR019591 | ATPase like ParA |
| Bridging integrator 1 | 100 | 50 | ENSGACT00000020571 | 18 | 1 to 10 | IPR004148 | PAR domain |
| | | | | | | IPR003005 | Amphyphysin |
| Paraxonase 2 | 87 | 5 | ENSTNIG00000018456 | 14 | 12 | No-IPR | Signal Peptide |
| | | | | | | IPR013838 | autoregulation binding site |
| Cathepsin H | 75 | 4 | ENSTNIG00000022446 | 7 | 1 | IPR013201 | Proteinase inhibitor, cathepsin propeptide |
| Polyadenylate-binding protein- interacting protein 2 | 100 | 4 | ENSTNIG00000005741 | 8 | 6 | IPR009818 | Ataxin 2 |
| Transitional endoplasmic reticulum atpase (cdc48) | 100 | 40 | ENSGACG00000018832 | 9 | 8 | IPR003338 | AAA + Atpase domain |
| | | | | | | IPR009010 | Aspartate descarboxylase fold |
| S-adenosylmethionine decarboxylase | 50 | 5 | ENSTNIG00000002751 | 6 | 5 | IPR018166 | adenosylmethionine descarboxylase |
| O-sialoglycoprotein endopeptidase | 62 | 4 | ENSGACG00000019496 | 6 | 5 | IPR017860 | Peptidase M22, Glycopeptidase |
| Zinc finger x-chromosomal protein | 30 | 4 | ENSGACG00000020679 | 2 | 1 | IPR007087 | Zinc finger C2H2 |
| Dead (asp-glu-ala-asp) box polypeptide 1 | 57 | 6 | ENSGACG00000011162 | 13 | 8 | IPR000504 | RNA recognition motif domain |
List of paralogues identified in the gilthead sea bream skeletal muscle transcriptome
| Acethylcoline receptor subunit alpha 1 | Ion-conducting channel | 50/50 | 8/12 | 85 | Alpha 1.a/1.b |
| Adp/atp translocase (Solute carrier family 25, SLC25) | Catalyzes the exchange of ADP and ATP across the mitochondrial inner membrane | 100/100 | 43/87 | 92 | SLC25 member 5 and 6 |
| Calpain small subunit 1 | Calcium-regulated thiol-protease involved in cytoskeletal remodeling | 100/100 | 10/11 | 79 | Calpain subunit 1a/b |
| Carnitine O- acetyltransferase | Carnitine acetylase is specific for short chain fatty acids | 100/67 | 23/7 | 67 | Carnitine O-acetyltransferase a1/a2 |
| Dehydrogenase reductase member 7c | Putative oxidoreductase | 100/99 | 59/32 | 60 | DHR7SC-A/DHR7SC-B |
| Dysferlin interacting protein 1 | Sarcolemma repair mechanism of both skeletal muscle and cardiomyocytes | 90/90 | 39/11 | 71 | Dysferlin1a/b |
| Epithelial membranse protein 3 | Probably involved in cell proliferation and cell-cell interactions | 100/100 | 4/14 | 68 | EMP3a/b |
| Glioblastoma amplified sequence | Widely expressed. Most abundant in heart and skeletal muscle | 100/100 | 39/10 | 80 | Nipsnap2a/b |
| High mobility group box 1 | DNA binding proteins that associates with chromatin | 88/60/87 | 439/7/9 | 60 | HMG1a/b HMG2 |
| Microglobulin beta-2 | Component of the class I major histocompatibility complex (MHC) | 100/100 | 57/90 | 60 | B2ma/b |
| Myomesin 185 kDa | Major component of the vertebrate myofibrillar M band | 98/90 | 170/54 | 65 | Myomesin1a/b |
| Serine threonine-protein phosphatase | Essential for cell division, and participates in muscle contractility and protein synthesis | 50/50 | 3/4 | 89 | Subunit alpha/gamma |
| Solute carrier family 38 member 5 | Sodium-dependent, pyrimidine- and purine-selective. Involved in the homeostasis of endogenous nucleosides | 100/90 | 14/5 | 73 | Member 5a/b |
| Tyrosine 3 monooxigenase | Enzyme function | 90/100 | 5/5 | 74 | Ywhab/ywhag2 |
| Set and mynd domain- containing protein 1 | Acts as a transcriptional repressor. Essential for cardiomyocyte differentiation | 95/97 | 154/127 | 76 | Smyd1a/b |
| Dual specificity phosphatase and pro Isomerase domain containing 1 | Catalyse reaction: Protein tyrosine phosphate + H2O = protein tyrosine + phosphate. | 95/100 | 7/7 | 51 | DUPD1a/b |
| Metalloproteinase inhibitor 2 precursor | Complexes with metalloproteinases and irreversibly inactivates them | 80/60 | 4/4 | 62 | TIMP2b/a |
| Retinoid x gamma | Rreceptor for retinoic acid | 100/100 | 21/10 | 73 | Gamma/beta |
| Junctophilin 1 | Contributes to the stabilization of the junctional membrane complexes | 85/87 | 7/10 | 65 | Junctophilin 1a/b |
| 60s ribosomal protein l5 | Required for rRNA maturation and formation of the 60 S ribosomal subunits | 100/100 | 58/186 | 91 | Rpl5a/b |
| Trans-2,3-enoyl-CoA reductase | Reduces trans-2,3-stearoyl-CoA to stearoyl-CoA of long and very long chain fatty acids | 98/98 | 11/61 | 75 | Trec.a/b |
| Methylmalonate- semialdehyde Dehydrogenase | Plays a role in valine and pyrimidine metabolism. Binds fatty acyl-CoA | 100/100 | 26/12 | 86 | Aldha1.a/a1.b |
| Eukaryotic translation initiation factor 4e type 3 | Its translation stimulation activity is repressed by binding to the complex CYFIP1-FMR1 | 100/100 | 28/15 | 73 | EIF4E3a/b |
| Fk506-binding protein 1a | May play a role in modulation of ryanodine receptor isoform-1 (RYR-1) | 100/100 | 52/12 | 82 | FKBP1A.1/A.2 |
| Splicing arginine serine-rich 11 | May function in pre-mRNA splicing. | 95/95 | 11/25 | 82 | Srf11a/b |
| Kelch repeat and btb domain containing 10 | Substrate-specific adapter of an E3 ubiquitin-protein ligase complex | 95/100 | 38/15 | 53 | Kbtb5/kbtb10 |
Transcription factor families present in the gilthead sea bream transcriptome
| ZnF-C2H2 | interleukin enhancer-binding factor 3 | 54 | 16.8 |
| Chromatin-associated | yy1 transcription represor factor | 32 | 10.0 |
| Cofactor | e1a binding protein p300 | 25 | 10.9 |
| Beta-scaffold | signal transducer and activator of transcription 3 | 24 | 7.5 |
| bZIP | transcription factor jun-d | 25 | 7.8 |
| bHLH | hypoxia-inducible factor 3 alpha | 21 | 6.5 |
| General transcription factor | transcription factor 20 | 20 | 6.2 |
| Protein-protein interaction | zinc finger and btb domain containing 33 | 16 | 5.0 |
| Homeobox | six homeobox 1 | 14 | 4.4 |
| Others | bromodomain adjacent to zinc finger 2b | 19 | 5.9 |
| Nuclear hormone receptor | Peroxisome proliferator-activated receptor alpha | 15 | 4.7 |
| ZnF-Others | glucocorticoid receptor dna-binding factor 1 | 16 | 5.0 |
| High mobility group box | transcription factor sox-6 isoform 2 | 8 | 2.5 |
| Trp-clusters | interferon regulatory factor 2 | 9 | 2.8 |
| Forkhead | forkhead box o3 | 4 | 1.2 |
| TEA | tea domain family member 3 | 3 | 0.9 |
| Dwarfin | smad family member 2 | 3 | 0.9 |
| E2F | e2f transcription factor 6 | 2 | 0.6 |
Genes were categorized according to the description in Uniprot [27] and TFCONES, Institute of Molecular and Cell Biology [28].
Znf-C2H2: Zinc finger domain with two conserved cysteines and two histidines co-ordinate a zinc ion.
bZIP basic leucine zipper.
bHLH basic helix loop helix transcription domain.
Trp-clusters: include Interferon regulatory transcription factors and E-twenty six transcription factors.
TEA transcriptional enhancer factor.
Figure 3Dot plot pairwise comparison of reads contribution to the isotigs formation from each experimental group. Each dot represents a contig with reads from one or both treatments. X vs Y graph illustrate the relation of the number of reads between treatments in function of the region where the dots are placed.
Expression analysis of libraries showing isotigs where reads from each experimental condition significantly contributed to the assembly
| Isotig06049 | Slow myosin heavy chain 2 | CBN81811.1 | 0.0 | 0 | 0 | |
| | Isotig05152 | Similar to ankyrin 2 | CAM15089.1 | 2e-24 | 0 | 0 |
| | Isotig01065 | Calcium binding and coiled coil domain | AAI17592.1 | 1e-57 | 0 | 0 |
| | Isotig30481 | myotubularin-related protein 5 | NP_001038623 | 3e-11 | 0 | 0 |
| | Contig01939 | GTPase, IMAP family member 7 | ACO08772.1 | 2e-53 | 0 | 0 |
| | Isotig31931 | Myosin, heavy polypeptide 6 | CAX12653.1 | 2e-06 | 0 | 0 |
| | Isotig06490 | Jeltraxin | ACN11240.1 | 6e-14 | 1,00e-05 | 0.004 |
| | Isotig30481 | Adenylate kinase 1-2 | ACM41863.1 | 5e-33 | 0 | 0 |
| | Isotig07982 | Aurora kinase A-interactinng protein | ACQ58398 | 4e-92 | 0 | 0 |
| | Isotig06835 | Slow Troponin T2 | AAV80376.1 | 1e-68 | 0 | 0 |
| Contig02082 | Parvalbumin | ACM41857.1 | 0.72 | 0 | 0 | |
| | Isotig05339 | VHSV-induced protein | AEG78384 | 1e-15 | 1,00E-05 | 0.004 |
| | Isotig17563 | Putative nuclease HARBI1 | XP_003200346.1 | 1e-16 | 0 | 0 |
| | Isotig18811 | Notch 2 | BAA20535.1 | 9e-61 | 0 | 0 |
| | Isotig22470 | Phosphatidylinositol N-acetylglucosaminyltransferase | NP_955461.1 | 7e-15 | 0 | 0 |
| | Isotig20332 | Glyceraldehyde 3-dehydrogenase | XP_9741181.1 | 5e-14 | 0 | 0 |
| | Contig01939 | GTPase, IMAP family member 7 | ACO08772.1 | 2e-53 | 0 | 0 |
| | Isotig07008 | Ribosomal protein L28 | ACQ58416.1 | 5e-59 | 2,00E-05 | 0.009 |
| | Isotig38944 | AMP deaminase-1-like | XP_003212994.1 | 3e-04 | 0 | 0 |
| | Isotig28974 | Myosin light chain 2 | AAX34414.1 | 2e-10 | 8,00E-05 | 0.03 |
| Isotig05428 | Xin actin-binding repeat containing protein 1 | NP_001012377.1 | 0.0 | 0 | 0 | |
| | Isotig02673 | Clusterin-1 | NP_001117890.1 | 3e-139 | 0 | 0 |
| | Isotig02981 | Myosin-6-like isoform 1 | XP_001923213.1 | 0.0 | 0 | 0 |
| | Isotig08301 | Heat shock protein 30 | NP_001134440.1 | 6e-62 | 0 | 0 |
| | Isotig05469 | Activator of 90kda heat shock protein ATPase homolog 1 | NP_997767.1 | 2e-145 | 5,00E-05 | 0.02 |
| | Isotig01745 | Selenoprotein L | NP_001180385.1 | 3e-63 | 8.00E-05 | 0.03 |
| | Isotig07184 | eEF1A2 binding protein | NP_001133224.1 | 1e-104 | 0 | 0 |
| | Isotig03041 | Heat shock protein 4a | NP_999881.1 | 0.0 | 5,00E-05 | 0.02 |
| | Isotig18085 | Srfs18 | CAG06353.1 | 1e-45 | 1,00E-05 | 0.004 |
| | Isotig02073 | Heat shock protein 90 | AAQ95586.1 | 0.0 | 0 | 0 |
| Isotig05303 | Zinc binding protein 33A | XP_694642.3 | 5e-61 | 0 | 0 | |
| | Isotig01520 | Interferon stimulated gene 15 | BAJ16365.1 | 4e-46 | 0 | 0 |
| | Isotig03692 | Receptor transporting protein 3 | ACQ57966.1 | 6e-66 | 0 | 0 |
| | Isotig03214 | Nicotinic acetylcholine receptor alpha 1b | CAG09972.1 | 0.0 | 0 | 0 |
| | Isotig05152 | Similar to ankyrin 2 | CAM15089.1 | 2e-24 | 0 | 0 |
| | Isotig06087 | G-rich sequence factor 1 | NP_001135339.1 | 1e-107 | 0 | 0 |
| | Isotig07416 | Presenilin associated | ABG81447.1 | 6e-45 | 1,00E-05 | 0.004 |
| | Isotig05735 | Ubtf protein | AAI15119.1 | 5e-169 | 0 | 0 |
| | Isotig19061 | rRNA promoter binding protein | NP_671477 | 7e-10 | 0 | 0 |
| Isotig06639 | C6orf64 | ACO14504.1 | 2e-26 | 0 | 0 |
Gene description, orthologue accession number and e-values were obtained by blastx against the NCBI nr database. Both p-values and FDR p-value were calculated by chi-square and FDR statistic using R statistical package [70].
Figure 4Barr charts summarizing transcripts with significant differences between groups in the number of reads mapped. The groups were as follows: 21°C fed (J), fasted 21°C (F), acutely transferred to 33°C fed (H) and acutely transferred to 11°C (L). All genes represented have been selected from Table 5 and have a FDR ≤ 0.01.