| Literature DB >> 18817567 |
Belén G Pardo1, Carlos Fernández, Adrián Millán, Carmen Bouza, Araceli Vázquez-López, Manuel Vera, José A Alvarez-Dios, Manuel Calaza, Antonio Gómez-Tato, María Vázquez, Santiago Cabaleiro, Beatriz Magariños, Manuel L Lemos, José M Leiro, Paulino Martínez.
Abstract
BACKGROUND: The turbot (Scophthalmus maximus; Scophthalmidae; Pleuronectiformes) is a flatfish species of great relevance for marine aquaculture in Europe. In contrast to other cultured flatfish, very few genomic resources are available in this species. Aeromonas salmonicida and Philasterides dicentrarchi are two pathogens that affect turbot culture causing serious economic losses to the turbot industry. Little is known about the molecular mechanisms for disease resistance and host-pathogen interactions in this species. In this work, thousands of ESTs for functional genomic studies and potential markers linked to ESTs for mapping (microsatellites and single nucleotide polymorphisms (SNPs)) are provided. This information enabled us to obtain a preliminary view of regulated genes in response to these pathogens and it constitutes the basis for subsequent and more accurate microarray analysis.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18817567 PMCID: PMC2569028 DOI: 10.1186/1746-6148-4-37
Source DB: PubMed Journal: BMC Vet Res ISSN: 1746-6148 Impact factor: 2.741
Summary statistics of ESTs from turbot libraries
| Number | % | |
| Good-quality ESTs | 9256 | |
| Redundant sequences | 6847 | 74.0 |
| Unique sequences | 3482 | 36.0 |
| Contigs | 1073 | 30.8 |
| Singletons | 2409 | 69.2 |
| Unique sequences with no BLAST hits | 1716 | 49.3 |
| Unique sequences with BLAST hits | 1766 | 50.7 |
| BLASTN | 1091 | 61.8 |
| BLASTX | 675 | 38.2 |
| Unique sequences with functional annotation | 816 | 23.4 |
| Contigs | 489 | 59.9 |
| Singletons | 327 | 40.1 |
Figure 1Sequence prevalence distribution of the identified contigs from turbot libraries. (A) Absolute frequency histogram showing contig size (number of sequences) distribution. (B) Functional and BLAST hits confidence characteristics of the ten largest contigs. Only biological function according to GO terms has been included.
Figure 2Classification of turbot unique sequences in biological processes categories following Gene Ontology (GO).
Figure 3Classification of turbot unique sequences in molecular function categories following Gene Ontology (GO).
Figure 4Classification of turbot unique sequences in cellular component categories following Gene Ontology (GO).
Defence and immune-related annotated ESTs from turbot libraries
| Genes | No. unique sequences | % |
| Complement related | 16 | 7.9 |
| Apoptosis related | 10 | 4.9 |
| Immunoglobulin related | 8 | 3.9 |
| Glutathione S-transferase | 7 | 3.5 |
| Elastase | 6 | 2.9 |
| Cytochrome P450 | 6 | 2.9 |
| Major histocompatibility complex | 5 | 2.5 |
| Coagulation factor | 5 | 2.5 |
| Interferon related | 3 | 1.5 |
| Perforin | 3 | 1.5 |
| Hepcidin precursor | 3 | 1.5 |
| Nephrosin | 3 | 1.5 |
| Alpha-2-macroglobulin | 3 | 1.5 |
| Other genes | 119 | 58.6 |
| Total | 203 | 24.9 |
"No. unique sequences" refers to the total amount of the different annotated contigs and singletons for each gene class listed in the Table. "Total percentage" of defence/immune-related genes is referred to the number of unique annotated sequences from turbot libraries (816).
Summary statistics of SNP identification from turbot EST resources
| Real SNPs | True SNPs andSNPs | |
| Total sequences analysed | 12584 | |
| Number of contigs | 257 | 255 |
| Total SNPs detected | 2197 | 1158 |
| SNP frequency | 1.39/100 bp | 0.74/100 bp |
| Total number of transitions | 749 | 453 |
| C/T | 556 | 344 |
| A/G | 193 | 109 |
| Total number of transversions | 974 | 558 |
| A/T | 161 | 87 |
| A/C | 352 | 214 |
| T/G | 251 | 130 |
| C/G | 210 | 127 |
| Total number of indels | 366 | 125 |
| Tri-allelic polymorphisms | 99 | 21 |
| Tetra-allelic polymorphisms | 9 | 1 |
Real SNPs are those which passed quality filters 1 and 2 using the pipeline QualitySNP and true SNPs are the highest quality SNPs passing the three filters (see Methods).
Real and true quality SNP distribution in contigs with 4 or more ESTs
| Number of contigs | Real SNPs | True SNPs | |
| with 4 sequences | 22 | 58 | 58 |
| with 5–10 sequences | 94 | 295 | 235 |
| with 11–20 sequences | 75 | 437 | 266 |
| with 21–30 sequences | 28 | 322 | 215 |
| with 31–50 sequences | 21 | 289 | 110 |
| with > 50 sequences | 17 | 796 | 274 |
| Total | 257 | 2197 | 1158 |
Real SNPs are those which passed quality filters 1 and 2 using the pipeline QualitySNP and true SNPs are the highest quality SNPs passing the three filters (see Methods).