| Literature DB >> 21672188 |
Carine Genet1, Patrice Dehais, Yniv Palti, Guangtu Gao, Frederick Gavory, Patrick Wincker, Edwige Quillet, Mekki Boussaha.
Abstract
BACKGROUND: Rainbow trout (Oncorhynchus mykiss) are cultivated worldwide for aquaculture production and are widely used as a model species to gain knowledge of many aspects of fish biology. The common ancestor of the salmonids experienced a whole genome duplication event, making extant salmonids such as the rainbow trout an excellent model for studying the evolution of tetraploidization and re-diploidization in vertebrates. However, the lack of a reference genome sequence hampers research progress for both academic and applied purposes. In order to enrich the genomic tools already available in this species and provide further insight on the complexity of its genome, we sequenced a large number of rainbow trout BAC-end sequences (BES) and characterized their contents.Entities:
Mesh:
Year: 2011 PMID: 21672188 PMCID: PMC3125269 DOI: 10.1186/1471-2164-12-314
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of the INRA RT rep1.0 database.
| Number of elements | Percentage of sequence | Number of elements | Percentage of sequence | ||
|---|---|---|---|---|---|
| DNA | 1 | 0,05 | |||
| LTR/DIRS1 | 2 | 0,02 | DNA/Harbinger | 2 | 0,02 |
| LTR/Gypsy | 21 | 1,68 | DNA/hAT | 9 | 0,27 |
| DNA/hAT-Charlie | 5 | 0,26 | |||
| LINE | 1 | 0,29 | DNA/hAT-Tag1 | 1 | 0,05 |
| LINE/I | 1 | 0,01 | DNA/hAT-Tip100 | 1 | 0,01 |
| LINE/Jockey | 3 | 0,05 | DNA/Helitron | 1 | 0,01 |
| LINE/L1 | 6 | 0,14 | DNA/Maverick | 1 | 0,01 |
| LINE/L2 | 19 | 2,18 | DNA/Tc1 | 11 | 4,26 |
| LINE/Penelope | 1 | 0,05 | DNA/TcMar-Tc1 | 46 | 24,51 |
| LINE/Rex1 | 6 | 3,03 | DNA/TcMar-Tc2 | 1 | 0,01 |
| SINE | 5 | 1,09 | |||
| SINE/Deu | 5 | 0,23 | Simple_repeat | 22 | 1,19 |
| SINE/tRNA-Lys | 1 | 0,01 | rRNA | 3 | 0,11 |
| SINE/5S | 1 | 0,01 | Satellite | 2 | 0,06 |
| SINE? | 8 | 0,37 | Unknown elements | 549 | 19,19 |
Columns 1 and 4 show the name of the repeat elements. Columns 2 and 5 provide the number of repeats elements in the INRA RT rep1.0 generated in this work. Columns 3 and 6 correspond to the percentage of total base pairs masked by the corresponding repeat element.
Figure 1Low complexity sequences identified through BLAST search of masked BES against themselves. × axis represents the distribution of BLAST hits. Y axis represents the number of BES.
Distribution of Simple Sequence Repeats in trout BAC end sequences.
| Repeat | Type | Number | Repeat | Type | Number |
|---|---|---|---|---|---|
| A/T | 71 | CACA/TGTG | 38 | ||
| C/G | 1 | TCTG/CAGA | 37 | ||
| TG/CA | 1346 | TATG/CATA | 26 | ||
| AC/GT | 1238 | AGAC/GTCT | 38 | ||
| GA/TC | 607 | GAGA/TCTC | 24 | ||
| TA/TA | 505 | ACAT/ATGT | 22 | ||
| AG/CT | 498 | AGAA/TTCT | 17 | ||
| AT/AT | 311 | Other | 370 | ||
| AAC/GTT | 5 | TCAAA/TTTGA | 19 | ||
| CAT/ATG | 13 | ATTTG/CAAAT | 15 | ||
| AAT/ATT | 36 | Other | 92 | ||
| ATA/TAT | 36 | 356 | |||
| TTA/TAA | 24 | 80 | |||
| Other | 55 | 278 | |||
| ACAC/GTGT | 26 | 144 | |||
| TGTA/TACA | 66 | 478 | |||
| ATAC/GTAT | 23 | ||||
| CTCA/TGAG | 25 | Total | 6,920 | ||
Rainbow trout BAC end sequences BlastN statistics
| Medaka | Stickleback | Zebrafish | |
|---|---|---|---|
| 15,896 | 17,499 | 14,716 | |
| | 15,379 | 15,612 | 13,868 |
| | 14,206 | 15,467 | 13,304 |
| | 845 | 1,016 | 706 |
| | 792 | 812 | 632 |
| | 418 | 566 | 360 |
| | 85 | 105 | 105 |
| | 307 | 421 | 176 |
a The bioinformatic filtering criteria used for identifying 'unique' hits are described in the methods section.
Rainbow trout BAC end sequences BlastX statistics
| Medaka | Stickleback | Zebrafish | |
|---|---|---|---|
| 9,312 | 9,308 | 10,487 | |
| | 9,125 | 9,058 | 9,983 |
| | 8,700 | 8,674 | 9,717 |
| | 306 | 317 | 385 |
| | 293 | 305 | 355 |
| | 132 | 148 | 155 |
| | 85 | 105 | 105 |
| | 40 | 45 | 30 |
a The bioinformatic filtering criteria used for identifying 'unique' hits are described in the methods section.