| Literature DB >> 20569428 |
Baocheng Guo1, Ming Zou, Xiaoni Gan, Shunping He.
Abstract
BACKGROUND: Variations in genome size within and between species have been observed since the 1950 s in diverse taxonomic groups. Serving as model organisms, smooth pufferfish possess the smallest vertebrate genomes. Interestingly, spiny pufferfish from its sister family have genome twice as large as smooth pufferfish. Therefore, comparative genomic analysis between smooth pufferfish and spiny pufferfish is useful for our understanding of genome size evolution in pufferfish.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20569428 PMCID: PMC2996927 DOI: 10.1186/1471-2164-11-396
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The evolutionary history of the pufferfish. The tree topology was modified according to Li et al. [52], and the divergent times were adopted from Steinke et al. [53] and Tyler and Santini [54]. The genome sizes of the species used in analysis are shown.
Sequencing information of each BAC clone.
| ctfa | 1020 | 311030 | 87525 | 90 | 3.46 | 78386 |
| ctfb | 1013 | 335538 | 103323 | 110 | 3.05 | 67071 |
| ctfc | 1000 | 324833 | 99742 | 95 | 3.42 | 51904 |
| ctfd | 996 | 303386 | 89282 | 90 | 3.37 | 54060 |
| ctfe | 890 | 279782 | 80143 | 100 | 2.80 | 41374 |
| ctff | 881 | 251821 | 75743 | 80 | 3.15 | 40883 |
| ctfg | 794 | 255225 | 62595 | 90 | 2.84 | 44454 |
| ctfh | 942 | 281266 | 85394 | 80 | 3.52 | 66638 |
| ctfj | 655 | 186358 | 62614 | 70 | 2.66 | 39606 |
| ctfk | 1095 | 315029 | 85969 | 100 | 3.15 | 43834 |
| Total | 9286 | 2844268 | 832330 | 3.14 |
Coverage, regarded as sequencing redundancy was calculated as the ratio of the total read length to the assembled length. N50 length is the size x such that 50% of the assembly is in units of length at least x.
Summary of the predicted genes in each BAC clone.
| ctfa | Scaffold000001 | ring finger protein 17; | |
| Scaffold000002 | hypothetical protein LOC334519; unnamed protein product | ||
| Scaffold000003 | similar to ReO_6; similar to ring finger protein 17 | ||
| ctfb | Scaffold000001 | similar to GF20795; similar to KH domain-containing, RNA-binding, signal transduction-associated protein 2; EGF-like domain-containing protein; | |
| Scaffold000002 | |||
| Scaffold000003 | similar to KH domain-containing, RNA-binding, signal transduction-associated protein 2 | ||
| Scaffold000004 | similar to DNA primase large subunit(58 kDa) | ||
| Scaffold000005 | |||
| ctfc | Scaffold000001 | novel predicted protein; similar to neuron navigator 1; similar to cysteine and glycine-rich protein 2 | |
| Scaffold000002 | |||
| Scaffold000003 | similar to TEA domain family member 3 | ||
| Scaffold000004 | similar to troponin I, slow skeletal muscle | ||
| ctfd | Scaffold000001 | similar to neurocan; novel predicted protein | |
| Scaffold000002 | unnamed protein product; novel predicted protein; similar to hypothetical LOC100002099; similar to deoxyhypusine hydroxylase/monooxygenase; transmembrane 6 superfamily member 1-like | ||
| Scaffold000003 | similar to phosphodiesterase 4D-interacting protein; unnamed protein product | ||
| Scaffold000004 | |||
| Scaffold000005 | unnamed protein product | ||
| ctfe | Scaffold000001 | glutathione S-transferase theta; novel predicted protein | |
| Scaffold000002 | novel predicted protein; d-dopachrome decarboxylase; similar to coiled-coil-helix-coiled-coil-helix domain containing 10; zinc finger 214-contain protein | ||
| Scaffold000003 | |||
| ctff | Scaffold000001 | similar to human adrenergic receptor kinase 2; similar to human adrenergic receptor kinase 2 | |
| Scaffold000002 | similar to beta-adrenergic kinase 2; novel predicted protein | ||
| Scaffold000003 | similar to myosin-like protein | ||
| Scaffold000004 | similar to beta-adrenergic receptor kinase 2 | ||
| Scaffold000005 | similar to beta-adrenergic receptor kinase 2 | ||
| Scaffold000006 | |||
| Scaffold000007 | |||
| Scaffold000008 | |||
| Scaffold000009 | novel predicted protein | ||
| ctfg | Scaffold000001 | novel predicted protein; similar to tetraspanin 18; novel predicted protein | |
| Scaffold000002 | similar to erythrocyte membrane protein 3; novel predicted protein | ||
| Scaffold000003 | aristaless-like homeobox 4 | ||
| ctfh | Scaffold000001 | novel predicted protein; | |
| Scaffold000002 | novel predicted protein; novel predicted protein; novel predicted protein | ||
| Scaffold000003 | |||
| ctfj | Scaffold000001 | novel predicted protein; novel predicted protein; similar to sterile alpha motif domain containing 3; similar to zinc finger protein | |
| Scaffold000002 | novel predicted protein; novel predicted protein | ||
| Scaffold000003 | unnamed protein product | ||
| Scaffold000004 | Dexamethasone-induced Ras-related protein 1 precursor | ||
| Scaffold000005 | |||
| Scaffold000006 | |||
| ctfk | Scaffold000001 | ||
| Scaffold000002 | novel predicted protein; similar to calcium-transporting ATPase 2C1 | ||
| Scaffold000003 | novel predicted protein | ||
| Scaffold000004 | |||
| Scaffold000005 | |||
| Scaffold000006 | |||
| Scaffold000007 | novel predicted protein | ||
| Scaffold000008 |
Novel predicted proteins represent predicted genes that had no significant hits when searched against the non-redundant GenBank protein database using BLASTP. Genes used for the comparisons of intron size variation are underlined.
Locations of each BAC on the genomes of different species
| ctfa | Scaffold000001 | scaffold_38 | Chromosome_2 | scaffold_115 | Chromosome_21 |
| Scaffold000002 | scaffold_38 | groupXIV | |||
| Scaffold000003 | |||||
| ctfb | Scaffold000001 | scaffold_24 | Chromosome_17 | groupVI | Chromosome_15 |
| Scaffold000002 | scaffold_24 | Chromosome_17 | groupXI | ||
| Scaffold000003 | scaffold_24 | Chromosome_17 | groupVI | Chromosome_15 | |
| Scaffold000004 | scaffold_24 | Chromosome_17 | groupVI | Chromosome_15 | |
| Scaffold000005 | scaffold3867 | ||||
| ctfc | Scaffold000001 | scaffold_635 | Un_random | groupXVII | Chromosome_5 |
| Scaffold000002 | scaffold_47 | Un_random | groupXVII | ||
| Scaffold000003 | scaffold_47 | Un_random | groupXVII | Chromosome_5 | |
| Scaffold000004 | groupXVII | Chromosome_5 | |||
| ctfd | Scaffold000001 | scaffold_200 | groupIII | Chromosome_17 | |
| Scaffold000002 | scaffold_200 | groupIII | Chromosome_17 | ||
| Scaffold000003 | scaffold_200 | groupIII | Chromosome_17 | ||
| Scaffold000004 | scaffold_200 | Un_random | groupIII | Chromosome_17 | |
| Scaffold000005 | scaffold_200 | groupIII | |||
| ctfe | Scaffold000001 | ||||
| Scaffold000002 | scaffold_589 | Un_random | |||
| Scaffold000003 | |||||
| ctff | Scaffold000001 | scaffold_27 | Chromosome_12 | ||
| Scaffold000002 | scaffold_27 | ||||
| Scaffold000003 | scaffold_27 | Un_random | Chromosome_12 | ||
| Scaffold000004 | scaffold_27 | ||||
| Scaffold000005 | scaffold_27 | ||||
| Scaffold000006 | scaffold_27 | Chromosome_12 | |||
| Scaffold000007 | |||||
| Scaffold000008 | scaffold_50 | Chromosome_12 | |||
| Scaffold000009 | |||||
| ctfg | Scaffold000001 | scaffold_2 | Chromosome_13 | groupXIX | Chromosome_6 |
| Scaffold000002 | |||||
| Scaffold000003 | scaffold_2 | Chromosome_13 | groupXIX | Chromosome_6 | |
| ctfh | Scaffold000001 | scaffold_225 | Chromosome_13 | groupXIX | Chromosome_6 |
| Scaffold000002 | scaffold_225 | Chromosome_13 | groupXIX | Chromosome_6 | |
| Scaffold000003 | scaffold_225 | Chromosome_13 | groupXIX | Chromosome_6 | |
| ctfj | Scaffold000001 | Un_random | groupXIII | Chromosome_1 | |
| Scaffold000002 | Chromosome_2 | groupXIII | |||
| Scaffold000003 | Un_random | ||||
| Scaffold000004 | scaffold_3 | Chromosome_2 | groupV | scaffold4255 | |
| Scaffold000005 | Chromosome_2 | ||||
| Scaffold000006 | |||||
| ctfk | Scaffold000001 | scaffold_270 | Un_random | groupXX | scaffold1954 |
| Scaffold000002 | scaffold_270 | Un_random | groupXX | scaffold1954 | |
| Scaffold000003 | Chromosome_2 | ||||
| Scaffold000004 | |||||
| Scaffold000005 | |||||
| Scaffold000006 | |||||
| Scaffold000007 | |||||
| Scaffold000008 | scaffold_56 |
Figure 2Microsynteny between the sequences of clone ctfh in . The length of the bar represents the relatively length of the sequence, and the lines between the bars represent the high-scoring segment pair (HSP) of the BLASTN searches between the sequences. Other microsyntenic relationships between the BAC clone sequences of D. holocanthus and the smooth pufferfish genomes were given in additional file 1.
Figure 3Distribution patterns of intron size. Distributions of the intron lengths used for the comparison in D. holocanthus (Diodon), T. rubripes (Fugu), Te. nigroviridis (Tetraodon), and G. aculeatus (Gasterosteus).
Comparison of the average intron size and exon lengths of D. holocanthus, T. rubripes, and Te. nigroviridis.
| 125 | 140 | 111 | 110 | |
| 566 | 435 | 435 | 722 |
Comparison of repetitive DNA sequence contents (%) of D. holocanthus, T. rubripes, and Te. nigroviridis
| WG | HR-fugu | HR-tetra | WG | HR-Dh | WG | HR-Dh | |
|---|---|---|---|---|---|---|---|
| 7.94 | 5.16 | 4.87 | 6.89 | 2.43 | 4.66 | 2.93 | |
| 6.73 | 4.05 | 3.84 | 4.69 | 1.03 | 2.03 | 0.18 | |
| | 6.00 | 3.76 | 3.36 | 3.39 | 0.33 | 1.37 | 0.08 |
| | 2.00 | 1.63 | 1.15 | 0.20 | 0.00 | 0.13 | 0.00 |
| | 3.44 | 2.13 | 2.12 | 2.51 | 0.22 | 1.06 | 0.08 |
| | 0.55 | 0.00 | 0.00 | 0.68 | 0.11 | 0.19 | 0.00 |
| | 0.66 | 0.29 | 0.48 | 1.05 | 0.61 | 0.45 | 0.00 |
| | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| | 0.07 | 0.00 | 0.00 | 0.25 | 0.09 | 0.21 | 0.10 |
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | |
| 0.75 | 0.68 | 0.75 | 1.75 | 0.93 | 1.96 | 2.34 | |
| 0.46 | 0.43 | 0.28 | 0.44 | 0.47 | 0.67 | 0.41 | |
WG, sequenced whole genome dataset; HR-fugu, homologous region in the D. holocanthus genome compared with the T. rubripes genome; HR-tetra, homologous region in the D. holocanthus genome compared with the Te. nigroviridis genome; HR-Dh, homologous regions in the smooth pufferfish genomes compared with the D. holocanthus genome.
Transposable elements in D. holocanthus and their classification
| SINEs | |||
| V | Vertebrates | FR2 | 63 (46) |
| TE | 18 (11) | ||
| Mermaid | Vertebrates | FR1c | 16 (5) |
| FR1d | 9 (3) | ||
| Non-LTR Retrotransposons | |||
| L1 | Vertebrates, plants | KibiFr1 | 2 |
| RTE | Nematodes | Expander | 29 |
| Expander2 | 2 | ||
| L2 | Metazoa | Maui | 13 |
| Rex1/Babar | Fish | Rex1_FurC | 7 |
| LTR Retrotransposons | |||
| Gypsy/TY3 | Eukaryotes | Ronin1_I | 4 |
| Ronin2_I | 1 | ||
| Ronin3_I | 1 | ||
| Samurai_I | 1 | ||
| Penelope-like | Insects, fish | Bridge2(Xena) | 17 |
| DNA transposons | |||
| hAT-Charlie | Mammals | Chaplin4_FR | 2 (2) |
| Harbinger | Nematodes, plants | Senkusha1A | 2 (1) |
| Tc1-mariner | Metazoa, plants | TC1_FR1 | 8 (3) |
| TC1_FR3 | 2 (0) | ||
| TC1_FR4 | 5 (3) | ||
| Tc2 | Nematodes | TC2_FR2a | 2 (2) |
| TC2_FR4 | 1 (1) | ||
| Unclassified | 6 (1) |
Numbers in parentheses represent the number of transposable elements distributed within gene.