| Literature DB >> 26742787 |
Yingjia Shen1,2, Domitille Chalopin3, Tzintzuni Garcia4, Mikki Boswell5, William Boswell6, Sergey A Shiryev7, Richa Agarwala8, Jean-Nicolas Volff9, John H Postlethwait10, Manfred Schartl11, Patrick Minx12, Wesley C Warren13, Ronald B Walter14.
Abstract
BACKGROUND: Xiphophorus fishes are represented by 26 live-bearing species of tropical fish that express many attributes (e.g., viviparity, genetic and phenotypic variation, ecological adaptation, varied sexual developmental mechanisms, ability to produce fertile interspecies hybrids) that have made attractive research models for over 85 years. Use of various interspecies hybrids to investigate the genetics underlying spontaneous and induced tumorigenesis has resulted in the development and maintenance of pedigreed Xiphophorus lines specifically bred for research. The recent availability of the X. maculatus reference genome assembly now provides unprecedented opportunities for novel and exciting comparative research studies among Xiphophorus species.Entities:
Mesh:
Year: 2016 PMID: 26742787 PMCID: PMC4705583 DOI: 10.1186/s12864-015-2361-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Fish used in this study. Approximate geographical distributions of three Xiphophorus species. The swordtail, X. hellerii is a male fish showing an extended caudal fin ray, while X. maculatus and X. couchianus are platyfish and do not exhibit this caudal fin extension. The stars are the locations where fishes were collected and the red stars are the location of sequenced fish originally derived
Assembly statistics of genomes of three Xiphophorus species
| Level |
|
|
| |
|---|---|---|---|---|
| Contig level | Number | 67,070 | 34,765 | 70,798 |
| N50 length (Mb) | 0.02 | 0.06 | 0.03 | |
| Shortest contig | 500 | 200 | 200 | |
| GC content (%) | 34.7 | 35.4 | 34.6 | |
| Total size (Mb) | 652 | 648 | 657 | |
| Scaffold level | Number | 20,640 | 12,015 | 23,897 |
| N50 length (Mb) | 1.3 | 1.8 | 1.6 | |
| Total size (Mb) | 730 | 711 | 741 | |
| Chromosome level | Number | 24 | 24 | 24 |
| N50 length (Mb) | 29.4 | 29.3 | 29.4 | |
| Total size (Mb) | 724 | 708 | 734 |
Statistics of transcriptomes of three Xiphophorus species
|
|
|
| |
|---|---|---|---|
| # of gene models | 20,498 | 20,300 | 20,325 |
| N50 length (bp) | 3,615 | 3,609 | 3,625 |
| Average length (bp) | 2,679 | 2,575 | 2,581 |
| Total size (Mb) | 52.9 | 52.3 | 52.5 |
Fig. 2Distribution of polymorphisms in Xiphophorus genome among 24 chromosomes. a The histogram rings in the Circos plot represent the number of SNCs in 100 kb bins normalized by 3000. Tracks from outside circles to inner circles are polymorphisms between X. maculatus and X. couchianus (orange), between X. maculatus and X. hellerii (light green), only in X. maculatus (purple), only in X. couchianus (red) and only in X. hellerii (dark green). The connecting links in the inner circle show the inter-chromosomal rearrangements between X. maculatus and X. couchianus (orange links) and X. maculatus and X. hellerii (green links). b Distribution of genes with high impact polymorphisms in the genome. The black, orange, blue and green bars represent the location of all protein coding genes in X. maculatus genome, genes with high impact variants (see Materials and Methods) in X. couchianus relative to X. maculatus, shared genes with high high impact variants between in X. couchianus and X. hellerii relative to X. maculatus, genes with high impact variants in X. hellerii relative to X. maculatus
Number and percentage of polymorphisms’ effects in X. couchianus and X. hellerii compared with the X. maculatus reference genome
| Type |
|
| ||
|---|---|---|---|---|
| Number | Percentage | Number | Percentage | |
| Downstream | 1,542,508 | 12.06 | 1,942,329 | 11.71 |
| Codon InDel | 2467 | 0.02 | 2,863 | 0.02 |
| Exon | 211 | 0.00 | 230 | 0.00 |
| Intergenic | 5,007,423 | 39.16 | 6,594,737 | 39.79 |
| Intron | 4,103,306 | 32.09 | 5,358,720 | 32.33 |
| Non synonymous coding | 84,752 | 0.66 | 101,178 | 0.61 |
| Splice site | 35,823 | 0.29 | 44,523 | 0.28 |
| Loss of start codon | 116 | 0.00 | 133 | 0.00 |
| Gain of stop codon | 805 | 0.01 | 945 | 0.01 |
| Loss of stop codon | 314 | 0.00 | 445 | 0.00 |
| Synonymous coding | 140,717 | 1.10 | 174,526 | 1.05 |
| Upstream | 1,564,218 | 12.23 | 1,937,840 | 11.91 |
| 3'-UTR | 243,474 | 1.90 | 304,241 | 1.84 |
| 5'-UTR | 51,921 | 0.41 | 64,017 | 0.39 |
| Total | 12,778,055a | 100.00 | 16,526,727a | 100.00 |
aThe number of effect is higher than the number of polymorphisms because one polymorphism could cause multiple effects in neighboring genes
Fig. 3Conserved syntenies between Xiphophorus species. a The X. couchianus orthologs of chromosome 13 tend to lie on X. maculatus chromosome 13 (Y-axis) and show conserved syntenic relationship in two species. Some orthologs are mapped to other chromosomes of X. maculatus mostly notable chromosome 3, suggesting paralogous chromosomes arising from genome duplication. b The X. hellerii orthologs of chromosome 13 behave similarly as X. couchianus orthologs
Statistics of transposable elements in Xiphophorus genomes. Left panels: Genomes without filtration. Right panels: Genomes after removing small (less than 80 bp) and divergent (less than 80 % identify) TE elements
| Class/family | Coverage(%, no filtration) | Coverage(% filtered) | ||||
|---|---|---|---|---|---|---|
| Species |
|
|
|
|
|
|
| DNA Transposons | 12.348 | 12.267 | 12.013 | 6.212 | 6.023 | 6.022 |
| DNA/Academ | 0.019 | 0.206 | 0.021 | 0.016 | 0.016 | 0.017 |
| DNA/Buster | 0 | 0.005 | 0 | 0 | 0 | 0 |
| DNA/CMC-Chapaev-3 | 0.017 | 0.017 | 0.017 | 0.016 | 0.017 | 0.015 |
| DNA/CMC-Enspm | 0.02 | 0.018 | 0.021 | 0.014 | 0.013 | 0.014 |
| DNA/Ginger2 | 0 | 0.001 | 0 | 0 | 0 | 0 |
| DNA/Helitron | 0.256 | 0.252 | 0.256 | 0.157 | 0.153 | 0.162 |
| DNA/IS4EU | 0.053 | 0.052 | 0.05 | 0.045 | 0.046 | 0.046 |
| DNA/Kolobok | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/P | 0.041 | 0.04 | 0.04 | 0.029 | 0.028 | 0.03 |
| DNA/PIF-Harbinger | 0.634 | 0.619 | 0.614 | 0.568 | 0.557 | 0.555 |
| DNA/PiggyBac | 0.25 | 0.249 | 0.245 | 0.233 | 0.232 | 0.231 |
| DNA/Polinton | 0.024 | 0.025 | 0.029 | 0.014 | 0.015 | 0.017 |
| DNA/Tc-Mariner | 6.631 | 6.495 | 6.494 | 1.778 | 1.71 | 1.721 |
| DNA/hAT | 3.368 | 3.29 | 3.242 | 2.515 | 2.44 | 2.427 |
| DNA/MITE | 0.033 | 0.032 | 0.03 | 0.023 | 0.021 | 0.02 |
| Unclassified | 1.002 | 0.966 | 0.954 | 0.804 | 0.775 | 0.767 |
| LINE Retrotransposons | 2.576 | 2.417 | 2.411 | 1.678 | 1.572 | 1.536 |
| LINE/I-Nimb | 0.057 | 0.055 | 0.055 | 0.025 | 0.024 | 0.023 |
| LINE/Jockev | 0.058 | 0.058 | 0.058 | 0.031 | 0.05 | 0.031 |
| LINE/L1 | 0.125 | 0.106 | 0.124 | 0.064 | 0.06 | 0.061 |
| LINE/L2 | 0.942 | 0.905 | 0.899 | 0.659 | 0.623 | 0.624 |
| LINE/R2 | 0.001 | 0.003 | 0.001 | 0.001 | 0.002 | 0.001 |
| LINE/R4 | 0.016 | 0.013 | 0.012 | 0.007 | 0.003 | 0.002 |
| LINE/RTE | 0.563 | 0.537 | 0.536 | 0.343 | 0.326 | 0.321 |
| LINE/Rex-Babar | 0.756 | 0.687 | 0.671 | 0.526 | 0.464 | 0.453 |
| PLE/Penelope | 0.004 | 0.004 | 0.004 | 0.001 | 0 | 0 |
| Unclassified | 0.054 | 0.049 | 0.051 | 0.021 | 0.02 | 0.02 |
| LTR Retrotransponsons | 0.632 | 0.592 | 0.635 | 0.316 | 0.253 | 0.333 |
| LTR/BEL-Pao | 0.036 | 0.035 | 0.033 | 0.006 | 0.008 | 0.007 |
| LTR/Copia | 0.005 | 0.005 | 0.005 | 0.002 | 0.002 | 0.002 |
| LTR/DIRS1-Ngaro | 0.129 | 0.105 | 0.12 | 0.053 | 0.047 | 0.046 |
| LTR/ERV | 0.113 | 0.118 | 0.14 | 0.08 | 0.083 | 0.109 |
| LTR/ERV1 | 0.01 | 0.01 | 0.01 | 0.008 | 0.008 | 0.009 |
| LTR/Gypsy | 0.234 | 0.219 | 0.228 | 0.09 | 0.077 | 0.086 |
| Unclassified | 0.105 | 0.1 | 0.099 | 0.077 | 0.028 | 0.074 |
| SINE Retrotransposons | 0.611 | 0.524 | 0.567 | 0.395 | 0.315 | 0.347 |
| SINE | 0.188 | 0.144 | 0.181 | 0.135 | 0.093 | 0.117 |
| SINE/Hpa | 0.004 | 0.005 | 0.005 | 0 | 0.001 | 0 |
| SINE/MIR | 0.114 | 0.112 | 0.112 | 0.059 | 0.058 | 0.061 |
| SINE/V | 0.238 | 0.196 | 0.203 | 0.16 | 0.121 | 0.128 |
| SINE/tRNA | 0.067 | 0.067 | 0.066 | 0.041 | 0.042 | 0.041 |
| Unknown | 5.657 | 5.58 | 5.502 | 4.012 | 3.947 | 3.903 |
| Total | 21.824 | 21.38 | 21.128 | 12.613 | 12.11 | 12.141 |
Fig. 4History of transposable element superfamilies in the three Xiphophorus genomes and potential respective current activity. a Kimura distances of the different superfamily copies (recent copies on the left, ancient copies on the right). b Focus on first Kimura distances highlighting recent copies specific to each species. c Spider graph representing the TE superfamilies content in the genome (Log10 [% of the genome], orange) and in the transcriptome (Log10 [% of the transcriptome], blue)
| BioProject | BioSample | Accession | Organism |
| PRJNA290781 | SAMN03922721 | LNCC00000000 |
|
| PRJNA290782 | SAMN03968850 | LNCB00000000 |
|