| Literature DB >> 30372461 |
Luciana Watanabe1, Fátima Gomes1, João Vianez2, Márcio Nunes2, Jedson Cardoso2, Clayton Lima2, Horacio Schneider1, Iracilda Sampaio1.
Abstract
BACKGROUND: The Arapaima (Arapaima gigas) is one of the world's largest freshwater bony fish, and is found in the rivers of the Amazon basin. This species is a potential aquaculture resource, although reproductive management in captivity is limited in particular due to the lack of external sexual dimorphism. In this study, using the 454 Roche platform (pyrosequencing) techniques, we evaluated a major portion of the transcriptome of this important Amazonian species.Entities:
Mesh:
Year: 2018 PMID: 30372461 PMCID: PMC6205615 DOI: 10.1371/journal.pone.0206379
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow of the main steps of bioinformatics analysis.
Summary statistics for the Arapaima gigas transcriptome, from the Roche 454 pyrosequencing libraries and the assemblies generated for each library separately.
| Male | Female | |||
|---|---|---|---|---|
| Skin | Liver | Skin | Liver | |
| Raw Reads | 1,315,106 | 1,898,857 | 580,558 | 1,659,398 |
| Ribosomal RNA | 413,443 | 1,444,941 | 162,945 | 1,119,938 |
| Filtered Reads | 750,016 | 311,348 | 325,268 | 354,008 |
| Average Length | 393.6 | 438.6 | 398.2 | 458.7 |
| Mean Quality | 29.4 | 32.4 | 29 | 32 |
| Reads Assembled | 362,908 | 215,813 | 105,562 | 243,699 |
| Contigs | 108,060 | 21,597 | 44,247 | 25,177 |
| N50 (bps) | 742 | 874 | 627 | 920 |
| Largest Contig | 8,900 | 12,525 | 13,758 | 7,506 |
| Average Length | 710.26 | 833.12 | 593.79 | 876.64 |
| Number of bases | 76,751,253 | 17,994,637 | 26,273,278 | 22,075,645 |
N50: This value was computed by sorting all the contigs from the largest to the smallest, and then determining the minimum set of contigs whose total size is equal to 50% of the entire transcriptome.
Fig 2Top-hit species distribution based on BLASTx results.
Fig 3Heatmap of the genes with differentiated expression profile in the liver and skin of males and females of A. gigas.
The colored bars indicate the relative expression levels in RPKM. Highly expressed genes are shown in red, while low expressed genes are shown in blue. The complete list of genes used to generate the heatmap is in S4 Table.
Genes with a more differentiated expression profile between males and females of A. gigas.
| Gene Symbol | Accession Number | Description | Tissue | Length | RPKM Male | RPKM Female | log2FC |
|---|---|---|---|---|---|---|---|
| XP_018607998 | B-cadherin isoform X1 | Skin | 3,904 | 273.744 | 1.952 | 7.131 | |
| XP_018609834 | Sperm acrosome membrane associated protein | Skin | 1,411 | 472.264 | 5.401 | 6.449 | |
| XP_018606345 | Myelin and lymphocyte protein | Skin | 1,638 | 348.480 | 4.653 | 6.226 | |
| XP_018599340 | Collagen alpha-1 (X) chain | Skin | 2,718 | 201.684 | 2.804 | 6.168 | |
| XP_018584540 | Integrin alpha-6 | Skin | 4,163 | 358.190 | 5.492 | 6.027 | |
| XP_018600166 | Syndecan-4 isoform X2 | Skin | 3,085 | 154.053 | 2.470 | 5.962 | |
| NP_001185503 | Beta-2-microglobulin precursor | Skin | 984 | 886.748 | 15.492 | 5.838 | |
| XP_005986238 | Nascent polypeptide-associated complex subunit alpha isoform X4 | Skin | 1,485 | 62.183 | 10.265 | 5.775 | |
| XP_018592541 | Thymosin beta-a | Skin | 844 | 914.665 | 18.062 | 5.662 | |
| YP_001816867 | NADH dehydrogenase subunit 6 (mitochondrion) | Skin | 1778 | 696.441 | 14.011 | 5.635 | |
| XP_018605820 | Armadillo repeat protein deleted in velo-cardio-facial syndrome | Skin | 724 | 3.473 | 178.973 | -5.867 | |
| XP_018590687 | Metastasis-associated protein MTA1 isoform X3 | Skin | 519 | 4.845 | 220.294 | -5.506 | |
| XP_018600885 | Zinc finger protein GLIS2 | Skin | 765 | 3.287 | 129.527 | -5.300 | |
| XP_018611035 | Alpha-1-macroglobulin | Skin | 4,784 | 6.307 | 183.225 | -4.860 | |
| XP_008509893 | Catenin alpha-2 | Skin | 1,262 | 7.970 | 211.391 | -4.729 | |
| XP_021347235 | Uncharacterized protein LOC110446415 | Skin | 563 | 4.466 | 108.308 | -4.599 | |
| XP_018595532 | UDP-GalNAc:beta-1,3-N-acetylgalactosaminyltransferase 2 | Skin | 484 | 5.195 | 110.238 | -4.407 | |
| XP_015193432 | Zinc finger protein 384 isoform X1 | Skin | 514 | 4.892 | 103.804 | -4.407 | |
| XP_018593664 | Retinol-binding protein 1 isoform X1 | Skin | 940 | 10.700 | 186.500 | -4.123 | |
| XP_016120336 | DENN domain-containing protein 5A | Skin | 696 | 10.838 | 164.271 | -3.921 | |
| XP_018618982 | Serum amyloid A-5 | Liver | 865 | 1198.344 | 27.998 | 5.419 | |
| XP_018613206 | Complex I assembly factor TIMMDC1, mitochondrial | Liver | 1,435 | 103.641 | 19.690 | 2.396 | |
| XP_018615415 | Adenylate kinase 2, mitochondrial isoform X2 | Liver | 939 | 148.787 | 30.091 | 2.305 | |
| XP_018602541 | Sodium- and chloride-dependent taurine transporter isoform X1 | Liver | 3,875 | 394.273 | 81.250 | 2.278 | |
| XP_018589419 | Heat shock 70 kDa protein | Liver | 2,516 | 128.970 | 30.482 | 2.080 | |
| XP_018587675 | Interferon-inducible GTPase 5 | Liver | 1,712 | 118.461 | 28.293 | 2.065 | |
| XP_018597164 | Bile salt export pump isoform X2 | Liver | 1,833 | 186.862 | 52.851 | 1.821 | |
| XP_018580468 | Adipocyte plasma membrane-associated protein | Liver | 1,464 | 132.372 | 41.357 | 1.678 | |
| XP_018620258 | Thioredoxin-interacting protein isoform X2 | Liver | 2,347 | 247.711 | 77.393 | 1.678 | |
| XP_018591791 | Glutathione S-transferase kappa 1 | Liver | 781 | 115.411 | 361.786 | 1.673 | |
| XP_018618234 | Acyl-coenzyme A thioesterase 2, mitochondrial | Liver | 3,719 | 35.143 | 347.319 | -3.304 | |
| XP_016400483 | DNA ligase 1 | Liver | 533 | 169.113 | 106.024 | -2.648 | |
| NP_001187932 | SUMO-conjugating enzyme UBC9 | Liver | 924 | 24.387 | 122.318 | -2.326 | |
| XP_018611927 | Eukaryotic translation initiation factor 4E-binding protein 3 | Liver | 2,058 | 37.228 | 151.025 | -2.020 | |
| XP_018604956 | Angiotensinogen | Liver | 2,000 | 137.457 | 530.800 | -1.949 | |
| XP_016106557 | Fatty acid synthase isoform X1 | Liver | 2,327 | 42.608 | 150.913 | -1.824 | |
| XP_018598262 | Glioma tumor suppressor candidate region gene 2 protein | Liver | 1,484 | 30.369 | 100.640 | -1.728 | |
| XP_018614615 | Cytochrome c oxidase subunit 7C, mitochondrial | Liver | 400 | 56.335 | 181.642 | -1.688 | |
| XP_018586503 | Calreticulin | Liver | 1,717 | 73.495 | 230.388 | -1.648 | |
| XP_018611791 | Fibrinogen alpha chain | Liver | 2,620 | 78.690 | 243.471 | -1.629 |
a) Symbol used to represent the gene
b) Access number in RefSeq
c) Gene Description
d) Tissue where the gene was up-regulated in Arapaima
e) Transcript size in bp
f) RPKM value obtained for the transcript in the A. gigas male
g) RPKM value obtained for the transcript in the A. gigas female
h) The absolute value of log2FC ≥ 1 or ≤-1 means the magnitude of up or down-regulation for each gene, positive values indicate up-regulated genes expression in male and negative in female.