| Literature DB >> 28282934 |
Mi Ae Kim1, Jae-Sung Rhee2, Tae Ha Kim3, Jung Sick Lee4, Ah-Young Choi5, Beom-Soon Choi6, Ik-Young Choi7, Young Chang Sohn8.
Abstract
In order to characterize the female or male transcriptome of the Pacific abalone and further increase genomic resources, we sequenced the mRNA of full-length complementary DNA (cDNA) libraries derived from pooled tissues of female and male Haliotis discus hannai by employing the Iso-Seq protocol of the PacBio RSII platform. We successfully assembled whole full-length cDNA sequences and constructed a transcriptome database that included isoform information. After clustering, a total of 15,110 and 12,145 genes that coded for proteins were identified in female and male abalones, respectively. A total of 13,057 putative orthologs were retained from each transcriptome in abalones. Overall Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analyzed in each database showed a similar composition between sexes. In addition, a total of 519 and 391 isoforms were genome-widely identified with at least two isoforms from female and male transcriptome databases. We found that the number of isoforms and their alternatively spliced patterns are variable and sex-dependent. This information represents the first significant contribution to sex-preferential genomic resources of the Pacific abalone. The availability of whole female and male transcriptome database and their isoform information will be useful to improve our understanding of molecular responses and also for the analysis of population dynamics in the Pacific abalone.Entities:
Keywords: Haliotis discus hannai; PIS system; abalone; isoform; transcriptome
Year: 2017 PMID: 28282934 PMCID: PMC5368703 DOI: 10.3390/genes8030099
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Schematic diagram of pipeline to isoforms of full-length complementary DNA (cDNA) sequence (PIS system).
Platforms for establishing gene sets of female and male Haliotis discus hannai.
| Step | Data | Platform | Female | Male |
|---|---|---|---|---|
| 1 | High quality consensus sequence | RS_IsoSeq | 22,494 | 18,981 |
| 2 | Non-redundant representative sequence | CD-HIT | 18,692 | 15,271 |
| 3 | Reference isoforms | BLASTCLUST and TransDecoder | 15,363 | 12,409 |
| 4 | Final isoform transcriptome by combine representative sequence | GMAP and ToFU | 15,792 | 12,718 |
| 5 | Final gene set with representative isoforms | TransDecoder | 15,110 | 12,145 |
Figure 2Homology searches of the female and male Haliotis discus hannai transcript contigs. (A) Number of BLAST hits; (B) top-hit phylum distribution; (C) top-hit class distribution.
Figure 3Gene discovery rate of each transcriptome database. Venn diagram to compare ortholog numbers annotated in the female and male Haliotis discus hannai.
Ortholog statistics between female and male Haliotis discus hannai.
| Number of Total Orthologs | 13,057 |
|---|---|
| Average identity (%) | 99.33 |
| Average coverage of female (%) | 85.23 |
| Average coverage of male (%) | 90.63 |
| Number of 100% coverage orthologs | 502 |
| Number of 100% identity orthologs | 597 |
Figure 4Gene Ontology (GO) analysis in terms of (A) molecular function, (B) cellular component, and (C) biological process that are enriched in the female and male Haliotis discus hannai transcript contigs.
Summary of the isoform information for Haliotis discus hannai.
| Contig data | Female | Male |
|---|---|---|
| Total genes | 15,110 | 12,145 |
| Genes with no isoforms | 14,591 | 11,754 |
| Genes with at least two isoforms | 519 | 391 |
| Total length of genes with isoforms (bp) | 1,599,611 | 1,166,159 |
| Average length (bp) | 3082 | 2982 |
| Maximum length (bp) | 8315 | 8058 |
| Minimum length (bp) | 741 | 847 |
List of top-ranked genes containing over five isoforms in the female abalone.
| Cluster ID | Length (bp) | #Isoform | Description | Matched Species | GenBank No. |
|---|---|---|---|---|---|
| F_Cluster00018 | 6565 | 27 | deleted in malignant brain tumors one protein | ZDB-GENE-060228-6 | |
| F_Cluster11205 | 2162 | 11 | - | - | - |
| F_Cluster00024 | 6380 | 9 | PREDICTED: cubilin | ZDB-GENE-060228-6 | |
| F_Cluster00089 | 4372 | 9 | PREDICTED: cubilin-like | ZDB-GENE-060228-6 | |
| F_Cluster13261 | 1675 | 9 | - | - | - |
| F_Cluster00812 | 3837 | 8 | PREDICTED: cyclin-L1-like | H2U6Q2 | |
| F_Cluster00002 | 8315 | 7 | PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, epidermal growth factor (EGF) and pentraxin domain-containing protein 1 | F1MNH3 | |
| F_Cluster00829 | 3833 | 6 | - | - | - |
| F_Cluster03162 | 3343 | 6 | PREDICTED: serine/arginine-rich splicing factor 6-like isoform X1 | A0A0D9SEM4 | |
| F_Cluster05356 | 3088 | 6 | heterogeneous nuclear ribonucleoprotein L, partial | R4GHI6 | |
| F_Cluster11593 | 2114 | 6 | - | - | - |
| F_Cluster00004 | 7437 | 5 | hypothetical protein LOTGIDRAFT_214098 | NP_001116989.1 | |
| F_Cluster00011 | 6791 | 5 | hypothetical protein AC249_AIPGENE2795 | F1NX90 | |
| F_Cluster10316 | 2268 | 5 | PREDICTED: Na(+)/H(+) exchange regulatory cofactor NHE-RF1-like | XP_414851.3 | |
| F_Cluster12757 | 1916 | 5 | - | - | - |
List of top-ranked genes containing over five isoforms in the male abalone.
| Cluster ID | Length (bp) | #Isoform | Description | Matched species | GenBank No. |
|---|---|---|---|---|---|
| M_Cluster00016 | 6579 | 52 | hypothetical protein cypCar_00021969, partial | ZDB-GENE-060228-6 | |
| M_Cluster00705 | 3598 | 11 | PREDICTED: cyclin-L1-like | H2U6Q2 | |
| M_Cluster00017 | 6523 | 9 | PREDICTED: cubilin | ZDB-GENE-060228-6 | |
| M_Cluster01458 | 3359 | 8 | PREDICTED: mesocentin-like | - | |
| M_Cluster09253 | 1770 | 8 | - | - | - |
| M_Cluster09585 | 1697 | 8 | - | - | - |
| M_Cluster00226 | 3901 | 7 | hypothetical protein LOTGIDRAFT_115468 | XP_002415964.1 | |
| M_Cluster00908 | 3513 | 7 | serine-arginine protein 55 | E1C270 | |
| M_Cluster00059 | 4569 | 5 | hypothetical protein LOTGIDRAFT_200884 | NP_001040037.1 | |
| M_Cluster00931 | 3505 | 5 | - | - | - |
| M_Cluster01425 | 3366 | 5 | - | - | - |
| M_Cluster01740 | 3296 | 5 | putative splicing factor, arginine/serine-rich 7 | NP_064477.1 | |
| M_Cluster01748 | 3295 | 5 | - | - | - |
| M_Cluster06621 | 2255 | 5 | PREDICTED: tryptophan 2,3-dioxygenase-like | M3X838 | |
| M_Cluster09868 | 1649 | 5 | - | - | - |
Figure 5Cubilin isoforms in female and male Pacific abalone. Cubilin isoforms were determined by mapping consensus sequences to cubilin reference genes. Nine isoforms were defined in both female and male abalone, but only three isoforms showed the sequence in both sexes.
Figure 6Relative mRNA expression profiles of the six selected genes from sexually mature Haliotis discus hannai by quantitative real-time RT-PCR analysis. The genes overexpressed in females were vitellogenin (Hdh-VTG) (A) and forkhead box protein L2 (Hdh-FOXL2) (B), but not condensin-2 (Hdh-condensin-2) (C). The genes overexpressed in males were sperm-associated antigen 6 (Hdh-SPAG6) (D), protein fem-1 homolog C-like (Hdh-FEM1-like) (E), and tektin-1 (Hdh-tektin-1) (F). Statistical changes were determined by the Student’s t test (two-tailed) and are denoted as follows: ** p < 0.01. M and F indicate female and male, respectively.