Leonard Whye Kit Lim1, Hung Hui Chung1, Han Ming Gan2,3. 1. Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia. 2. GeneSEQ Sdn Bhd, Bukit Beruntung, 48300 Rawang, Selangor, Malaysia. 3. Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Geelong, Victoria, Australia.
A total of 52 ABC transporters were discovered from the striped catfish genome.Duplicated genes such as ABCA1, ABCB3, ABCB6, ABCC5, ABCD3, ABCE1, ABCF2 as well as ABCG2 were uncovered within the striped and channel catfish genomes.The motif analysis has revealed several exclusive characteristics of some catfishes which provides unique genomic landscape for future ecotoxicological, biochemical and physiological researches.
INTRODUCTION
The striped catfish (Pangasianodon hypophthalmus), also known as the tra catfish, is a freshwater riverine belonging to the Pangasiidae family. This catfish is a facultative air-breather whereby it utilises its swim bladder as an air-breathing organ, unlike the channel catfish (Ictalurus punctatus) that is unable to breathe in the air (Ma ). This fish was first cultured widely in Vietnam and was later introduced to countries like Malaysia, China, Philippines, Singapore and India (Gao ). The tra catfish is highly appreciated for its unsaturated fatty acids content (Kim ). Around 59% of the striped catfish weight is contributed by the crude fat found within the head and flab parts. Surprisingly, the unsaturated fatty acids in the crude fat encompasses a sum of 60% (Gao ). This economically important fish should be researched on immediately, especially on its detoxification aspect (Nguyen ), apart from its growth and nutrient improvement aspects. Hence, one of the most renowned detoxification efflux pump orchestrators, namely the ATP-binding cassette (ABC) transporters, was selected as the main focus of this work.The ABC transporters are present in all organisms including human and microorganisms, making up one of the largest protein families known to date (Liu ). These transporters translocate substrates such as toxins, xenobiotics, ions and metabollites across membranes via their conserved nucleotide-binding domains (NBDs) and transmembrane domains (TMDs). There are six major subfamilies encompassing protein members from ABCA to ABCH, each of them is associated with various essential biological functions (Andersen ; Guo ); Park ). The ABCH1 gene is exclusively absent in mammals, but present in insects, mold, fish and other vertebrates (Popovic ). The subfamily members of ABCA, ABCB and ABCC are full transporters whereas the subfamily members of ABCD, ABCE/ABCF and ABCG/ABCH contain only half transporters. These proteins have been proven as pivotal agrochemical and ecotoxicological markers in fishes closely related to striped catfish, namely zebrafish, Sarawak rasbora, common carp and channel catfish, in assessing the water quality of a natural habitat (Popovic ; Liu ; 2016; Lim ; Yeaw ; Md Yusni ; Lai ; Lim ). By elucidating the functional roles of these proteins in indicator fish species like the tra catfish, we can finally decipher the underlying detoxification mechanism that is extremely useful in aiding us to further enhance and strengthen the immunity system of this valuable aquaculture fish in order to minimise economical loss due to infections in the fish farming industry (Chung ; Lau ; 2021b; Lim ). Therefore, in this study, we aimed to unearth and characterise the 52 genome-wide ABC proteins of striped catfish in hope to further enrich the detoxification database of this catfish to support future metabolism studies and conservation endeavors.
METHODOLOGY
Transcriptome Sequencing and Genome-wide Identification of ABC Proteins
The juvenile striped catfish (body length: 5 cm) was sampled from an aquaculture centre in Kuching, Sarawak, Malaysia (GPS coordinates: 1.4700676917664783, 110.33115299732843). The fish was deposited as voucher specimen (voucher ID: ASD03019) in the fish museum situated inside the Faculty of Resource Science and Technology, Universiti Malaysia Sarawak. The euthanisation of the fish was conducted in compliance to the guidelines and permissions granted by the Animal Ethics Committee of Universiti Malaysia Sarawak with permit number UNIMAS/TNC(PI)-04.01/06-09(17). The total RNA of striped catfish was extracted from the muscle tissues using Wizol TriZol-like reagent (WizBio, Republic of Korea) according to the manufacturer’s protocol. The purified total RNA was then subjected to mRNA enrichment process employing poly-T magnetic bead (NEB, England). Next, the library was prepared from the enriched mRNAs using NEB Ultra II RNA library (NEB, England). Transcriptome sequencing was then performed on an Illumina NovaSeq6000 (2 x 150 bp). Transcriptome assembly statistics were generated using QUAST (Gurevich ). The genome-wide ABC protein sequences of channel catfish (Liu ) were retrieved from the public GenBank database, to be used as a reference. The genome scaffolds of striped catfish were blasted against the reference ABC protein sequences of channel catfish via DIAMOND (Buchfink ) with the cutoff value set at above 70% similarity. Additional protein BLAST searches (>70% identity cutoff value) were also conducted across ABC sequences of other catfishes (Asian redtail catfish, yellowhead catfish, black bullhead catfish, channel catfish, walking catfish, Chinese large-mouth catfish and giant devil catfish) from the GenBank database to look for ABC proteins not present in channel catfish.
ABC Protein Characterisation and Phylogenetic Analysis
The ABC proteins of striped catfish were first subjected to amino acid length, molecular weight and theoretical isoelectric point (pI) predictions via Sequence Manipulation Suite (Stothard 2000). Next, the subcellular localisation was predicted employing CELLO server version 2.5 (Yu ). The domain structure prediction was conducted utilising SMART tool (Letunic ). A motif analysis was performed for each ABC subfamily using MEME suite tool (Bailey ). Motif identity was determined through NCBI conserved domain search (Yang ) and ScanProsite (De Castro ). The ABC proteins were aligned before undergoing a model test using MEGA X (Kumar ). The selected model is the JTT+I+G (Jones-Taylor-Thornton matrix with consideration of invariant sites [+I] and gamma distribution for modeling rate heterogeneity [+G]) model. Phylogenetic tree was constructed using MEGA X (Kumar ) based on all ABC proteins across different catfishes besides the individual trees plotted representing each ABC subfamily, with 1000 bootstrap replications and maximum likelihood criterion.
RESULTS AND DISCUSSION
Transcriptome Sequencing, Motif and Phylogenetic Analysis of ABC Transporters in Striped Catfish
The transcriptome of the striped catfish was sequenced (GenBank SRA accession number: SRR17621361) and the assembly statistics were enumerated using QUAST (Gurevich ). A total of 154,159 contigs were unearthed from the striped catfish genome (Table 1). About 31.17% of them have length of at least 1000 bp. Approximately 2.36% and 0.08% of them have at least 5000 bp and 10,000 bp, respectively. No contigs were found to have length larger than 25,000 bp and 50,000 bp as the largest length was 17,647 bp. The total length of contigs is 143,584,872 bp with 47.92% of them having more than 500 bp in length. The total GC content recorded was 44.52%. The N50 and N75 of the tra catfish transciptome assembly are 2,729 bp and 1,594 bp, respectively. This indicates that the median value of contig length is around 2,729 bp, a skew towards shorter contigs was observed for this assembly. This value is much lower to that determined by Kim where the N50 calculated was 14.29 Mbp. On the other hand, the L50 and L75 were determined at 17,010 and 34,065, respectively. This means that almost 11% of total contigs (17,010 contigs) encompass half of the total base content of the tra catfish transcriptome assembly. No N nucleotide was detected per 100 kbp.
Table 1
The transcriptome sequencing data.
Number of contigs (>= 0 bp)
154,159
Number of contigs (>= 1000 bp)
48,049
Number of contigs (>= 5000 bp)
3,634
Number of contigs (>= 10,000 bp)
126
Number of contigs (>= 25,000 bp)
0
Number of contigs (>= 50,000 bp)
0
Total length (>= 0 bp)
168,216,156
Total length (>= 1000 bp)
125,526,508
Total length (>= 5000 bp)
23,574,416
Total length (>= 10,000 bp)
1,464,951
Total length (>= 25,000 bp)
0
Total length (>= 50,000 bp)
0
Total number of contigs (>= 500 bp)
73,879
Largest contig
17,647
Total length
143,584,872
GC (%)
44.52
N50
2,729
N75
1,594
L50
17,010
L75
34,065
Number of Ns per 100 kbp
0.00
A sum of 52 ABC transporter genes were discovered in the striped catfish genome. Their amino acid lengths, molecular weights, subcellular localisations, theoretical isoelectric points as well as domain structures were summarised in Table 2. Domain structure was only compared against the channel catfish as this is the only catfish with full genome sequenced apart from the striped catfish while the ABC gene number within the genome was contrasted against several model organisms to provide an interspecies landscape view of ABC proteins evolution (Table 3). The identified 52 ABC transporters were divided into six subfamilies: 9 ABCAs, 13 ABCBs, 12 ABCCs, 5 ABCDs, 2 ABCEs, 4 ABCFs, 6 ABCGs and 1 ABCH. Motif (Figs. S1–S6 [see Appendices]) and phylogenetic analysis (Figs. 1–7) were conducted across all striped catfish ABC subfamilies.
Table 2
The protein profiles of the 52 striped catfish ABC proteins.
Gene/Protein
Amino acid length (aa)
Molecular weight (kDa)
Theoretical isoelectric point (pI)
Subcellular localisation
Domain structure
ABCA1a
2269
254.54
6.24
Plasma membrane
(6/7TMD-NBD)2
ABCA1b
2269
254.27
6.07
Plasma membrane
(6/7TMD-NBD)2
ABCA1-like
2287
255.56
7.52
Plasma membrane
(7TMD-NBD)2
ABCA2
2502
279.74
6.48
Plasma membrane
(8/5TMD-NBD)2
ABCA3
1710
191.69
6.37
Plasma membrane
(7TMD-NBD)2
ABCA4
2341
264.94
6.46
Plasma membrane
(5/6TMD-NBD)2
ABCA5
1653
186.09
6.96
Plasma membrane
(7TMD-NBD)2
ABCA7
2328
260.86
6.38
Plasma membrane
(5/6TMD-NBD)2
ABCA12
3026
338.61
5.25
Plasma membrane
(5/6TMD-NBD)2
ABCB1
1337
147.32
8.92
Plasma membrane
(5/6TMD-NBD)2
ABCB2
726
81.27
7.63
Plasma membrane
7TMD-NBD
ABCB3
717
80.93
9.55
Plasma membrane
7TMD-NBD
ABCB3-like
719
80.86
8.53
Plasma membrane
5TMD-NBD
ABCB4
1277
141.08
8.56
Plasma membrane
(6/5TMD-NBD)2
ABCB5
1653
186.09
6.96
Plasma membrane
(7TMD-NBD)2
ABCB6-1
852
96.25
8.14
Plasma membrane
11TMD-NBD
ABCB6-2
650
73.61
7.28
Plasma membrane
6TMD-NBD
ABCB7
744
81.99
9.47
Plasma membrane
5TMD-NBD
ABCB8
709
76.77
10.15
Plasma membrane
5TMD-NBD
ABCB9
787
87.07
5.55
Plasma membrane
7TMD-NBD
ABCB10
704
77.04
9.34
Plasma membrane
5TMD-NBD
ABCB11
1324
146.18
7.22
Plasma membrane
5TMD-NBD
ABCC1
1515
169.65
7.75
Plasma membrane
5TMD-(6/5TMD-NBD)2
ABCC2
1565
175.49
8.62
Plasma membrane
5TMD-(6/5TMD-NBD)2
ABCC3
1538
173.74
6.25
Plasma membrane
5TMD-(4/5TMD-NBD)2
ABCC4
1324
149.15
8.16
Plasma membrane
(7/5TMD-NBD)2
ABCC5
1423
158.27
8.23
Plasma membrane
(6/5TMD-NBD)2
ABCC5-like
1398
155.51
8.37
Plasma membrane
(5/4TMD-NBD)2
ABCC6
1506
169.54
6.76
Plasma membrane
5TMD-(5/2TMD-NBD)2
ABCC7
1480
168.51
8.72
Plasma membrane
(4/6TMD-NBD)2
ABCC8
1631
182.48
7.11
Plasma membrane
5TMD-(6/4TMD-NBD)2
ABCC9
1595
180.19
6.23
Plasma membrane
5TMD-(6/5TMD-NBD)2
ABCC10
1547
171.98
6.74
Plasma membrane
5TMD-(6/5TMD-NBD)2
ABCC12
1410
158.07
7.86
Plasma membrane
(6/5TMD-NBD)2
ABCD1
780
88.53
8.79
Plasma membrane and mitochondria
NBD
ABCD2
746
83.10
9.17
Mitochondria
NBD
ABCD3a
658
73.58
9.70
Plasma membrane
4TMD-NBD
ABCD3b
665
75.66
9.39
Plasma membrane
4TMD-NBD
ABCD4
603
68.65
8.03
Plasma membrane
5TMD-NBD
ABCE1-1
599
67.35
8.30
Cytoplasm and nucleus
NBD-NBD
ABCE1-2
599
67.60
8.46
Cytoplasm and nucleus
NBD-NBD
ABCF1
864
97.93
5.98
Nucleus
NBD-NBD
ABCF2-1
609
69.85
6.90
Cytoplasm and nucleus
NBD-NBD
ABCF2-2
609
69.66
6.93
Cytoplasm and nucleus
NBD-NBD
ABCF3
710
80.07
5.33
Cytoplasm
NBD-NBD
ABCG1
672
74.96
6.89
Plasma membrane
NBD-7TMD
ABCG2-1
643
72.15
8.76
Plasma membrane
NBD-5TMD
ABCG2-2
656
73.41
8.81
Plasma membrane
NBD-5TMD
ABCG4
643
72.07
7.85
Plasma membrane
NBD-7TMD
ABCG5
655
73.61
8.90
Plasma membrane
NBD-6TMD
ABCG8
679
76.82
6.71
Plasma membrane
NBD-5TMD
ABCH1
720
79.52
8.51
Plasma membrane
NBD-6TMD
Table 3
The ABC gene number comparison across various vertebrates.
Gene
Striped catfish
Channel catfish
Zebrafish
Medaka
Fugu
Tetradon
Stickleback
Tilapia
Cod
Mouse
Coelacanth
Human
ABCA1
3
3
2
2
2
2
2
2
2
1
1
1
ABCA2
1
1
1
0
0
0
0
0
0
1
0
1
ABCA3
1
1
1
1
1
1
1
1
1
1
1
1
ABCA4
1
1
2
2
2
1
2
2
2
1
1
1
ABCA5
1
1
1
0
1
1
1
1
1
1
1
1
ABCA6
0
0
0
0
0
0
0
0
0
1
0
1
ABCA7
1
1
1
1
1
1
1
0
1
1
1
1
ABCA8
0
0
0
0
0
0
0
0
0
2
0
1
ABCA9
0
0
0
0
0
0
0
0
0
1
0
1
ABCA10
0
0
0
0
0
0
0
0
0
0
0
1
ABCA12
1
1
1
1
1
0
1
1
1
1
1
1
ABCA13
0
0
0
0
0
0
0
0
0
1
0
1
ABCA14
0
0
0
0
0
0
0
0
0
1
0
0
ABCA15
0
0
0
0
0
0
0
0
0
1
0
0
ABCA17
0
0
0
0
0
0
0
0
0
1
0
0
ABCB1
1
1
1
1
0
0
1
1
1
2
0
1
ABCB2
1
1
1
1
1
1
1
1
1
1
1
1
ABCB3
2
2
1
0
0
0
0
0
0
1
0
1
ABCB4
1
0
0
0
0
0
0
0
0
1
0
1
ABCB5
1
1
1
0
0
0
0
0
0
1
1
1
ABCB6
2
2
2
1
2
2
2
2
1
1
1
1
ABCB7
1
1
1
1
1
1
1
1
1
1
1
1
ABCB8
1
1
1
1
1
1
1
1
1
1
1
1
ABCB9
1
1
1
1
1
1
1
0
1
1
1
1
ABCB10
1
1
1
1
1
1
1
1
1
1
1
1
ABCB11
1
1
2
1
1
1
1
0
1
1
1
1
ABCC1
1
1
1
1
1
1
1
1
0
1
1
1
ABCC2
1
1
1
1
1
1
1
1
1
1
1
1
ABCC3
1
1
1
1
1
1
1
1
0
1
0
1
ABCC4
1
1
1
2
3
4
4
3
3
1
1
1
ABCC5
2
2
1
2
1
1
1
2
1
1
0
1
ABCC6
1
1
3
1
1
2
2
1
1
1
1
1
ABCC7
1
1
1
0
0
0
0
0
1
1
0
1
ABCC8
1
1
3
1
1
1
1
1
1
1
1
1
ABCC9
1
1
1
0
0
0
0
0
0
1
1
1
ABCC10
1
1
1
0
1
1
1
1
1
1
1
1
ABCC11
0
0
0
0
0
0
0
0
0
0
0
1
ABCC12
1
1
1
0
1
1
1
0
1
1
0
1
ABCC13
0
0
1
0
0
0
0
0
0
0
0
0
ABCD1
1
1
1
1
1
1
1
1
0
1
1
1
ABCD2
1
1
2
1
1
0
1
1
1
1
1
1
ABCD3
2
2
2
1
1
1
1
1
1
1
1
1
ABCD4
1
1
1
1
1
1
1
1
1
1
1
1
ABCE1
2
2
1
1
1
1
1
1
3
1
1
1
ABCF1
1
1
1
1
1
3
1
1
1
1
1
1
ABCF2
2
2
2
1
1
1
1
1
1
1
1
1
ABCF3
1
1
1
1
1
1
1
1
1
1
1
1
ABCG1
1
1
1
1
1
0
0
1
0
1
1
1
ABCG2
2
2
3
2
2
2
2
2
2
1
0
1
ABCG3
0
0
0
0
0
0
0
0
0
1
0
0
ABCG4
1
1
2
2
3
2
2
2
1
1
1
1
ABCG5
1
1
1
1
1
1
1
1
1
1
1
1
ABCG8
1
1
1
1
1
1
1
1
1
1
1
1
ABCH1
1
0
1
0
0
0
0
0
0
0
0
0
Figure 1
The maximum likelihood phylogenetic tree of ABCA subfamily, with 1000 bootstrap replications.
Figure 2
The maximum likelihood phylogenetic tree of ABCB subfamily, with 1000 bootstrap replications.
Figure 3
The maximum likelihood phylogenetic tree of ABCC subfamily, with 1000 bootstrap replications.
Figure 4
The maximum likelihood phylogenetic tree of ABCD subfamily, with 1000 bootstrap replications.
Figure 5
The maximum likelihood phylogenetic tree of ABCE-ABCF subfamily, with 1000 bootstrap replications.
Figure 6
The maximum likelihood phylogenetic tree of ABCG-ABCH subfamily, with 1000 bootstrap replications.
Figure 7
The maximum likelihood phylogenetic tree of all ABC proteins of catfishes, with 1000 bootstrap replications.
High Motif Conservation Observed Across the ABCA Subfamily Members
A total of nine ABCA genes were uncovered from the striped catfish genome, which encompasses ABCA1a, ABCA1b, ABCA-like, ABCA2, ABCA3, ABCA4, ABCA5, ABCA7 and ABCA12. All of them are full transporters and are located within the plasma membrane. The tra catfish ABCA12 is the largest protein of ABCA subfamily (338.61 kDa with 3026 aa) with the lowest pI of 5.25 (Table 2). The smallest P. hypophthalmus ABCA member is the ABCA3 (191.69 kDa with 1710 aa) but the highest pI value belongs to that of the ABCA1-like protein (7.52). The domain structures of tra catfish ABCA subfamily members mirrored that of the channel catfish with some exceptions and incomplete data impeding comparison, this holds true for ABCA1a, ABCA1b, ABCA1-like, ABCA2, ABCA5 and ABCA3. The coding sequences of channel catfish ABCA4, ABCA7 and ABCA12 are unavailable for comparison due to partial sequences deposited (Liu ).Looking at the motif distribution (Fig. S1: Appendix A), the ABCA3 and ABCA7 proteins are greatly conserved across the catfishes in terms of motif position, motif length and motif amount. Relatively high motif conservation pattern was also observed in ABCA1, ABCA2 and ABCA5 proteins across all catfishes. The motif distances of ABCA4 proteins varied significantly, especially that from black bullhead catfish and channel catfish. The spaces between the motifs within the ABCA12 proteins do not differ across all six catfishes. There is one missing motif (Motif 11, the retinal-specific rim ABC transporter [photoreceptor protein]) from the ABCA5 of black bullhead catfish. This motif is responsible in various pathways such as phosphotidylethanolamine flippase activity, phospholipid transporter activity, lipid transporter activity, ATP binding as well as ATPase-coupled intermembrane and transmembrane transporter activity (The UniProt Consortium 2021). However, the functional loss impacted by this motif loss is minor as there is high abundance of this motif within the protein.The ABCA phylogenetic tree (Fig. 1) depicted that all ABCA proteins formed a cluster with their respective counterparts from other catfishes. The ABCA1a of channel catfish is more closely related to the ABCA1 of Asian redtail catfish instead of ABCA1a of the striped catfish. However, the ABCA1b proteins of both channel and striped catfish are grouped together with high bootstrap value of 96%. The ABCA1-like proteins of both striped and channel catfish are located far away from the ABCA1 cluster and are more closely associated with the ABCA7 group. A similar phenomenon has been previously observed by Liu on channel catfish. They postulated that the ABCA7 proteins were possibly derived from ABCA1 by duplication event (Liu ). Only the striped and channel catfishes contain three ABCA1 genes (Table 3) when interspecies ABC gene number were compared across a diverse range of well-documented model organisms. The ABCA6, ABCA8 and ABCA9 genes were found exclusively in mouse and human whereas the ABCA10 is exclusive to human only.
ABCB4 is Present in Striped Catfish, Human, Mouse and Zebrafish, but Not in Channel Catfish
There are 13 ABCB family members identified from the striped catfish genome, namely ABCB1, ABCB2, ABCB3, ABCB3-like, ABCB4, ABCB5, ABCB6-1, ABCB6-2, ABCB7, ABCB8, ABCB9, ABCB10 and ABCB11. All ABCB transporters localise within the plasma membrane. The largest and smallest member of the ABCB subfamily of tra catfish are ABCB5 and ABCB6-2, respectively (Table 2). The highest and lowest recorded pI for the P. hypophthalmus ABCB subfamily are 10.15 (ABCB8) and 5.55 (ABCB9) correspondingly. The three full transporters of the tra catfish ABCB subfamily are the ABCB1, ABCB5 and ABCB11 proteins whereas the others are all half transporters. These transporters are conserved across channel and striped catfishes, except for ABCB6-1 where there are 11 TMDs in the striped catfish in contrast to the nine present in the channel catfish ABCB6-1 (Liu ). The ABCB4 protein is absent in channel catfish (Liu ) but present in mouse, human, zebrafish and tra catfish (Table 3).Based on Fig. S2 (Appendix B), the motif distribution of the catfishes ABCB subfamily displayed a conserved pattern with some minor exceptions. For instance, Motif 11 (ABC Signature, Walker B and D-loop) and Motif 9 (P-loop containing nucleoside triphosphate hydrolases) are missing from the ABCB2 of the walking catfish and ABCB11 of Asian redtail catfish correspondingly. Besides, Motif 15 (Uncharacterised motif) is also absent in ABCB3 of the black bullhead catfish, ABCB5 of channel catfish and ABCB5 of walking catfish. Interestingly, the Motif 9 (P-loop containing nucleoside triphosphate hydrolases) of striped catfish ABCB5 was found at different location as compared to ABCB5 from other catfishes. This Motif 9 is responsible for various biological processes such as ATPase activity, ATP binding, hydrolase activity as well as zinc ion binding (The UniProt Consortium 2021).The maximum likelihood phylogenetic tree of ABCB subfamily (Fig. 2) depicted distinctive clades separating the ABCB members accordingly. The ABCB3-like proteins of channel and tra catfish formed a strong clade of their own with 100% bootstrap value but they are still residing the same clade of ABCB3. Similarly, the ABCB6-1 proteins of striped and channel catfish obtained a bootstrap value of 100% for their cluster but that is not the case for ABCB6-2 proteins where the channel catfish ABCB6-2 is more closely related to the ABCB6 of black bullhead catfish than the ABCB6-2 of striped catfish. Liu had previously reported a close synteny across ABCB1 and ABCB5 genes of channel catfish in comparison to that from Xenopus, chicken, mouse, human, zebrafish and medaka. The phylogenetic tree plotted by Liu showed the ABCB1 and ABCB5 clades in close proximity, however in this study these two clades are sandwiching the ABCB4 clade. It is postulated that the lack of ABCB4 proteins from fish (only human and mouse ABCB4 used) included in the phylogenetic tree by Liu has caused the weak separation between ABCB1 and ABCB5 proteins as the bootstrap values were low across these two clades in Liu .
All Striped Catfish ABCC Subfamily Proteins Mirrored that of the Channel Catfish
The ABCC subfamily of striped catfish encompasses a sum of twelve ABCC members, including ABCC1, ABCC2, ABCC3, ABCC4, ABCC5, ABCC5-like, ABCC6, ABCC7, ABCC8, ABCC9, ABCC10 and ABCC12. All of them are full transporters and they are localised in the plasma membrane of the cell. The 1631 aa ABCC8 is the lengthiest member with a documented molecular weight of 182.48 kDa whereas the 1324 aa ABCC4 is the smallest of all with molecular weight of 149.15 kDa. The ABCC7 has the highest pI of 8.72 while the lowest pI belongs to that of ABCC9. All tra catfish ABCC domain structures echoed that of the channel catfish (Liu ). The ABCC11 gene was only found in human whereas the ABCC13 protein is exclusive to zebrafish only (Table 3).Referring to Fig. S3 (Appendix C), the motif distribution pattern is very conserved across all ABCC protein members of all catfishes, except for the ABCC1 of channel catfish and ABCC2 of giant devil catfish. Both Motif 2 (ABC Signature, Walker B and D-loop) and Motif 9 (H-loop/switch region) are missing from channel catfish ABCC1 as well as giant devil catfish ABCC2. Additionally, Motif 4 (Q-loop/lid) was also not found in the ABCC1 of channel catfish. This Q-loop, or the lid, is an indispensable element in couple drug binding to the ATP catalytic cycle (Zolnerciks ). One notable observation is that the ABCC7 of striped catfish has Motif 9 (H-loop/switch region), which this motif is absent in all ABCC7 of other catfishes. H-loop, or the switch region, is obligated to catalyse ATP hydrolysis (Zhou ).The phylogenetic tree representing the ABCC subfamily (Fig. 3) illustrates the well-supported annotations of ABCC proteins. The multi-drug resistance proteins (ABCC1, ABCC2, ABCC3, ABCC6, ABCC9 and ABCC10) formed a monophyletic cluster whereas the rest formed another. The ABCC5-like proteins of striped catfish and channel catfish formed a strong cluster with bootstrap score of 100%. Yet, they are still attached to the major ABCC5 clade with bootstrap value of 100%. Generally, the ABCC subfamily tree reported in this study mirrored that of the tree constructed by Liu .
Striped Catfish ABCD3a Did Not Form A Clade with ABCD3a of Channel Catfish
The ABCD subfamily of tra catfish consists of five protein members, namely ABCD1, ABCD2, ABCD3a, ABCD3b and ABCD4. All five members are half transporters and majority of them are localised in the plasma membrane, with the exception of ABCD2 where it is localised in the mitochondria. The ABCD1 can be found in both plasma membrane and mitochondria. The ABCD1 is the largest of all ABCDs and the ABCD3a is the smallest. The ABCD3a has the highest pI of 9.70 whereas the ABCD4 has the lowest value of 8.03. The domain structures of tra catfish ABCD proteins resemble closely to that of the channel catfish, with the ABCD4 as an minor exception. The ABCD4 of channel catfish contains three transmembrane domains whereas the striped catfish has two extra. All organisms examined in Table 3 contains at least one copy of each ABCD genes, except for tetradon with the lack of ABCD2 gene (Table 3).At a glance on Fig. S4 (Appendix D), the motif arrangements of all ABCD1 and ABCD2 proteins portrayed high conservation. Only the walking catfish ABCD1 lacks a Motif 2 (Peroxysomal fatty acyl CoA transporter family protein). The ABCD3 proteins also showed relatively high resemblance to that of ABCD1 and ABCD2 proteins, except that they lack the two motifs flanking the both ends of ABCD1 and ABCD2 proteins. The ABCD4 proteins are quite distinctive from the other ABCD members in terms of motif arrangements, motif number and motif availability. Motifs such as Motif 9 (Peroxysomal fatty acyl CoA transporter family protein), Motif 10 (Peroxysomal fatty acyl CoA transporter family protein) and Motif 8 (Peroxysomal fatty acyl CoA transporter family protein) are absent from the ABCD proteins. The peroxysomal fatty acyl CoA transporter family protein functions as an assistant to the ABCD transporters to facilitate the cleaving of acyl CoA substrates before the reactivation by the peroxisomal synthetases (Baker ).The ABCD maximum likelihood phylogenetic tree (Fig. 4) showcases four major clades. The ABCD3b proteins of channel and tra catfish formed a strong clade with bootstrap score of 100%. Conversely, the channel catfish ABCD3a did not form a clade with ABCD3a of tra catfish, instead it formed a strong clade with the black bullhead catfish with 100% bootstrap value. The nomenclature of ABCD3b comes from both zebrafish and channel catfish where this duplicated ABCD3-like was named ABCD3b, however the orthology of this protein necessitates further verification.
All ABCE-ABCF Subfamily Members are Found in All Organisms Examined in This Study
The ABCE-ABCF subfamily contains proteins with NBDs only and without TMDs. These proteins are termed non-functional transporters and they localise in cytoplasm and nucleus, unlike the majority of other ABC proteins which localise in the plasma membrane. Despite having the same amino acid length, both tra catfish ABCE1 have different molecular weights and theoretical isoletric points. The ABCF1 is the largest protein of the group and the ABCE1-2 topped the pI chart. All domain structures of tra catfish ABCE and ABCF proteins is the same as that of the channel catfish (Liu ). All members of this subfamily are present in all organisms investigated in Table 3.High motif conservation was observed across all ABCE and ABCF proteins (Fig. S5: Appendix E), with some exclusions. The Motif 5 (ABC transporter F family) of the striped catfish ABCE1-2 is located somewhere in the middle of protein instead of at the starting position. Both the channel catfish ABCE1-2 and Chinese large-mouth catfish ABCE1 has Motif 1 (ATPase components of ABC transporters with duplicated ATPase domains) located at the middle position instead of towards the end of protein. The ATPase components of ABC transporters with duplicated ATPase domains are involved in the DNA binding, ATP binding as well as ATPase-coupled transmembrane transporter activity (The UniProt Consortium 2021).The phylogenetic tree of the ABCE-ABCF subfamily (Fig. 5) has well supported the annotations of all protein members. The channel catfish ABCE1-2, Chinese large-mouth catfish ABCE1 as well as striped catfish ABCE1-2 reside a slightly further clade from the major ABCE1 clade. The striped and channel catfish ABCF2-2 was grouped into a strong clade with 100% bootstrap score whereas the ABCF1-1 of the aforementioned catfishes are clustered closer with the other ABCF1 from other catfishes as compared to the ABCF2-2 proteins.
ABCG3 is Mouse Exclusive While ABCH1 is Striped Catfish and Zebrafish Exclusive in this Study
Six ABCG proteins and one ABCH protein were identified from the striped catfish genome. All of them are half transporters, localising in the plasma membrane. All members of this subfamily do not differ much in molecular weights and amino acid lengths, with the highest recorded at 79.82 (ABCH1, 720 aa) and lowest recorded at 72.07 kDa (ABCG4, 643 aa). The highest pI was seen in ABCG5 protein (8.90) while the lowest pI was observed in ABCG8 protein. The members of this subfamily are unique from all other family members as their NBDs are located at the N-terminus instead of the C-terminus in other ABC proteins. Majority of the domain structures of tra catfish ABCG proteins closely resembles that of the channel catfish, except for ABCG1 and ABCG2-2 proteins. The reported ABCG1 proteins of the channel catfish was partial, therefore comparison cannot be done. The tra catfish ABCG2-2 protein has two less TMDs as compared to that of the channel catfish. The ABCG3 gene is absent in all organisms examined in Table 3, with the mouse as an exclusion. The ABCH1 gene is only found in striped catfish and zebrafish.Looking at Fig. S6 (Appendix F), the motif distribution pattern is highly identical across ABCG1 and ABCG2 proteins. The ABCG4 proteins have an extra Motif 3 (Eye pigment precursor transporter family protein) at the N-terminus. Interestingly, the Motif 15 (Uncharacterised motif) of the ABCG5 proteins are located at the N-terminus instead of the C-terminus as observed in ABCG1, ABCG2, ABCG8 and ABCG4 proteins. The ABCG8 proteins have Motif 13 (Uncharacterised motif) in place of that N-terminus Motif 15 found in the ABCG5 proteins. The ABCH1 only contains Motif 3 (Eye pigment precursor transporter family protein) and Motif 1 (Walker B and D-loop). The eye pigment precursor transporter family protein serves as an orchestrator for pigment binding, ATP binding as well as ATPase-coupled transmembrane transporter activity (The UniProt Consortium 2021). The maximum likelihood phylogenetic tree of the ABCG-ABCH subfamily (Fig. 6) has also supported the annotations of all protein members of this subfamily. The ABCG2-1 proteins of channel and striped catfish formed a 100% bootstrap scored clade under the major ABCG2 subclade.
Eight Striped Catfish ABC Transporters are Duplicated and Twelve Genes are Lost
Teleost fishes are known to have undergone genome duplication to have two paralogous copies of a myriad of genes in contrast to the only one ortholog as seen in tetrapods (Hoegg ). Gene duplication as a result of evolution has resulted in the emergence of new genes equipped with neofunctions or partitioned functions (Liu ).There are a number of ABC transporters in striped catfish that have been identified to have experienced gene duplications. These ABC transporter genes include ABCA1, ABCB3, ABCB6, ABCC5, ABCD3, ABCE1, ABCF2 as well as ABCG2 (Table 2). These genes are postulated to be closely associated with the fish-specific genome duplication events (Liu ). On the other hand, the single copy genes found within the tra and channel catfish genomes, namely ABCA4, ABCC4, ABCC6 and ABCG4, were believed to have lost subsequent to the whole genome duplication of these teleost fishes (Liu ). Interestingly, some genes such as ABCA6, ABCA8, ABCA9, ABCA10, ABCA12, ABCA13, ABCA14, ABCA15, ABCA17, ABCB4, ABCC11 as well as ABCG3, are all absent from all the fish genomes examined in Table 3. These non-fish ABC transporter genes were deemed to have lost during the genome duplication events that diverge tetrapods from teloest fishes (Dean & Annilo 2005; Annilo ) The ABCA6, ABCA8, ABCA9, ABCA10, ABCA12 as well as ABCA13 are postulated to be mammal-specific.The striped catfish contains all the ABC transporters identified in the channel catfish genome and an additional of two ABC proteins not found in the channel catfish, namely the ABCB4 and ABCH1 proteins. The ABCB4 was found in human, mouse, striped catfish and zebrafish, but absent in channel catfish. This protein is a liver-specific transporter of phosphatidycholine (a mammalian bile compound) (Annilo ). The lack of this protein suggested the elimination of the need for phospholipids in some teloest fishes (Annilo ). These phospholipids are essential parts of the pulmonary surfactant utilised to lower the surface tension of the respiratory organ (Daniels ). Hence, it is postulated that teleost fishes that are capable of air-breathing like the zebrafish and striped catfish may necessitate the ABCB4 protein for oxygen harvest from the air whereby it is needed to lubricate the respiratory tracts subsequent to dry air inhalation. The ABCH1 was only found in zebrafish and striped catfish among all the vertebrate genomes investigated in Table 3. In fact, this protein is also absent in yeast, plants, mammals and worms, but it is present in all insects, non-insect protozoans, Sarawak rasbora fish, non-insect metazoans, green spotted pufferfish, sea urchin as well as non-insect arthropods (Guo ; Jeong ; Lim ). In zebrafish, this protein was expressed in the gills, brain, kidney and intestine (Popovic ). Popovic postulated that the ABCH1 protein may have similar multixenobiotic defence role as the ABCG2 protein. Looking at all the fishes that contain the ABCH1 gene, one major similarity is that all of them are air-breather, hence it is postulated that this ABCH1 protein may be required for the detoxification of the inhaled air. Nonetheless, further investigation is needed to functionally characterise this protein in striped catfish in order to support the aforementioned postulations.
Orthology and Potential Functional Inferences of Striped Catfish ABC Transporters
The ABC transporters have been intensively studied in human genetics where the majority of them are actively involved in pivotal biological processes and the onset of genetic diseases (Dean & Annilo 2005). These orthologs may provide clues towards the duplication of some ABC genes in teleosts in order to assist them to adapt to the aquatic environments.In vertebrates, the ABCA1 proteins are the main cholesterol transporters. They are actively involved in the flopping of the inner leaflet cholesterol to the outer leaflet of the plasma membrane (Ogasawara ). These proteins orchestrates and maintains a low cholesterol level in the inner leaflet of the plasma membrane, this makes the lipid a membranous signal molecule (Ogasawara ). Ogasawara deemed that the ABCA1 is the major accelerator of vertebrate evolution. The ABCA4 proteins are the photoreceptor rim proteins that are expressed exclusively in the retina (Sun & Nathans 2000). This protein plays multiple roles in early vertebrate development and one of the major function is the governance of retina-specific pathways in mammals like rodent, human and mouse (Borst & Elferink 2002). The duplication of ABCA1 and ABCA4 genes in most of the fish genomes in Table 3 signals dire need for further investigation on the association of high metabolic energy consumption of aquatic fishes and cholesterol translocation speed as well as whether retina cell activities are required twice as much in fishes than in mammals.The ABCB3 proteins are antigen peptide transporters present in zebrafish, tra catfish and channel catfish, but absent in species that evolved earlier like the lampreys (Ren ). These proteins are closely associated with the gnathostomes adaptive immune system as they translocate peptides to the endoplasmic reticulum to aid in the formation of class I major histocompatibility complex molecules (Michalova ). The zebrafish ABCB3 and ABCB7 were proven as efficient metal detoxicants not so long ago (Lerebours ). The ABCB6 proteins participate in the iron metabolism and Fe/S protein precursors translocation (Zutz ). The expression of this protein was found high in muscles and intestines during early metamorphic stages of lampreys (Ren ). Therefore, it is postulated that the ABCB3 and ABCB6 gene duplication in channel and striped catfishes may be essential for the high detoxification needs in fish as compared to mammals (Liu ).Majority of the ABCC members are involved in toxic translocation out of the cells. The ABCC5 proteins are highly expressed in glia, blood-brain barrier and neurons (Jansen ). They are indispensable efflux transporters of glutamate conjugates and analogs, governing the activity of the excitatory neurotransmitters in the brain (Jansen ). Besides, this protein also orchestrates the depth of the anterior chamber and is directly linked toward the primary angle closure glaucoma (Nongpiur ). Human cervical cancer cells expressing moderate ABCC5 protein levels were discovered to experience exponential growth (Eggen ). In fishes, this protein is highly expressed in the muscles, brain, gills and digestive tracts of the zebrafish (Bolbol ). The elevated expression of ABCC5 genes was induced with the presence of Tubifex worms and metallic (copper and cadmium) contamination (Bolbol ). The duplication of this gene in catfishes, tilapia and medaka may have similar implications or a more substantial role in assisting these fishes in their adaptations to chemically harsh environments.Each of the ABCD subfamily members has diverse functions unlike proteins from other ABC subfamilies. The ABCD1 and ABCD2 proteins translocate long and very long chain fatty acids into peroxisomes, while the ABCD4 transfers lysosomal vitamin B12 to the cytosol (Kawaguchi & Morita 2016). On the other hand, the ABCD3 is responsible for the translocation of branched chain acyl-CoA towards the peroxisomes (Kawaguchi & Morita 2016). The price of having defective ABCD1 and ABCD3 is one of the most lethal as this can lead to neurodegenerative disease and hepatosplenomegaly liver disease (Mosser ; Ferdinandusse ). The duplication of ABCD3 gene was observed in zebrafish as well as both the catfishes, therefore it is imperative to decipher its associated metabolic pathways, especially when both catfishes are highly demanded important sources of great quality unsaturated fatty acids.The expression of ABCE1, ABCF1 and ABCF2 was seen in all tissues in lampreys, Sarawak rasbora and zebrafish (Ren ; Lim ; Newman ). The ABCE1 protein has diverse roles such as ribosome recycling, translation initiation, termination and elongation (Chen ; Barthelme ; Pisareva ). The efficacy of the ABCF2 protein in resisting cisplatin-induced apoptosis in ovarian cancer cells has been documented by Bao . Both ABCE1 and ABCF2 genes are duplicated in channel and striped catfishes, while there are three ABCE1 in cod and two ABCF2 in zebrafish, these extra copies would serve as a replacement in times when one of them is defective in nature, mutated or halted of their housekeeping obligations.The ABCG2 protein is one of the most well-studied members of the ABCG subfamily. This protein is also named mitoxantrone-resistance protein, breast cancer resistance associated protein and placenta-specific ABC proteins for the diverse pathways it is involved in (Dermauw & van Leeuwen 2014; Ferreira ). Furthermore, it also facilitates the restricted absorption of pharmaceutics into the intestine, exports urate from the kidney as well as provides shield for haematopoietic stem cells against heme-induced toxins (Kerr ; Vlaming ; Woodward ). In fishes, the rainbow trout ABCG2a was proven an effective guard against environmental toxins (Zaja ). The research on ABCG2 genes of economically valuable fishes like the catfishes is extremely essential to minimise the harvest loss due to the presence of environmentally induced toxins.
CONCLUSION
A sum of 52 ABC transporters were unearthed from the striped catfish genome. The motif analysis has revealed several exclusive characteristics of some catfishes in this study. The phylogenetic analysis has proven its efficacy in the successful annotations of these transporter proteins. Duplicated genes such as ABCA1, ABCB3, ABCB6, ABCC5, ABCD3, ABCE1, ABCF2 as well as ABCG2 were found within the striped and channel catfish genomes. The objectives set for this study have been achieved, leaving some windows for more knowledge gaps to be filled soon. This whole set of ABC transporters channels valuable genomic landscape for future ecotoxicological, biochemical and physiological researches in striped catfish.
Authors: Vera P Pisareva; Maxim A Skabkin; Christopher U T Hellen; Tatyana V Pestova; Andrey V Pisarev Journal: EMBO J Date: 2011-03-29 Impact factor: 11.598
Authors: J Mosser; A M Douar; C O Sarde; P Kioschis; R Feil; H Moser; A M Poustka; J L Mandel; P Aubourg Journal: Nature Date: 1993-02-25 Impact factor: 49.962
Authors: Edouard de Castro; Christian J A Sigrist; Alexandre Gattiker; Virginie Bulliard; Petra S Langendijk-Genevaux; Elisabeth Gasteiger; Amos Bairoch; Nicolas Hulo Journal: Nucleic Acids Res Date: 2006-07-01 Impact factor: 16.971
Authors: Monisha E Nongpiur; Chiea Chuen Khor; Hongyan Jia; Belinda K Cornes; Li-Jia Chen; Chunyan Qiao; K Saidas Nair; Ching-Yu Cheng; Liang Xu; Ronnie George; Do Tan; Khaled Abu-Amero; Shamira A Perera; Mineo Ozaki; Takanori Mizoguchi; Yasuo Kurimoto; Sancy Low; Liza-Sharmini A Tajudin; Ching-Lin Ho; Clement C Y Tham; Ileana Soto; Paul T K Chew; Hon-Tym Wong; Balekudaru Shantha; Masako Kuroda; Essam A Osman; Guangxian Tang; Sujie Fan; Hailin Meng; Hua Wang; Bo Feng; Victor H K Yong; Serena M L Ting; Yang Li; Ya-Xing Wang; Zheng Li; Raghavan Lavanya; Ren-Yi Wu; Ying-Feng Zheng; Daniel H Su; Seng-Chee Loon; Vernon K Y Yong; R Rand Allingham; Michael A Hauser; Nagaswamy Soumittra; Vedam L Ramprasad; Naushin Waseem; Azhany Yaakub; Kee-Seng Chia; Govindasamy Kumaramanickavel; Tina T Wong; Alicia C How; Tran Nguyen Bich Chau; Cameron P Simmons; Jin-Xin Bei; Yi-Xin Zeng; Shomi S Bhattacharya; Mingzhi Zhang; Donald T Tan; Yik-Ying Teo; Saleh A Al-Obeidan; Do Nhu Hon; E-Shyong Tai; Seang-Mei Saw; Paul J Foster; Lingam Vijaya; Jost B Jonas; Tien-Yin Wong; Simon W M John; Chi-Pui Pang; Eranga N Vithana; Ningli Wang; Tin Aung Journal: PLoS Genet Date: 2014-03-06 Impact factor: 5.917
Authors: Oanh T P Kim; Phuong T Nguyen; Eiichi Shoguchi; Kanako Hisata; Thuy T B Vo; Jun Inoue; Chuya Shinzato; Binh T N Le; Koki Nishitsuji; Miyuki Kanda; Vu H Nguyen; Hai V Nong; Noriyuki Satoh Journal: BMC Genomics Date: 2018-10-05 Impact factor: 3.969