Literature DB >> 32631258

Phylogenetic analysis of the ATP-binding cassette proteins suggests a new ABC protein subfamily J in Aedes aegypti (Diptera: Culicidae).

Janaina Figueira-Mansur1, Carlos G Schrago2, Tiago S Salles1,3, Evelyn S L Alvarenga1, Brenda M Vasconcellos1, Ana Claudia A Melo1,3, Monica F Moreira4,5.   

Abstract

BACKGROUND: We performed an in-depth analysis of the ABC gene family in Aedes aegypti (Diptera: Culicidae), which is an important vector species of arthropod-borne viral infections such as chikungunya, dengue, and Zika. Despite its importance, previous studies of the Arthropod ABC family have not focused on this species. Reports of insecticide resistance among pests and vectors indicate that some of these ATP-dependent efflux pumps are involved in compound traffic and multidrug resistance phenotypes.
RESULTS: We identified 53 classic complete ABC proteins annotated in the A. aegypti genome. A phylogenetic analysis of Aedes aegypti ABC proteins was carried out to assign the novel proteins to the ABC subfamilies. We also determined 9 full-length sequences of DNA repair (MutS, RAD50) and structural maintenance of chromosome (SMC) proteins that contain the ABC signature.
CONCLUSIONS: After inclusion of the putative ABC proteins into the evolutionary tree of the gene family, we classified A. aegypti ABC proteins into the established subfamilies (A to H), but the phylogenetic positioning of MutS, RAD50 and SMC proteins among ABC subfamilies-as well as the highly supported grouping of RAD50 and SMC-prompted us to name a new J subfamily of A. aegypti ABC proteins.

Entities:  

Keywords:  ABC protein classification; ABC protein subfamily J; Aedes aegypti; MDR phenotype; MutS, RAD50 and SMC proteins

Mesh:

Substances:

Year:  2020        PMID: 32631258      PMCID: PMC7339416          DOI: 10.1186/s12864-020-06873-8

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

The ATP-binding cassette (ABC) transporters constitute a diverse gene family consisting of proteins found in all cellular organisms and participating in several different biological pathways [1]. Among these processes, the ABC transporters are mostly involved in extra and intracellular trans membrane ATP energy driven traffic of molecules such as lipids, amino acids, hormones and xenobiotics [2, 3]. Members of this family are characterized by two trans membrane domains (TMD) and two nucleotide-binding domains (NBD) characterized by conserved motifs: Walker A, Walker B, ABC signature (LSGGQ-motif), Q loop, and H loop [1, 4]. The TMD domains of the ABC-transporters are composed of five to ten membrane-spanning regions that are involved in substrate translocation. The four domains (two TMD and two NBD) of a functional ABC transporter might be present in a single protein (full transporter) or in dimers of separate proteins that have at least one TMD and one NBD each (half transporter) [3, 5]. The traditional classification is based on sequence similarity and arranged the ABC protein diversity into eight subfamilies (A- H) [6]. The ABCE and ABCF subfamilies are unique among the ABC proteins because they exhibit a pair of linked nucleotide-binding domains while lacking trans membrane domains [3, 6]. The ABCH subfamily was described for protozoa [7] and insects [8, 9], but it has not yet been found in mammals, bacteria, and yeast genomes. Plants, besides presenting eukaryotic ABC subfamilies A to G, exhibit a heterogeneous and extensive group of ABC proteins that bear similarities to the components of prokaryotic multi-subunit ABC transporters. This group was named subfamily I and includes NBD and TMD domains and homologues of soluble cytosolic proteins that interact with NBDs as well as putative substrate-binding proteins similar to the periplasmic binding proteins [10]. Three other groups of proteins not assigned to the subfamilies mentioned above exhibit ABC transporter domains: (1) the MutS proteins that are responsible for DNA mismatch repair and maintenance of genomic stability [11, 12]; (2) the structural maintenance of chromosome proteins (SMC), which are mostly responsible for chromosome condensation and sister chromatid cohesion [13], and (3) the Rad 50 proteins that also function on DNA repair [8, 9, 14]. Although MutS, SMC, and Rad50 proteins show ABC protein characteristics, they have not yet been included in the standard ABC classification for humans, arthropods, and the Caenorhabditis elegans nematode [8, 9, 15, 16]. Nonetheless, in the complete inventory of ABC proteins of the Arabidopsis thaliana plant, SMC proteins were proposed as a new ABC protein subfamily [17]. Some ABC proteins have been associated with multidrug resistance (MDR) phenotype in a variety of organisms. This phenotype is associated with the overexpression of P-glycoproteins (P-gp/MDR/ABCB1), the multidrug resistance protein (MDR/ABCC), and the breast cancer resistance protein (BCRP/ABCG2) [5, 18, 19]. These act as efflux pumps that result in resistance to chemotherapeutics, antibiotics, and antiretroviral drugs [20, 21]. One important control mechanism of vector-borne diseases is vector control, which relies mainly on insecticide treatments of vector populations. In these populations, the insecticide-resistant phenotype arises due to the selection of genetically resistant individuals that exhibit higher fitness under special conditions [22, 23]. Multiple insecticide resistance can be separated into two main categories: cross-resistance—when a single mechanism confers resistance to a range of different insecticides; and multiple resistance—when several coexisting defense mechanisms act in the same organism [24, 25]. The involvement of ABC transporters in insecticide resistance and transport is poorly documented, but an increasing number of studies have shown that ABC transporters have been linked to insecticide and nicotine transport [26-28] and insect resistance to Bacillus thuringiensis toxins and pyrethroids [29, 30]. The high expression of P-gp in insecticide resistant pests such as Heliothis virescens and Helicoverpa armigera has been suggested to be a mechanism of resistance [31, 32]. Recent surveys of the ABC gene family in arthropods included the fruit fly Drosophila melanogaster, the mosquito Anopheles gambiae, the beetle Tribolium castaneum, the honey bee Apis mellifera, the silkmoth Bombyx mori, the water flea Daphia pulex, and the spider mite Tetranychus urticae [16]. Analyses focusing on crustaceans such as the sea lice Caligus rogercresseyi [33] and Lepeophtheirus salmonis [34] were also carried out. These studies left out the A. aegypti mosquito, which is an important vector species of arthropod-borne viral infections such as chikungunya, dengue, and Zika diseases [35]. In 2016, Lu et al. [36] conducted a comparative analysis of the ABC transporter family in three mosquito species (Anopheles gambiae, Aedes aegypti, and Culex pipiens quinquefasciatus) and found 55, 69, and 70 ABC genes, respectively. The search for Aedes aegypti ABC proteins, however, was carried out within a limited evolutionary range because only mosquito sequences were analyzed. In this study, we surveyed the Aedes aegypti genome in a broader evolutionary spectrum, employing human and Drosophila ABC transporters as queries. By including all the putative proteins that exhibit the ABC domain into a phylogenetic analysis, we showed that SMC, Rad 50, and MutS proteins were part of the main ABC gene family diversification, which justifies the proposition of a new subfamily of the ABC proteins.

Results

The BLASTp search on the A. aegypti genome retrieved 62 complete proteins that were identified as ABC transporters when submitted to the NCBI Conserved Domain Database. The ABC gene family phylogeny recovered subfamilies A-H with significant statistical support (Fig. 1). The sizes of gene subfamilies varied significantly with subfamilies A-C and G consisting of the larger groups. Sister group associations between ABC subfamilies were less resolved. The single exception was the clade with subfamilies ABCA and ABCH that were grouped with maximum statistical support. In all ABC subfamilies, A. aegypti proteins had a tendency to be positioned among human and Drosophila sequences suggesting that the duplication events that gave rise to current ABC diversity took place before the evolution of those lineages. Clusters containing ABC genes exclusively from A. aegypti were found in subfamilies ABCA, ABCC, and ABCG. These clusters indicate mosquito-specific duplication events.
Fig. 1

a Maximum likelihood phylogeny of the ABC gene family including SMC, Rad50 and MutS genes. ABC subfamilies are shown with the new mosquito sequences highlighted in blue. b Numbers at branches indicate statistical support (ultra-fast bootstrap) for each subfamily A-J

a Maximum likelihood phylogeny of the ABC gene family including SMC, Rad50 and MutS genes. ABC subfamilies are shown with the new mosquito sequences highlighted in blue. b Numbers at branches indicate statistical support (ultra-fast bootstrap) for each subfamily A-J Classification of ABC proteins subfamilies in Homo sapiens and Drosophila melanogaster The variation of the rate of evolution within each ABC subfamily as measured by the heterogeneity of the distance between the common ancestor of all members of the subfamily and the tips was higher in subfamily ABCA. In this subfamily, an interesting pattern of rate increase along lineages was observed (Fig. 1). As expected, deeper nodes exhibited lower statistical support demonstrating that the evolutionary relationships between these subfamilies were not fully resolved. Surprisingly, root placement using the minimal ancestor deviation (MAD) method suggested that subfamily ABCG is a sister to the remaining ABC transporters including the clades consisting of SMC and Rad50 proteins as well as the MutS proteins that were positioned as a sister to subfamily ABCD (Fig. 1).

Discussion

To investigate ABC transporters in the A. aegypti genome within a broader evolutionary context, we identified A. aegypti ABC homologs employing human and D. melanogaster as queries (Table 1). We also identified the conserved domains of all the putative A. aegypti ABC transporters to investigate the assignment of the putative proteins to the described subfamilies of these transporters. We identified ten members of the A. aegypti ABCA subfamily (Fig. 1 and Table 2). This subfamily contains longer proteins that ranged from 1419 to 1673 amino acid residues. Nine of these members have the topology of full transporters with two NBDs and two TMDS (Table 2). The A. aegypti ABCA subfamily was encoded by genes organized in tandem indicating specific gene duplication events (Table 2). Four members of this cluster have genes organized in tandem in the supercontig 1.726, two members belong to the supercontig 1.321, and four belong to other supercontigs (Table 2). The roles of arthropod ABCA members are unclear [16], but this subfamily has been described as involved with lipid transport in mammals [37].
Table 1

Classification of ABC proteins subfamilies in Homo sapiens and Drosophila melanogaster

Homo sapiensDrosophila melanogaster
Sub-familySeq IDGenBank accessionSub-familySeq IDGenBank accession
AABCA1NP_005493AAAF50836
AABCA2NP_001597AAAF50837
AABCA3CAA65825AAAF50838
AABCA4_ABCRAAC05632AAAF50847
AABCA5NP_061142AAAF53329
AABCA6NP_525023AAAF55726
AABCA7AAK00959AAAF57490
AABCA8BAA74845BAAF45509
AABCA9NP_525022BAAF47525
AABCA10XP_085647BAAF48177
AABCA12NP_056472BAAF50669
AABCA13NP_689914BAAF50670
BABCB1_MDR1_P_gpNP_000918BAAF53736
BABCB2_TAP1CAA40741BAAF53737
BABCB_TAP2AAA59841BAAF55241
BABCB4_MDR3AAA36207BAAF58271
BABCB5AAO73470BAAF58437
BABCB6_MTABC3NP_005680CAAF46706
BABCB7BAA28861CAAF52639
BABCB8_MABC1AAD15748CAAF52648
BABCB9AAF89993CAAF52866
BABCB10_MABC2XP_001871CAAF53223
BABCB11_BSEPAAC77455CAAF53950
CABCC1_MRP1AAB46616CAAF54656
CABCC2_MRP2CAA65259CAAF55707
CABCC3_MRP3BAA28146CAAF56312
CABCC4_MRP4NP_005836CAAF56869
CABCC5_MRP5AAB71758CAAF56870
CABCC6_MRP6AAC79696CAAF58947
CABCC7_CFTRAAC13657DAAF49018
CABCC8_SUR1AAB02278DAAF59367
CABCC9_SUR2AAC16058EAAF50342
CABCC10NP_258261FAAF48069
CABCC11NP_149163FAAF48493
CABCC12NP_150229FAAF49142
DABCD1_ALDPCAA79922GAAF45826
DABCD2_ALDRNP_005155GAAF47020
DABCD3_PMP70CAA41416GAAF49455
DABCD4_PMP69AAB83967GAAF50035
EABCE1_RNAseL1CAA53972GAAF51027
FABCF1AAH34488GAAF51122
FABCF2NP_005683GAAF51130
FABCF3NP_060828GAAF51131
GABCG1_WHITE1AAC51098GAAF51223
GABCG2_BCRPQ9UNQ0GAAF51341
GABCG4_WHITE2NP_071452GAAF51548
GABCG5AAG40003GAAF51551
GABCG8AAG40004GAAF52835
JSMC1AAB34405GAAF56360
JSMC4BAA73535GAAF56361
JSMC3AAC14893HAAF52284
JSMC2AAI44164HAAF56807
JRAD50NP_005723HABC66191
JSMC5CAC39247JSMC1AAF56231
JSMC6CAC39248JSMC4AAF53560
JMSH4_MutSNP_002431JSMC2AAF58197
JMSH3_MutSAAB06045JRAD50AAF46847
JMSH5_MutSNP_079535JSMC5CAD29584
JMSH2_MutSNP_000242JSMC6AAF56254
JMSH6_MutSNP_000170JMSH2_MutSNP_523565
JMSH6_MutSAAF49656
JSMC3AAF48625
Table 2

Characterization of the 62 A. aegypti ABC proteins

Sub-familyNameVectorBase (accession number)Size (amino acids)Predicted topologyLocation(gene)Orientation(gene)
A
AaegABCA3L1AAEL012702-PA1669TMD1-NBD1-TMD2-NBD21.726: 372101–377,726+
AaegABCA3L2AAEL012700-PA1648TMD1-NBD1-TMD2-NBD21.726: 388899–394,375+
AaegABCA3L3AAEL012701-PA1622TMD1-NBD1-TMD2-NBD21.726: 409854–439,050+
AaegABCA3L4AAEL012698-PA1652TMD1-NBD1-TMD2-NBD21.726: 450626–459,977+
AaegABCA3L5AAEL008388-PA1666TMD1-NBD1-TMD2-NBD21.321: 644618–664,804
AaegABCA3L6AAEL008384-PA1660TMD1-NBD1-TMD2-NBD21.321: 675803–697,600
AaegABCA3L7AAEL001938-PA1673TMD1-NBD1-TMD2-NBD21.46: 792516–818,527
AaegABCA5LAAEL004331-PA1419TMD1-NBD1-TMD2-NBD21.115: 240545–271,476+
AaegABCA5LAAEL018040-PA1987TMD1-NBD1-TMD2-NBD23.322: 613800–714,818
AaegABCA18AAEL017572-PA347NBD1.176: 1628836–1,629,879
B
AaegABCB1L/AaegP-gpAAEL010379-PA1307TMD1-NBD1-TMD2-NBD21.474: 313030–327,570+
AaegABCB6LAAEL000434-PA693TMD-NBD1.8: 3711414–3,730,662+
AaegABCB7LAAEL006717-PA734TMD-NBD1.219: 178589–203,717
AaegABCB8LAAEL002468-PA703TMD-NBD1.58: 1203051–1,224,141
AaegABCB10LAAEL008134-PA848TMD-NBD1.302: 73729–107,503+
C
AaegABCC1L1AAEL005026-PA1384TMD0-TMD1-NBD1-TMD2-NBD21.139: 1168407–1,184,363+
AaegABCC1L2AAEL005045-PA1514TMD0-TMD1-NBD1-TMD2-NBD21.139: 1184563–1,195,380
AaegABCC1L3AAEL005030-PA1396TMD0-TMD1-NBD1-TMD2-NBD21.139: 1233513–1,252,972
AaegABCC1L4AAEL004743-PA1089TMD0-TMD1-NBD11.129: 994901–1,030,978+
AaegABCC1L5AAEL017209-PA903TMD0 -TMD1-NBD11.107: 820177–825,969
AaegABCC4L1AAEL013567-PA1311TMD1-NBD1-TMD2-NBD21.871: 281423–317,150+
AaegABCC4L2AAEL005918-PA1312TMD1-NBD1-TMD2-NBD21.180: 664096–681,744
AaegABCC4L3AAEL005937-PA1300TMD1-NBD1-TMD2-NBD21.180: 724473–765,746+
AaegABCC4L4AAEL005929-PA1413TMD1-NBD1-TMD2-NBD21.180: 786121–801,780+
AaegABCC4L5AAEL013834-PA1235TMD1-NBD1-TMD2-NBD21.936: 291553–353,031
AaegABCC4L6AAEL012395-PA1357TMD1-NBD1-TMD2-NBD21.688: 67831–72,390
AaegABCC4L7AAEL012386-PA1351TMD1-NBD1-TMD2-NBD21.688: 87463–91,714+
AaegABCC4L8AAEL012192-PA1345TMD1-NBD1-TMD2-NBD21.664: 660781–670,973
AaegABCC10LAAEL006622-PA1540TMD0-TMD1-NBD1-TMD2-NBD21.213: 838086–915,438+
AaegABCC14AAEL005499-PA1382TMD1-NBD1-TMD2-NBD21.160: 1362499–1,398,139
D
AaegABCD2LAAEL002913-PA659TMD-NBD1.71: 1617561–1,676,168+
AaegABCD3LAAEL010047-PA753TMD-NBD1.449: 843528–895,566+
E
AaegABCE1LAAEL010059-PA609NBD1-NBD21.450: 713084–727,146+
F
AaegABCF1LAAEL001101-PA894NBD1-NBD21.23: 2941514–2,961,984
AaegABCF2LAAEL010977-PA602NBD1-NBD21.529: 122943–143,748
AaegABCF3LAAEL010359-PA609NBD1-NBD21.450: 713084–727,146+
G
AaegWhite*AAEL016999-PA692NBD-TMD1.71: 816675–879,820
AaegABCG/whiteL1AAEL008138-PA773NBD-TMD1.303: 412767–432,830+
AaegABCG/whiteL2AAEL008672-PA689NBD-TMD1.340: 378892–469,513+
AaegABCG/whiteL3AAEL008624-PA593NBD-TMD1.337: 23491–61,022
AaegABCG/whiteL4AAEL008632-PA607NBD-TMD1.337: 68512–71,099
AaegABCG/whiteL5AAEL008628-PA571NBD-TMD1.337: 85665–99,799
AaegABCG/whiteL6AAEL008625-PA606NBD-TMD1.337: 119628–131,014+
AaegABCG/whiteL7AAEL008629-PA723NBD-TMD1.337: 131034–224,797
AaegABCG/whiteL8AAEL008631-PA759NBD-TMD1.337: 276979–394,542
AaegABCG/whiteL9AAEL008635-PA676NBD-TMD1.337: 470559–525,334
AaegABCG/whiteL10AAEL013372-PA599NBD-TMD1.830: 256110–301,680+
AaegABCG/scarletL1AAEL003703-PA616NBD-TMD1.94: 984346–994,398
AaegABCG/scarletL2AAEL017106-PA689NBD-TMD1.1174: 142758–145,023
AaegABCG8L1AAEL012170-PA275NBD1.662: 13565–20,660+
AaegABCG8L2AAEL011265-PA787NBD-TMD1.561: 434947–481,728+
H
AaegABCH1AAEL005249-PA872NBD-TMD1.147: 1132338–1,176,150+
AaegABCH2AAEL005491-PA783NBD-TMD1.159: 901905–920,193+
AaegABCH3AAEL014428-PA727NBD-TMD1.1111: 126773–184,623+
AaegABCH4L5AAEL018334-PA814NBD-TMD1.181:178199–420,930
J
AaegSMC1L1AAEL005802-PA1227NBD1.175: 1418947–1,437,720
AaegSMC1L2AAEL015592-PA594NBD1.3565: 4371–6283
AaegSMC2LAAEL003449-PA1182NBD1.86: 847376–874,751
AaegSMC3LAAEL006937-PA1201NBD1.229: 536438–586,285
AaegSMC4LAAEL001655-PA1347NBD1.38: 1385452–1,432,324+
AaegSMC6LAAEL002581-PA1107NBD1.61: 165187–198,927+
AaegRAD50L1AAEL005245-PA1034NBD1.147: 747831–755,196
AaegRAD50L2AAEL014748-PA1051NBD1.1248: 101280–115,288
AaegMutS6LAAEL011780-PA1130NBD1.612: 217022–240,932+
Characterization of the 62 A. aegypti ABC proteins Five sequences retrieved from the A. aegypti genome were assigned to the ABCB subfamily (Fig. 1, Table 2). This subfamily is composed of putative homologs of the human P-glycoprotein, which plays key physiological roles such as the excretion of toxic compounds and the multidrug resistance phenotype [3, 26, 27, 37, 38]. The identified A. aegypti ABCB proteins are intimately related to the human mitochondrial transporters HsABCB6, HsABCB7, HsABCB8, and HsABCB10 leading us to suppose that these proteins have a similar role associated with the iron metabolism and the transport of Fe/S protein precursors from the mitochondria to the cytoplasm [37, 39]. We also note that one D. melanogaster protein classified as belonging to the ABCB (CG31792_B) subfamily was recovered in the ABCC clade. This may be due to misclassification or to recent duplication and functional change. In either case, this protein should be further investigated. One of the most diverse subfamilies identified in the mosquito genome was the ABCC with 15 members—all full transporters (Table 2). This subfamily presents a high diversity of sequences as well as functional roles when compared with the human ABCC proteins. These functions are related to ion transport, cell surface receptors, toxin secretion, and multidrug resistance [38]. A sub-clade containing all the MRP from humans and D. melanogaster was recovered including four A. aegypti proteins (AaegABCC1L1, AaegABCC1L2, AaegABCC1L4, and AaegABCC1L5) suggesting that these proteins might also be responsible for protection against xenobiotics [40] and for the MDR phenotype [38, 41]. The ABCD and ABCE subfamilies were the least diverse of the groups identified in humans—the former is known to appear as half transporters forming homo or heterodimers in peroxisomes acting in lipid transport [3, 39, 42]. The ABCD subfamily has two members and the ABCE subfamily has only one protein described for most eukaryotes (Table 2) with the exception of A. thaliana [17]. This was consistent with the findings of a single ABCE gene in the A. aegypti genome. These proteins lack the TMD and were first described as the RNAseL protein participating in ribosome biogenesis and protein translation [37–39, 43–46]. Like ABCE proteins, the ABCF subfamily also lacks the TMD and is involved in the ribosome complex formation and activation [46-48]; only three of these proteins were found in the mosquito genome in our analysis. Although only five members of the ABCG proteins were described in humans [3, 37], 15 proteins belonging to this group were identified for A. aegypti (Table 2). This number is greater than the 11 genes previously identified in An. gambiae [9]. This excessive number of ABCG proteins in A. aegypti mosquito is likely due to a series of duplication events that is supported by the tandem organization observed in the supercontig 1.337 of the A. aegypti genome (Table 2). In D. melanogaster, the white gene is the most studied gene from the ABCG subfamily, and the product of this gene can form dimers with the scarlet and brown proteins (scarlet and brown genes, respectively). These dimers are transporters of eye pigment precursors in D. melanogaster [49, 50]. Only one ortholog of the white and scarlet proteins was found in the A. aegypti genome but no ortholog of the brown protein was found. In humans, ABCG5 and ABCG8 are glycoproteins that also form obligate heterodimers. These are useful to limit the absorption of plant sterols and cholesterol from the diet and promote secretion of plant sterols and cholesterol from liver cells into the bile. Based on their head-to-head orientation and clear orthologous relationships with human ABCG5 and ABCG8, these arthropod ABCGs probably have a similar role as their human orthologues [37]. The ABCH subfamily was exclusively found in insects with no reports in mammals, plants, or yeast [9, 37]. Here, four members of the ABCH subfamily were identified in the A. aegypti genome (Fig. 1 and Table 2). This included the sequence AAEL018334, which has been previously assigned to ABCG subfamily. Although these are proteins with unknown function, topological similarities with the ABCG proteins have suggested that the ABCH might be involved in sterol transport and multidrug resistance [51, 52]. Insect P-glycoproteins and multidrug-resistance associated proteins are frequently associated with pesticide resistance as reported in Heliothis virescens and Helicoverpa armigera [30, 31] and insecticide transport. The expression of A. aegypti P-gp (AAEL010379) increases eightfold in the temephos-treated larvae, and silencing of this gene expression significantly increases temephos toxicity [27]. These findings suggested that ABC transport, which consists of ATP-dependent efflux pumps, might be involved with compound traffic and multidrug resistance phenotypes. New insights into insecticide efflux, ATP-dependent efflux pump inhibitors, and/or RNAi associated with pesticides will potentially assist in the development of control strategies for important vectors of infectious diseases like A. aegypti. Rad50 shares topological and sequence features with SMC proteins [52]. Notably, Rad50 has a relatively well-conserved LSGG motif compared to the classic ABC proteins. Moreover, it has an extensive coiled region that facilities dimerization of large molecules restoring the close proximity of the Walker A and B motifs for nucleotide binding [53]. SMCs have more degenerated versions of this signature motif and contain minimal Walker A and B motifs (Supplemental material 1) [54]. Finally, perhaps a distant lineage but still within the ABC diversification [55], are the DNA repair enzymes such as MutS [56]. SMC proteins formed a highly supported clade with the Rad50 proteins. These proteins form dimers and have a conserved mechanism of conformational change observed in the classic ABC proteins. The ATP binding and NBD dimerization promote changes in the substrate-binding domains that are important for the function of the ABC-type ATPases. The substrate-binding domains of the SMC and Rad50 proteins are located in similar positions as the classic ABC proteins [52]. The ABC proteins subfamilies are grouped together based on sequence similarity and proteins belonging to the same subfamily usually have similar functions. Our results showed that ABC subfamilies were always strongly recovered in the gene family phylogeny and that the sequences of SMC and Rad50 proteins formed a well-supported clade (100 bootstrap support), sister to MutS proteins, and ABC transporters excluding ABCG. Functional similarities are also observed within the groups. We know the following: (i) SMC and Rad50 proteins exhibit similar functions on DNA repair and chromosomal maintenance [8, 9, 11, 12, 14], (ii) they form a strongly supported clade with ABC transporters phylogeny, and (iii) they exhibit the structural and sequence characteristics of ABC proteins. Thus, we propose these proteins be included in the ABC gene family with the creation of a new subfamily called J (Fig. 1; Table 2) that includes ABC proteins involved in DNA repair and structural maintenance of the chromosomes.

Conclusions

In summary, we found 53 classic complete ABC proteins annotated in the A. aegypti genome that were classified in traditional ABC subfamilies (A-H) as reported in other species. We also found 9 sequences of the Rad, MutS, and SMC in the Aedes genome database that clustered with human and D. melanogaster orthologs in the same clade. Considering other similarities observed between these enzymes and the classic ABC proteins, we propose these proteins be included in the ABC gene family followed by creation of a new subfamily called J that includes ABC enzymes involved in DNA repair and the structural maintenance of the chromosome.

Methods

Sequence sampling and alignment

We selected protein sequences of ABC protein subfamilies from humans (46 sequences) and Drosophila melanogaster (50 sequences) including sequences from SMC, Rad50, and MutS genes ensuring a broad evolutionary diversity. These sequences were used as query for BLAST searches of A. aegypti ABC proteins. To identify A. aegypti ABC proteins, human and Drosophila sequences were used as queries to search sequences on the mosquito genome (VectorBase) and on the NCBI protein database using BLASTp [57]. Sequences resulting from these searches that had more than 30% identity were considered as homologous (Supplemental material 2). All putative homologous sequences were submitted to the NCBI Conserved Domain Database [58, 59] to confirm the presence of the ATP-binding cassette domain. Datasets were aligned using the MUSCLE software [60] and then pruned for removal of regions with high frequency of indels using TrimAL using the “gappyout” command (Supplemental material 3) [61].

Inference of ABC gene genealogy

The maximum likelihood (ML) tree topology was inferred with the IqTree 1.6 [62] program employing the LG + R10 model of amino acid substitution that was chosen by the Bayesian information criterion. This model uses the LG amino acid replacement matrix [63] coupled with ten relative rate classes to accommodate among-site rate heterogeneity [64]. Branch support was assessed by the ultrafast bootstrap implementation of IqTree using 1000 replicates [65]. IqTree was executed via the command “iqtree -s infile -bb 1000”. Because no outgroup was included in our analysis, rooting of the ABC gene genealogy was performed using the minimal ancestor deviation method of Tria at al. [66]. Rooting is necessary for establishing the chronological direction of the ABC gene family evolution (Supplemental material 4). Additional file 1. Sequences.txt: Rad50, SMC and MutS sequences highlighted Walker A and Walker B motifs and ABC signature. Additional file 2. ABC.txt: Unaligned ABC sequences used in this study. Additional file 3. ABCalin.txt: Aligned and trimmed ABC sequences employed in evolutionary analyses. Additional file 4. ABC.unrooted.txt: Unrooted maximum likelihood phylogenetic tree of the ABC gene family.
  64 in total

Review 1.  Structural maintenance of chromosomes (SMC) proteins: conserved molecular properties for multiple biological functions.

Authors:  A V Strunnikov; R Jessberger
Journal:  Eur J Biochem       Date:  1999-07

2.  Genetical aspects of insecticide resistance.

Authors:  R Milani
Journal:  Bull World Health Organ       Date:  1963       Impact factor: 9.408

3.  Characterization of the human ABC superfamily: isolation and mapping of 21 new genes using the expressed sequence tags database.

Authors:  R Allikmets; B Gerrard; A Hutchinson; M Dean
Journal:  Hum Mol Genet       Date:  1996-10       Impact factor: 6.150

4.  Phylogenetic analysis of the ATP-binding cassette transporter family in three mosquito species.

Authors:  Hong Lu; Yongyu Xu; Feng Cui
Journal:  Pestic Biochem Physiol       Date:  2015-11-17       Impact factor: 3.963

5.  Evolutionary analyses of ABC transporters of Dictyostelium discoideum.

Authors:  Christophe Anjard; William F Loomis
Journal:  Eukaryot Cell       Date:  2002-08

6.  A phylogenomic study of the MutS family of proteins.

Authors:  J A Eisen
Journal:  Nucleic Acids Res       Date:  1998-09-15       Impact factor: 16.971

7.  The high turnover Drosophila multidrug resistance-associated protein shares the biochemical features of its human orthologues.

Authors:  Flóra Szeri; Attila Iliás; Viola Pomozi; Steven Robinow; Eva Bakos; András Váradi
Journal:  Biochim Biophys Acta       Date:  2008-11-20

8.  The N-terminal region of ABC50 interacts with eukaryotic initiation factor eIF2 and is a target for regulatory phosphorylation by CK2.

Authors:  Sonia Paytubi; Nicholas A Morrice; Jerome Boudeau; Christopher G Proud
Journal:  Biochem J       Date:  2008-01-01       Impact factor: 3.857

9.  Cloning and characterization of a RNAse L inhibitor. A new component of the interferon-regulated 2-5A pathway.

Authors:  C Bisbal; C Martinand; M Silhol; B Lebleu; T Salehzada
Journal:  J Biol Chem       Date:  1995-06-02       Impact factor: 5.157

10.  The yeast RAD50 gene encodes a predicted 153-kD protein containing a purine nucleotide-binding domain and two large heptad-repeat regions.

Authors:  E Alani; S Subbiah; N Kleckner
Journal:  Genetics       Date:  1989-05       Impact factor: 4.562

View more
  5 in total

1.  Transcriptomic modulation in response to an intoxication with deltamethrin in a population of Triatoma infestans with low resistance to pyrethroids.

Authors:  Lucila Traverso; Jose Manuel Latorre Estivalis; Gabriel da Rocha Fernandes; Georgina Fronza; Patricia Lobbia; Gastón Mougabure Cueto; Sheila Ons
Journal:  PLoS Negl Trop Dis       Date:  2022-06-29

Review 2.  Multidrug Resistance in Cancer Cells: Focus on a Possible Strategy Plan to Address Colon Carcinoma Cells.

Authors:  Chenmala Karthika; Raman Sureshkumar; Mehrukh Zehravi; Rokeya Akter; Faraat Ali; Sarker Ramproshad; Banani Mondal; Milton Kumar Kundu; Abhijit Dey; Md Habibur Rahman; Angela Antonescu; Simona Cavalu
Journal:  Life (Basel)       Date:  2022-05-30

3.  Comparative and functional genomics of the ABC transporter superfamily across arthropods.

Authors:  Shane Denecke; Ivan Rankić; Olympia Driva; Megha Kalsi; Ngoc Bao Hang Luong; Benjamin Buer; Ralf Nauen; Sven Geibel; John Vontas
Journal:  BMC Genomics       Date:  2021-07-19       Impact factor: 3.969

Review 4.  Expression and Function of ABC Proteins in Fish Intestine.

Authors:  Flavia Bieczynski; Julio C Painefilú; Andrés Venturino; Carlos M Luquet
Journal:  Front Physiol       Date:  2021-12-09       Impact factor: 4.566

5.  A near-chromosome level genome assembly of the European hoverfly, Sphaerophoria rueppellii (Diptera: Syrphidae), provides comparative insights into insecticide resistance-related gene family evolution.

Authors:  Emma Bailey; Linda Field; Christopher Rawlings; Rob King; Fady Mohareb; Keywan-Hassani Pak; David Hughes; Martin Williamson; Eric Ganko; Benjamin Buer; Ralf Nauen
Journal:  BMC Genomics       Date:  2022-03-12       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.