Literature DB >> 28878041

Characteristics of 29 novel atypical solute carriers of major facilitator superfamily type: evolutionary conservation, predicted structure and neuronal co-expression.

Emelie Perland1, Sonchita Bagchi2, Axel Klaesson3, Robert Fredriksson2.   

Abstract

Solute carriers (SLCs) are vital as they are responsible for a major part of the molecular transport over lipid bilayers. At present, there are 430 identified SLCs, of which 28 are called atypical SLCs of major facilitator superfamily (MFS) type. These are MFSD1, 2A, 2B, 3, 4A, 4B, 5, 6, 6 L, 7, 8, 9, 10, 11, 12, 13A, 14A and 14B; SV2A, SV2B and SV2C; SVOP and SVOPL; SPNS1, SPNS2 and SPNS3; and UNC93A and UNC93B1. We studied their fundamental properties, and we also included CLN3, an atypical SLC not yet belonging to any protein family (Pfam) clan, because its involvement in the same neuronal degenerative disorders as MFSD8. With phylogenetic analyses and bioinformatic sequence comparisons, the proteins were divided into 15 families, denoted atypical MFS transporter families (AMTF1-15). Hidden Markov models were used to identify orthologues from human to Drosophila melanogaster and Caenorhabditis elegans Topology predictions revealed 12 transmembrane segments (for all except CLN3), corresponding to the common MFS structure. With single-cell RNA sequencing and in situ proximity ligation assay on brain cells, co-expressions of several atypical SLCs were identified. Finally, the transcription levels of all genes were analysed in the hypothalamic N25/2 cell line after complete amino acid starvation, showing altered expression levels for several atypical SLCs.
© 2017 The Authors.

Entities:  

Keywords:  atypical SLC; family clustering; major facilitator superfamily; nutrition; solute carrier; topology

Mesh:

Substances:

Year:  2017        PMID: 28878041      PMCID: PMC5627054          DOI: 10.1098/rsob.170142

Source DB:  PubMed          Journal:  Open Biol        ISSN: 2046-2441            Impact factor:   6.411


Introduction

It is essential that transport of nutrients, waste and drugs over lipid bilayers is executed accurately to keep the homeostasis within the body, and disturbances in the transport systems are associated with Mendelian diseases [1,2]. Most transport is carried out by three major types of transporters [3]: channels, primary active transporters and secondary active transporters. With its 430 members [4], the secondary active transporters, commonly called the solute carriers (SLCs), constitute the largest group of membrane-bound transporters in humans [5]. The SLCs are currently divided into 52 families [6]. SLCs use energy from coupled ions or facilitative diffusion to move substrates via coupled transport, exchange or uniport [7]. SLC transporters are crucial throughout the body, and their importance is particularly prominent in the brain, where they, for example, gate nutrients over the blood–brain barrier [8], terminate neuronal transmission by clearing neurotransmitters from the synaptic cleft [9,10], refill vesicles [11] and maintain the glutamineglutamate cycle [12]. These mechanisms are used in pharmacology, where transporters are used either as direct drug targets [2,10] or indirectly as facilitators of drug distribution to specific tissues [13]. Most SLC proteins can be divided into Pfam clans based on sequence similarity [4,14], where the major facilitator superfamily (MFS; Pfam clan id: CL0015), amino acid/polyamine/organocation (APC; CL0062), cation : proton antiporter/anion transporter (CPA/AT; CL0064) and drug/metabolite transporter superfamily (DMT; CL0184) clans include more than one SLC family [4,14,15]. Approximately one-third of all SLCs belong to the MFS clan [4], making it the largest group of phylogenetically related SLCs. MFS is a large and diverse family of proteins [16], which evolved from a common ancestor [17]. This ancient family has members in several organisms, including bacteria, yeast, insects and mammals [16-20]. As MFS proteins are closely related, they usually share protein topology. MFS proteins are single polypeptides [16], usually composed of 400–600 amino acids [21]. They probably arose by duplication of a six transmembrane segment (TMS), providing the N and C domains, which are connected by a long cytoplasmic loop between TMS 6 and 7 [21], resulting in a 12 TMS protein [17]. It is suggested that transporters containing the MFS fold move substrates via the rocker-switch mechanism [22] or through the updated clamp-and-switch model [23]. Among the 430 human SLCs, 30 proteins are called atypical SLCs as they are evolutionarily connected to SLCs [4], but are yet to be classified into any existing SLC family. Twenty-eight of the atypical SLCs belong to the MFS Pfam clan [4] and are discussed in this article, together with the non-MFS Pfam clan protein ceroid lipofuscinosis, neuronal 3 (CLN3). According to the transporter classification database [24], CLN3 belongs to the equilibrative nucleoside transporter, which is a subfamily of the larger MFS superfamily. Additional atypical SLCs are TMEM104 that belong to the APC clan and OCA2 which cluster with the IT clan [4]. The atypical SLCs of MFS type are the major facilitator superfamily domain containing (MFSD) proteins, MFSD1, 2A, 2B, 3, 4A, 4B, 5, 6, 6 L, 8, 9, 10, 11, 12, 13A, 14A and 14B; the synaptic vesicles glycoprotein 2 (SV2) proteins, SV2A, SV2B and SV2C; the SV2-related proteins SVOP and SVOPL; three sphingolipid transporters, SPNS1, SPNS2 and SPNS3; and two unc-93 proteins, UNC93A and UNC93B1 [4]. These proteins were identified as possible SLCs by searching the human proteome using hidden Markov models (HMM) composed of known SLC sequences originating from the MFS Pfam clan [4]. MFSD7 was also included in the analysis and considered as an atypical SLC, because of its status as an orphan protein. However, MFSD7 is already classified into the SLC49 family [25]. Knowledge about atypical SLCs is limited, which is why we aim to present a cohesive study of the basic characteristics of 29 atypical SLCs belonging to the MFS clan. They cluster phylogenetically with SLC families from the MFS Pfam clan, SLC2, 15 16, 17, 18, 19, SLCO (SLC21), 22, 29, 33, 37, 40, 43, 45, 46 and 49 [4], suggesting that they have transporter properties, and are involved in homeostatic maintenance. Since the atypical SLCs are MFS proteins, it is likely that they all are constituted of the common 12 TMS polypeptides [17], which has been predicted for some (e.g. MFSD1 [19], MFSD2A [26], MFSD8 [27,28], SVOP [29] and UNC93B1 [30]), while CLN3 only has six predicted TMSs [31,32]. Several atypical SLCs are expressed in the brain, where they are found in neurons [19,20,33,34] and the CNS vasculature system [35]. Concerning their subcellular expression, atypical SLCs are expressed both in the plasma membrane [19,36] and intracellular membranes [27,33,37-40] (localizations summarized in table 1). There are also contradictory reports, suggesting that the same protein is located in several subcellular locations; MFSD1 is found in embryonic mouse neuronal plasma membranes [19] and lysosomal membranes in HeLa and rat liver cells [39,41], which could be explained by translocation of the transporters in the cell, serving multiple functions under different conditions or states of the cell. SV2 proteins are identified both at synaptic vesicles [49] and the plasma membrane, possibly because the synaptic vesicles fuse with the plasmalemma during neurotransmitter release. CLN3 is expressed at the plasma membrane as well as on endosome/lysosome membranes [34], where it is involved in neuronal ceroid lipofuscinosis, which leads to neurodegenerative disorders resulting from the accumulation of lipofuscin [57]. This is of interest because MFSD8 (known as CLN7) is also involved in this pathology [58].
Table 1.

Basic facts about atypical SLCs.

atypical SLCaliasesHuman Genome Nomenclature Committee (HGNC) IDTCDB IDprotein sizesubcellular expressionsubstrate
MFSD1SMAP4HGNC:25874514aaplasma membrane [19] and lysosomes [39,41]
MFSD2ANLS1HGNC:25897TC: 2.A.2.3.8543aaplasma membrane [42] and ER [37]sodium-dependent phospholipid transport [43]
MFSD2BHGNC:37207504aaER [37]
MFSD3HGNC:25157TC: 2.A.1.25.4412aaplasma membrane [19]
MFSD4AMFSD4HGNC:25433514aa
MFSD4BKIAA1919, NAGLT1HGNC:21053518aaintracellular [44,45]sodium-dependent glucose transport [38]
MFSD5hsMOT2HGNC:28156557aamolybdate-anions transport [46]
MFSD6MMR2HGNC:24711TC: 2.A.1.65.6791aa
MFSD6 LHGNC:26656TC: 2.A.1.65.10586aa
MFSD7MYL5, SLC49A3HGNC:26177TC: 2.A.1.28.2559aa
MFSD8CLN7HGNC:28486TC: 2.A.1.2.56518aalysosomal [27]
MFSD9HGNC:28158TC: 2.A.1.2.72474aa
MFSD10TETRANHGNC:16894TC: 2.A.1.2.73455aaplasma membrane [47] and intracellular [36]organic anions [36]
MFSD11HGNC:25458TC: 2.A.1.58.3449aa
MFSD12HGNC:28299480aamitochondria [44,45]
MFSD13ATMEM180HGNC:26196517aa
MFSD14AHIAT1, MF14AHGNC:23363490aaintracellular [33]presumed sugar transport [48]
MFSD14BHIATL1, MF14BHGNC:23376TC: 2.A.1.2.30506aaintracellular [33]
SV2AKIAA0736HGNC:20566742aavesicular [4951]sugar [52]
SV2BKIAA0735HGNC:16874683aavesicular [51]
SV2CKIAA1054HGNC:30670727aavesicular [51,53]
SVOPHGNC:25417TC: 2.A.1.82.3548aavesicular [54,55]
SVOPLHGNC:27034492aa
SPNS1SPIN1, HSpin1HGNC:30621TC: 2.A.1.49.2528aamitochondria [56]
SPNS2HGNC:26992TC: 2.A.1.49.6549aa
SPNS3HGNC:28433512aa
UNC93AHGNC:12570TC: 2.A.1.58.2457aa
UNC93B1UNC93, UNC93BHGNC:13481TC: 2.A.1.58.7597aaER [40]
CLN3BTS, BatteinHGNC:2074TC: 2.A.57.5.1438aaplasma membrane [34] and lysosomal [32,34]
Basic facts about atypical SLCs. Several atypical SLCs are affected by food intake and nutritional status, where both high-fat diet and food deprivation alter their expression levels in rodents [19,20,33,37,59]. Furthermore, the expression of Mfsd11 is altered in immortalized mouse hypothalamic N25/2 cells exposed to complete amino acid starvation [60]. This suggests that the atypical SLCs are involved in maintaining the nutritional status both in vivo and in vitro, which reinforces the importance of understanding their fundamental properties. Here, we phylogenetically studied interrelations between the atypical SLCs of MFS type and similarities between the protein sequences. Furthermore, we investigated if the atypical SLCs met the requirements to belong in any of the existing 52 SLC families. SLC families are divided on the basis of homology or phenotype [61], and a protein must share at least 20% sequence identity to another family member [62] to be placed in that family. HMMs were built to search proteomes from several organisms to identify related proteins, showing their evolutionary development. Furthermore, topology predictions were made for the human protein sequences, suggesting 12 TMS for all investigated atypical SLCs, except for CLN3 with its 11 predicted TMS. With single-cell RNA sequencing data retrieved from 10X genomics (www.10xgenomics.com/), we examined which atypical SLCs were expressed in the same cell from an 18 days mouse embryo brain. We supplemented these results at protein level using in situ proximity ligation assay [63,64], where interaction between proteins were quantified in mouse brain sections. Finally, using microarray data [60], we analysed if and how the atypical SLCs were affected by complete amino acid deprivation in N25/2 cells.

Material and methods

Clustering of human atypical SLCs of MFS type

To study the interrelations between atypical SLCs of MFS type, the longest amino acid sequences for the human MFSD1, 2A, 2B, 3, 4A, 4B, 5, 6, 6 L, 7, 8, 9, 10, 11, 12, 13A, 14A, 14B, SV2A, SV2B, SV2C, SVOP, SVOPL, SPNS1, SPNS2, SPNS3, UNC93A, UNC93B1 and CLN3 proteins (for sequences, see electronic supplementary material, table S1) were combined in a multiple PSI/TM tcoffee sequences alignment [65] before inferring their relationship according to the Bayesian approach, as implemented in MrBayes 3.2.2 [66,67]. The analysis was run via the Beagle library [68] on six chains (five heated and one cold), with two runs in parallel (n runs = 2) for a maximum of 2 000 000 generations. An additional tree was built, including all known SLC and atypical SLC sequences originating from the MFS Pfam clan. After a multiple PSI/TM tcoffee sequence alignment [65], a phylogenetic tree was built using RAxML [69] on a 14 Core Intel CPU workstation. The tree was calculated on protein sequences using the GAMMAJTT amino acid model with 500 bootstrap replicas, and a consensus tree was calculated from these using the built in consensus tree calculation in RAxML. SLC families are built on homology, function, phenotype [61] and sequence identities [62]. As the atypical SLCs group among SLC families [4], it is possible that they belong to already annotated SLC or new families. To study this further, sequence identities were analysed using global pairwise sequence alignment based on the Needleman–Wunsch algorithm [70]. The similarities between human atypical SLCs were analysed, followed by comparison with all SLC members of MFS type (SLC family 2, 15 16, 17, 18, 19, SLCO, 22, 29, 33, 37, 40, 43, 45, 46 and 49) (matrixes in electronic supplementary material, table S1). To group the atypical proteins into families, the following parameters were considered: (i) 20% identity to other atypical SLCs, (ii) phylogenetic clustering among the atypical SLCs, (iii) phylogenetic clustering among SLCs and (iv) 20% identity to at least one other SLC family member. Families including atypical SLCs were called atypical MFS transporter families (AMTF).

Hidden Markov models to identify related proteins

Hidden Markov models (HMM) were built for all 29 atypical SLCs by running mammalian sequences through HMMbuild from the HMMER package [71]. The models were used to search the protein datasets (obtained from Ensembl version 86 [72]) listed in table 2, to identify related proteins in yeast, roundworm, fruit fly, zebrafish, chicken, mouse and human. Sequences were manually curated, and proteins originating from the same locus and pseudogenes were removed. Genes not in closest phylogenetic proximity with the human version were also removed, as they were either without specific orthologues in mammals or that they phylogenetically clustered to other proteins. Predicted full-length proteins were kept as related reliable hits. As the atypical SLCs are relatively similar in amino acid sequence, proteins were identified in several HMM. Phylogenetic analyses were therefore performed, using RAxML, as described above, to determine which were orthologues and other related proteins. All identified proteins were annotated and listed with accession number in electronic supplementary material, table S2. Note that some proteins were given names with Like (L) as a suffix, and these were related proteins identified by the HMM, without belonging to the human protein cluster. It is possible that these are orthologues to proteins not studied here, or that they lack equivalents in humans.
Table 2.

Datasets searched for related proteins.

speciescommon namedataset version
S. cerevisiaeyeastR64-1-1.pep.all
C. elegansroundwormWBcel235.pep.all
D. reriozebrafishGRCz10.pep.all
D. melanogasterfruit flyBDGP6.pep.all
G. galluschickenGalgal4.pep.all
H. sapienshumanGRCh38.pep.all
M. musculusmouseGRCm38.pep.all
Datasets searched for related proteins.

Structural predictions to study possible transporter properties

For a MFS protein to have optimal transporter properties, 12 transmembrane segments (TMS) are required [17]. To investigate if the proteins of interest possessed the common MFS structures, topology predictions were done using the constrained consensus TOPology prediction server (CCtop) [73,74]. CCtop combine the results from 10 known online topology tools to incorporate parameters like hydrophobicity, charge bias, helix lengths and signal peptides in the predictions [75,76], and further combine the result with structural information from existing experimental and computational sources [73]. Three of the proteins were not predicted to contain 12 TMS, MFSD13A, SPNS3 and CLN3, and homology models were built to verity these three predictions. The tertiary structures were built using Swiss Model, a fully automated homology program [77], where structurally known MFS transporters were used as templates. MFSD13A was aligned against the bacterial sodium symporter, MelB [78], providing global model quality estimation (GMQE) of 0.47. GMQE indicates the reliability of models on a scale range from 0 to 1, where 1 represents total reliability. For the SPNS3 model, the proton-driven YajR transporter from E. coli was used as template [79], with a GMQE of 0.45. For CLN3, a peptide MFS transporter from bacteria [80] was used as template, providing a score of 0.44. Homology models were adjusted in the open-source Java viewer Jmol [81] (http://www.jmol.org/). Finally, the amino acids in each TMS from the homology models were manually identified and compared with the ones predicted by CCtop.

RNA analysis from single brain cells, to identify co-expression between atypical SLCs

The complete dataset (9 k brain cells from an E18 Mouse) for single-cell RNA sequencing from E18 mouse brain was downloaded from 10X Genomics (www.10xgenomics.com) under a Creative Commons license. The data was analysed to investigate co-expression of atypical SLCs of MFS type in single brain cells. Of note, 10 289 cells were collected from cortex, hippocampus and subventricular zone of an E18 mouse, and sequenced on Illumina Hiseq4000 with approximately 42 000 reads per cell (10X Genomics). A digital expression matrix was constructed based on that data to extract information from the atypical SLCs, and removing cells with fewer than three identified transcripts. Then, cells expressing fewer than two different atypical SLC transcripts were removed. This resulted in 9693 cells co-expressing 21 atypical SLCs. To assess the significance of these observations, we used a bootstrapping approach, implemented in a custom written Java program. Briefly, in the implementation, as our null hypothesis, we assumed that there was no co-expression observed in the data over what is expected by chance. We created a dataset with the same frequency of each of the transcripts as observed in our actual data and randomly assigned these transcripts to 9693 cells. This process was repeated 1000 times and the mean number of transcripts and the population standard deviation of the number of transcripts for each cell were calculated. We considered any values one standard deviation above and below the mean of the bootstrapped data as significantly different from true chance.

In situ proximity ligation assay, sample preparation, execution and analysis

To complement the co-expression, in situ proximity ligation assay (PLA) was performed. Intra-peritoneal injections of sodium Pentobarbital (Apoteket Farmaci, Sweden) (10 mg kg−1) were used to anesthetize adult C57BL6/J mice, followed by trans-cardiac perfusion using 4% formaldehyde (Histolab) and then paraffin embedding, as described in [20]. The brains were cut in 7 µm sections using a Microm 355S STS cool cut microtome and attached on Superfrost Plus slides (Menzel-Gläser). Each slide was dried overnight at 37°C before stored at 4°C. Sections were deparaffinized by 10 min washes in X-TRA solve (Medite, Dalab), followed by an ethanol (Solveco) rehydration series ranging from 100% to water. Antigen retrieval was performed in boiling 0.01 M citric acid (Sigma-Aldrich) at pH 6.0, for 10 min, after which the slides were cooled, washed in PBS, and placed in a humidity chamber throughout the experiment to avoid drying out during incubations at 37°C. Brain sections were blocked for 1 h at 37°C in blocking solution, provided by Duolink II fluorescence kit (orange detection reagents; Olink Biosciences), followed by primary antibody incubation at 4°C overnight (table 3 for antibody information). The antibodies were diluted in specific antibody diluent provided by Duolink II fluorescence kit (orange detection reagents; Olink Biosciences). The slides were then washed 2 × 5 min in wash buffer A, while kept on orbital shaking. Two PLA probes, PLUS and MINUS, were added to each selected primary antibody combinations (summarized in table 3). The probes were diluted in antibody diluent, and added to the slides followed by incubation for 1 h at 37°C. Slides were washed for 2 × 5 min in Wash buffer A, before adding the Ligation-Ligase solution (Duolink II fluorescence kit; Olink Biosciences), followed by 30 min incubation at 37°C. Slides were washed 2 × 5 min in Wash Buffer A, before adding the Amplification-Polymerase solution (Duolink II fluorescence kit; Olink Biosciences), followed by incubation for 100 min, at 37°C. After tapping the Amplification-Polymerase off, the slides were washed 2 × 10 min in Wash Buffer B, followed by a 1 min washing step using 0.01× Wash Buffer B. Slides were dried under dark conditions, and mounted in Duolink in situ Mounting Medium, including DAPI (Olink Biosciences).
Table 3.

Antibody combinations and concentrations used for the in situ proximity ligation assay.

proteinoriginconcentrationsuppliercatalogue numbercombined withPLA probes
MFSD3rabbit1 : 50Sigma-AldrichAV51707MFSD11+
MFSD4Arabbit1 : 50Sigma-AldrichSAB1305276/AV53395MFSD11+
MFSD6goat1 : 20Sigma-AldrichSAB2502050MFSD11
MFSD7rabbit1 : 100Abcamab180496MFSD11+
MFSD8rabbit1 : 50Sigma-AldrichHPA044802MFSD9MFSD11+
MFSD9goat1 : 50Santa Cruzsc-247973MFSD8MFSD10MFSD14AMFSD14B
MFSD10rabbit1 : 20Sigma-AldrichHPA037398MFSD9MFSD11+
MFSD11goat/rabbit1 : 80Santa Cruz/Sigma-Aldrichsc-243472/HPA022001MFSD3MFSD4AMFSD6MFSD7MFSD8MFSD10MFSD14a−/+
MFSD14Arabbit1 : 100Sigma-AldrichSAB1306449MFSD9 MFSD11+
MFSD14Brabbit1 : 100Sigma-AldrichSAB2107506MFSD9 MFSD11+
Antibody combinations and concentrations used for the in situ proximity ligation assay. Micrographs were taken using a Zeiss Axioplan 2 epifluorescent microscope, and 11 Z-stacks from various brain areas, like cortex and striatum, were acquired for each antibody-pair combination. Filters suitable for the used fluorophores and a filter to detect autofluorescence were used. The Z-stacked images were transformed using the maximum intensity projection function in ImageJ v. 1.48 [82], to merge the signals into a one plane image. CellProfiler v. 2.2.0 [83,84] was then used to analyse the signals. The autofluorescence data were used to subtract background from the images, after which the images were cleared using a white tophat filter to remove anything over 10 pixels in diameter, leaving only the amplified signal. DAPI staining was used to define cells to enable automated counting of PLA signals within specific cells, and all signals with pixel intensity above 0.08 were automatically counted. The combined signal from all brain areas was divided with number of cells, to get an average of interactions within the brain. A graph was plotted using GraphPad Prism 5 software.

Analysis of gene expression after complete amino acid starvation in N25/2 mouse hypothalamic cells

It was previously shown that gene expression of Mfsd11 is altered upon complete amino acid starvation for 1, 2, 3, 5 or 16 h in immortalized N25/2 mouse hypothalamic cells [60]. Here, we reused the data from their microarray analysis (accession number GSE61402) to study if the atypical SLCs were affected by the removal of all amino acids. Data were downloaded and the probes most similar to the human proteins were included in the analysis. Note that two genes (Unc93a and Cln3) had two probes each that correspond to the human protein on the GeneChip, which is why both are presented in the heat map. The duplicated probes are splice variants that are present under different accession numbers in the database used to define the genes on the chip. Genesis version 1.7.6 was used to generate the heat map. For 1, 2, 3 and 16 h, the difference between the log2 values of expression between starved and control cells were used in the analysis. For 5 h of starvation, the log2 fold change value of expression was used. Green colour represents downregulation and red colour represents upregulation, where more alteration correlates with more colour intensity.

Results

Interrelations between human SLCs of MFS type

The phylogenetic interrelations between atypical SLCs were inferred in the phylogenetic tree presented in figure 1, where the schematic branching order is displayed in the figure. Some sequences were seemingly diverged from the other proteins, like MFSD3, MFSD6, MFSD6 L, MFSD7, MFSD8, MFSD12, MFSD13A and CLN3 (figure 1), while others formed potential families connected by a common node. Grouping of proteins is important as it strengthens the possibility to elucidate evolutionary conservation, mechanism and substrate specificity, because similar sequences usually share these characteristics [85]. To divide the atypical SLCs into families, members had to share phylogenetic closeness and be 20% identical to other proteins in the family.
Figure 1.

Interrelations between human atypical SLCs. The Bayesian approach was implemented when inferring the phylogenetic interrelations between the longest splice variants for 29 human atypical SLCs. When combining the phylogenetic clustering with sequence identities, the proteins could be divided into 15 families denoted Atypical MFS Transporter Family (AMTF) 1-15. The tree displays the schematic branching order of the human atypical SLCs of MFS type, together with CLN3.

Interrelations between human atypical SLCs. The Bayesian approach was implemented when inferring the phylogenetic interrelations between the longest splice variants for 29 human atypical SLCs. When combining the phylogenetic clustering with sequence identities, the proteins could be divided into 15 families denoted Atypical MFS Transporter Family (AMTF) 1-15. The tree displays the schematic branching order of the human atypical SLCs of MFS type, together with CLN3. Among the atypical SLCs we identified 15 possible families that were denoted Atypical MFS Transporter Family 1-15 (AMTF1-15); where seven families contained more than one atypical SLC protein. AMTF1 included MFSD9, MFSD10, MFSD14A and MFSD14B; AMTF 3 contained MFSD4A and MFSD4B; and AMTF6 had MFSD1 and MFSD5 as members. MFSD2A and MFSD2B belonged to AMTF8; while SV2A, SV2B, SV2C, SVOP and SVOPL were in AMTF9. AMTF10 included MFSD11, UNC93A and UNC93B1; and AMTF11 consisted of SPNS1, SPNS2 and SPNS3 (figure 1). To examine the plausible family members further, similarities between protein sequences were analysed. All sequence identities were listed in the matrixes in supplementary table 1, where 24 of the 29 atypical SLCs had more than 20% identical amino acids to at least one other atypical SLC sequence. MFSD3, MFSD6, MFSD6 L, MFSD8 and MFSD13A had less than 20% identity with any other atypical SLC protein. In predicted AMTF1 (for members, see figure 1), all four proteins shared more than 20% identity with at least one other member, as were the case for AMTF9, AMTF10 and AMTF11. In AMTF3, MFSD4A and MFSD4B shared 20% identity, and AMTF8 was constituted by MFSD2A and MFSD2B sharing 37% identity. MFSD1 and MFSD5 did not cluster in closest proximity, yet shared 20% identity, and were considered constituents of the same family. The remaining eight atypical SLCs did not meet the clustering and/or identity criteria and were placed in individual families. Taken together, the atypical SLCs can be grouped into 15 possible AMTF (summarized in figure 1). The AMTF nomenclature was used instead of the SLC nomenclature to highlight that the functions of the atypical SLCs remains to be elucidated. The distribution of the atypical SLCs among the SLCs of MFS type was investigated through a phylogenetic analysis. It showed that the proteins of interest placed within the SLC tree, and not as outgroups (figure 2). This strengthens the hypothesis that they are novel transporters of SLC type. When comparing the sequence identities (MFS matrix 2 in supplementary table 1), the following atypical proteins had less than 20% identity with any other SLC: MFSD2A, MFSD4B, MFSD6, SV2A, SV2B, SV2C and UNC93B1. On the other hand, some atypical SLCs had at least 20% identity to members of several families, like MFSD1, which was more than 20% identical with SLC2A8, SLC16A10, SLC19A2; and MFSD9 and MFSD10, having 20% or higher identity with members from seven different SLC families each. Finally, no atypical SLC shared more than 20% with all members in a single SLC family. Therefore, it is not possible to place the atypical SLCs into existing SLC families based only on sequence identity. However, when combining the sequence identity and phylogenetic clustering (figure 2), possible family clustering is observed; MFSD7 (in AMTF5) is already classified as a member of SLC49 [25], while MFSD9, MFSD10, MFSD14A and MFSD14B (AMTF1) could belong to SLC46, and SV2A, SV2B, SV2C, SVOP and SVOPL (AMTF9) could be members of SLC22. The remaining atypical SLCs would belong to novel SLC families. If we combine the 52 SLC and 15 AMTF families (where AMTF1 is merged with SLC46; AMTF5 with SLC49; and AMTF9 with SLC22), a total of 64 different families including SLC proteins exists.
Figure 2.

Atypical SLCs cluster among known SLCs of the MFS clan. RAxML was used to calculate a phylogenetic tree, showing how the atypical SLCs were related to known SLCs of MFS type. Trees were calculated on a model with 500 bootstrap replicas, and combined into a final tree using the built-in consensus tree calculation in RAxML. The highlighted proteins correspond to the atypical SLCs.

Atypical SLCs cluster among known SLCs of the MFS clan. RAxML was used to calculate a phylogenetic tree, showing how the atypical SLCs were related to known SLCs of MFS type. Trees were calculated on a model with 500 bootstrap replicas, and combined into a final tree using the built-in consensus tree calculation in RAxML. The highlighted proteins correspond to the atypical SLCs.

Identification of related proteins in several species

With hidden Markov models, several protein datasets were searched to identify related proteins in various species. The atypical SLCs were identified in human and mouse (figure 3), where UNC93A had duplicated in mouse resulting in two variants on the same chromosome. All but MFSD3, MFSD6 L, SPNS1 and CLN3 were found in chicken (figure 3). Furthermore, MFSD14B was identified in both the MFSD14A and MFSD14B HMM search in the chicken proteome, but it phylogenetically clustered closer to human MFSD14A. Therefore, MFSD14B was not separately included in figure 3 or electronic supplementary material, table S2, but as one of the two proteins found for MFSD14A. All except MFSD5 were detected in zebrafish (figure 3). Eight proteins had two copies each in the zebrafish proteome. 11 atypical SLCs had related proteins in fruit flies (figure 3), where MFSD1 had two copies, MFSD14A had four copies (equally related to MFSD14B), SV2A had 10 (equally related to SV2B and SV2C) and Unc93A had two copies (equally related to UNC93B1). In the figure, we enlisted the proteins where they were most similar, and if they were equally related to several proteins we listed them in the first possible position. Identified proteins were sometimes found in several HMM, but they were included only once in figure 3 and electronic supplementary material, table S2. About half of the atypical SLCs were found in C. elegans, while only CLN3 was identified in yeast. Furthermore, in some proteomes, several related proteins were found but they did not cluster phylogenetically with the human proteins, but still in relative proximity. We call these ‘Like’ (L) proteins, and they are included in electronic supplementary material, table S2, but not in figure 3. There are, for example, 11 proteins related to MFSD8, but none in the human cluster, and they were annotated as MFSD8L1–MFSD8L11.
Figure 3.

Evolutionarily conserved proteins. Hidden Markov models were used to identify related proteins to the atypical SLCs in the listed species. In this schematic description the coloured cartons indicate the presence of a related atypical SLC, while no box indicates a missing protein. The × n designation corresponds to the amount of proteins/variants identified, where n is a specific number. No specific marking was made where only one variant was found.

Evolutionarily conserved proteins. Hidden Markov models were used to identify related proteins to the atypical SLCs in the listed species. In this schematic description the coloured cartons indicate the presence of a related atypical SLC, while no box indicates a missing protein. The × n designation corresponds to the amount of proteins/variants identified, where n is a specific number. No specific marking was made where only one variant was found.

Atypical SLCs are predicted to have 12 TMS

We used CCtop to predict the structural appearance of the human atypical SLCs. All but MFSD13A (9 TMS), SPNS3 (11 TMS) and CLN3 (11 TMS) were predicted to contain 12 TMS, the common number for MFS proteins [17]. Six TMS has been suggested for CLN3 [31,32,86,87], but different TMS has been found by the different groups. The general 12 TMS structure is schematically depicted in figure 4a. MFSD6, SV2A, SV2B, SV2C and UNC93B1 were seemingly longer peptides than the regular MFS peptide (table 1), and they all were predicted to contain exceptionally long N-terminals. Furthermore, MFSD6 had a relatively long extracellular loop between TMS 3 and 4, while the SV2 proteins had a longer loop between TMS 7 and 8. To verify the structure of the irregular predictions of MFSD13A, SPNS3 and CLN3, homology models were built. Structurally known MFS proteins were used as templates. In the homology models both MFSD13A (figure 4b) and SPNS3 (figure 4c) were predicted to contain the expected 12 TMS, whereas CLN3 (figure 4d) still was composed of 11 TMS. When manually comparing the amino acids in each TMS that were identified in CCtop versus the homology models, it was revealed that MFSD13A consisted of several amphipathic TMS (figure 4b), which could explain why they were not identified by CCtop. For SPNS3, all TMS overlapped, except TMS11, which was lacking in the secondary structure prediction. As TMS 11 was amphipathic, it could have been considered as a too short hydrophobic segment to be identified as a TMS by the CCtop server. Finally, for CLN3, both models predicted the same TMS. In conclusion, we predict all studied atypical SLCs to have 12 TMS, except CLN3, which was predicted to have 11 TMS.
Figure 4.

Structural prediction of the atypical SLCs. The online tool CCtop [73] was used to predict the topology of the atypical SLCs, where all but three proteins were predicted to possess the N and C domain, connected by a long cytoplasmic loop (MFS loop), resulting in a 12 transmembrane segment (TMS) polypeptide, as schematically depicted in (a) MFSD13A, SPNS3 and CLN3 diverged from the common structure, for which homology models were built to verity the predictions. The three proteins were aligned against structurally known MFS proteins, using the automated Swiss model homology program [77]. MFSD13A (b) and SPNS3 (c) were both constituted by 12 TMS, while CLN3 (d) had 11TMS. All three contained the long intracellular loop between TMS6 and 7.

Structural prediction of the atypical SLCs. The online tool CCtop [73] was used to predict the topology of the atypical SLCs, where all but three proteins were predicted to possess the N and C domain, connected by a long cytoplasmic loop (MFS loop), resulting in a 12 transmembrane segment (TMS) polypeptide, as schematically depicted in (a) MFSD13A, SPNS3 and CLN3 diverged from the common structure, for which homology models were built to verity the predictions. The three proteins were aligned against structurally known MFS proteins, using the automated Swiss model homology program [77]. MFSD13A (b) and SPNS3 (c) were both constituted by 12 TMS, while CLN3 (d) had 11TMS. All three contained the long intracellular loop between TMS6 and 7.

Several atypical SLC genes are expressed in the same cells

To study co-expression of atypical SLC genes in embryonic mouse brain cells, data from single-cell RNA sequencing was analysed. Co-expression of at least two atypical SLC transcripts was identified in 9693 of the total 10 289 cells analysed. Twenty-one of the atypical SLCs were found as significantly co-expressed with other atypical SLCs (figure 5). Mfsd1, Mfsd4b, Mfsd5, Mfsd6l, Mfsd9, Mfsd13a, Sv2b and Svop were not detected in the analysis, probably due to the relatively shallow sequence depth or utilized cut-off values. There are three different Mfsd7 (Mfsd7a-c) genes in mice corresponding to human Mfsd7, but only Mfsd7c was found in the dataset. Some genes were co-expressed with several other genes, like Mfsd11, which was co-expressed with all studied atypical transcripts except Mfsd14b and Cln3. Others showed more stringent co-expression, like Mfsd14b, which only co-localized with Mfsd8, Mfsd10 and Mfsd12. The sequentially similar Mfsd2a and Mfsd2b displayed a complementary co-expression, and together they were co-expressed with all found atypical SLCs except Sv2a and Sv2c. The three Spns genes supplemented each other, and together they were expressed in the same cells as all other genes except Mfsd14b (figure 5). Regarding AMTFs, Mfsd10 showed extensive co-expression with 12 other genes, while its family member were more restricted; Mfsd14a was co-expressed with eight other genes and Mfsd14b with only 3, while Mfsd9 was not detected at all. Some of the co-expressions were found only in few cells, like Unc93a having only 1–2 cells containing each interaction (figure 5). Among the more frequently found co-expressions were Cln3 together with Mfsd10, Mfsd11, Mfsd12 or Sv2a, with co-expression in more than 3000 cells (figure 5).
Figure 5.

Single cells co-express atypical SLC transcripts. Single-cell RNA sequencing data from a mouse embryo brain was retrieved from 10X Genomics, and analysed to study co-expression of atypical SLC genes in single cells. Twenty-one atypical SLCs were co-expressed in various combinations in 9693 cells. In the figure, boxes represent co-expression, where colours represent number of co-expressing cells. Purple correspond to 0–100 co-expressing cells, light blue 100–200 cells, dark blue 201–300 cells, green 301–500 cells, yellow 501–1000 cells, orange 1001–2000 cells and red represents more than 2001 co-expressing cells.

Single cells co-express atypical SLC transcripts. Single-cell RNA sequencing data from a mouse embryo brain was retrieved from 10X Genomics, and analysed to study co-expression of atypical SLC genes in single cells. Twenty-one atypical SLCs were co-expressed in various combinations in 9693 cells. In the figure, boxes represent co-expression, where colours represent number of co-expressing cells. Purple correspond to 0–100 co-expressing cells, light blue 100–200 cells, dark blue 201–300 cells, green 301–500 cells, yellow 501–1000 cells, orange 1001–2000 cells and red represents more than 2001 co-expressing cells. To supplement the co-localization and to detect probable interactions at protein level, in situ proximity ligation assay was run. As Mfsd11 was most commonly found as co-expressed on transcript level (figure 6a), a subset of its combinations were selected and tested. In all selected combinations, interaction signals were identified, but at different degrees, confirming that co-expressed RNA transcripts were found at protein level (figure 6b). Even genes such as Mfsd9, which was not found to be co-expressed in the RNA sequencing, was found in proximity to other atypical SLCs at protein level (figure 6c).
Figure 6.

Verification of co-expression at protein level. In situ PLA was run on mouse brain sections to study interaction between certain atypical SLC proteins, to verify the single-cell RNA sequencing. (a) Mfsd11 was co-expressed with several atypical SLCs using the RNA sequencing dataset. (b) The co-expression of the corresponding proteins was also detected at protein levels using in situ PLA. Some atypical SLCs were not detected in the single-cell RNA sequencing data, likely due to low transcript detection. However, interactions for those proteins were still found using in situ PLA. (c) Protein–protein interactions detected by PLA between MFSD7, which was not found on transcript level, and its closely related proteins MFSD8, MFSD10, MFSD14A and MFSD14B are shown here.

Verification of co-expression at protein level. In situ PLA was run on mouse brain sections to study interaction between certain atypical SLC proteins, to verify the single-cell RNA sequencing. (a) Mfsd11 was co-expressed with several atypical SLCs using the RNA sequencing dataset. (b) The co-expression of the corresponding proteins was also detected at protein levels using in situ PLA. Some atypical SLCs were not detected in the single-cell RNA sequencing data, likely due to low transcript detection. However, interactions for those proteins were still found using in situ PLA. (c) Protein–protein interactions detected by PLA between MFSD7, which was not found on transcript level, and its closely related proteins MFSD8, MFSD10, MFSD14A and MFSD14B are shown here.

Transcriptional changes upon amino acid starvation

Mouse hypothalamic N25/2 cell lines were deprived of amino acids for 1–16 h, followed by gene expression analysis. The alterations in gene expression for the atypical SLCs were depicted in a heat map (figure 7), with corresponding log2 differences listed in table 4. All genes were affected at all times, except Mfsd6l after 16 h, Mfsd9 after 1 h and Mfsd13a after 5 h. Mfsd2a and Spns2 were reduced throughout the experiment, whereas Mfsd11 and one of the Cln3 reported increased expression (figure 7). Mfsd8 was reduced up to 5 h, after which the expression was enhanced. The opposite pattern was seen for both Unc93a probes on the array, with upregulation during the first 5 h, followed by reduction after 16 h (figure 7). The duplicated probes for Unc93a and Cln3 on the array are probable splice variants listed under different accession numbers, and the pairs of probes follow the same trend in expression change. At 5 h, adjusted p-values were calculated, showing significant reduction of Mfsd2a (adj. p = 0.00041), while the Mfsd1 (adj. p = 0.0029), Mfsd11 (adj. p = 0.00003) and one Cln3 (adj. p = 0.00007) genes were upregulated (adjusted p-values listed in table 4).
Figure 7.

Transcription levels of atypical SLCs are changed upon complete amino acid starvation. Mouse hypothalamic N25/2 cells were deprived of all amino acids for 1, 2, 3, 5 and 16 h, followed by microarray analysis to study transcriptional changes [60]. Data accession number was GSE61402. Genesis version 1.7.6 was used to generate the heat map, which depicts log2 difference between starved and control cells at each time point. Green colour depicts downregulation while red colour corresponded to upregulated expression, where larger changes correlate with stronger colour intensity. Note that for Cln3 and Unc93a, two probes were identified corresponding to the human proteins, and both were included in the analysis.

Table 4.

Results from amino acid starvation on N25/2 mouse hypothalamic cells [60]. Asterisk indicates significantly changed expressions.

geneprobe IDlog2 1 hlog2 2 hlog2 3 hlog2 5 hadj. p-value (5 h)log2 16 h
Mfsd110492499−0.170.110.120.340.00290*0.15
Mfsd2a10516064−0.36−1.21−1.01−0.860.00041*−0.85
Mfsd2b103993140.080.090.16−0.010.979880.09
Mfsd3104249910.090.050.18−0.150.25122−0.26
Mfsd4a103576600.02−0.14−0.05−0.130.26274−0.17
Mfsd510427162−0.14−0.230.180.040.76622−0.17
Mfsd610354506−0.07−0.05−0.230.060.695340.91
Mfsd6l10377308−0.190.10−0.100.020.922480.00
Mfsd7a105321690.19−0.030.03−0.110.58350−0.05
Mfsd810497944−0.36−0.10−0.20−0.080.574020.36
Mfsd9103542200.00−0.01−0.04−0.090.48358−0.21
Mfsd1010529410−0.20−0.080.01−0.010.948980.33
Mfsd11103828520.100.410.470.650.00003*0.34
Mfsd1210365104−0.010.100.10−0.210.16511−0.62
Mfsd13a104636320.09−0.08−0.300.000.99228−0.12
Mfsd14a10501676−0.36−0.13−0.120.070.689400.44
Mfsd14b10410173−0.090.09−0.050.040.868990.33
Sv2a104943720.21−0.07−0.08−0.120.38332−0.42
Sv2b10564646−0.21−0.030.29−0.010.954830.09
Sv2c10411274−0.10−0.050.120.020.937260.01
Svop10532784−0.060.300.210.040.85594−0.09
Svopl105440170.050.010.120.150.195760.13
Spns1105678380.170.080.340.300.042500.20
Spns210388194−0.02−0.22−0.08−0.120.40316−0.07
Spns3103882110.13−0.120.12−0.050.81915−0.16
Unc93a104476340.120.530.480.060.89752−0.50
Unc93a104479040.160.570.480.040.92946−0.88
Unc93b1104602370.02−0.210.01−0.070.61249−0.02
Cln3105574340.110.150.650.780.00007*0.24
Cln310567964−0.090.080.11−0.180.12644−0.41
Transcription levels of atypical SLCs are changed upon complete amino acid starvation. Mouse hypothalamic N25/2 cells were deprived of all amino acids for 1, 2, 3, 5 and 16 h, followed by microarray analysis to study transcriptional changes [60]. Data accession number was GSE61402. Genesis version 1.7.6 was used to generate the heat map, which depicts log2 difference between starved and control cells at each time point. Green colour depicts downregulation while red colour corresponded to upregulated expression, where larger changes correlate with stronger colour intensity. Note that for Cln3 and Unc93a, two probes were identified corresponding to the human proteins, and both were included in the analysis. Results from amino acid starvation on N25/2 mouse hypothalamic cells [60]. Asterisk indicates significantly changed expressions.

Discussion

Here we investigated the characteristics of 29 novel predicted transporters, denoted atypical SLCs, to get a comprehensive understanding of their phylogenetic interrelations, family clustering, protein structures, co-expression and how they responded to altered amino acid levels. With phylogenetic trees, we elucidated the interrelations between the atypical SLCs alone, and how they group among the known SLC of MFS type. Upon closer inspection, the two phylogenetic trees provided mostly similar results, but not identical. UNC93A, for example, clustered with MFSD11 and UNC93B1 in figure 1, and closest to MFSD12 in figure 2. The reasons for this discrepancy could be several. First, we used different programs for tree calculations. MrBayes is a good tool concerning small-to-medium alignments, but for larger and more complex datasets, other methods, like the likelihood method implemented in RAxML [69], have to be used. Here, the main reasons for differences are within the tree searching algorithms. With the more advanced and computational intensive models implemented in MrBayes, it will be possible to investigate a smaller proportion of the total number of possible trees compared to RAxML. In addition, the more stringent models implemented in MrBayes will not converge in reasonable time for more complex datasets. Second, as more sequences were included when compiling figure 2, there were larger variations, resulting in a less accurate starting alignment. This is why the tree in figure 1 was considered most accurate and primarily used for family clustering, while the second figure showed that the atypical SLCs cluster with SLCs. The atypical SLCs are probably SLC proteins, but most are still orphan regarding function. Therefore, they were divided into AMTF families instead of using the existing SLC nomenclature. This highlights that the proteins are possible transporters, but that their function remains to be elucidated. Whenever their functions are determined they can be renamed according to the SLC root system, which could result in 64 SLC families instead of the present 52 SLC families. In general, proteins within a SLC family usually share mechanism and substrate profiles [85], although exceptions to this rule can be observed. Most proteins in the AMTFs are not well studied, but there seem to be both similarities and differences within the families. AMTF1 (MFSD9, MFSD10, MFSD14A and MFSD14B) and AMTF8 (MFSD2A and MFSD2B) are examples for similarities and dissimilarities. In AMTF1, MFSD10 is identified both at the plasma [47] and intracellular membranes [36], while MFSD14A and MFSD14B have only known intracellular expressions [33]. MFSD8, which shares a branching node with the AMTF1 proteins, is also intracellular [27]. Therefore, it is likely that MFSD9 also has an intracellular location. This hypothesis was strengthened as we detected interaction between MFSD9 and MFSD8, MFSD10, MFSD14A and MFSD14B using in situ PLA. This means that MFSD9 is located within 40 nm proximity of the other three intracellular proteins. Regarding their substrates, they are believed to differ as MFSD10 transport organic ions [36], while MFSD14A is suggested to be sugar transporter as it shares several structural characteristics with known sugar transporters [48]. MFSD14B is a predicted sugar transporter due to its high sequence identity (67.7%) to MFSD14A. However, similar response patterns to amino acid deprivation were found, where small changes were detected until 5 h for all four members, followed by upregulation of all but Mfsd9 after 16 h. If we instead consider AMTF8, both MFSD2A and MFSD2B are located to the endoplasmic reticulum [37], while MFSD2A is also detected in the plasmalemma [42]. As they are nearly 40% identical, it is likely that they share a substrate and mechanism, and as MFSD2B transports lipids in a sodium-dependent manner [43], it is possible that MFSD2B does so as well. The genes were expressed together in some cells, and their combined transcripts were found with all atypical SLCs, except the Sv2s, suggesting they could have similar effects. Mfsd2a was co-expressed with 14 atypical SLCs, while Mfsd2b co-expressed with 12 genes, of which they shared co-expression with 7 genes. This suggests that MFSD2B could function as the back-up system for MFSD2A in specific cells or that it may have a more direct and specific function. They responded differently to amino acid starvation, where Mfsd2a was significantly reduced, while Mfsd2b remained unaffected. It is possible that Mfsd2b functions as a housekeeping gene, and hence lacks alteration upon diet change. On the other hand, Mfsd2a could have a direct function in energy balance, and is therefore found to be affected by starvation. Taken together, there are both similarities and differences between AMTF members, and it is not yet possible to elucidate their expression or functions, but the family clusters are good suggestions on which further investigations can be based. To understand how single cells maintain their homeostasis, preserve ion balances, keep optimal sugar levels and so on, we must figure out which transporters are expressed together. By studying single-cellular transcriptomes, we identified genes that seem to be co-expressed with several other atypical SLCs, like Mfsd8, Mfsd11 and Mfsd12, suggesting that are needed for basic maintenance, while other genes displayed a more restricted co-expressions, like Mfsd14b and Unc93a. In the RNA sequencing analysis, there were approximately 42 000 reads per cell, meaning that low-expressed genes are probably missing from the dataset. This is why undetected but anticipated co-expressed transcripts, like Mfsd9, could still be found as interacting partner to other proteins in vitro. There were detectable PLA signals even though the corresponding genes were not present in the sorted RNA dataset. This can be explained by the fact that low levels of mRNA can result in high protein translation in mammalian cells [88]. In many cases, mRNA and protein levels do not correlate completely because of different regulation controls. From the experiments we conclude that if genes were co-expressed according to the RNA sequencing, they were indeed found in the same cell. However, we cannot deduce anything about the unfound interactions; even if Mfsd2b has fewer gene co-expressions than Mfsd2a, transcripts could have been missed. For the in situ PLA, interactions were considered as accurate and as confirmations of co-existing proteins in the same cell, but comparisons between protein combinations were not performed. If we were able to understand the complete transporter co-expression map, it would facilitate the understanding of pharmacokinetics and human diseases. Most MFSs are similar in structure [17], despite their relatively low sequence identities. Therefore, we found it convincing that the predictions of atypical SLCs containing 12 TMS were accurate. This was in accordance with previous publications describing the structure of some atypical SLCs based on other topology prediction tools [27,29,30] or homology models [19,26]. As the predictions for MFSD13A, SPNS3 and CLN3 did not support our hypothesis, we built homology models to verify their predicted structures. When building homology models, the sequences were aligned against a structurally known MFS protein, providing higher reliability to the model than the prediction pool based only on amino acid sequences. This is why we feel confident to suggest that MFSD13A and SPNS3 have 12 TMS each. Interestingly, we identified only 11 TMS for CLN3 using CCtop and homology modelling, while previous reports have postulated conflicting results [31], where a six TMS protein is seemingly accepted [31,32,86,87]. However, it is different six TMS that are predicted in previous publications [31]. We have identified all previously predicted TMS, and additionally two regions, TMS 8 and 11, which have not been suggested so far. To our knowledge, no homology models have previously been built for CLN3. Since it does not belong to any Pfam clan, but is a clustered as member of the MFS superfamily according to the Transporter classification database [24], and because it shared between 10 and 20% sequence identities with many MFS proteins, we decided to align it against an MFS template. As the predicted TMS corresponded with those found by CCtop we considered it as a reliable three-dimensional model. Therefore, we deviate from previous reports, and propose an 11 TMS structure for CLN3. Among SLCs belonging to other Pfam clans, 11TMS is a common structure (e.g. the SLC38 family belong to the APC Pfam clan, and they all are predicted to contain 11 TMS [89]). It is thus possible for an atypical SLC to have such structure. Since the atypical SLCs phylogenetically group among SLCs of MFS type, share the MFS transporter topology and are affected by complete amino acid deprivation in cell cultures, it is likely that these proteins are novel transporters. As there has been a call for systematic research on transporters [6], we suggest that the atypical SLCs should be included in this. They could interact with drugs and be associated with diseases.
  86 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

Review 2.  Understanding transport by the major facilitator superfamily (MFS): structures pave the way.

Authors:  Esben M Quistgaard; Christian Löw; Fatma Guettou; Pär Nordlund
Journal:  Nat Rev Mol Cell Biol       Date:  2016-01-13       Impact factor: 94.444

3.  An extended proteome map of the lysosomal membrane reveals novel potential transporters.

Authors:  Agnès Chapel; Sylvie Kieffer-Jaquinod; Corinne Sagné; Quentin Verdon; Corinne Ivaldi; Mourad Mellal; Jaqueline Thirion; Michel Jadot; Christophe Bruley; Jérôme Garin; Bruno Gasnier; Agnès Journet
Journal:  Mol Cell Proteomics       Date:  2013-02-24       Impact factor: 5.911

4.  Direct observation of individual endogenous protein complexes in situ by proximity ligation.

Authors:  Ola Söderberg; Mats Gullberg; Malin Jarvius; Karin Ridderstråle; Karl-Johan Leuchowius; Jonas Jarvius; Kenneth Wester; Per Hydbring; Fuad Bahram; Lars-Gunnar Larsson; Ulf Landegren
Journal:  Nat Methods       Date:  2006-10-29       Impact factor: 28.547

Review 5.  Evolutionary origin of amino acid transporter families SLC32, SLC36 and SLC38 and physiological, pathological and therapeutic aspects.

Authors:  Helgi B Schiöth; Sahar Roshanbin; Maria G A Hägglund; Robert Fredriksson
Journal:  Mol Aspects Med       Date:  2013 Apr-Jun

6.  Characterization of mouse synaptic vesicle-2-associated protein (Msvop) specifically expressed in the mouse central nervous system.

Authors:  Eun Young Cho; Chae Jin Lee; Keun Su Son; Yoo Jung Kim; Sun Jung Kim
Journal:  Gene       Date:  2008-11-01       Impact factor: 3.688

7.  Mfsd2a is a transporter for the essential omega-3 fatty acid docosahexaenoic acid.

Authors:  Long N Nguyen; Dongliang Ma; Guanghou Shui; Peiyan Wong; Amaury Cazenave-Gassiot; Xiaodong Zhang; Markus R Wenk; Eyleen L K Goh; David L Silver
Journal:  Nature       Date:  2014-05-14       Impact factor: 49.962

8.  Profiling solute carrier transporters in the human blood-brain barrier.

Authors:  E G Geier; E C Chen; A Webb; A C Papp; S W Yee; W Sadee; K M Giacomini
Journal:  Clin Pharmacol Ther       Date:  2013-09-05       Impact factor: 6.875

9.  The Novel Membrane-Bound Proteins MFSD1 and MFSD3 are Putative SLC Transporters Affected by Altered Nutrient Intake.

Authors:  Emelie Perland; Sofie V Hellsten; Emilia Lekholm; Mikaela M Eriksson; Vasiliki Arapi; Robert Fredriksson
Journal:  J Mol Neurosci       Date:  2016-12-16       Impact factor: 3.444

10.  Major facilitator superfamily domain-containing protein 2a (MFSD2A) has roles in body growth, motor function, and lipid metabolism.

Authors:  Justin H Berger; Maureen J Charron; David L Silver
Journal:  PLoS One       Date:  2012-11-29       Impact factor: 3.240

View more
  17 in total

1.  CLN3 is required for the clearance of glycerophosphodiesters from lysosomes.

Authors:  David M Sabatini; Monther Abu-Remaileh; Nouf N Laqtom; Wentao Dong; Uche N Medoh; Andrew L Cangelosi; Vimisha Dharamdasani; Sze Ham Chan; Tenzin Kunchok; Caroline A Lewis; Ivonne Heinze; Rachel Tang; Christian Grimm; An N Dang Do; Forbes D Porter; Alessandro Ori
Journal:  Nature       Date:  2022-09-21       Impact factor: 69.504

2.  Genome-wide identification and in silico analysis of NPF, NRT2, CLC and SLAC1/SLAH nitrate transporters in hexaploid wheat (Triticum aestivum).

Authors:  Aman Kumar; Nitika Sandhu; Pankaj Kumar; Gomsie Pruthi; Jasneet Singh; Satinder Kaur; Parveen Chhuneja
Journal:  Sci Rep       Date:  2022-07-03       Impact factor: 4.996

3.  Emerging Roles of the Human Solute Carrier 22 Family.

Authors:  Sook Wah Yee; Kathleen M Giacomini
Journal:  Drug Metab Dispos       Date:  2021-12-17       Impact factor: 3.579

4.  Therapeutic efficacy of antisense oligonucleotides in mouse models of CLN3 Batten disease.

Authors:  Jessica L Centa; Francine M Jodelka; Anthony J Hinrich; Tyler B Johnson; Joseph Ochaba; Michaela Jackson; Dominik M Duelli; Jill M Weimer; Frank Rigo; Michelle L Hastings
Journal:  Nat Med       Date:  2020-07-27       Impact factor: 53.440

5.  Loss of CLN7 results in depletion of soluble lysosomal proteins and impaired mTOR reactivation.

Authors:  Tatyana Danyukova; Khandsuren Ariunbat; Melanie Thelen; Nahal Brocke-Ahmadinejad; Sara E Mole; Stephan Storch
Journal:  Hum Mol Genet       Date:  2018-05-15       Impact factor: 6.150

6.  Structural prediction of two novel human atypical SLC transporters, MFSD4A and MFSD9, and their neuroanatomical distribution in mice.

Authors:  Emelie Perland; Sofie V Hellsten; Nadine Schweizer; Vasiliki Arapi; Fatemah Rezayee; Mona Bushra; Robert Fredriksson
Journal:  PLoS One       Date:  2017-10-19       Impact factor: 3.240

7.  Bioinformatics analysis of potential core genes for glioblastoma.

Authors:  Yu Zhang; Xin Yang; Xiao-Lin Zhu; Jia-Qi Hao; Hao Bai; You-Chao Xiao; Zhuang-Zhuang Wang; Chun-Yan Hao; Hu-Bin Duan
Journal:  Biosci Rep       Date:  2020-07-31       Impact factor: 3.840

8.  The Neuronal and Peripheral Expressed Membrane-Bound UNC93A Respond to Nutrient Availability in Mice.

Authors:  Mikaela M Ceder; Emilia Lekholm; Sofie V Hellsten; Emelie Perland; Robert Fredriksson
Journal:  Front Mol Neurosci       Date:  2017-10-31       Impact factor: 5.639

Review 9.  Transport and Use of Bicarbonate in Plants: Current Knowledge and Challenges Ahead.

Authors:  Charlotte Poschenrieder; José Antonio Fernández; Lourdes Rubio; Laura Pérez; Joana Terés; Juan Barceló
Journal:  Int J Mol Sci       Date:  2018-05-03       Impact factor: 5.923

10.  Glucose Availability Alters Gene and Protein Expression of Several Newly Classified and Putative Solute Carriers in Mice Cortex Cell Culture and D. melanogaster.

Authors:  Mikaela M Ceder; Emilia Lekholm; Axel Klaesson; Rekha Tripathi; Nadine Schweizer; Lydia Weldai; Sourabh Patil; Robert Fredriksson
Journal:  Front Cell Dev Biol       Date:  2020-07-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.