| Literature DB >> 36164651 |
Evandro Ferrada1, Giulio Superti-Furga1,2.
Abstract
Solute carriers are an operationally defined diverse family of membrane proteins involved in the transport of nutrients, metabolites, xenobiotics, and drugs. Here, we provide an integrative classification of solute carriers by combining evolutionary information with proteome-wide structure models recently made available through the AlphaFold resource. Analyses of orthologous relations among 455 protein-coding genes currently classified as human solute carriers, over the fully sequenced genomes of 2,100 species, suggest no more than approximately 180 independent evolutionary origins. Structural comparative analyses provided further insight revealing a total of 24 structurally distinct transmembrane folds, increasing by approximately 40% the number of previously described SLC structural folds. In addition, a structural comparative analysis identified a new human solute carrier member and revealed details of noncanonical ones. Our analyses uncover new ancestral relations between solute carrier genes, provide insights into the evolution of remote homologs and a platform to test hypotheses of functional deorphanization.Entities:
Keywords: Biochemistry; Biological sciences; Classification of proteins; Evolutionary biology
Year: 2022 PMID: 36164651 PMCID: PMC9508557 DOI: 10.1016/j.isci.2022.105096
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Summary of SLC genes, orphans, and families sharing common ancestors
| SLC Families | SLC Members |
|---|---|
| orphans, 22 | MFSD9, MFSD14A, MFSD14B, SLC22A18 |
| orphan, 32, 38 | TMEM104; SLC32A1; SLC38A7; SLC38A8 |
| orphan, 59 | MFSD12; SLC59A1; SLC59A2 |
| Orphans | MFSD11; UNC93A |
| 17, 37 | SLC17A9; SLC17A5; SLC37A2; SLC37A4; SLC37A3; SLC37A1 |
| 17, 63 | SLC17A6; SLC17A7; SLC17A2; SLC17A3; SLC17A1; SLC17A4; SLC17A8; SLC63A1; SLC63A3; SLC63A2 |
| 8, 24 | SLC8B1; SLC24A4; SLC24A2; SLC24A3; SLC24A5 |
| 36, 38 | SLC36A1; SLC36A4; SLC36A3; SLC36A2; SLC38A5; SLC38A1; SLC38A2; SLC38A4; SLC38A6; SLC38A3 |
List of SLC genes from different SLC families and/or orphans sharing a common ancestor.
Figure 1Hierarchical clustering based on structural similarity between structure models of human solute carrier proteins
Structure models for 450 human SLCs were structurally aligned against each other and against models for the entire human proteome. A set of 49 experimentally solved structures were included (leaves labeled in red, and red with a green background for those that did not exactly cluster with their respective structure models). A dissimilarity matrix was built and used for a hierarchical clustering analysis based on Euclidean distance and the group average method. Branch colors were assigned to members of the same SLC families. Dashed branches were used to highlight groups of nested families suggestive of common ancestry. A scan over the entire human proteome identified new putative SLC genes close to the 35 family (highlighted with a red dot). Stars, highlight noncanonical SLCs. For an interactive, online version of this figure see: https://itol.embl.de/tree/91141792154601629031391. See also Tables S1 and S7.
Figure 2Hierarchical clustering based on structural similarity between transmembrane domains extracted from structure models of human solute carrier proteins
The transmembrane (TM) domains of 436 human SLCs, with at least 6 transmembrane (TM) segments, were structurally aligned against each other. The dendrogram summarizes a hierarchical clustering analysis based on pairwise structural similarity (STAR Methods). Branch colors and branch labels highlight clusters of relatively independent structural similarity. Colored rectangles (and legend) represent 14 CATH superfamilies annotated over the TM domains of 45 of the 60 SLC families in the dendrogram. Bars on the dendrogram tips show the number of TM segments per SLC gene, and the three blue lines indicate a scale highlighting 5, 10, and 15 TM segments. For an interactive, online version of this figure see: https://itol.embl.de/tree/911416913099241661370120. See also Tables S8 and S9.
Summary of fold classification
| Fold | Numer of SLC genes | TM segments | SLC Families/or orphans |
|---|---|---|---|
| MFS | 138 | 12 | MFSD6, UN93B, MFS6L, MFSD8, MF13A, MF14B, CLN3, MFS12, MFSD9, MF14A, MFSD1, MFS11, UN93A, SLC2, SLC15, SLC16, SLC17, SLC18, SLC18B1, SLC19, SLCO5, SLCO4C1, SLCO1B3, SLCO3, SLCO1C1, SLCO4, SLCO1, SLCO1B7, SLCO2B1, SLCO1B1, SLCO6, SLCO2, SLC22B3, SLC22B1, SLC22, SLC22B2, SLC22, SLC22B4, SLC22, SLC22B5, SLC22, SLC29, SLC33, SLC37, SLC40, SLC43, SLC45, SLC46, SLC49, SLC52, SLC59, SLC60, SLC61, SLC63 |
| LeuT | 72 | 12 | TM104, SLC5, SLC6, SLC7, SLC11, SLC12, SLC32, SLC36, SLC38 |
| MitC | 53 | 6 | SLC25 |
| DMT | 39 | 10 | SLC35E2, SLC35F4, SLC35F5, SLC35E1, SLC35, SLC35D3, SLC35G2, SLC35F1, SLC35F3, SLC35E2B, SLC35, SLC35B3, SLC35G1, SLC35B2, SLC35C1, SLC35F2, SLC35E4, SLC35F6, SLC35D1, SLC35G6, SLC35C2, SLC35G7, SLC35G4, SLC35G3, SLC35, SLC35G5, SLC35, SLC35D2, SLC35B1, SLC35B4, SLC35D4, SLC35, SLC35E3, SLC57 |
| UraA | 24 | 12 | SLC4, SLC23, SLC26 |
| NhaA | 20 | 9 | SLC9C1, SLC9C2, SLC9, SLC9B2, SLC9B1, SLC10 |
| ZIP | 14 | 6,8 | SLC39 |
| YiiP | 10 | 6 | SLC30 |
| NCX | 9 | 9,11 | SLC8, SLC8B1, SLC24 |
| Glt | 7 | 8 | SLC1 |
| AmtB | 7 | 12 | SLC14, SLC42 |
| MtN3 | 6 | 7 | SLC50, SLC66 |
| IT | 6 | 13 | P_HUMAN, SLC13 |
| SLC56 | 5 | 3 | SLC56 |
| SLC44 | 5 | 9 | SLC44 |
| SLC51 | 3 | 7 | SLC51 |
| SLC41 | 3 | 10 | SLC41 |
| MATE | 3 | 13 | SLC47, SLC62 |
| CNT2 | 3 | 10 | SLC34 |
| CNT1 | 3 | 11 | SLC28 |
| PiT | 2 | 10,12 | SLC20 |
| NPC1 | 2 | 13 | SLC65 |
| SLC64 | 1 | 6 | SLC64 |
| SLC53 | 1 | 8 | SLC53 |
A total of 436 SLC genes were classified into 24 structure folds according to structure dissimilarity and topology criteria.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Raw and analyzed data | This paper | Mendeley Data: |
| iToL interactive | This paper | |
| iToL interactive | This paper | |
| HGNC | ||
| TCDB | ||
| OPM | ||
| OMA | ||
| HMMER | ||
| Pfam | ||
| CATH | ||
| BLAST | ||
| PDB | ||
| Alpha-Fold | ||
| IUpred2 | ||
| TopMatch | ||
| STAT R package | ||
| iToL | ||
| ape R package | ||
| CCTOP | ||