| Literature DB >> 34925370 |
Jacob Bauss1, Michele Morris2, Rama Shankar1, Rosemary Olivero1,3, Leah N Buck1,4, Cynthia L Stenger4, David Hinds1,2, Joshua Mills5, Alexandra Eby6, Joseph W Zagorski7, Caitlin Smith8, Sara Cline8, Nicholas L Hartog1,9, Bin Chen1,10, John Huss11, Joseph A Carcillo12, Surender Rajasekaran1,7,13, Caleb P Bupp1,14, Jeremy W Prokop1,10.
Abstract
In the age of genomics, public understanding of complex scientific knowledge is critical. To combat reductionistic views, it is necessary to generate and organize educational material and data that keep pace with advances in genomics. The view that CCR5 is solely the receptor for HIV gave rise to demand to remove the gene in patients to create host HIV resistance, underestimating the broader roles and complex genetic inheritance of CCR5. A program aimed at providing research projects to undergraduates, known as CODE, has been expanded to build educational material for genes such as CCR5 in a rapid approach, exposing students and trainees to large bioinformatics databases and previous experiments for broader data to challenge commitment to biological reductionism. Our students organize expression databases, query environmental responses, assess genetic factors, generate protein models/dynamics, and profile evolutionary insights into a protein such as CCR5. The knowledgebase generated in the initiative opens the door for public educational information and tools (molecular videos, 3D printed models, and handouts), classroom materials, and strategy for future genetic ideas that can be distributed in formal, semiformal, and informal educational environments. This work highlights that many factors are missing from the reductionist view of CCR5, including the role of missense variants or expression of CCR5 with neurological phenotypes and the role of CCR5 and the delta32 variant in complex critical care patients with sepsis. When connected to genomic stories in the news, these tools offer critically needed Ethical, Legal, and Social Implication (ELSI) education to combat biological reductionism.Entities:
Keywords: CCR5; educational material generation; evolutionary profiling; expression analysis; microglia; molecular dynamic simulations; viral infections
Mesh:
Substances:
Year: 2021 PMID: 34925370 PMCID: PMC8674737 DOI: 10.3389/fimmu.2021.790041
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Human CCR5 paralogs. (A) Phylogenetic tree of the top 100 human paralogs for CCR5 protein. Values at each node represent the percent of trees that cluster out of 500 bootstrap analyses. CCR5 is marked with a red box.(B) Model of human CCR5 with conservation colored. The transmembrane is colored cyan, amino acids conserved >90% in 100 GPCRs in red, conserved >90% in 16 most related CCR5 paralogs in orange, and conserved >90% in CCR5 vertebrate orthologs in yellow. The top left shows the view of CCR5 from extracellular space looking into the GPCR. The top right shows the side view with the transmembrane to visualize exposed and conserved residues, which are labeled. The bottom shows a 180° rotation of side view of CCR5.
Top functional amino acids of CCR5 from amino acid knowledgebase.
| AA | Codon | AA | Inclusion | CCR5 Conserved (%) | GPCR Conserved | Top 16 GPCR (E<1E-50) | Secondary Structure | RMSF (Å) | mds DCCM >0.9 |
|---|---|---|---|---|---|---|---|---|---|
| 3 | TAT | Y | PTM | 82.65 | 2.02 | 0.00 | – | – | – |
| 6 | TCA | S | PTM | 78.57 | 14.14 | 13.33 | C | 12.161 | 2 |
| 7 | AGT | S | PTM | 80.61 | 27.27 | 6.67 | C | 9.246 | 2 |
| 10 | TAT | Y | PTM | 80.61 | 4.04 | 13.33 | T | 4.615 | 1 |
| 14 | TAT | Y | PTM | 98.98 | 6.06 | 33.33 | T | 3.63 | 0 |
| 15 | TAT | Y | PTM | 19.39 | 4.04 | 20.00 | T | 4.047 | 1 |
| 16 | ACA | T | PTM | 27.55 | 3.03 | 6.67 | C | 2.959 | 1 |
| 17 | TCG | S | PTM | 69.39 | 7.07 | 33.33 | C | 3.384 | 0 |
| 20 | TGC | C | PTM | 98.97 | 11.11 | 53.33 | C | 1.325 | 0 |
| 46 | GTG | V | Top 10 | 97.96 | 19.19 | 40.00 | H | 1.786 | 4 |
| 55 | CTG | L | Top 10 | 100.00 | 23.23 | 60.00 | H | 1.738 | 10 |
| 60 | AGG | R | Top 10 | 100.00 | 25.25 | 33.33 | C | 2.658 | 8 |
| 63 | AGC | S | Top 10 | 94.90 | 21.21 | 46.67 | C | 1.85 | 8 |
| 73 | GCC | A | Top 10 | 97.96 | 82.83 | 80.00 | H | 1.229 | 6 |
| 101 | TGT | C | PTM | 100.00 | 100.00 | 100.00 | H | 1.202 | 7 |
| 106 | GGG | G | Top 10 | 91.75 | 13.13 | 46.67 | H | 1.277 | 4 |
| 121 | CTC | L | Top 10 | 100.00 | 13.13 | 40.00 | H | 1.262 | 5 |
| 131 | GTC | V | Top 10 | 100.00 | 52.53 | 80.00 | T | 2.255 | 0 |
| 178 | TGC | C | PTM | 100.00 | 10.10 | 53.33 | E | 1.055 | 1 |
| 223 | CGG | R | Top 10 | 89.80 | 20.20 | 40.00 | C | 3.459 | 0 |
| 269 | TGC | C | PTM | 98.97 | 47.47 | 93.33 | H | 1.181 | 2 |
| 288 | ACG | T | Top 10 | 88.78 | 9.09 | 26.67 | H | 1.222 | 1 |
| 321 | TGC | C | PTM | 89.69 | 2.02 | 13.33 | C | 3.514 | 0 |
| 323 | TGC | C | PTM | 26.80 | 0.00 | 0.00 | C | 5.452 | 4 |
| 324 | TGT | C | PTM | 97.94 | 2.02 | 13.33 | C | 6.36 | 5 |
| 336 | AGC | S | PTM | 83.67 | 13.13 | 40.00 | C | 9.765 | 4 |
| 337 | TCA | S | PTM | 100.00 | 22.22 | 46.67 | C | 9.244 | 3 |
| 342 | TCC | S | PTM | 100.00 | 27.27 | 40.00 | T | 7.834 | 1 |
| 349 | TCT | S | PTM | 98.98 | 13.13 | 26.67 | C | 4.292 | 1 |
Sites were included based on known posttranslational modifications (PTMs) or for being a top 10 scoring variant.
Figure 2Amino acid knowledgebase of CCR5 used for educational insights. CODE students generated protein models, which were embedded into a lipid membrane and run for molecular dynamics simulations. These values were combined with multiple species analysis of CCR5 evolution and genomic variant extractions from gnomAD, TOPmed, COSMIC, and ClinVar. This amino acid knowledgebase was then used to assess the delta 32 variant and generate various educational handouts, videos, and 3D printed models.
Figure 3Genomic architecture around CCR5. The genome browser view (hg38) of around one million bases near CCR5 (A) or zoomed into around 40,000 bases (B). Tracks shown include the Gencode transcripts, known variants from GWAS/OMIM/ClinVar, the high-risk COVID-19 loci (higher dots are the strongest signal), various gene regulation insights (ENCODE cCTEs, H3K27Ac, CpG Islands), and evolutionary conservation (Cons 100 Verts).
Traits associated with variants within the CCR5 gene region (chr3:46,353,419-46,409,888, hg38) of .
| Study ID | Trait | P-value | Beta | Publication |
|---|---|---|---|---|
| GCST004433 | Macrophage inflammatory protein 1b levels | 7.57E-115 | 0.4985 | PMID:27989323 |
| GCST90002340 | Monocyte count | 1.11E-75 | 0.035585 | PMID:32888493 |
| NEALE2_30190_raw | Monocyte percentage | 4.35951E-50 | 0.0963857 | UKB Neale v2 |
| GCST004609 | Monocyte percentage of white cells | 1.051E-33 | 0.04394357 | PMID:27863252 |
| NEALE2_6149_1 | Mouth ulcers | mouth/teeth dental problems | 4.03372E-32 | 0.093964676 | UKB Neale v2 |
| GCST004608 | Granulocyte percentage of myeloid white cells | 2.358E-25 | -0.03789086 | PMID:27863252 |
| NEALE2_30130_raw | Monocyte count | 1.90975E-22 | 0.00516017 | UKB Neale v2 |
| GCST004625 | Monocyte count | 5.757E-22 | 0.03506334 | PMID:27863252 |
| GCST004438 | Monocyte chemoattractant protein-1 levels | 1.05E-19 | 0.2902 | PMID:27989323 |
| GCST90002292 | Basophil count | 4.85E-17 | -0.018406 | PMID:32888493 |
| GCST90002316 | Lymphocyte counts | 4.44E-14 | 0.021045 | PMID:32888493 |
| NEALE2_30300_raw | High light scatter reticulocyte count | 7.77011E-14 | 0.000252296 | UKB Neale v2 |
| NEALE2_30180_raw | Lymphocyte percentage | 2.13306E-11 | 0.159232 | UKB Neale v2 |
| NEALE2_30120_raw | Lymphocyte count | 2.24757E-10 | 0.0233973 | UKB Neale v2 |
| NEALE2_30290_raw | High light scatter reticulocyte percentage | 3.2664E-10 | 0.00704487 | UKB Neale v2 |
| GCST003045 | Ulcerative colitis [EA] | 1.3229E-08 | 0.0757873 | PMID:26192919 |
| NEALE2_6149_100 | None of the above | mouth/teeth dental problems | 2.22533E-08 | -0.027486511 | UKB Neale v2 |
| NEALE2_30150 | Eosinophill count | 2.81799E-08 | 0.00995217 | UKB Neale v2 |
| NEALE2_30260_raw | Mean reticulocyte volume | 7.16666E-08 | -0.137991 | UKB Neale v2 |
| NEALE2_30250_raw | Reticulocyte count | 8.74646E-08 | 0.000690186 | UKB Neale v2 |
Top eQTL for CCR5 expression.
| Tissue | SNPs | Lowest P-Value | NES | rsID |
|---|---|---|---|---|
| Whole Blood | 206 | 1.1E-09 | -0.24 | rs76258812 |
| Lung | 123 | 3.7E-06 | -0.14 | rs9110 |
| Brain - Caudate (basal ganglia) | 10 | 1.3E-05 | 1.1 | rs9862021 |
| Brain - Cortex | 2 | 1.9E-05 | -1.6 | rs140177427 |
| Skin - Sun Exposed (Lower leg) | 3 | 1.9E-05 | 0.17 | rs1388604 |
| Esophagus - Mucosa | 47 | 2.1E-05 | -0.17 | rs202207288 |
| Esophagus - Muscularis | 26 | 2.4E-05 | 0.16 | rs2133660 |
| Colon - Sigmoid | 114 | 2.7E-05 | -0.39 | rs6765904 |
| Skin - Not Sun Exposed (Suprapubic) | 51 | 2.8E-05 | -0.18 | rs9872946 |
| Nerve - Tibial | 4 | 1.0E-04 | -0.71 | rs80257961 |
Figure 4Expression of CCR5 in 1,063 mouse single-cell datasets. (A) PanglaoDB analysis of experiments that CCR5 was detected in various cells based on single-cell analysis. (B, C) The number of times sample types showed CCR5 expression for macrophages/monocytes (B) or microglia (C).
Mouse knock-out phenotypes.
| Phenotype | Publication | Neurological |
|---|---|---|
| abnormal astrocyte physiology | ( | Yes |
| abnormal glial cell physiology | ( | Yes |
| abnormal long term spatial reference memory | ( | Yes |
| abnormal nervous system physiology | ( | Yes |
| abnormal spatial learning | ( | Yes |
| decreased microglial cell activation | ( | Yes |
| abnormal CD4-positive, alpha beta T cell morphology | ( | |
| abnormal CD8-positive, alpha beta T cell morphology | ( | |
| abnormal cytokine level | ( | |
| abnormal hepatocyte physiology | ( | |
| abnormal Ito cell morphology | ( | |
| abnormal Kupffer cell morphology | ( | |
| abnormal locomotor behavior | ( | |
| abnormal NK T cell physiology | ( | |
| decreased NK cell number | ( | |
| decreased susceptibility to induced colitis | ( | |
| decreased susceptibility to Retroviridae infection | ( | |
| impaired macrophage chemotaxis | ( | |
| increased NK T cell number | ( | |
| increased susceptibility to fungal infection | ( | |
| liver failure | ( |
Figure 5Human brain CCR5 and microglia. (A) The correlation of genes to CCR5 from the Allen Brain Atlas Human Brain microarray data for 500 samples relative to their expression in mouse microglia single-cell experiments. The x-axis shows the genes R2 from microarray relative to CCR5 expression, while the y-axis shows the number of experiments the gene is detected in mouse single-cell datasets for microglia. (B) STRING protein network for genes in panel (A) showing the enrichment of synapse pruning (green), microglia/glioblastoma (red), and microglia/aging (blue) genes. (C) Expression of CCR5 in transcripts per million (TPM) from seven BioProjects of human microglia cells or related experiments.
Figure 6CCR5 protein network highlighting microglia factors. The STRING protein network for the top 50 proteins interacting with CCR5. In green/yellow are various genes annotated to chemokine biology, and in red/blue are genes connected to microglia biology.
Neurological phenotypes for the top human CCR5 missense variants from Geno2MP.
| Variant | Het | Hom | Phenotype |
|---|---|---|---|
| L55Q | 7 | 1 | Abnormality of brain morphology |
| L55Q | 4 | 0 | Epileptic encephalopathy |
| L55Q | 4 | 0 | Abnormality of nervous system physiology |
| L55Q | 3 | 0 | Neurodevelopmental abnormality |
| R223Q | 3 | 0 | Intellectual disability |
| L55Q | 2 | 0 | Microcephaly |
| L55Q | 2 | 0 | Dystonia |
| L55Q | 2 | 0 | Fatigable weakness |
| R223Q | 2 | 0 | Agenesis of corpus callosum |
| R223Q | 2 | 0 | Abnormality of brain morphology |
| R223Q | 2 | 0 | Cerebral cortical atrophy |
| R223Q | 2 | 0 | Abnormality of nervous system morphology |
| A73V | 2 | 0 | Epileptic encephalopathy |
| S63C | 2 | 0 | Intellectual disability |
| L55Q | 1 | 0 | Autism, Intellectual disability |
| L55Q | 1 | 0 | Spastic paraplegia |
| L55Q | 1 | 0 | Abnormality of hindbrain morphology |
| L55Q | 1 | 0 | Seizures |
| L55Q | 1 | 0 | Abnormality of nervous system morphology |
| L55Q | 1 | 0 | Intellectual disability |
| L55Q | 1 | 0 | Global developmental delay, Autism |
| L55Q | 1 | 0 | Global developmental delay |
| L55Q | 1 | 0 | Abnormality of the nervous system |
| R223Q | 1 | 0 | Behavioral abnormality |
| R223Q | 1 | 0 | Seizures |
| R223Q | 1 | 0 | Neurodegeneration |
| R223Q | 1 | 0 | Abnormality of hindbrain morphology |
| R223Q | 1 | 0 | Seizures |
| R223Q | 1 | 0 | Epileptic encephalopathy |
| R223Q | 1 | 0 | Abnormality of nervous system physiology |
| R223Q | 1 | 0 | Intellectual disability |
| R223Q | 1 | 0 | Intellectual disability |
| R223Q | 1 | 0 | Global developmental delay |
| A73V | 1 | 0 | Intellectual disability |
| A73V | 1 | 0 | Agenesis of corpus callosum |
| A73V | 1 | 0 | Abnormality of hindbrain morphology |
| A73V | 1 | 0 | Abnormality of brain morphology |
| A73V | 1 | 0 | Intellectual disability |
| V131F | 1 | 0 | Abnormality of hindbrain morphology |
| V131F | 1 | 0 | Abnormality of brain morphology |
| V131F | 1 | 0 | Fatigable weakness |
| V131F | 1 | 0 | Abnormality of nervous system physiology |
| S63C | 1 | 0 | Spastic paraplegia |
| S63C | 1 | 0 | Microcephaly |
| S63C | 1 | 0 | Abnormality of brain morphology |
| S63C | 1 | 0 | Seizures |
| S63C | 1 | 0 | Abnormality of movement |
The Het is the number of heterozygous individuals with the variant and the annotated phenotype and Hom are homozygous individuals with phenotype.
Figure 7CCR5 expression in blood PAXgene tube RNAseq. (A) Box and whisker plots for the expression (TPM) of CCR5 from various NCBI BioProjects that were generated by Illumina paired-end RNA-Seq from human blood collected PAXgene tube samples. Listed next to each BioProject code is the number of samples, average CCR5 expression, and standard deviation of CCR5 expression. (B) The TPM expression for each sample (x-axis) of panel (A) relative to the BioProject normalized Z-score (y-axis). The top seven samples based on Z-score are labeled. (C) The Z-scores for our three pilot precision transcriptome datasets, with outlier samples labeled. (D) The COVID-19 study analysis of CCR5 normalized expression relative to CIBERSORTx absolute CD8 T-cell values. (E) The percent of transcripts containing the delta 32 variant relative to wild type (x-axis) for samples of panel (C) relative to the BioProject normalized Z-Score. All homozygous samples for delta 32 are labeled as is the heterozygous sample with the highest study Z-score.