| Literature DB >> 23953889 |
Scott Sherrill-Mix1, Mary K Lewinski, Marylinda Famiglietti, Alberto Bosque, Nirav Malani, Karen E Ocwieja, Charles C Berry, David Looney, Liang Shan, Luis M Agosto, Matthew J Pace, Robert F Siliciano, Una O'Doherty, John Guatelli, Vicente Planelles, Frederic D Bushman.
Abstract
BACKGROUND: HIV infection can be treated effectively with antiretroviral agents, but the persistence of a latent reservoir of integrated proviruses prevents eradication of HIV from infected individuals. The chromosomal environment of integrated proviruses has been proposed to influence HIV latency, but the determinants of transcriptional repression have not been fully clarified, and it is unclear whether the same molecular mechanisms drive latency in different cell culture models.Entities:
Mesh:
Year: 2013 PMID: 23953889 PMCID: PMC3765678 DOI: 10.1186/1742-4690-10-90
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
HIV-1 integration datasets from models of latency
| Jurkat | Jurkat cells | HIV vector pEV731(LTR-Tat-IRES-GFP) | 2 weeks | Sanger | TNF | [ | 463inducible | 643 |
| Bcl-2 transduced CD4 + | Primary CD4 +T cells (Bcl-2transduced) | HIV NL4-3- | 3 days + 3-4weeks + 3 days | Sanger | Anti-CD3, anti-CD28 antibodies, GFP expression | [ | 446inducible | 273 |
| Active CD4 + | Primary activeCD4 + T cells | HIV NL4-3 | 3 days | 454 | High vs. low Gag | [ | 1604silent | 1274 |
| Resting CD4 + | Primary restingCD4 + T cells | HIV NL4-3 | 3 days | 454 | High vs. low Gag | [ | 1942silent | 784 |
| Central Memory CD4 + | Primary centralmemory CD4 +T cells | HIV NL4-3 | 2 days/9 days | IonTorrent | Anti-CD3, anti-CD28 antibodies, GFP expression | This paper | 1729inducible | 3278 |
Genomic data available for comparison to HIV integration sites
| T cell expression | RNA-Seq | Unpublished | 1 | RNA |
| Jurkat expression | RNA-Seq | Encode [ | 1 | wgEncodeHudsonalphaRnaSeq |
| Integration sites | Locations | Unpublished | 1 | Sites |
| DNase sensitivity | DNA-Seq/peaks | Encode [ | 1 | wgEncodeOpenChromDnase |
| Methylation | DNA-Seq | [ | 1 | Methyl |
| CpG | Locations | UCSC [ | 1 | cpgIslandExt |
| Sequence-based | Continuous | — | 4 | % GC, HIV PWM score, distance to centrosome, chromosomal position |
| Repeats | Locations | UCSC [ | 16 | DNA, LINE, Low_complexity, LTR, Other, RC, RNA, rRNA, Satellite, scRNA, Simple_repeat, SINE, snRNA, srpRNA, tRNA, alphoid |
| Histone acetylation | ChIP-Seq/Peaks | [ | 18 | H2AK5ac, H2AK9ac, H2BK120ac, H2BK12ac, H2BK20ac, H2BK5ac, H3K14ac, H3K18ac, H3K23ac, H3K27ac, H3K36ac, H3K4ac, H3K9ac, H4K12ac, H4K16ac, H4K5ac, H4K8ac, H4K91ac |
| Histone methylation and other proteins | ChIP-Seq/Peaks | [ | 23 | CTCF, H2AZ, H2BK5me1, H3K27me1, H3K27me2, H3K27me3, H3K36me1, H3K36me3, H3K4me1, H3K4me2, H3K4me3, H3K79me1, H3K79me2, H3K79me3, H3K9me1, H3K9me2, H3K9me3, H3R2me1, H3R2me2, H4K20me1, H4K20me3, H4R3me2, PolII |
| Chromatin state | Binary | [ | 51 | State1,state2,…,state51 |
| HATs and HDACs | ChIP-Seq | [ | 11 | Resting-HDAC1, Resting-HDAC2, Resting-HDAC3, Resting-HDAC6, Resting-p300, Resting-CBP, Resting-MOF, Resting-PCAF, Resting-Tip60, Active-HDAC6, Active-Tip60 |
| Nucleosome | ChIP-Seq | [ | 2 | Resting-Nucleosomes, Active-Nucleosomes |
| UCSC genes | Locations | [ | 4 | In gene, in gene (same strand), gene count, distance to nearest gene, in exon, in intron |
Figure 1Correlations of genomic features and latency. Spearman rank correlation between proviral expression status and genomic features. Only genomic features with at least one correlation with latency with a false discovery rate q-value >0.01 (marked by asterisks) are shown.
Figure 2Lasso regressions predicting latency. Misclassification error from cross validation for lasso regressions of silent/inducible status on genomic features as a function of λ, the regularization coefficient for the lasso regression, for all cell culture models combined (A) and each individual cell culture model (B-F). The number of variables included and size of coefficients in the model increases to the left. Whiskers show the standard error of mean misclassification error. Dashed vertical lines indicate the minimum misclassification error and the simplest model within one standard error. Dotted horizontal line indicates the misclassification error expected from random guessing.
Figure 3Cellular expression and latency. Predictions from a logistic regression of silent/inducible status on cellular RNA expression. High y-axis values are predicted to be silent/inducible. Dotted line shows where equal odds of silent/inducible and expressed are predicted. Solid lines show predictions from the regression for each sample and shaded regions indicate one standard error from the modeled predictions.
Figure 4Strand orientation and latency. The proportion of provirus integrated in the opposite strand compared to cellular genes in silent/inducible (blue) and expressed (red) samples. Error bars show the 95% Clopper-Pearson binomial confidence interval. Dotted line shows the 0.5 proportion expected by chance.
Figure 5Genes and latency.(A) The proportion of provirus integrated outside genes in silent/inducible (blue) and expressed (red) samples. Error bars show the 95% Clopper-Pearson binomial confidence interval. (B) The nearest distance to any gene for integration sites (points) outside genes in the five samples. Points are spread in proportion to kernel density estimates. Horizontal lines indicate sample means where there was a significant difference in means between silent/inducible and expressed provirus (black) or no significant difference (grey).
Figure 6Alphoid repeats and latency. The proportion of integration sites with matches in alphoid repeats in silent/inducible (blue) and expressed (red) cells in five samples. Error bars show the 95% Clopper-Pearson binomial confidence interval. Asterisks indicate significant associations between integrations within an alphoid repeat and proviral expression status (Bonferroni corrected Fisher’s exact test p>0.05).
Figure 7Acetylation and latency.(A) The number of ChIP-seq reads for H4K12ac, the histone mark with the lowest Fisher’s method p-value for correlation with latency, within 50,000 bases across the five samples. Integration sites (points) are spread in proportion to kernel density estimates. Horizontal lines indicate sample means where there was a significant difference (black) in means between silent/inducible and expressed provirus or no significant difference (grey). (B) The correlation (points) and its 95% confidence interval (vertical lines) between principal components of acetylation and silent/inducible status for each of the five samples. Red indicates correlations with a Bonferroni-corrected p-value >0.05.
Figure 8Shared expression status between near neighbors. The ratio of the number of pairs of proviruses with matching expression status to the number of matches expected by random pairings given the frequency of silent/inducible proviruses. All possible pairs of proviruses integrated within a given distance of each other on the same chromosome (red line) were separated into two sets; one with both proviruses from within the same cell culture model and one with proviruses paired between two different cell culture models (black lines). The shaded region shows the 95% Clopper-Pearson binomial confidence interval for within and between sample pairings. The dashed horizontal line shows the ratio of 1 expected if there is no association between the expression status of neighboring proviruses.