| Literature DB >> 28739859 |
Mi Ni Huang1,2, Willie Yu1,2, Wei Wei Teoh3, Maude Ardin4, Apinya Jusakul1,5, Alvin Wei Tian Ng2,6, Arnoud Boot1,2, Behnoush Abedi-Ardekani7, Stephanie Villar4, Swe Swe Myint5, Rashidah Othman3, Song Ling Poon1,5, Adriana Heguy8, Magali Olivier4, Monica Hollstein4, Patrick Tan1, Bin Tean Teh1,5, Kanaga Sabapathy1,3, Jiri Zavadil4, Steven G Rozen1,2.
Abstract
Aflatoxin B1 (AFB1) is a mutagen and IARC (International Agency for Research on Cancer) Group 1 carcinogen that causes hepatocellular carcinoma (HCC). Here, we present the first whole-genome data on the mutational signatures of AFB1 exposure from a total of >40,000 mutations in four experimental systems: two different human cell lines, in liver tumors in wild-type mice, and in mice that carried a hepatitis B surface antigen transgene-this to model the multiplicative effects of aflatoxin exposure and hepatitis B in causing HCC. AFB1 mutational signatures from all four experimental systems were remarkably similar. We integrated the experimental mutational signatures with data from newly sequenced HCCs from Qidong County, China, a region of well-studied aflatoxin exposure. This indicated that COSMIC mutational signature 24, previously hypothesized to stem from aflatoxin exposure, indeed likely represents AFB1 exposure, possibly combined with other exposures. Among published somatic mutation data, we found evidence of AFB1 exposure in 0.7% of HCCs treated in North America, 1% of HCCs from Japan, but 16% of HCCs from Hong Kong. Thus, aflatoxin exposure apparently remains a substantial public health issue in some areas. This aspect of our study exemplifies the promise of future widespread resequencing of tumor genomes in providing new insights into the contribution of mutagenic exposures to cancer incidence.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28739859 PMCID: PMC5580708 DOI: 10.1101/gr.220038.116
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Study design. (A) We experimentally elucidated the mutational signature of AFB1 based on whole-genome sequence in four experimental systems: two different AFB1-exposed human cell lines and liver tumors in AFB1-exposed wild-type mice and in AFB1-exposed mice that carried a hepatitis B surface antigen transgene. In total, we examined 48,000 mutations in experimental systems including mutations in cell lines and somatic mutations in mouse tumors. (B) We integrated these experimental results with newly generated genomic HCC data from a geographical region of known aflatoxin exposure and with additional, publicly available human HCC data.
Figure 2.(A) Representative human-cell line trinucleotide mutation spectra grouped by mutations from guanine (G > T, G > C, G > A) and adenine (e.g., A > T, A > G, A > C). The most frequent G > T mutations are indicated (TGG > TTG, TGC > TTC, AGC > ATC). The number of mutations in in each mutation class (e.g., G > T) are indicated in parentheses. As there was little variation between replicates within each cell line, we show all individual spectra in Supplemental Figure S1. (B) Extreme transcription-strand bias for genes with high expression levels; see Supplemental Figure S2 for transcription-strand bias for all cell line replicates. Transcribed strand (Tr); nontranscribed strand (NT).
Figure 3.Somatic mutation spectra from HCC-like liver tumors from (A) three AFB1-exposed mice and (B) three AFB1-exposed mice with an HBsAg transgene. The latter have only one-tenth as many mutations as tumors from the AFB1-only mice. Spectra in panels A and B were normalized to the trinucleotide frequencies in the human genome. (C) Extreme transcription-strand bias for all G > N mutations in highly expressed genes in mouse M1. See Supplemental Figure S3 for other mice. Because of low mutation count, transcription-strand bias is evident only in the G > T mutations in the tumors from the AFB1 + HBsAg mice. Transcribed strand (Tr); nontranscribed strand (NT). (D) Principal components analysis (PCA) on G > N mutations in trinucleotide context. Replicates of each of the cell lines cluster together, while the mouse tumors are more dispersed in principal components space. The greater dispersion among the HBsAg tumors (M4, M5, M6) is likely due to higher stochastic variance because of much lower mutation counts combined with greater relative contributions from other mutational processes that arose during tumor development.
Figure 4.Selection of likely aflatoxin-associated HCCs by principal components analysis on the proportions of G > T mutations in trinucleotide context. Vertical green line indicates the value of M3 in PC1, which was used as a threshold for selecting WGS HCCs likely exposed to aflatoxins for further study: One HCC identified from WES data in Schulze et al. (2015) for which WGS data was subsequently available (DO23048), six from Sung et al. (2012), and one HCC from Fujimoto et al. (2016). As expected due to the higher relative sampling variance in WES spectra due to small numbers of mutations, these were more variable than WGS spectra.
Figure 5.(A) Somatic mutation spectra and transcriptional-strand bias for initial study of aflatoxin signatures in example human HCCs with likely aflatoxin exposure. Transcribed strand (Tr); nontranscribed strand (NT). Principal component analysis of (B) G > T and (C) G > N mutations for AFB1 experimental data (enclosed in irregular ovals) and HCCs with likely high aflatoxin exposure.
Figure 6.Aflatoxin signatures in human HCCs. (A) NMF decomposes the G > N spectra from experimental AFB1 exposure and likely aflatoxin-associated HCCs into two components, denoted AFB1sigG>N and AFsig2G>N. The left-hand side of the panel shows the signatures, as is conventional, based on trinucleotide frequencies in the human genome; the right-hand side shows them in frequencies per trinucleotide—equivalent to assuming all trinucleotides are equally common. (B,C) AFB1sigG>N (black) almost completely captures the spectra of HepaRG cell lines; AFsig2G>N (red) almost completely captures the spectra of some HCCs (e.g., HK067, HK203), while the mutations in HepG2, the mouse tumors, and some HCCs are most accurately reconstructed by a mixture of AFB1sigG>N and AFsig2G>N. (D,E) Nonnegative matrix factorization of selected HCCs using mutational signatures known to occur in HCCs, plus extensions of AFB1sigG>N and AFsig2G>N with A > N proportions set to 0, the latter denoted AFB1sig and AFsig2. (D) Absolute mutation numbers assigned to each signature. (E) Proportions of mutations assigned to each signature. (F) Reconstruction accuracy is generally good, with the exception of RK206, which has few mutations and a hard-to-reconstruct mutation spectrum outside of G > T mutations, partly due to spikes at TCA:TGA > TTA:TAA and CTG:CAG > CCG:CGG.