| Literature DB >> 29101368 |
Eric Letouzé1,2,3,4, Jayendra Shinde5,6,7,8, Victor Renault9, Gabrielle Couchy5,6,7,8, Jean-Frédéric Blanc10,11, Emmanuel Tubacher9, Quentin Bayard5,6,7,8, Delphine Bacq12, Vincent Meyer12, Jérémy Semhoun9, Paulette Bioulac-Sage10,13, Sophie Prévôt14, Daniel Azoulay15,16, Valérie Paradis17, Sandrine Imbeaud5,6,7,8, Jean-François Deleuze12, Jessica Zucman-Rossi18,19,20,21,22.
Abstract
Genomic alterations driving tumorigenesis result from the interaction of environmental exposures and endogenous cellular processes. With a diversity of risk factors, liver cancer is an ideal model to study these interactions. Here, we analyze the whole genomes of 44 new and 264 published liver cancers and we identify 10 mutational and 6 structural rearrangement signatures showing distinct relationships with environmental exposures, replication, transcription, and driver genes. The liver cancer-specific signature 16, associated with alcohol, displays a unique feature of transcription-coupled damage and is the main source of CTNNB1 mutations. Flood of insertions/deletions (indels) are identified in very highly expressed hepato-specific genes, likely resulting from replication-transcription collisions. Reconstruction of sub-clonal architecture reveals mutational signature evolution during tumor development exemplified by the vanishing of aflatoxin B1 signature in African migrants. Finally, chromosome duplications occur late and may represent rate-limiting events in tumorigenesis. These findings shed new light on the natural history of liver cancers.Entities:
Mesh:
Year: 2017 PMID: 29101368 PMCID: PMC5670220 DOI: 10.1038/s41467-017-01358-x
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Mutational processes are modulated by risk factors and preferentially alter specific hotspots. a Summary of the 10 signatures (COSMIC nomenclature) known to be operative in liver cancers analyzed in our combined series of 299 HCC. Each signature is displayed according to the 96-substitution classification defined by the substitution type and sequence context immediately 5′ and 3′ to the mutated base. b Proportion of samples in which each signature was detected (top) and distribution of mutation counts for each signature in the relevant samples (bottom). c, d Correlation of mutational signatures with risk factors and age. The number of mutations attributed to signatures 4 c and 16 d is represented as a function of age for patient groups stratified according to the risk factors significantly associated with each signature (Supplementary Table 5). e Distribution of mutational signatures associated with driver gene mutations. We estimated the probability of each driver gene mutation being due to each mutational process. We then summed these probabilities over all mutations and signatures to obtain the cumulative probabilities across all driver gene mutations (pie chart) and for each driver gene separately (barplot). f CTNNB1 mutations (left) overall have higher probabilities being due to signature 16 than other mutations in the same samples (middle) and in other samples (right). The violin plots represent the distribution of probabilities for each group of mutations and horizontal segments highlight median values. g Distribution of mutational signatures associated with CTNNB1 mutations across the main oncogenic hotspots. The probability of each CTNNB1 mutation being due to each mutational process was estimated and summed across each position between amino acids S29 and S45, the main oncogenic hotspots of CTNNB1. The DNA and protein sequence are depicted below the barplot
Fig. 2Gene expression strongly modulates mutational processes in liver cancer. a Number of single nucleotide variants (SNVs, left) and small insertions and deletions (indels, right) per megabase in genes as a function of expression level. Genes with expression between 0 and 100 fragments per kilobase of exons per million reads (FPKM) were divided in 5 gene expression quintiles. A separate group was created for very highly expressed genes (FPKM ≥ 100). Error bars indicate the 95% confidence intervals of the estimated mutation rates. b Per-gene correlation of indel rate with gene expression. The normalized indel rate represents the number of indels per megabase in each gene divided by the indel rate in unexpressed genes (FPKM = 0). Very highly expressed genes with a high indel rate are represented in a zoomed box on the right. c Distribution of indel types and sizes in very highly expressed (FPKM ≥ 100) vs. all other genes. d Correlation between gene expression and SNV rates broken down by mutational signature. Mutation rates were normalized against the mutation rate in unexpressed genes. e Evolution of mutation rates when crossing transcription start sites (TSS) for the 4 mutational signatures with the strongest transcriptional strand bias. Each dot represents the average mutation rate in a 1-kb window between 50 kb upstream and 100 kb downstream transcription start sites, normalized against the mutation rate in intergenic regions. In transcribed regions, the mutation rate was estimated separately on the transcibed (light) and non-transcribed (dark) DNA strands. Black and gray arrows represent the shifts in mutation rate attributed to transcription-coupled repair (TCR) and transcription-coupled damage (TCD). f Transcription-coupled damage was quantified in each tumor as the increase of A>G mutations between low and high expression genes (top). Bottom: Correlation of TCD with clinical and molecular features
Fig. 3Structural rearrangement signatures in liver cancer. a Six rearrangement signatures identified by non-negative matrix factorization. Structural rearrangements were classified in 38 categories considering their type (del: deletion, dup: tandem duplication, inv: inversion, trans: interchromosomal translocation) and size, and distinguishing clustered from non-clustered events. The probability of each rearrangement category in each signature is represented, with rearrangement types indicated above and rearrangement sizes below. b Unsupervised classification reveals HCC subgroups with similar rearrangement signatures. Particular phenotypes, defined by the presence of ≥50 rearrangements attributed to a same signature, are indicated, with the percentage of tumors displaying the phenotype in parenthesis. Below, CIRCOS plots represent the structural rearrangement profiles of 5 tumors representative of particular structural rearrangement phenotypes
Fig. 4Clonal architecture and natural history of liver tumors. a Number of clonal and subclonal mutations identified in 44 liver tumors. A color code indicates the number of mutations with a cancer cell fraction (proportion of tumor cells harboring the mutation) between 0 and 1 (clonal mutations). Tumors are ordered according to their proportion of subclonal mutations and significantly associated clinical features are represented below. b Clonal history of an hepatocellular carcinoma displaying 3 different subclonal mutations of CTNNB1. Driver mutations and copy-number alterations in the clonal and subclonal compartments are indicated with a color code indicating the type of event and the most likely signature of origin, as represented in the legend. Copy-number alterations are indicated and duplications are positioned according to their timing in point mutation time. c Clonal histories of two cases of adenoma-to-carcinoma progressions. In CHC361T (CTNNB1-related adenoma), few clonal mutations distinguish the adenoma (HCA) and carcinoma (HCC) samples, but 3 subclonal driver mutations were acquired in the carcinoma part. In CHC465T (HNF1A-related adenoma), the carcinoma developed from a subclone representing 14% of cells in the adenoma sample that further acquired 5905 mutations. In both cases, chromosome duplications occurred shortly before the carcinoma was operated
Fig. 5Mutational processes evolve along liver tumorigenesis. a Proportion of mutations attributed to each mutational signature in the clonal and subclonal mutations of each tumor (connected with a line). Clon. clonal; Sub. subclonal. b Average contribution of each signature to clonal and subclonal mutations in our series of 44 liver tumors. c Clonal history of 3 HCC developed in African migrants with aflatoxin B1 exposure and HBV infection. Driver alterations in the clonal and subclonal compartments are indicated with a color code describing the type of event and the most likely signature of origin, as represented in the legend. The R249s mutation of TP53, hallmark of aflatoxin B1 exposure, is encountered in two patients. TERT promoter is altered by mutation in one patient and HBV insertion in two patients. The two TP53-mutated tumors display many chromosome deletions, and the 3 cases show an accumulation of chromosome gains shortly before the last selective sweep. Signature 24 (aflatoxin B1) is dominant in clonal mutations but vanishes in subclonal mutations
Fig. 6Chromosome duplications are late events in hepatocellular carcinomas. a Timing of copy-number gains in 33 informative HCC. The timing of chromosome gains was estimated from the ratio of duplicated over non-duplicated mutations. Each arrow represents the point mutation time, 0 being the time when no mutation had been acquired yet and 100 the time when all clonal mutations were present. Chromosome gains are placed on this scale with a color code for each chromosome. Tumors are grouped based on the presence in their history of whole genome duplications or multiple synchronous gains defined as the co-occurrence of ≥4 duplications in a window shorter than 30% mutation time. When several copies of a same chromosome were gained, only the first event is represented for clarity. b Distribution of duplication timings in point mutation time for the most recurrent scattered duplications, synchronous gains of ≥4 chromosome regions and whole genome duplications. c Tumor size distribution in tumor groups showing different patterns of chromosome duplication acquisition, as defined in a