| Literature DB >> 30865299 |
Michael D Kessler1,2,3,4, Nicholas W Bateman5,6, Thomas P Conrads5,6,7, George L Maxwell5,6,7, Julie C Dunning Hotopp1,4,8, Timothy D O'Connor1,2,3,4.
Abstract
BACKGROUND: Although cell lines are an essential resource for studying cancer biology, many are of unknown ancestral origin, and their use may not be optimal for evaluating the biology of all patient populations.Entities:
Keywords: African ancestry; Asian ancestry; admixture; cancer genomics; clinical genetics; genomic ancestry; population genetics; precision medicine
Mesh:
Substances:
Year: 2019 PMID: 30865299 PMCID: PMC6541501 DOI: 10.1002/cncr.32020
Source DB: PubMed Journal: Cancer ISSN: 0008-543X Impact factor: 6.860
Figure 1Genomic Ancestry estimates for 1009 cell lines from the Catalogue of Somatic Mutations in Cancer (COSMIC) database are illustrated. Each vertical bar represents a different cell line, and the height of the bar (y‐axis) represents the total genomic ancestry proportion. The height of each color represents the proportion of the ancestry represented by that color. The group labeled “European” on the x‐axis represents cells lines for which the ancestry was reported by COSMIC as European (n = 244). The vast majority of these cell lines are comprised of European ancestry (red), with some cell lines exhibiting predominantly African ancestry (blue) and some exhibiting small amounts of South Asian ancestry (gold). Cell lines reported as African (n = 26) exhibit predominantly African ancestry (blue) as well as a gradation of European ancestry proportion (red). Cell lines reported as East Asian (n = 38) are almost exclusively of East Asian ancestry, except for 1 inaccurately reported cell line that exhibits exclusively European ancestry (red). Among the 701 cell lines for which ancestry was reported as “unknown,” 453 were of predominantly European ancestry (red), 30 were of predominantly African ancestry (blue), and 215 were of predominantly East Asian ancestry. Within this group of cell lines with previously unknown ancestry, the predominantly African cell lines were admixed the most, followed by the predominantly European cell lines, and then the mostly nonadmixed cell lines of East Asian origin.
Figure 2Ancestry estimates for NCI60 cell lines are illustrated. Boxplots represent the distribution of ancestry proportion estimates across 59 of the 70 nonredundant cell lines belonging to the expanded NCI60 anticancer drug screening resource. Nearly all of the cell lines are of almost entirely European ancestry: only 3 cell lines exhibit predominantly African ancestry, and none of the cell lines exhibit notable proportions of East Asian, South Asian, or Native American ancestry.
Figure 3Distributions of genetic ancestry estimates across 10 racially disparate cancers are illustrated. Boxplots represent the distributions of ancestry proportion estimates for cell lines belonging to each of 10 cancer types. These 10 representative cancers, which are indicated above each chart (with the numbers of cell lines shown below the name of each tumor type), exhibit different incidence and/or mortality rates across ancestrally distinct populations (for all analyzed cancer types, see Supporting Fig. 2). Most cancer cell lines are of predominantly European (red), East Asian (green), or European and East Asian ancestry. Exceptions to this are lung, hematopoietic and lymphoid, and breast tumors, which have significant numbers of cells lines with predominantly African ancestry (blue). Cell lines from prostate cancer, which is 1 of the most common cancers in all men, have only European ancestry. Few or no cell lines from cancers that have significantly high incidence rates among individuals of African ancestry, like cancers of the stomach, liver, pancreas, and kidney, have African ancestry (blue).
Figure 4Associations between ancestry proportion and gene expression levels are illustrated. Results shown are from analyses estimating the relation between ancestry proportions and gene expression levels for 16,681 genes across 959 cancer cell lines. After the most conservative multiple testing correction, the NBEAL1 gene (arrow) is associated significantly with gene expression and African ancestry (Supporting Table 5). Three additional genes, SLC6A19, HEATR6, and ECT2L (arrow) are associated significantly with European ancestry at a threshold set by a more moderate multiple testing correction.
Associations Between Ancestry Proportion and Single Nucleotide Mutation Proportions
| Mutation | Ancestry | |||||||
|---|---|---|---|---|---|---|---|---|
| European | African | East Asian | Unidentified | |||||
| βa |
| βa |
| βa |
| βa |
| |
| A>C | 0.001 | .287 | 0.001 | .703 | −0.002 | .088 | .009 | .330 |
| A>G | −0.013 | 1.675 × 10−8
| 0.010 | .041 | 0.012 | 1.267 × 10−6
| −0.009 | .573 |
| A>T | 0.002 | .123 | −0.002 | .509 | −0.002 | .182 | 0.008 | .417 |
| C>A | 0.009 | 1.688 × 10−3
| 0.008 | .156 | −0.012 | 6.9 × 10−5
| −0.015 | .426 |
| C>G | 0.002 | .188 | −0.005 | .232 | −0.002 | .448 | 0.038 | .004 |
| C>T | −0.000 | .935 | −0.020 | .027 | 0.006 | .173 | −0.008 | .796 |
| G>A | −0.006 | .168 | −0.009 | .286 | 0.008 | .057 | −0.020 | .502 |
| G>C | 0.003 | .103 | −0.005 | .169 | −0.001 | .505 | 0.022 | .103 |
| G>T | 0.012 | 1.814 × 10−4
| 0.007 | .221 | −0.015 | 6.984 × 10−7
| −0.022 | .276 |
| T>A | 0.002 | .181 | 0.002 | .389 | −0.003 | .048 | 0.010 | .304 |
| T>C | −0.013 | 2.78 × 10−8
| 0.011 | .024 | 0.011 | 7.625 ×10−6
| −0.016 | .352 |
| T>G | 0.000 | .805 | 0.001 | .734 | −0.001 | .458 | 0.004 | .651 |
Abbreviations: A, adenosine; βa, correlation coefficient; C, cytosine; G, guanine; T, thymidine.
Results are from association analyses estimating the effect of ancestry on the proportion of single nucleotide mutations made up by each of 12 possible mutation types. The proportions of 4 mutation types differ significantly by ancestry across the 1009 cancer cell lines for which mutation and other data were available.
bThese 4 mutation types represent 2 independent mutation classes after accounting for reverse complementation (ie, A>G implies T>C), with significance levels and βa values representing concordant associations between complimentary mutation types.
cThese P values were significant after corrections for multiple testing.