| Literature DB >> 31882973 |
Gian-Andri Thun1, Sophia Derdak1, Francesc Castro-Giner1, Katherine Apunte-Ramos1, Lidia Águeda1, Matthias Wjst2,3, Anne Boland4, Jean-François Deleuze4, Umme Kolsum5, Marion S Heiss-Neumann6, Adam Nowinski7, Dorota Gorecka7, Jens M Hohlfeld8,9, Tobias Welte9, Christopher E Brightling10, David G Parr11, Antje Prasse9,12, Joachim Müller-Quernheim12, Timm Greulich13, Mariarita Stendardo14, Piera Boschetto14, Imre Barta15, Balázs Döme16, Marta Gut1,17, Dave Singh5, Loems Ziegler-Heitbrock6, Ivo G Gut18,19.
Abstract
Chronic obstructive pulmonary disease (COPD) is induced by cigarette smoking and characterized by inflammation of airway tissue. Since smokers with COPD have a higher risk of developing lung cancer than those without, we hypothesized that they carry more mutations in affected tissue. We called somatic mutations in airway brush samples from medium-coverage whole genome sequencing data from healthy never and ex-smokers (n = 8), as well as from ex-smokers with variable degrees of COPD (n = 4). Owing to the limited concordance of resulting calls between the applied tools we built a consensus, a strategy that was validated with high accuracy for cancer data. However, consensus calls showed little promise of representing true positives due to low mappability of corresponding sequence reads and high overlap with positions harbouring known genetic polymorphisms. A targeted re-sequencing approach suggested that only few mutations would survive stringent verification testing and that our data did not allow the inference of any difference in the mutational load of bronchial brush samples between former smoking COPD cases and controls. High polyclonality in airway brush samples renders medium-depth sequencing insufficient to provide the resolution to detect somatic mutations. Deep sequencing data of airway biopsies are needed to tackle the question.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31882973 PMCID: PMC6934450 DOI: 10.1038/s41598-019-56618-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristics of the study population.
| Subject | Status | Sex | Age, [y] | Pack years | FEV1/FVC, [%] | GOLD stage |
|---|---|---|---|---|---|---|
| 1 | Case | m | 70 | 35 | 55.7 | 2 |
| 2 | Matched FS | m | 68 | 32 | 76.6 | 0 |
| 3 | Case | m | 66 | 36 | 42.7 | 3 |
| 4 | Matched FS | m | 56 | 37 | 78.7 | 0 |
| 5 | Case | f | 61 | 27 | 67.0 | 1 |
| 6 | Matched FS | f | 61 | 59 | 85.5 | 0 |
| 7 | NS | m | 57 | 0 | 87.2 | 0 |
| 8 | NS | f | 68 | 0 | 84.1 | 0 |
| 9 | NS | f | 62 | 0 | 85.0 | 0 |
| 10 | Case | m | 68 | 35 | 41.0 | 3 |
| 11 | FS | m | 73 | 54 | 76.4 | 0 |
| 12 | FS | m | 64 | 55 | 98.0 | 0 |
Subjects 1–9 were recruited at a first stage of the study with only one lung brush sample collected, subjects 10–12 were later added, providing two lung brush samples each. Pack years is the number of cigarette packs smoked by day multiplied by the number of years the person has smoked. A COPD case is defined by FEV1/FVC < 70%. The GOLD stage is a measure of magnitude of the airflow obstruction in cases (1: mild; 2: moderate; 3: severe). None of the subjects has smoked in the past two years. FS = former smoker, NS = never smoker, FEV1 = forced expiratory volume in one second, FVC = forced vital capacity.
Figure 1Numbers of somatic single-nucleotide mutation (SSM) calls in lower lobe bronchial brushings per subject. Numbers represent the intersected results of three somatic mutation calling tools. Shades of blue represent COPD cases (dark), ex-smokers without COPD (azure) and never smokers (light).
Number of somatic single-nucleotide mutation calls per Mb in lower lobe lung brushings of COPD cases and controls. Genome size was calculated based on positions with read depth ≥10 in both, reference and affected tissue. Ranges are given due to the small number of individuals per category.
| N | MuTect | VarScan 2 | Strelka | Intersection | |
|---|---|---|---|---|---|
| COPD cases | 4 | 1.97–3.00 | 2.26–9.03 | 0.10–0.54 | 0.02–0.19 |
| former smokers | 5 | 1.36–12.17 | 1.79–5.00 | 0.14–0.95 | 0.03–0.12 |
| never smokers | 3 | 2.01–2.81 | 1.91–7.94 | 0.18–0.25 | 0.03–0.04 |
Figure 2Characteristics of somatic single-nucleotide mutation calls in lung brushings of subjects from the EvA study (N = 12) as well as in tumour tissue of two cancer cases based on the same calling procedure. Coverages and VAFs were based on the Strelka results. MB = medulloblastoma, CLL = chronic lymphocytic leukemia, DHS = DNase I hypersensitive sites, IQR = interquartile range, VAF = variant allele frequency.
Verification analysis of selected somatic single-nucleotide mutation (SSM) calls, stratified by confidence category. Only one of 85 SSM calls, targeted for verification, could be confirmed. WGS = whole genome sequencing.
| Verification result of variant allele | high-confidence (N = 28) | medium-confidence (N = 29) | low-confidence (N = 28) | Interpretation of original WGS call |
|---|---|---|---|---|
| found in target tissue (subject-specific) | 1 | 0 | 0 | correct |
| found in target and reference tissue (subject-specific or non-specific) | 5 | 1 | 4 | mosaicism (if subject-specific verification); allele-specific sequencing bias |
| fully present in target and reference tissue (≈50 or 100%) | 0 | 1 | 9 | single nucleotide polymorphism |
| unlikely or unclearly present (e.g. few high-quality reads) | 1 | 6 | 3 | position difficult to call |
| missing in target and reference tissue | 21 | 21 | 12 | sequencing or mapping inaccuracies |
Figure 3Representation of mutant clones in the bronchial epithelium. Coloured cylinders represent different cellular clones of recent origin in the airway epithelium, each containing cells that share the same somatic mutations. Such a clone (shown for the one in salmon at the bottom) may contain a mixture of cell types, such as differentiated secretory cells (shown with granules in the cytoplasm), ciliated cells (depicted with cilia at the outer surface) and unstratified basal cells (square shaped), all sitting on the basement membrane (line at the very bottom). Such a cellular arrangement is named pseudostratified columnar epithelium and is typical for the airways. Clone size may vary, but presumably lies in the sub-millimeter range, which is much smaller than the diameter and the length of a brush used in bronchoscopy (shown on the right). Consequently, a brush sample contains many different clones and the proportion of cells carrying the same somatic mutations is very small and falls below the detection limit. Successful somatic mutation calling requires sampling a smaller area (i.e. coming closer to the detection limit by harvesting a higher proportion of cells carrying the same somatic mutations) and/or sequencing to a higher read depth (i.e. lowering the detection limit).