| Literature DB >> 34155260 |
Manuel Solís-Moruno1,2, Anna Mensa-Vilaró3,4, Laura Batlle-Masó1,2, Irene Lobón1, Núria Bonet2, Tomàs Marquès-Bonet1,5,6,7, Juan I Aróstegui3,4,8, Ferran Casals9,10.
Abstract
There are increasing evidences showing the contribution of somatic genetic variants to non-cancer diseases. However, their detection using massive parallel sequencing methods still has important limitations. In addition, the relative importance and dynamics of somatic variation in healthy tissues are not fully understood. We performed high-depth whole-exome sequencing in 16 samples from patients with a previously determined pathogenic somatic variant for a primary immunodeficiency and tested different variant callers detection ability. Subsequently, we explored the load of somatic variants in the whole blood of these individuals and validated it by amplicon-based deep sequencing. Variant callers allowing low frequency read thresholds were able to detect most of the variants, even at very low frequencies in the tissue. The genetic load of somatic coding variants detectable in whole blood is low, ranging from 1 to 2 variants in our dataset, except for one case with 17 variants compatible with clonal haematopoiesis under genetic drift. Because of the ability we demonstrated to detect this type of genetic variation, and its relevant role in disorders such as primary immunodeficiencies, we suggest considering this model of gene mosaicism in future genetic studies and considering revisiting previous massive parallel sequencing data in patients with negative results.Entities:
Year: 2021 PMID: 34155260 PMCID: PMC8217568 DOI: 10.1038/s41598-021-92381-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Samples and mutations included in the study.
| Sample | Coordinate (hg38) | Gene | Change in DNA | Change in protein | WES | ADS | ||
|---|---|---|---|---|---|---|---|---|
| VAF (%) | DP/VD | Mean coverage | VAF (%) | |||||
| S1a (PB) | chr1:247,424,492 | c.1049C>T | p.Thr350Met | 0 | 192/0 | 232 | 2.80 | |
| S1b (OM) | chr1:247,424,492 | c.1049C>T | p.Thr350Met | 7.22 | 97/7 | 153 | 6.90 | |
| S2 (PB) | chr1:247,424,357 | c.914A>C | p.Asp305Ala | 36.26 | 171/62 | 274 | 34.80 | |
| S3a (PB) | chr16:50,710,912 | c.1001G>A | p.Arg334Gln | 10.13 | 592/60 | 220 | 9.40 | |
| S3b (OM) | chr16:50,710,912 | c.1001G>A | p.Arg334Gln | 5.46 | 1171/64 | 349 | 4.90 | |
| S4a (PB) | chr16:50,710,912 | c.1001G>A | p.Arg334Gln | 46.44 | 618/287 | 231 | – | |
| S4 (OM) | chr16:50,710,912 | c.1001G>A | p.Arg334Gln | 5.21 | 576/30 | 179 | 8.50 | |
| S5a (PB) | chr1:247,425,355 | c.1912C>G | p.Gln638Glu | 19.67 | 422/83 | 318 | 18.40 | |
| S5b (OM) | chr1:247,425,355 | c.1912C>G | p.Gln638Glu | 8.72 | 390/34 | 274 | 6.00 | |
| S6 (PB) | chr1:247,424,367 | c.924A>T | p.Gln308His | 8.57 | 175/15 | 308 | 5.10 | |
| S7a (PB) | chrX:71,109,309 | c.676C>T | p.Arg226Cys | 18.75 | 192/36 | 247 | 17.80 | |
| S7b (UR) | chrX:71,109,309 | c.676C>T | p.Arg226Cys | 11.24 | 169/19 | 213 | 8.30 | |
| S8 (PB) | chr1:247,424,356 | c.913G>A | p.Asp305Asn | 8.00 | 125/10 | 234 | 7.20 | |
| S9 (PB) | chr16:50,710,912 | c.1001G>A | p.Arg334Gln | 2.12 | 1038/22 | 312 | 2.70 | |
| S10 (PB) | chr14:35,007,365 | c.338G>T | p.Gly113Val | 2.34 | 128/3 | 146 | 2.30 | |
| S11 (PB) | chr19:855,967 | c.607G>C | p.Gly203Arg | 9.10 | 99/9 | 219 | 16.20 | |
VAFs from ADS were extracted from a previous publication[21].
DP total depth; VD variant depth.
Figure 1Previously reported causal somatic mutations detected by each variant caller (in green), assessed as the presence of the variant in the raw VCF files. The germline variant in S4a was detected in Strelka germline mode but not in the somatic one. All VAF were extracted from a previous publication[21].
Numer of called variants and after sequential variant filtering process for each sample.
| Filtering | S1a | S1b | S2 | S3a | S3b | S4 | S5a | S5b | S6 | S7a | S7b | S8 | S9 | S10 | S11 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| On target (VarDict − VarScan2) | 298,250 173,072 | 363,209 101,637 | 239,526 266,542 | 286,381 200,507 | 241,135 453,494 | 382,519 119,025 | 231,469 263,604 | 312,808 245,467 | 276,040 273,296 | 293,494 176,720 | 315,825 171,718 | 274,664 317,568 | 191,603 720,791 | 223,731 261,383 | 302,540 172,629 |
| Intersection | 48,715 | 44,246 | 49,871 | 51,066 | 70,617 | 49,399 | 52,849 | 61,889 | 62,644 | 53,951 | 51,705 | 58,201 | 64,366 | 50,684 | 53,268 |
| 6pb indels | 48,187 | 43,598 | 49,246 | 50,477 | 69,885 | 48,451 | 52,286 | 61,126 | 61,954 | 53,382 | 50,880 | 57,646 | 63,757 | 50,157 | 52,621 |
| 1000G mask | 35,864 | 31,825 | 37,231 | 37,437 | 55,243 | 35,592 | 39,619 | 47,635 | 48,200 | 40,621 | 38,732 | 44,829 | 50,966 | 37,963 | 39,983 |
| DP > 50 | 34,091 | 27,692 | 36,459 | 35,706 | 54,052 | 31,989 | 38,734 | 45,779 | 46,668 | 37,906 | 35,927 | 42,349 | 49,958 | 33,123 | 36,816 |
| QUAL > 25 | 33,771 | 27,346 | 35,991 | 35,272 | 53,184 | 31,542 | 38,295 | 45,035 | 45,851 | 37,388 | 35,353 | 41,439 | 48,835 | 32,584 | 36,282 |
| LoF & missense | 18,476 | 15,154 | 19,998 | 18,929 | 31,534 | 18,100 | 21,148 | 26,962 | 27,596 | 22,103 | 20,585 | 24,283 | 28,888 | 18,518 | 21,472 |
| gnomAD < 0.001 | 12,135 | 10,119 | 13,562 | 11,980 | 24,001 | 12,871 | 14,427 | 20,553 | 21,178 | 16,281 | 14,781 | 17,977 | 21,910 | 12,711 | 15,634 |
| CADD > 15 | 9,035 | 7,904 | 10,085 | 8,864 | 19,044 | 9,994 | 10,828 | 16,271 | 12,662 | 11,077 | 13,887 | 17,086 | 9,547 | 12,155 | |
| GERP > 2 | 7,787 | 6,771 | 8,703 | 7,604 | 16,486 | 8,582 | 9,374 | 13,979 | 14,498 | 10,953 | 9,528 | 11,976 | 14,633 | 8,161 | 10,409 |
| VD ≥ 3 | 6,977 | 6,086 | 7,446 | 6,528 | 14,473 | 7,720 | 8,560 | 12,509 | 13,231 | 9,764 | 8,181 | 9,719 | 11,024 | 5,162 | 8,991 |
| Candidate genes | 174 | 275 |
The last step where the causal somatic variant is retained is shown in bold.
Figure 2Filtering process followed to obtain somatic candidate variants. We got around 40 variants per blood sample that we then experimentally validated by ADS.
VAF of the 20 somatic candidate variants studied in S5 patient.
| Chr | Position | Gene | Type | Whole blood | Whole blood post | Urine | Oral mucosa | Neutrophils | Monocytes | B cells | TCD4 | TCD8 | Control1 | Control2 | Validated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| chr1 | 153,003,501 | Missense | 7.7216 | 3.7419 | 7.3138 | 1.6876 | 7.165 | 9.5398 | YES | ||||||
| chr1 | 247,425,355 | Missense | 24.6228 | 12.4922 | 25.3693 | 7.0918 | 24.3422 | 33.993 | YES | ||||||
| chr2 | 24,300,108 | Missense | 4.7359 | 3.4122 | 5.0154 | 1.8296 | 5.6246 | 2.2179 | YES | ||||||
| chr2 | 209,888,127 | Synonimous | 26.5932 | 13.452 | 28.2002 | 6.9989 | 26.8757 | 34.7143 | YES | ||||||
| chr2 | 219,251,622 | Synonimous | 4.5508 | 3.2531 | 4.6625 | 1.3055 | 5.3645 | 1.9301 | YES | ||||||
| chr3 | 52,913,506 | Missense | 1.4477 | 1.2013 | 1.3699 | 0.4748 | 1.7193 | 1.453 | 2.5658 | YES | |||||
| chr4 | 143,695,587 | Missense | 24.3856 | 12.176 | 24.6201 | 6.1085 | 23.192 | 31.5614 | YES | ||||||
| chr4 | 165,059,454 | Intergenic | |||||||||||||
| chr6 | 36,270,463 | Missense | 4.8759 | 4.0553 | 5.5917 | 1.7713 | 6.4759 | 2.6609 | YES | ||||||
| chr6 | 52,082,518 | Missense | 26.3027 | 12.7771 | 27.3696 | 6.8415 | 26.0161 | 35.482 | YES | ||||||
| chr6 | 151,349,029 | Missense | 1.496 | 1.2648 | 1.6739 | 0.4203 | 1.8367 | 1.2469 | 2.7175 | YES | |||||
| chr7 | 157,614,060 | Intronic | 47.7712 | 48.8889 | 46.9676 | 46.2124 | 49.503 | 48.0925 | 48.1535 | 43.1712 | 46.6121 | NO | |||
| chr9 | 91,410,553 | Missense | |||||||||||||
| chr11 | 111,853,480 | Synonimous | 4.2723 | 3.1181 | 4.6411 | 1.3842 | 5.2542 | 2.2404 | YES | ||||||
| chr12 | 128,705,237 | Missense | 2.4998 | 1.2072 | 2.4715 | 0.5706 | 2.2107 | 3.4175 | YES | ||||||
| chr13 | 24,912,928 | Missense | 1.5069 | 1.1153 | 1.8446 | 0.6136 | 1.8944 | 0.9268 | 1.2501 | YES | |||||
| chr17 | 50,840,691 | Missense | 4.1908 | 2.7357 | 4.4201 | 1.3373 | 4.4152 | 1.9862 | YES | ||||||
| chr19 | 16,529,871 | Synonimous | 4.6041 | 2.2909 | 4.4651 | 1.1656 | 4.6677 | 5.4904 | YES | ||||||
| chr20 | 13,915,139 | Intronic | 24.5724 | 11.9959 | 24.142 | 6.1982 | 23.3935 | 33.2716 | YES | ||||||
| chrX | 71,537,899 | Missense | 49.978 | 25.1551 | 48.1124 | 12.4303 | 48.9245 | 32.2739 | YES |
In bold, values below the sequencing error threshold.
Figure 3VAF of validated somatic variants in S5 patient per tissue and cell type. Green is used for the group of variants with higher VAF (around 24%), red for those with intermediate VAF (around 4%) and blue for those with low VAF (around 1.5%, the only group present in B cells). Of note, there is one variant in the X chromosome whose frequency has been divided by 2 in order to visualize it grouped with the others.