| Literature DB >> 34716697 |
Frederic Tran Mau-Them1,2, Yannis Duffourd2,3, Antonio Vitobello1,2, Ange-Line Bruel1,2, Anne-Sophie Denommé-Pichon1,2, Sophie Nambot4, Julian Delanne4, Sebastien Moutton4, Arthur Sorlin4, Victor Couturier1,2, Valentin Bourgeois1,2, Martin Chevarin1,2, Charlotte Poe1,2, Anne-Laure Mosca-Boidron5, Patrick Callier5, Hana Safraou1,2, Laurence Faivre2,4, Christophe Philippe1,2, Christel Thauvin-Robinet1,2,3,6.
Abstract
BACKGROUND: Exome sequencing (ES) has become the most powerful and cost-effective molecular tool for deciphering rare diseases with a diagnostic yield approaching 30%-40% in solo-ES and 50% in trio-ES. We applied an innovative parental DNA pooling method to reduce the parental sequencing cost while maintaining the diagnostic yield of trio-ES.Entities:
Keywords: cost effectiveness; exome sequencing; rare diseases; trio-like strategy; parental-pool strategy
Mesh:
Substances:
Year: 2021 PMID: 34716697 PMCID: PMC8683640 DOI: 10.1002/mgg3.1836
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Next generation sequencing and pooling methods in the literature
| Disorder | Individuals | NGS method | Pooling methods | Number of individual per pool | Mean depth | Diagnostic rate (number) | |
|---|---|---|---|---|---|---|---|
| Zhu et al | IBD, depression | 70 | ES | index case | 35 | NR | NA |
| Bansal et al | Diabetes | 6058 | Targeted sequencing | index case | 20–32 | NR | 0.6% (40) |
| Calvo et al | Mitochondrial | 60 | Targeted sequencing | index case | 20–21 | 3360X | 22% (13) |
| Popp et al | DD/ID | 96 | ES | index case | 12 | 324 to 491X | 28% (27) |
| Ryu et al | NA | 1125 | Targeted sequencing | index case | 25 | 1068X | NA |
Abbreviations: DA, developmental anomaly; IBD, inflammatory bowel disease; ID, intellectual disability, NA, not applicable; NGS, next generation sequencing.
FIGURE 1(a) Number of individuals per rare disease (neurologic and developmental anomaly focus). DA: developmental anomaly; ID: intellectual disability. (b) Expected allelic balance (AB) in %, depending on the sequencing depth (from 20X to 100X) and number of parents per pool (from 4 to 7 parents). (c) Molecular results in the second‐tier (top) and first‐tier cohorts (bottom) VUS: variants of unknown significance. (d) Sequencing cost in $ depending on the solo, trio, or first‐tier parental‐pool strategies (5 or 6 parents) and based on a sequencing cost of solo‐ES at 500$. In gray shades: prices of first‐tier strategies
FIGURE 2Workflow of the parental‐pool ES strategy with IGV pictures of the variants. On the left side are positive controls with six parental pools and CRE‐v2 kit. On the right side are compound heterozygous examples of five parental pools with TWIST‐HCE kit
List of variants used as positive controls with their AB in parental pools (1 to 6) and two examples (7 and 8) of compound heterozygous variants with the 5 and 6 parents parental‐pool strategies
| Individual | Gene | MIM number | Genotype | Genomic position | c.DNA | Protein | Paternal‐pool allelic balance | Maternal‐pool allelic balance | Sanger segregation |
|---|---|---|---|---|---|---|---|---|---|
| 1 |
| 604491 | hm | chr3:g.105572421G>A | NM_170662.3:c.256C>T | p.(Arg86*) | 0.09 | 0.08 | Maternally and paternally inherited |
| 2 |
| 602858 | htc | chr11:g.71146710C>T | NM_001360.2:c.1139G>A | p.(Cys380Tyr) | 0 | 0.09 | Maternally inherited |
| chr11:g.71146886C>G | NM_001360.2:c.964‐1G>C | p.? | 0.08 | 0 | Paternally inherited | ||||
| 3 | RPS6KA3 | 300075 | hmi | chrX:g.20206617G>A | NM_004586.2:c.629C>T | p.(Thr210Ile) | 0 | 0.11 | Maternally inherited |
| 4 |
| 614787 | ht | chr1:g.151378304_151378305del | NM_015100.3:c.3206_3207del | p.(Tyr1069*) | 0 | 0 | De novo |
| 5 |
| 300401 | hmi | chrX:g.103043401T>G | NM_001128834.1:c.658T>G | p.(Cys220Gly) | 0 | 0.7 | Maternally inherited |
| 6 |
| 106180 | hm | chr17:g.61573878G>C | NM_000789.2:c.3503+1G>C | p.? | 0.10 | 0 | Paternally inherited |
| 7 |
| 608473 | htc | chr2:g.112541514C>T | NM_022662.3:c.5023G>A | p.(Glu1675Lys) | 0 | 0.09 | Maternally inherited |
| chr2:g.112621271delA | NM_022662.3:c.1033delT | p.(Ser345Profs*6) | 0.07 | 0 | Paternally inherited | ||||
| 8 |
| 600024 | htc | chr1:g.225591096C>T | NM_002296.2:c.1757G>A | p.(Arg586His) | 0 | 0.07 | Maternally inherited |
| chr1:g.225594482_225594483delinsGC | NM_002296.3:c.1366_1367delinsGC | p.(Leu456Ala) | 0.08 | 0 | Paternally inherited |
Abbreviations: Hm, homozygous; hmi, hemizygous; ht, heterozygous; htc, compound heterozygous.
Version of the exome enrichment kits used in individuals and parental pools in first and second‐tier strategies
| Capture kit version | Second‐tier parental pool | First‐tier parental pool | |||
|---|---|---|---|---|---|
| Agilent‐CRE‐v2 | TWIST‐HCE | Agilent‐CRE‐v2 | TWIST_HCE | ||
| Index case | Agilent‐v5‐51Mb | 3 | 8 | ||
| Agilent‐v6 | |||||
| Agilent‐CRE | 13 | ||||
| Agilent‐CRE‐v2 | 6 | 26 | |||
| TWIST‐HCE | 12 | 317 | |||
Abbreviations: CRE, Clinical Research Exome; HCE, Human Core Exome.
FIGURE 3IGV examples of false positive and negative variants. (a) IGV pictures of the BRD7 (left) and GPDP4 (right) strand bias. (b) IGV pictures of the OTOA poor quality mapping. (c and d) IGV pictures of the TADAD2B and ALG8 variants. Note the absence of strand bias. (e) IGV picture of the possible cDNA contamination with misaligned read at the exon‐intron junction in the individual solo‐ES (top) but not in solo‐GS (bottom; two distinct samples from the same individual). (f) IGV picture of the HNRNPAB variant. Note the absence of variant at this position. (g) IGV picture of the SHANK3 variant in the individual (top) and the parental pools (middle and bottom). Note the presence of the variant in both parental pools
FIGURE 4(a) Distribution of the AB of rare variants in the entire parental pools. Note the Gaussian curve with a maximum around 9%. (b) Comparison of the mean number of rare de novo variants in 295 pools (light gray; 288 TWIST‐HCE and 7 CRE‐v2) and 281 trios (dark gray; 280 TWIST‐HCE; and 1 CRE‐v2) with two different enrichment kits (multiallelic, SnpCluster and outliers >500 de novo excluded; individual depth ≥10). (c) Comparison of the mean number of rare de novo variants 281 in pools (light gray) and 291 trios (dark gray; multiallelic, SnpCluster and outliers >50 de novo excluded; individual depth ≥10; rare <1%; NSSI only). NS, not significant. (d) Comparison of the mean number of de novo variants in six (265; light gray) and five (50; dark gray) parental pools. NS: not significant
Individuals with double hits and/or double diagnosis
| Individual | OMIM‐morbid | Gene | Genotype | Segregation | Genomic position | c.DNA | protein | CADD | polyphen | GERP | misZ | pLI |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | + |
| ht | De novo | chr20:g.49508824_49508827del | NM_015339.2:c.2424_2427del | p.(Lys809Serfs*19) | 2.22 | 1 | |||
| + |
| hmi | Maternally inherited | chrX:g.49645630C>A | NM_001145073.1:c.720C>A | p.(Phe240Leu) | 3.28 | 0.93 | ||||
| 2 | + |
| hm | Paternally and maternally inherited | chr7:g.65548091C>T | NM_000048.3:c.376C>T | p.(Arg126Trp) | 14.6 | 1 | 3.98 | 0.95 | 0 |
| ‐ |
| hm | Paternally and maternally inherited | chr2:g.97278215C>T | NM_001115016.2:c.998G>A | p.(Arg333Lys) | 23.9 | 0.77 | 5.29 | 2.58 | 1 | |
| 3 | + |
| ht | De novo | chr11:g.119148891T>C | LRG_608t1:c.1111T>C | p.(Tyr371His) | 25.3 | 1 | 5.65 | 1.16 | 0 |
| + |
| ht | Maternally inherited | chr12:g.112926909A>G | LRG_614t1:c.1529A>G | p.(Gln510Arg) | 23.9 | 1 | 5.13 | |||
| 4 | + |
| ht | NR | chr17:g.41245479_41245482del | LRG_292t1:c.2066_2069del | p.(Ser689Lysfs*11) | 0.85 | 0 | |||
| Deletion 16q24.3 | ht | Maternally inherited | chr16:g.89590412–89599046 | |||||||||
| 5 | + |
| ht | NR | chr13:g.32971125_32971126del | LRG_293t1:c.9592_9593 | p.(Cys3198Tyrfs*23) | |||||
| Duplication 16p11.2 | Maternally inherited | chr16:g.8836685–29001335 | ||||||||||
| 6 | + |
| ht | NR | chr13:g.32914699_32914702del | LRG_293t1:c.6207_6210del | p.(Glu2070Valfs*10) | |||||
| ‐ |
| ht | De novo | chr15:g.22962436C>G | NM_014608.2:c.2160‐4C>G | p.? | 2.73 | 0.97 | ||||
| 7 | + |
| ht | Maternally inherited | chr12:g.9000236G>T | NM_144670.4:c.1775G>T | p.(Arg592Leu) | 28.1 | 0.97 | 2.74 | 0.46 | 0 |
| + |
| ht | Maternally inherited | chr5:g.14482785G>A | NM_007118.2:c.6560G>A | p.(Arg2187His) | 24 | 0.94 | 5 | 5.6 | 1 | |
| 8 | + |
| htc | Paternally inherited | chr1:g.64100539T>G | NM_002633.2:c.722T>G | p.(Leu241Arg) | 18.9 | 1 | 5.13 | −0.06 | 0 |
| Maternally inherited | chr1:g.64114218T>G | NM_002633.2:c.1175T>G | p.(Leu392Arg) | 32 | 1 | 6.06 | ||||||
| ‐ |
| ht | De novo | chr9:g.114184413C>T | NM_001080398.1:c.1866+1G>A | p.? | 25.4 | 4.22 | 1.9 | 1 | ||
| 9 | + |
| ht | Paternally inherited | chr5:g.89949539A>G | NM_032119.3:c.4148A>G | p.(Tyr1383Cys) | 26.4 | 1 | 5.48 | 0.07 | 0 |
| Duplication 5q14.3 | Maternally inherited | chr5:89969869–89986860 | ||||||||||
| 10 | + |
| hm | Maternally inherited | chr15:g.43897539G>A | NM_153700.2:c.3853C>T | p.(Gln1285*) | 21 | 5.46 | 1.41 | 0 | |
| Deletion 15q15.3 | ht | Paternally inherited | chr5:g.43888604–43940261 |
Abbreviations, Hm, homozygous, hmi: hemizygous; ht, heterozygous; htc, compound heterozygous; NR, not reported.
False positive and false negative variants identified in the cohort
| Strategy | Gene | Genotype | genomic position | Index case allelic depth | Index case allelic balance | Paternal pool depth | Paternal pool allelic balance | Maternal pool depth | Maternal pool allelic balance | Sanger segregation |
|---|---|---|---|---|---|---|---|---|---|---|
| Second‐tier |
| ht | chr4:g.7056620C>T | 125 | 0.51 | 87 | 0 | 99 | 0 | Paternally inherited |
|
| ht | chr16:g.50354575T>C | 16 | 0.67 | 46 | 0 | 50 | 0 | Maternally inherited | |
|
| ht | chr11:g.76979979C>A | 24 | 0.50 | 52 | 0 | 48 | 0 | Maternally inherited | |
| First‐tier |
| ht | chr11:g.77825024G>A | 45 | 0.33 | 54 | 0 | 37 | 0 | Maternally inherited |
|
| ht | chr5:g.177637237_177637274del | 107 | 0.40 | 132 | 0 | 131 | 0 | Maternally inherited | |
|
| hm | chr16:g.21763731_21763732del | 6 | 1 | 2 | 0 | 11 | 0 | Maternally inherited | |
|
| ht | chr22:g.51159940dup | 72 | 0.5 | 113 | 0.02 | 119 | 0.02 |
| |
|
| ht | chr16:g.69056767A>C | 91 | 0.28 | 99 | 0 | 103 | 0 | Absent |
Abbreviations: Hm, homozygous, hmi, hemizygous; ht, heterozygous; htc, compound heterozygous.