| Literature DB >> 29504914 |
Maxim Ivanov1,2, Alina Matsvay3,4, Olga Glazova3, Stanislav Krasovskiy5, Mariya Usacheva5, Elena Amelina5, Aleksandr Chernyak5, Mikhail Ivanov3, Sergey Musienko6, Timofey Prodanov7, Sergey Kovalenko8,9, Ancha Baranova3,6,10,11, Kamil Khafizov3,4.
Abstract
BACKGROUND: Cystic fibrosis (CF) is one of the most common life-threatening genetic disorders. Around 2000 variants in the CFTR gene have been identified, with some proportion known to be pathogenic and 300 disease-causing mutations have been characterized in detail by CFTR2 database, which complicates its analysis with conventional methods.Entities:
Keywords: CFTR cystic fibrosis; NGS next generation sequencing
Mesh:
Substances:
Year: 2018 PMID: 29504914 PMCID: PMC5836842 DOI: 10.1186/s12920-018-0328-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Clinical and Demographic characteristics of patients
| Basic patient characteristics | |
| Number of Patients | 84 |
| Gender | |
| Male | 40 (48%) |
| Female | 44 (52%) |
| Age, mean (range) | 26 (19-47) |
| Height, cm mean (range) | 168 (142-187) |
| Weight, kg mean (range) | 52 (33-75) |
| Clinical characteristics | |
| FVC, % mean (range) | 67 (28-120) |
| FEV, % mean (range) | 47 (15-111) |
| Phenotype | |
| Mild | 38 (45%) |
| Severe | 46 (55%) |
| Diabetes | |
| Yes | 10 (12%) |
| No | 74 (88%) |
| Pancreatic sufficiency | |
| Sufficient | 37 (44%) |
| Insufficient | 47 (56%) |
| Bacterial flora characteristics | |
| S.aureus (yes - no) | 12 - 72 |
| P.aeruginosa (yes - no) | 41 - 43 |
| Achromobacter spp. (yes - no) | 7 - 77 |
| Stenotrophomonas spp. (yes - no) | 1 - 83 |
| E.coli (yes - no) | 1 - 83 |
| B.cepacia (yes - no) | 22 - 62 |
Fig. 1NGS data analysis results. a. Mean and median coverage for each CFTR exon across 84 samples b. Identified complex alleles mapped into CFTR gene product. MSD - membrane-spanning domains, NBD - nucleotide-binding domain 1 c. Total and unique mutation calls count by CFTR exon across 84 samples
Fig. 2Capability of the semiconductor technology to detect mutations near homopolymer regions. a. Mutation c.2052dupA occur in (A)7 homopolymer region of the CFTR which is complicated to accurately discriminate with the semiconductor NGS technology b. Single sample carrying c.2052dupA mutation demonstrates presence of the variant (A)8 allele in sequencing reads (indicated by arrows) alongside with the (A)7, (A)6 and (A)5 alleles in contrast to sample without mutation, harboring predominantly (A)7, (A)6 and (A)5 alleles c. Distribution of (A)8 variant allele frequency across sequencing among all samples. The majority of samples are grouped below 10% allele frequency, while mutation carries have frequencies of 16, 18, 21 and 42%
Fig. 3Impact of the primer trimming on variant calling. Detection of the p.Glu92Lys variant was complicated by presence of the technical sequences in data, which lead to false negative calls employing standard pipelines: a. Technical sequences (primers) result in reference allele overestimation and influence variant calling b. Trimming primers allow to detect mutation in all samples
Causative mutations identified in 84 patients with diagnosed CF
| Allele 1* | Allele 2* | n | PT | ||
|---|---|---|---|---|---|
| PANCREATIC SUFFICIENT DISEASE ( | |||||
| 3849 + 10kbC > T | F508del | 8 | M | ||
| 3849 + 10kbC > T | F508del;L467F | 1 | M | ||
| 3849 + 10kbC > T;R668C | F508del | 1 | M | ||
| 3849 + 10kbC > T;R668C | R1066C | 1 | M | ||
| 3849 + 10kbC > T;R668C | 2143delT | 1 | M | ||
| 3849 + 10kbC > T | I1295Ffs | 1 | M | ||
| 3849 + 10kbC > T | S1226* | 1 | M | ||
| 3272-16 T > A† | R347P† | 394delTT† | 1 | M | |
| 3272-16 T > A | F508del | 3 | M | ||
| 2789 + 5G > A | F508del;L467F | 1 | M | ||
| 2789 + 5G > A | ex2,3del | 1 | M | ||
| 3272-11A > G | F508del | 1 | M | ||
| E92K | E92K | 1 | M | ||
| E92K | F508del | 7 | M | ||
| E92K | ex2,3del | 1 | M | ||
| S1159P | F508del | 1 | S | ||
| R334W | F508del | 1 | M | ||
| R347P | W1282R | 1 | M | ||
| L138ins | F508del | 2 | M | ||
| L1335P | F508del | 1 | M | ||
| I1295Ffs | F508del | 1 | S | ||
| PANCREATIC INSUFFICIENT DISEASE ( | |||||
| Y569H | F508del | 1 | S | ||
| G461E | N1303 K | 1 | S | ||
| S1159F | S1159F | 1 | M | ||
| Q98R | 2184insA | 1 | M | ||
| Q98R | G542* | 1 | M | ||
| F508del | F508del | 6 | S | ||
| F508del | W1282R | 2 | S | ||
| F508del | S945 L | 2 | S | ||
| F508del | R1070Q;S466* | 1 | S | ||
| F508del | ex2,3del | 4 | S | ||
| F508del;L467F | ex2,3del | 1 | S | ||
| F508del | 2143delT | 3 | S | ||
| F508del;L467F | 2143delT | 1 | S | ||
| F508del | 2184insA | 1 | S | ||
| F508del | N415* | 1 | S | ||
| F508del | R1239= | 2 | S | ||
| F508del | R785* | 1 | S | ||
| F508del | dup6b,10 | 1 | S | ||
| F508del | R709* | 1 | S | ||
| F508del | 1898 + 1G > C | 1 | S | ||
| F508del | 4374 + 1G > A | 1 | S | ||
| F508del | 3821delT | 1 | S | ||
| F508del | L15Ffs | 1 | S | ||
| N1303 K | N1303 K | 1 | S | ||
| N1303 K | ex2,3del | 1 | S | ||
| R785* | R1070Q;S466* | 1 | S | ||
| ex2,3del | ex2,3del | 1 | S | ||
| ex2,3del | 2184insA | 1 | S | ||
| ex2,3del | N415* | 1 | S | ||
| 394delTT | 2184insA | 1 | S | ||
| 4374 + 1G > T | 4374 + 1G > T | 1 | S | ||
| dup6b,10 | 712-1G > T | 1 | S | ||
| 1716 + 1G > A | 2043delG | 1 | S | ||
| 2118del4 | 1248 + 1G > A | 1 | S | ||
Phenotype is displayed in the PT columns (M – “mild”; S – “severe”, where “mild” is defined as homozygous or heterozygous for class IV-V mutations and “severe” as homozygous for class I-III mutations). cDNA variant names according to HGVS nomenclature are given below the table for each variant. Variants were referenced using NM_000492.3. *Identified variants: 3849 + 10kbC > T (c.3718-2477C > T); 3272-16 T > A (c.3140-16 T > A); 2789 + 5G > A (c.2657 + 5G > A); 3272-11A > G (c.3140-11A > G); E92K (c.274G > A); S1159F (c.3476C > T); R334W (c.1000C > T); R347P (c.1040G > C); Q98R (c.293A > G); L138ins (c.412_413insACT); L1335P (c.4004 T > C); Y569H (c.1705 T > C); G461E (c.1382G > A); S1159P (c.3476C > T); F508del (c.1521_1523delCTT); N1303 K (c.3909C > G); W1282R (c.3844 T > C); S945 L (c.2834C > T); R1070Q (c.3209G > A); R1066C (c.3196C > T); ex2,3del (p.Ser18_Gly91del); 2143delT (c.2012delT); 394delTT (c.262_263delTT); 2184insA (c.2052dupA); 1295Ffs (c.3883delA); R1239 = (c.3717G > A);N415* (c.1240_1244delCAAAA); R785* (c.2353C > T); 4374 + 1G > T (c.4242 + 1G > T); R668C (c.2002C > T);dup6b,10 (c.(743 + 1_744-1)_(1584 + 1_1585-1)dup); S466* (c.1397C > A); R709* (c.2125C > T); S1226* (c.3587C > G); 1898 + 1G > C (c.1766 + 1G > C); 1716 + 1G > A (c.1584 + 1G > A); 4374 + 1G > A (c.4242 + 1G > A); 3874-14C > T (c.3874-14C > G); 3821delT (c.3691delT); 2118del4 (c.1984_1987delCTAA); 1248 + 1G > A (c.1116 + 1G > A); L15Ffs (c.43delC); G542* (c.1624G > T); 2043delG (c.1911delG); L467F (c.1399C > T); † Cis/trans position of alleles could not be identified
Fig. 4Missense substitutions and in-frame deletions/insertions annotated in accordance with used variant databases and computational tools. VUS stands for Variant of Uncertain Significance; D – pathogenic; N – benign/non CF-causing; U – annotation could not be assessed; VVS – variant of varying clinical significance according to CFRT2 variant classification