Mohamad Saad1, Younes Mokrab2, Najeeb Halabi3, Jingxuan Shan4, Rozaimi Razali2, Khalid Kunji1, Najeeb Syed5, Ramzi Temanni6, Murugan Subramanian7, Michele Ceccarelli8, Arash Rafii Tabrizi4, Davide Bedognetti9, Lotfi Chouchane10. 1. Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar. 2. Department of Human Genetics, Sidra Medicine, Doha, Qatar. 3. Genetic Intelligence Laboratory, Weill Cornell Medicine-Qatar, Doha, Qatar. 4. Genetic Intelligence Laboratory, Weill Cornell Medicine-Qatar, Doha, Qatar; Department of Genetic Medicine, Weill Cornell Medicine, New York, NY, USA. 5. Applied Bioinformatics Core, Integrated Genomics Services, Research Branch, Sidra Medicine, Doha, Qatar. 6. Janssen Research and Development, Paris, France. 7. Genetic Intelligence Laboratory, Weill Cornell Medicine-Qatar, Doha, Qatar; Department of Microbiology and Immunology, Weill Cornell Medicine, New York, NY, USA. 8. Department of Electrical Engineering and Information Technology (DIETI), University of Naples "Federico II", Naples, Italy; Biogem, Istituto di Biologia e Genetica Molecolare, Ariano Irpino, Italy. 9. Cancer Research Department, Research Branch, Sidra Medicine, Doha, Qatar. 10. Genetic Intelligence Laboratory, Weill Cornell Medicine-Qatar, Doha, Qatar; Department of Genetic Medicine, Weill Cornell Medicine, New York, NY, USA; Department of Microbiology and Immunology, Weill Cornell Medicine, New York, NY, USA. Electronic address: loc2008@med.cornell.edu.
Abstract
BACKGROUND: Disparities in the genetic risk of cancer among various ancestry groups and populations remain poorly defined. This challenge is even more acute for Middle Eastern populations, where the paucity of genomic data could affect the clinical potential of cancer genetic risk profiling. We used data from the phase 1 cohort of the Qatar Genome Programme to investigate genetic variation in cancer-susceptibility genes in the Qatari population. METHODS: The Qatar Genome Programme generated high-coverage genome sequencing on DNA samples collected from 6142 native Qataris, stratified into six distinct ancestry groups: general Arab, Persian, Arabian Peninsula, Admixture Arab, African, and South Asian. In this population-based, cohort study, we evaluated the performance of polygenic risk scores for the most common cancers in Qatar (breast, prostate, and colorectal cancers). Polygenic risk scores were trained in The Cancer Genome Atlas (TCGA) dataset, and their distributions were subsequently applied to the six different genetic ancestry groups of the Qatari population. Rare deleterious variants within 1218 cancer susceptibility genes were analysed, and their clinical pathogenicity was assessed by ClinVar and the CharGer computational tools. FINDINGS: The cohort included in this study was recruited by the Qatar Biobank between Dec 11, 2012, and June 9, 2016. The initial dataset comprised 6218 cohort participants, and whole genome sequencing quality control filtering led to a final dataset of 6142 samples. Polygenic risk score analyses of the most common cancers in Qatar showed significant differences between the six ancestry groups (p<0·0001). Qataris with Arabian Peninsula ancestry showed the lowest polygenic risk score mean for colorectal cancer (-0·41), and those of African ancestry showed the highest average for prostate cancer (0·85). Cancer-gene rare variant analysis identified 76 Qataris (1·2% of 6142 individuals in the Qatar Genome Programme cohort) carrying ClinVar pathogenic or likely pathogenic variants in clinically actionable cancer genes. Variant analysis using CharGer identified 195 individuals carriers (3·17% of the cohort). Breast cancer pathogenic variants were over-represented in Qataris of Persian origin (22 [56·4%] of 39 BRCA1/BRCA2 variant carriers) and completely absent in those of Arabian Peninsula origin. INTERPRETATION: We observed a high degree of heterogeneity for cancer predisposition genes and polygenic risk scores across ancestries in this population from Qatar. Stratification systems could be considered for the implementation of national cancer preventive medicine programmes. FUNDING: Qatar Foundation.
BACKGROUND: Disparities in the genetic risk of cancer among various ancestry groups and populations remain poorly defined. This challenge is even more acute for Middle Eastern populations, where the paucity of genomic data could affect the clinical potential of cancer genetic risk profiling. We used data from the phase 1 cohort of the Qatar Genome Programme to investigate genetic variation in cancer-susceptibility genes in the Qatari population. METHODS: The Qatar Genome Programme generated high-coverage genome sequencing on DNA samples collected from 6142 native Qataris, stratified into six distinct ancestry groups: general Arab, Persian, Arabian Peninsula, Admixture Arab, African, and South Asian. In this population-based, cohort study, we evaluated the performance of polygenic risk scores for the most common cancers in Qatar (breast, prostate, and colorectal cancers). Polygenic risk scores were trained in The Cancer Genome Atlas (TCGA) dataset, and their distributions were subsequently applied to the six different genetic ancestry groups of the Qatari population. Rare deleterious variants within 1218 cancer susceptibility genes were analysed, and their clinical pathogenicity was assessed by ClinVar and the CharGer computational tools. FINDINGS: The cohort included in this study was recruited by the Qatar Biobank between Dec 11, 2012, and June 9, 2016. The initial dataset comprised 6218 cohort participants, and whole genome sequencing quality control filtering led to a final dataset of 6142 samples. Polygenic risk score analyses of the most common cancers in Qatar showed significant differences between the six ancestry groups (p<0·0001). Qataris with Arabian Peninsula ancestry showed the lowest polygenic risk score mean for colorectal cancer (-0·41), and those of African ancestry showed the highest average for prostate cancer (0·85). Cancer-gene rare variant analysis identified 76 Qataris (1·2% of 6142 individuals in the Qatar Genome Programme cohort) carrying ClinVar pathogenic or likely pathogenic variants in clinically actionable cancer genes. Variant analysis using CharGer identified 195 individuals carriers (3·17% of the cohort). Breast cancer pathogenic variants were over-represented in Qataris of Persian origin (22 [56·4%] of 39 BRCA1/BRCA2 variant carriers) and completely absent in those of Arabian Peninsula origin. INTERPRETATION: We observed a high degree of heterogeneity for cancer predisposition genes and polygenic risk scores across ancestries in this population from Qatar. Stratification systems could be considered for the implementation of national cancer preventive medicine programmes. FUNDING: Qatar Foundation.