| Literature DB >> 29776329 |
Simon Jonas Larsen1, Luisa Matos do Canto2, Silvia Regina Rogatto3, Jan Baumbach4,5.
Abstract
BACKGROUND: Copy number variations (CNVs) are large segments of the genome that are duplicated or deleted. Structural variations in the genome have been linked to many complex diseases. Similar to how genome-wide association studies (GWAS) have helped discover single-nucleotide polymorphisms linked to disease phenotypes, the extension of GWAS to CNVs has aided the discovery of structural variants associated with human traits and diseases.Entities:
Keywords: Association studies; Copy number variation
Mesh:
Year: 2018 PMID: 29776329 PMCID: PMC5960125 DOI: 10.1186/s12864-018-4732-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Example of a query specified through the web interface for the query-based model. This query specifies that the method should look for regions where at least 20% of the samples in the positive group have a gain in copy numbers and at least 90% of the samples in the negative group have no variation
Fig. 2Example of output from gene set enrichment. The table shows KEGG pathways that are significantly enriched with the genes from a set of CNV regions
Fig. 3Example of chromosome segmented into ten CNV regions by CNVs from five patients. A new region is started whenever a segment from any patient starts or ends
Fig. 4Example of matching query to CNV regions. The query ((≥,0.60,=,gain),(≤,0.40,≠,normal)) is evaluated against the ten CNV regions generated in Fig. 3. LOH frequencies have been left out for simplicity. Four regions match the predicate for group 1 (R6- R9), and eight regions match the predicate for group 2 (R1- R4, R7- R10). Three regions match the full query (R7, R8 and R9)
CNVRs extracted from penile cancer data set using the statistical model with p-value ≤ 0.05
| HPV-pos. | HPV-neg. | ||||||
|---|---|---|---|---|---|---|---|
| Chr | Start | End | Type | Q-value | freq. (%) | freq. (%) | |
| 4 | 9729740 | 24650257 | Loss | 0.0341 | 0.0955 | 21.4 | 0-3.7 |
| 2 | 230554659 | 234415376 | Loss | 0.0387 | 0.3650 | 28.6-35.7 | 0 |
| 17 | 15537019 | 18617236 | Loss | 0.0341 | 0.4632 | 21.4 | 0-3.7 |
| 2 | 204245506 | 207036312 | Loss | 0.0341 | 0.5235 | 21.4 | 0-3.7 |
| 2 | 237307835 | 238724893 | Loss | 0.0387 | 0.6505 | 28.6-35.7 | 0-3.7 |
| 9 | 33911175 | 34589574 | Gain | 0.0387 | 0.6977 | 28.6 | 3.7 |
| 4 | 40058630 | 40957235 | Loss | 0.0341 | 0.6977 | 21.4 | 0 |
| 17 | 19272468 | 20059509 | Loss | 0.0341 | 0.7198 | 21.4 | 0 |
| 17 | 7496965 | 8209436 | Loss | 0.0341 | 0.7295 | 21.4 | 3.7 |
| 8 | 37606006 | 38160563 | Gain | 0.0341 | 0.7325 | 21.4 | 0 |
| 19 | 51889824 | 52236621 | Gain | 0.0341 | 0.7465 | 21.4 | 0 |
| 16 | 779112 | 798699 | Gain | 0.0387 | 0.7907 | 28.6 | 3.7 |
| 5 | 130994540 | 131251586 | Loss | 0.0341 | 0.8083 | 21.4 | 0 |
| 2 | 218474389 | 218676793 | Loss | 0.0341 | 0.8163 | 21.4 | 0 |
| 3 | 53066401 | 53145339 | Loss | 0.0350 | 0.8387 | 35.7 | 0 |
| 2 | 240601010 | 240608450 | Loss | 0.0387 | 0.8415 | 28.6 | 3.7 |
Regions are sorted by q-value. The two rightmost columns contain the frequency of variations of the type corresponding to the type of event (column 4) for the case and control groups, respectively
CNVRs extracted from penile cancer data set using the query-based model with the Qloss query
| HPV-pos. | HPV-neg. | ||||
|---|---|---|---|---|---|
| Chr | Start | End | Q-value | loss (%) | loss (%) |
| 2 | 204245506 | 240688770 | 0.0120 | 21.4-35.7 | 0-7.41 |
| 4 | 9729740 | 24650257 | 0.2037 | 21.4 | 0 |
| 4 | 40058630 | 47037351 | 0.3377 | 21.4 | 0-7.41 |
| 5 | 140934280 | 147709268 | 0.3785 | 21.4-28.6 | 3.7-7.41 |
| 5 | 60380235 | 66930240 | 0.3835 | 21.4 | 3.7-7.41 |
| 5 | 76699391 | 82660330 | 0.3955 | 21.4 | 7.41 |
| 17 | 13330532 | 18617236 | 0.4525 | 21.4 | 0-7.41 |
| 5 | 126804776 | 131251586 | 0.4948 | 21.4 | 0-7.41 |
| 5 | 72910129 | 75996874 | 0.5863 | 21.4 | 7.41 |
| 17 | 19272468 | 22200000 | 0.6252 | 21.4 | 0-7.41 |
| 17 | 7262327 | 9931292 | 0.6633 | 21.4 | 0-7.41 |
| 17 | 567713 | 3136246 | 0.6793 | 21.4 | 7.41 |
| 3 | 46788991 | 49028973 | 0.6957 | 28.6 | 7.41 |
| 17 | 4566909 | 6490288 | 0.7335 | 21.4 | 3.7-7.41 |
| 5 | 108507593 | 110029337 | 0.7552 | 21.4-28.6 | 7.41 |
| 2 | 241709000 | 242951149 | 0.7762 | 21.4 | 3.7-7.41 |
| 3 | 52397990 | 53145339 | 0.8732 | 21.4-35.7 | 7.41 |
| 5 | 148184554 | 148894433 | 0.8770 | 21.4 | 3.7 |
| 3 | 14191317 | 14755952 | 0.8862 | 21.4 | 7.41 |
| 5 | 140011232 | 140459066 | 0.8912 | 21.4 | 3.7-7.41 |
| 5 | 92966197 | 93334626 | 0.8935 | 21.4 | 7.41 |
| 5 | 55167345 | 55283138 | 0.9103 | 21.4 | 7.41 |
| 5 | 122689458 | 122739532 | 0.9113 | 28.6 | 7.41 |
Searching for regions with a loss of copy number in at least 20% of cases and at most 10% of controls. The two rightmost columns contain the frequency of copy nuber loss for the case and control groups, respectively
CNVRs extracted from penile cancer data set using the query-based model with the Qgain query
| HPV-pos. | HPV-neg. | ||||
|---|---|---|---|---|---|
| Chr | Start | End | Q-value | gain (%) | gain (%) |
| 9 | 70091642 | 90376569 | 0.2762 | 21.4 | 7.41 |
| 9 | 13045258 | 21368309 | 0.4562 | 21.4 | 3.7-7.41 |
| 9 | 119273838 | 124799603 | 0.4885 | 21.4-28.6 | 7.41 |
| 9 | 33871385 | 37132743 | 0.5633 | 21.4-28.6 | 3.7-7.41 |
| 20 | 43179131 | 45592378 | 0.6395 | 21.4 | 7.41 |
| 6 | 13768413 | 16036680 | 0.6512 | 21.4 | 3.7 |
| 19 | 50012054 | 52236621 | 0.6655 | 21.4 | 0-7.41 |
| 9 | 91369725 | 93273440 | 0.7047 | 28.6 | 7.41 |
| 16 | 463336 | 1664507 | 0.7648 | 21.4-28.6 | 3.7-7.41 |
| 17 | 77682387 | 78774742 | 0.7680 | 21.4 | 7.41 |
| 6 | 31191394 | 31875972 | 0.7963 | 21.4 | 3.7 |
| 8 | 37606006 | 38160563 | 0.8532 | 21.4 | 0 |
| 20 | 33204027 | 33688992 | 0.8565 | 21.4 | 7.41 |
| 7 | 54865060 | 55296001 | 0.8600 | 21.4 | 7.41 |
| 20 | 42433913 | 42557332 | 0.8710 | 21.4 | 7.41 |
Searching for regions with at least 20% of cases having a gain and at most 10% of controls having any kind of variation. The two rightmost columns contain the frequency of copy number gain for the case and control groups, respectively
CNVRs extracted from penile cancer data set using they query-based model with the query
| HPV-positive | HPV-negative | ||||||
|---|---|---|---|---|---|---|---|
| Chr | Start | End | Q-value | Gain (%) | Loss (%) | Gain (%) | Loss (%) |
| 2 | 233245552 | 234392866 | 0.0382 | 0 | 35.7 | 0 | 3.7 |
| 2 | 237307836 | 238138001 | 0.0473 | 0-7.14 | 35.7 | 0 | 3.7 |
Searching for regions with a copy number loss in at least 30% of cases and any kind of variation in at most 5% of controls. Columns 5 and 6 show the frequency of copy number gain and loss for the case group, and columns 7 and 8 shows the frequency of gain and loss for the control group