| Literature DB >> 28985730 |
Petr Ponomarenko1, Alex Ryutov2, Dennis T Maglinte2, Ancha Baranova3,4,5, Tatiana V Tatarinova6,7,8, Xiaowu Gai9,10.
Abstract
BACKGROUND: With 15,949 markers, the low-density Infinium QC Array-24 BeadChip enables linkage analysis, HLA haplotyping, fingerprinting, ethnicity determination, mitochondrial genome variations, blood groups and pharmacogenomics. It represents an attractive independent QC option for NGS-based diagnostic laboratories, and provides cost-efficient means for determining gender, ethnic ancestry, and sample kinships, that are important for data interpretation of NGS-based genetic tests.Entities:
Keywords: Clinical exome sequencing; Ethnicity; Infinium QC Array-24; Kinship; NGS-based molecular diagnostic tests; Quality control; Sample identity
Mesh:
Year: 2017 PMID: 28985730 PMCID: PMC5639583 DOI: 10.1186/s12920-017-0297-7
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Infinium QC Array-24 variants sorted by their category and source
| Marker Category | Category Description | Number of Markers |
|---|---|---|
| ADME | Pharmacogenomics, from | 1009 |
| AIM | Ancestry Informative markers from exome array ( | 2910 |
| Blood group | From NCBI’s | 1659 |
| Fingerprint | High MAF SNPs unlikely to be in LD with each other, from | 477 |
| Linkage | Linkage Panel by Illumina, contains heterozygous SNPs to test for Mendelian disorders, from Linkage 12 array | 5486 |
| Extended MHC | Variants from extended major histocompatibility complex MHC covering 8 Mb region containing immune markers | 930 |
| Mitochondrial | Determination of mtDNA haplogroups | 141 |
| Sex chromosomes | X-chromosome specific | 1840 |
| Y-chromosome specific | 1401 | |
| Pseudoautosomal Regions | 535 |
Fig. 1Concordance for same samples between Infinium QC and 1000 Genomes data (purple), and for different unrelated samples (black), between parent and child, siblings, and other relatives
Fig. 2Concordance histogram for all possible pairs of samples from iScan and CES experiments
Kinship (estimated by KING) 1000 Genomes
| Relatedness | Median Kinship | Sample size | Theoretical kinship | Min KIN | Max KIN |
|---|---|---|---|---|---|
| Siblings | 0.2354 | 9 | 0.25 | −0.0081 | 0.3029 |
| Parent-Child | 0.2441 | 221 | 0.25 | 0.1712 | 0.2620 |
| Second Order | 0.1107 | 9 | 0.125–0.1875 | 0.0714 | 0.1475 |
| Unrelated | −0.1300 | 1679 | <0.001 | −0.3074 | 0.0443 |
Ethnic composition of the subset of the 1000 Genomes samples genotyped on the Infinium QC array
| Population code | Number of samples | Population |
|---|---|---|
| ASW | 90 | Americans of African Ancestry in SW USA |
| CEU | 88 | Utah Residents (CEPH) with Northern and Western Ancestry |
| CHB | 38 | Han Chinese in Beijing, China |
| GIH | 77 | Gujarati Indian from Houston, Texas |
| JPT | 45 | Japanese in Tokyo, Japan |
| MXL | 82 | Mexican Ancestry from Los Angeles, USA |
| PUR | 72 | Puerto Ricans from Puerto Rico |
| TSI | 83 | Toscani in Italy |
| YRI | 88 | Yoruba in Ibadan, Nigeria |
reAdmix assignments, average number of ethnicities
| Population | Average number of ethnic assignments per individual | Weight of the most significant ethnic assignment |
|---|---|---|
| PUR | 1.78 | 0.59 |
| CEU | 1.58 | 0.67 |
| MXL | 1.39 | 0.65 |
| ASW | 1.28 | 0.76 |
| TSI | 1.25 | 0.74 |
| GIH | 1.18 | 0.82 |
| CHB | 1.13 | 0.90 |
| YRI | 1.01 | 0.99 |
reAdmix assignments, grouped by 1000 Genomes categories
| 1000 Genomes | Global | Number of assignments | Total | Fraction |
|---|---|---|---|---|
| ASW | AFRICA | 86 | 90 | 0.955556 |
| ASW | NATIVE AMERICAN | 2 | 90 | 0.022222 |
| ASW | EUROPE | 1 | 90 | 0.011111 |
| ASW | MIX AFRICAN/EUROPEAN | 1 | 90 | 0.011111 |
| CEU | EUROPE | 83 | 88 | 0.943182 |
| CEU | NORTH ASIA | 3 | 88 | 0.034091 |
| CEU | NATIVE AMERICAN | 2 | 88 | 0.022727 |
| CHB | EAST ASIA | 68 | 84 | 0.809524 |
| CHB | INDIA | 14 | 84 | 0.166667 |
| CHB | NORTH ASIA | 2 | 84 | 0.02381 |
| GIH | INDIA | 77 | 77 | 1 |
| MXL | NATIVE AMERICAN | 64 | 82 | 0.780488 |
| MXL | EUROPE | 9 | 82 | 0.109756 |
| MXL | NORTH ASIA | 6 | 82 | 0.073171 |
| MXL | NEAR EAST | 3 | 82 | 0.036585 |
| PUR | NEAR EAST | 17 | 72 | 0.236111 |
| PUR | EUROPE | 18 | 72 | 0.25 |
| PUR | NATIVE AMERICAN | 12 | 72 | 0.166667 |
| PUR | AFRICA | 12 | 72 | 0.166667 |
| PUR | MIX AFRICAN/EUROPEAN | 9 | 72 | 0.125 |
| PUR | NORTH ASIA | 3 | 72 | 0.041667 |
| PUR | NEAR EAST | 1 | 72 | 0.013889 |
| TSI | EUROPE | 70 | 83 | 0.843373 |
| TSI | NEAR EAST | 13 | 83 | 0.156627 |
| YRI | AFRICA | 88 | 88 | 1 |
Haplogroups for 33 in-house samples using 143 markers from the Infinium QC array (HaploGrep 2) and all sequence data (Phy-Mer)
| Sample ID | HaploGrep 2 (Human QC array) | Phy-Mer (CES data) |
|---|---|---|
| CPM10 | C1d1 | C1d1c1 |
| CPM11 | HV | B4a1a1 |
| CPM12 | N | A2w1 |
| CPM13 | H | H5a3b |
| CPM14 | N | A2d1 |
| CPM15 | N | A2–64 |
| CPM16 | N | W1 |
| CPM17 | N | A2–64-@153 |
| CPM18 | C1b14 | C1b14 |
| CPM19 | HV2 | B4c1b2a2 |
| NA12878 | H | H13a1a1a |
| CPM20 | N | A2r |
| CPM21 | D4 | D1h1 |
| CPM22 | D4 | D1 |
| CPM23 | C1c | C1c |
| CPM24 | B2 | B2v |
| CPM25 | T | T2b |
| CPM26 | C | C1b7a |
| CPM27 | H | H48 |
| CPM28 | N | A2 |
| CPM29 | M | M7c1a4a |
| CPM30 | HV | R9b2 |
| CPM31 | M | M7c1a4a |
| CPM32 | C | C1d |
| CPM33 | C | C1d |
| CPM34 | C | C1d-194 |
| CPM36 | N | A2 |
| CPM4 | L3 | L3b1a |
| CPM5 | D4j | D4j5 |
| CPM6 | L2a1c | L2a1c5 |
| CPM7 | A5 | A5a |
| CPM8 | N | A2w1 |
| CPM9 | K | K1a4b1 |
1000 Genomes records showing error in the database
| Family ID | Individual ID | Paternal ID | Maternal ID | Gender | Phenotype | Population | Relationship | Siblings | Second Order | Third Order | Other Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2484 | NA20334 | 0 | 0 | 2 | 0 | ASW | mother | NA20336 | NA20337 | 0 | 0 |
| 2484 | NA20335 | 0 | NA20334 | 1 | 0 | ASW | child | 0 | NA20336 | 0 | 0 |
| 2484a | NA20355 | 0 | 0 | 2 | 0 | ASW | unrel | 0 | 0 | 0 | 0 |
| 2485 | NA20336 | 0 | 0 | 2 | 0 | ASW | mother | NA20344 | NA20335 | 0 | 0 |