| Literature DB >> 27222811 |
Bo Yuan1, Pengfei Liu1, Jeffrey Rogers2, James R Lupski3.
Abstract
Array comparative genomic hybridization (aCGH) has been widely used to detect copy number variants (CNVs) in both research and clinical settings. A customizable aCGH platform may greatly facilitate copy number analyses in genomic regions with higher-order complexity, such as low-copy repeats (LCRs). Here we present the aCGH analyses focusing on the 45 kb LCRs [1] at the NPHP1 region with diverse copy numbers in humans. Also, the interspecies aCGH analysis comparing human and nonhuman primates revealed dynamic copy number transitions of the human 45 kb LCR orthologues during primate evolution and therefore shed light on the origin of complexity at this locus. The original aCGH data are available at GEO under GSE73962.Entities:
Keywords: Array comparative genomic hybridization; Low-copy repeats; NPHP1; Primate evolution
Year: 2016 PMID: 27222811 PMCID: PMC4872932 DOI: 10.1016/j.gdata.2016.04.008
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1aCGH design at the human NPHP1 locus. The aCGH design is shown as a UCSC Genome Browser custom track (top) together with the tracks of UCSC genes (middle) and Segmental Dups (bottom). The tracks were aligned according to genomic coordinates in the window Chr2:110488479-111402265. As illustrated, only the LCR copies on the left side of NPHP1 were extensively covered with aCGH probes. In the bottom Segmental Dups track, multiple LCRs including the 45 kb LCRs and 358 kb LCRs are presented for this region. We focus on the three copies of the 45 kb LCRs (highlighted in red) and two copies of the 358 kb LCRs (highlighted in blue) [1] in the haploid reference genome. The different copies of each LCR group were annotated on the Segmental Dups track with red and blue numbers, respectively.
Fig. 2Assessment of aCGH data quality. A. Value distribution of log2 ratios (LR) shown in box plots. Thirty-two samples from six species were analyzed. The wide value distribution observed in PT7 may be due to low DNA quality. The color annotations for each species are shown on the top right of the figure. B. Correlation between sequence identity and aCGH data quality. DLRS (derivative log ratio spread) is a measurement of standard deviation of the differences between adjacent points (noisiness) in log ratio data. The DLRS (Y-axis) is plotted against sequences identity (X-axis). Each gold triangle represents a sample. A strong correlation (R2 = 0.9717) is observed.
Fig. 3Examples of aCGH data for each species. Shown are aCGH LR plots of four human DNA samples with different copy number of the 45 kb LCRs and one nonhuman primate sample of each species. Red dots, probes with LR ≥ 0.25; black dots, probes with − 0.25 < LR < 0.25; green dots, probes with LR ≤ − 0.25. The human 45 kb LCRs is in the region flanked by the shaded areas. The open green boxes surrounding plots in the 5th and 6th aCGH panels include probes with LR value lower than − 2.0, indicating homozygous DNA losses. The species, copy number indicated by previous studies (CN*), theoretical LR calculated using CN*, experimental LR value based on the actual aCGH data from the current study, and the estimated copy number from the experimental LR value (CN#) are shown on the right side of the aCGH plots. The symbol “?” denotes unknown copy number before aCGH analysis.
| Specifications | |
| Organism/cell line/tissue | |
| Sex | NA |
| Sequencer or array type | Agilent customized aCGH with 8x60K format |
| Data format | Analyzed |
| Experimental factors | Normal |
| Experimental features | Copy number estimation of the human 45 kb LCRs or its nonhuman primate orthologues in the test samples using NA10851 as control |
| Consent | NA |
| Sample source location | Houston, TX USA |