| Literature DB >> 34723317 |
Cheng Quan1, Jie Ping1, Hao Lu1, Gangqiao Zhou1, Yiming Lu1.
Abstract
The rapid development of single-molecule long-read sequencing (LRS) and single-cell assay for transposase accessible chromatin sequencing (scATAC-seq) technologies presents both challenges and opportunities for the annotation of noncoding variants. Here, we updated 3DSNP, a comprehensive database for human noncoding variant annotation, to expand its applications to structural variation (SV) and to implement variant annotation down to single-cell resolution. The updates of 3DSNP include (i) annotation of 108 317 SVs from a full spectrum of functions, especially their potential effects on three-dimensional chromatin structures, (ii) evaluation of the accessible chromatin peaks flanking the variants across 126 cell types/subtypes in 15 human fetal tissues and 54 cell types/subtypes in 25 human adult tissues by integrating scATAC-seq data and (iii) expansion of Hi-C data to 49 human cell types. In summary, this version is a significant and comprehensive improvement over the previous version. The 3DSNP v2.0 database is freely available at https://omic.tech/3dsnpv2/.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34723317 PMCID: PMC8728236 DOI: 10.1093/nar/gkab1008
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Effects of structural variations (SVs) on local chromatin architecture. From top to bottom, the three chromatin loop tracks represent (i) the normal chromatin loops without considering any SV in the genome, (ii) the altered chromatin loops considering all possible SVs located in the relevant topologically associating domain (TAD) at the population scale and (iii) the altered chromatin loops considering only the presence of the query SV. The two tracks below the chromatin loop tracks are the RefSeq gene track and HGSVC v2 structural variation track. Red blocks in the SV track indicate deletions; blue blocks in the SV track indicate insertions; and the vertical blue line indicates the query SV.
Figure 2.Annotation of SNP or SV target cell types/subtypes using scATAC-seq data. (A) Pie chart of the percentages of cells with the nearest accessible chromatin peaks across 126 cell types/subtypes in 15 fetal tissues. (B) The Uniform Manifold Approximation and Projection (UMAP) plot of an open chromatin region across single cells. A total of 86 685 single cells across 126 cell types/subtypes are plotted. Single cells are represented by scattered hollow circles; cells with open chromatin states of the peak are marked with solid circles; and different colors represent the 126 cell types/subtypes.
Figure 3.New interface for visualizing chromatin architecture and accessibility surrounding the query variant. The IGV.js plugin is utilized to interactively display multiple genomic features in different tracks. The query SNP is marked by a vertical blue line; LD-linked SNPs of the query are marked by vertical gray lines. From top to bottom, the tracks represent: adjacent RefSeq genes, adjacent SNPs, structural variation in HGSVC2 or ClinVar CNV, fixation index statistics FST of surrounding 1000G SNPs, and scATAC-seq peaks averaged across cells for three cell types in three fetal tissues.