| Literature DB >> 31182012 |
Junyi Li1, Li Zhang1, Huinian Li1, Yuan Ping1, Qingzhe Xu1, Rongjie Wang2, Renjie Tan2, Zhen Wang3, Bo Liu2, Yadong Wang4,5.
Abstract
BACKGROUND: Numerous essential algorithms and methods, including entropy-based quantitative methods, have been developed to analyze complex DNA sequences since the last decade. Exons and introns are the most notable components of DNA and their identification and prediction are always the focus of state-of-the-art research.Entities:
Keywords: DNA sequences; Exon and intron prediction; Generalized topological entropy; Genomic signal processing; Information entropy
Mesh:
Substances:
Year: 2019 PMID: 31182012 PMCID: PMC6557737 DOI: 10.1186/s12859-019-2772-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Mean entropy value and number (in parentheses) of exons, introns and promoters on each chromosome in human genome
| Entropy (number) of exon | Entropy (number) of promoter | Entropy (number) of intron | |
|---|---|---|---|
| chr1 | 0.9653 (18043) | 0.9643 (13010) | 0.9689 (42806) |
| chr2 | 0.9677 (13911) | 0.9619 (9180) | 0.9687 (39446) |
| chr3 | 0.9651 (11456) | 0.9648 (7992) | 0.9707 (32834) |
| chr4 | 0.9656 (7087) | 0.9622 (5016) | 0.9697 (20301) |
| chr5 | 0.9668 (8036) | 0.9621 (5834) | 0.9707 (22176) |
| chr6 | 0.9653 (17918) | 0.9636 (12728) | 0.9687 (31005) |
| chr7 | 0.9652 (8159) | 0.9631 (6140) | 0.9678 (22410) |
| chr8 | 0.9646 (7170) | 0.9640 (5280) | 0.9682 (22011) |
| chr9 | 0.9642 (7084) | 0.9637 (5230) | 0.9681 (19486) |
| chr10 | 0.9677 (8529) | 0.9636 (6342) | 0.9690 (24883) |
| chr11 | 0.9651 (10006) | 0.9640 (7504) | 0.9690 (23462) |
| chr12 | 0.9660 (9533) | 0.9633 (6886) | 0.9695 (25561) |
| chr13 | 0.9651 (3589) | 0.9638 (2684) | 0.9699 (10396) |
| chr14 | 0.9666 (5948) | 0.9632 (4244) | 0.9691 (14017) |
| chr15 | 0.9644 (6857) | 0.9643 (4706) | 0.9691 (18634) |
| chr16 | 0.9636 (7303) | 0.9647 (5300) | 0.9691 (16214) |
| chr17 | 0.9642 (10218) | 0.9649 (7118) | 0.9687 (2295) |
| chr18 | 0.9656 (3041) | 0.9646 (2202) | 0.9697 (9062) |
| chr19 | 0.9689 (15539) | 0.9647 (8016) | 0.9705 (25077) |
| chr20 | 0.9634 (4724) | 0.9646 (3594) | 0.9702 (10692) |
| chr21 | 0.9639 (2326) | 0.9612 (1732) | 0.9692 (7033) |
| chr22 | 0.9626 (3985) | 0.9621 (2866) | 0.9675 (9221) |
| chrX | 0.9665 (6836) | 0.9615 (5392) | 0.9685 (14929) |
| chrY | 0.9647 (1121) | 0.9588 (1872) | 0.9671 (3320) |
Fig. 1Modified generalized topological entropy values of introns, exons and promoters
Fig. 2A portion of SVD curve and locations of real exons presented as red box
Fig. 3ROC curve for exon and intron prediction in gene AJ229040