| Literature DB >> 20622839 |
Mauricio A Arias, Shengdong Ke, Lawrence A Chasin.
Abstract
Entities:
Mesh:
Substances:
Year: 2010 PMID: 20622839 PMCID: PMC3010375 DOI: 10.1038/nbt0710-686
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Fig. 1Scheme for associating RNA sequence features with splicing outcomes. Top left: More than 1000 diverse features were used; the examples shown here were chosen to illustrate their diversity. Each feature was also defined by the region in which it occurs, as indicated on the map on the lower left, where the alternatively spliced exon is red. Upper right: Exon inclusion data were originally measured in 27 mouse tissues or cell lines using microarrays and then consolidated into four tissue types: C, central nervous system; M, striated and cardiac muscle; D, digestion related tissues; E, embryonic tissue and stem cells. A machine learning algorithm was devised to associate particular features with particular splicing outcomes; the latter being categorized as increased exon inclusion, increased exon exclusion, or no difference in comparing two tissue types. After training on a set of ~3000 exons, the algorithm was able to reliably predict these splicing outcomes in a set of test exons.