Literature DB >> 23118488

HEXEvent: a database of Human EXon splicing Events.

Anke Busch1, Klemens J Hertel.   

Abstract

HEXEvent (http://hexevent.mmg.uci.edu) is a new database that permits the user to compile genome-wide exon data sets of human internal exons showing selected splicing events. User queries can be customized based on the type and the frequency of alternative splicing events. For each splicing version of an exon, an ESTs count is given, specifying the frequency of the event. A user-specific definition of constitutive exons can be entered to designate an exon exclusion level still acceptable for an exon to be considered as constitutive. Similarly, the user has the option to define a maximum inclusion level for an exon to be called an alternatively spliced exon. Unlike other existing splicing databases, HEXEvent permits the user to easily extract alternative splicing information for individual, multiple or genome-wide human internal exons. Importantly, the generated data sets are downloadable for further analysis.

Entities:  

Mesh:

Year:  2012        PMID: 23118488      PMCID: PMC3531206          DOI: 10.1093/nar/gks969

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The process of pre-mRNA splicing is essential for the expression of most metazoan genes. It is carried out by the spliceosome that catalyzes the removal of non-coding intronic sequences and concatenates remaining exons to form mature mRNAs (1). Of the approximately 25 000 genes encoded by the human genome (2), >90% are believed to produce transcripts that are alternatively spliced (3,4). The process of alternative splicing results in the production of multiple mRNA isoforms from a single pre-mRNA and thereby significantly enriches the proteomic diversity of higher eukaryotic organisms. Major splice events include exon skipping (also referred to as cassette exon events), exons with alternative 3′- and/or 5′-splice sites, intron retention, mutually exclusive exons, as well as alternative first and last exons. Given the complexity of higher eukaryotic genes and the relatively low conservation of splice sites, the precision of the splicing machinery is impressive. Defects in splicing lead to many human genetic diseases (5–7) and splicing mutations in a number of genes involved in growth control have been implicated in multiple types of cancer (8–12). The vast majority of alternative splicing events are biased toward high (>80%) or low (<20%) inclusion levels (Figure 1). This distribution is not only observed for alternative splicing events defined by Expressed Sequence Tag (ESTs), but also for alternative splicing events derived from deep sequencing data (13). Interestingly, alternatively spliced exons with high inclusion levels (>90%) display physical characteristics indistinguishable from constitutive exons (13), even when using machine learning techniques (our unpublished data). Thus, an exon that is associated with an extremely rare alternative splicing event may behave much more like a constitutive exon. These observations, in combination with the demonstration that even constitutive exons display low levels of alternative splicing (4), challenge the definition of constitutive and alternative exons. Yet, all genome-wide analyses of alternative splicing carried out in the past have relied on comparing sets of alternatively spliced exons with constitutive exons based on simple yes/no decisions, thus potentially introducing large errors. HEXEvent permits the user to define the inclusion level required for an exon to be considered as alternatively or constitutively spliced. This definition can range from the strictest to more relaxed constitutive splicing interpretations.
Figure 1.

Cassette exon inclusion levels determined from HEXEvent. The plot shows the relationship between exon inclusion levels and the cumulative number of events.

Cassette exon inclusion levels determined from HEXEvent. The plot shows the relationship between exon inclusion levels and the cumulative number of events. To carry out large-scale or genome-wide analyses on exons of a certain type, extensive exon data sets of that alternative splicing type are needed. ASPicDB (14) is a recently published alternative splicing database that reports a list of exons within a certain region or gene. However, this list is not made available for download. While the University of California, Santa Cruz (UCSC) Genome Browser offers sets of all alternatively spliced exons for download (15), those sets are missing three important pieces of information: (i) no set of constitutively spliced exons is available; (ii) inclusion and/or usage levels are not assigned to exons or splice sites; and (iii) all sets have a splice event centric view, meaning they list all exons that show a certain splicing event, but do not list all splicing events for an individual exon. Finally, neither ASPicDB nor the UCSC Genome Browser allow user-specific definitions for constitutive and alternative exons. In contrast, HEXEvent allows the user to tailor splicing categorization based on exon inclusion levels. In addition, the queried data is downloadable for further analysis regardless of whether it covers individual or genome-wide sets of internal human exons. Finally, all known alternative splicing events per exon are reported in the output file.

DATABASE GENERATION, CONTENT AND DEFINITIONS

HEXEvent is a new database, which assists the user in compiling genome-wide exon data sets. Its exon information is based on known mRNA isoforms as defined by the UCSC Genome Browser (GRCh37/hg19) (15) as well as available EST information. For each internal exon in the human genome, the number of EST events that include or exclude the queried exon or an alternative version of it was computed. Based on this information, inclusion/exclusion levels of each exon as well as usage frequencies of alternative splice sites are defined. The major splice version of each exon was defined, whereas all minor splice variants are indicated as alternative splicing events of that exon. HEXEvent includes information about exon skipping/inclusion and alternative 3′ and/or 5′-splice sites. At this stage, we do not include information on intron retention events because HEXEvent is currently based on EST information. Due to the short length of the ESTs, intron retention information would be biased toward short retained introns. If a new splice variant was found only in ESTs, but it was not included in the UCSC isoform list, we accepted it as a real new splice alternative (indicated as ‘onlyEST’ instead of assigning a gene name), if it showed the canonical splice sites (GU at the 5′-splice site, AG at the 3′-splice site). Although this splice site requirement might reduce the identification of new alternative splicing events associated with the minor spliceosome, it significantly reduces the number of false positives. Seemingly, alternative versions of an exon where the annotated splice sites differ only by 1 or 2 nt were combined to a single version of the exon because the addition of one or two more nucleotides rarely defines an alternative splice site (16). For each alternative splice version of an exon, the number of supporting ESTs is given. For cassette exons, the number of ESTs including and not including the exon is specified. For exons with one or more alternative splice sites, the location of the alternative sites are reported as well as the number of ESTs supporting them. The basic EST counts as defined in Table 1 are represented as follows: Based on these numbers several values are calculated for each exon. Based on the input, exons are filtered according to the user settings and a customized set of exons will be extracted and prepared for download. A summary of the basic workflow of the creation of the HEXEvent database is shown in Figure 2.
Table 1.

Definition of the columns in the output format of HEXEvent for a randomly chosen exon

No.Column nameExampleDescription
1chromochrXReference sequence chromosome name
2strand++ or − for strand
3start101 854 639First position of the exon (0-based)
4end101 854 775Last position of the exon (1-based)
5count15Number of ESTs that include the exon as given in columns [chromo], [strand], [start], and [end]
6alt310Number of ESTs that include the exon with an alternative 3′-splice site
7alt50Number of ESTs that include the exon with an alternative 5′-splice site
8alt3+52Number of ESTs that include the exon with an alternative 3′ and an alternative 5′-splice site simultaneously
9skip0Number of ESTs in which the exon is skipped
10constitLevel0.556Constitutive level of the exon (=)
11inclLevel1.000Inclusion level of the exon (= )
123usageLevel0.556Usage level of the major 3′-splice site of the exon )
135usageLevel0.926Usage level of the major 5′-splice site of the exon )
14alt3singleCount10Number(s) of ESTs for different alternative 3′- splice site
15alt3singleLoc101 854 633Location(s) of alternative 3′-splice sites
16alt5singleCount0Number(s) of ESTs for different alternative 5′-splice site
17alt5singleLoc#Location(s) of alternative 5′-splice sites
18alt3and5singleCount2Number(s) of ESTs for different alternative 3′- and 5′-splice sitecombinations
19alt3and5singleLocName101 854 633–101 854 787Location(s) of alternative 3′ and 5′-splice site combinations
20OnlyESTexonCount0Number of ESTs in which the exon is overlapped by an alternative version of it, that is not included in the human isoform list of the UCSC Genome Browser yet, but has at least one EST supporting it
21OnlyESTexons#Location(s) of alternative version(s) of the exon, that is/are not included in the human isoform list of the UCSC Genome Browser yet, but has/have at least one EST supporting it
22genenameARMCX5Name of the gene the exon is part of, if no gene name was assignedyet, it is indicated by ‘onlyEST’

The first four columns describe the location of the exon, whereas columns 5–9 give EST counts for inclusion (as given in columns 1–4), alternative splice site usage and exclusion of the exon. Column 10 specifies the constitutive level of the exon. Here, the grade of being constitutive is calculated by comparing the occurrence of the exon as specified in columns 1–4 with all other alternative events. In column 11, the inclusion level of the exon is given. Here, inclusion is calculated as the sum of ESTs showing the exon with the coordinates given in columns 1–4 and EST counts for alternative version of the exons showing an alternative 3′- and/or 5′-splice site, whereas exclusion is represented by the number of ESTs having this exon skipped. Columns 12 and 13 show the usage level of the major 3′-splice site and the major 5′-splice site (as given in columns 3 and 4 or columns 4 and 3 when on the negative strand), respectively. The usage level of the major 3′-splice site of the exon is calculated as the ratio of the number of ESTs showing this 3′-splice site, i.e. all ESTs showing the exons as given in columns 1–4 as well as all ESTs showing the exon with an alternative 5′-splice site, and the number of ESTs that include the exon with any splice site. The usage level of the major 5′-splice site is calculated analogously. The location of all alternative 3′-splice sites of the exon can be found in column 15, whereas the EST counts for each single one are given in column 14. Respective entries can be found in columns 16 and 17 for alternative 5′-splice sites, as well as in columns 18 and 19 for mutually occurring 3′- and 5′-splice sites. The EST count and location of the new versions of the exon that have EST evidence but are not confirmed events in the UCSC Genome Browser yet, are shown in columns 20 and 21. The location is given in the form ‘chromosomeSTRANDstart-end’. The last column shows the name of the gene the exon is part of. If none was assigned yet, ‘onlyEST’ is specified.

Figure 2.

Workflow during the creation of HEXEvent. We downloaded the UCSC Genes track, the spliced ESTs track, as well as the human mRNAs track from the UCSC Genome Browser. Using all three data sets, we extracted all known versions of human internal exons. An EST count was assigned to each version of each exon, specifying inclusion and exclusion levels. In a last step, overlapping exons were combined and indicated as alternative versions of each other.

ccount: the number of ESTs including the exon with the major coordinates; calt3: number of ESTs including the exon with an alternative 3′-splice site; calt5: number of ESTs including the exon with an alternative 5′-splice site; calt3+5: number of ESTs including the exon with both; an alternative 3′-splice site as well as an alternative 5′-splice site; and cskip: number of ESTs excluding the exon The constitutive level evaluates the inclusion of the major version of the exon and compares it to the number of occurrences of any alternative event. The constitutive level is defined as: . The alternative count includes all counts of possible alternative events, . The inclusion level compares the presence of the exon and any alternative version of it with its exclusion. The inclusion level is defined as: . The inclusion count equals the sum of ESTs showing the exon with major coordinates plus the number of ESTs showing the exon with an alternative 3′-splice site , with an alternative 5′-splice site or with an alternative 3′- and 5′-splice site (, see Table 1). The exclusion count equals the number of ESTs supporting exon skipping. The usage level of the 3′-splice site represents the usage ratio of the major 3′-splice site and any used 3′-splice site. It is defined as: . The number of ESTs showing the usage of the major 3′-splice site equals the sum of ESTs including the major version of the exon and the number of ESTs including the exon with an alternative 5′-splice site but the same 3′-splice site calt5 (). In contrast, the number of ESTs not showing the exon with its major 3′-splice site equals the sum of ESTs that show the exon either only with an alternative 3′-splice site or with mutually occurring 3′- and 5′-splice sites calt3+5 (). The usage level of the 5′-splice site is defined analogously to the usage level of the 3′-splice site as: . Workflow during the creation of HEXEvent. We downloaded the UCSC Genes track, the spliced ESTs track, as well as the human mRNAs track from the UCSC Genome Browser. Using all three data sets, we extracted all known versions of human internal exons. An EST count was assigned to each version of each exon, specifying inclusion and exclusion levels. In a last step, overlapping exons were combined and indicated as alternative versions of each other. Definition of the columns in the output format of HEXEvent for a randomly chosen exon The first four columns describe the location of the exon, whereas columns 5–9 give EST counts for inclusion (as given in columns 1–4), alternative splice site usage and exclusion of the exon. Column 10 specifies the constitutive level of the exon. Here, the grade of being constitutive is calculated by comparing the occurrence of the exon as specified in columns 1–4 with all other alternative events. In column 11, the inclusion level of the exon is given. Here, inclusion is calculated as the sum of ESTs showing the exon with the coordinates given in columns 1–4 and EST counts for alternative version of the exons showing an alternative 3′- and/or 5′-splice site, whereas exclusion is represented by the number of ESTs having this exon skipped. Columns 12 and 13 show the usage level of the major 3′-splice site and the major 5′-splice site (as given in columns 3 and 4 or columns 4 and 3 when on the negative strand), respectively. The usage level of the major 3′-splice site of the exon is calculated as the ratio of the number of ESTs showing this 3′-splice site, i.e. all ESTs showing the exons as given in columns 1–4 as well as all ESTs showing the exon with an alternative 5′-splice site, and the number of ESTs that include the exon with any splice site. The usage level of the major 5′-splice site is calculated analogously. The location of all alternative 3′-splice sites of the exon can be found in column 15, whereas the EST counts for each single one are given in column 14. Respective entries can be found in columns 16 and 17 for alternative 5′-splice sites, as well as in columns 18 and 19 for mutually occurring 3′- and 5′-splice sites. The EST count and location of the new versions of the exon that have EST evidence but are not confirmed events in the UCSC Genome Browser yet, are shown in columns 20 and 21. The location is given in the form ‘chromosomeSTRANDstart-end’. The last column shows the name of the gene the exon is part of. If none was assigned yet, ‘onlyEST’ is specified.

USAGE

To use the HEXEvent database, characteristics of the queried exons have to be specified by the user. These specifications include defining which types of alternative splicing events should be investigated and designating an inclusion level to classify constitutive and alternative exons. Additionally, the user is asked to specify whether exons are allowed to show alternative splicing events other than the selected ones. For each queried exon, HEXEvents reports the genome location, the type(s) of alternative splicing and their location, as well as the number of ESTs in support of the events. An EST count is given for each alternative version.

Input

All input selections are used to specify the type of exons the user is interested in. The basic input can be a genomic region, a list of genes or the whole genome. Second, the type of alternative splicing the user is interested in needs to be defined. The user can choose among options including all exon types, only constitutive exons, or any combination of cassette, alternative 3′-splice site, alternative 5′-splice site and simultaneously alternative 3′- and 5′-splice sites exons. Third, the definition of a constitutive exon can be specified, i.e. the user decides what exon constitutive level (as defined in ‘Database Generation, Content and Definitions’ section) is acceptable for an exon to still be called constitutive. Furthermore, the user can restrict the set of alternatively spliced exon events by defining an upper inclusion level, which, for instance, is useful when analyzing only low-inclusion exons. Analogously, the user has the option to restrict the 3′- and/or 5′-splice site usage level. Fourth, the user chooses whether selected alternative splicing events should be unique or can occur in combination with other splicing events. To do so, the user will be asked whether selected exon types should be ‘strict’. Here, a ‘non-strict’ exon definition means that the exon has to show at least one of the selected types of alternative splicing, but it may also be associated with any of the not selected types. In contrast, when a strict exon definition is chosen, the exon has to be involved in at least one of the selected alternative splicing types, but must not show any of the not selected types. The database will compile a final list of exons based on these input parameters to be displayed in the browser or to be downloaded and saved to a file.

Output

The output of each query will be a table of all requested exons showing the frequencies of their known splicing events. Depending on the user choice, the results will be displayed in the browser or they will be written to a downloadable text file. All output columns are described in Table 1.

Example applications

In an example application of the HEXEvent database, we are interested in all exons of the gene ARMCX4 and all their known splicing events. To get this information, the gene name needs to be specified in the input mask of the database. Furthermore, an interest in all types of exons, i.e. all alternative events, needs to be selected. The refinement of the exon type as well as a strict or non-strict exon definition has no effect on the output because we requested all exons, no matter what type. Selecting all exons overrides any other selections made. The output of this query is shown in Table 2.
Table 2.

Examples of HEXEvent database outputs

chromostrandstartendcountalt3alt5alt3+5skipconstit Levelincl Level3usage Level5usage Levelalt3single Countalt3single Localt5single Countalt5single Localt3and5single Countalt3and5 singleLocOnlyEST exonCountOnlyEST exonsgenename
All exons of gene ARMCX4, all alternative eventsa
chrX+100 673 897100 673 9881501000.9381.0001.0000.9380#1100 673 9840#0#ARMCX4
chrX+100 699 039100 699 143900070.5620.5621.0001.0000#0#0#0#ARMCX4
chrX+100 700 985100 701 032700050.5830.5831.0001.0000#0#0#0#ARMCX4
chrX+100 741 009100 741 0793355110.7330.9780.8640.8645100 741 0125100 741 0851100 741 012– 100 741 085b0#ARMCX4
chrX+100 742 179100 742 2594510000.9781.0000.9781.0001100 742 1910#0#1chrX+100 742 179– 100 742 255cARMCX4
chrX+100 742 594100 742 6784400001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 743 030100 743 0863870000.8441.0000.8441.0007100 742 9940#0#0#ARMCX4
chrX+100 743 430100 744 3025000190.2080.2081.0001.0000#0#0#0#ARMCX4
chrX+100 753 131100 754 353502000.7141.0001.0000.7140#2100 753 3180#0#ARMCX4
chrX+100 759 923100 760 342500001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 764 345100 764 414330000.5001.0000.5001.00031007643500#0#0#ARMCX4
chrX+100 764 576100 764 665700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 766 011100 766 042700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 779 190100 779 419210030.3330.5000.6671.00011007792720#0#0#ARMCX4
chrX+100 786 630100 786 999300001.0001.0001.0001.0000#0#0#0#ARMCX4
All exons with an alternative 3′-splice site of gene ARMCX4, no other alternative events allowedd
chrX+100 742 179100 742 2594510000.9781.0000.9781.0001100 742 1910#0#1chrX+100 742 179– 100 742 255cARMCX4
chrX+100 743 030100 743 0863870000.8441.0000.8441.0007100 742 9940#0#0#ARMCX4
chrX+100 764 345100 764 414330000.5001.0000.5001.0003100 764 3500#0#0#ARMCX4
All exons with an alternative 3′-splice site of gene ARMCX4, other alternative events possiblee
chrX+100 741 009100 741 0793355110.7330.9780.8640.8645100 741 0125100 741 0851100 741 012– 100 741 085b0#ARMCX4
chrX+100 742 179100 742 2594510000.9781.0000.9781.0001100 742 1910#0#1chrX+100 742 179– 100 742 255cARMCX4
chrX+100 743 030100 743 0863870000.8441.0000.8441.0007100 742 9940#0#0#ARMCX4
chrX+100 764 345100 764 414330000.5001.0000.5001.0003100 764 3500#0#0#ARMCX4
chrX+100 779 190100 779 419210030.3330.5000.6671.0001100 779 2720#0#0#ARMCX4

Columns are as defined in Table 1. All ‘hash’ symbol indicate a non-existing value, meaning that there are no alternative 3′- or 5′-splice sites known.

aOutput files were generated when searching for all internal human exons of the gene ARMCX4 including all of their known alternative splicing events.

bLocation of both alternative splice sites (3′- and 5′-splice site).

cLocation of an alternative version of the exon not found in the UCSC Gene list, but in ESTs. It is given in the format ‘chromosomeSTRANDstart-end’.

dOutput files were generated when searching for internal human exons of the gene ARMCX4 that have an alternative 3′-splice site, but do not show any other alternative event.

eOutput files were generated when searching for internal human exons of the gene ARMCX4 that have an alternative 3′-splice site and possibly show other alternative splicing events.

Examples of HEXEvent database outputs Columns are as defined in Table 1. All ‘hash’ symbol indicate a non-existing value, meaning that there are no alternative 3′- or 5′-splice sites known. aOutput files were generated when searching for all internal human exons of the gene ARMCX4 including all of their known alternative splicing events. bLocation of both alternative splice sites (3′- and 5′-splice site). cLocation of an alternative version of the exon not found in the UCSC Gene list, but in ESTs. It is given in the format ‘chromosomeSTRANDstart-end’. dOutput files were generated when searching for internal human exons of the gene ARMCX4 that have an alternative 3′-splice site, but do not show any other alternative event. eOutput files were generated when searching for internal human exons of the gene ARMCX4 that have an alternative 3′-splice site and possibly show other alternative splicing events. If the user is only interested in exons of the gene ARMCX4 that have an alternative 3′-splice site, only alternative 3′-splice site exons should be selected in the exon-type specification. If a strict exon-type definition is chosen, exons that have an alternative 3′-splice site, but can also be skipped or have an alternative 5′-splice site will not be shown (Table 2). To show those, a non-strict exon definition should be chosen (Table 2). In case the user requests only constitutive exons, a constitutive exon needs to be defined. In the most conservative scenario, where the user allows no alternative events to call an exon constitutive, the HEXEvent database will report the five exons shown in Table 3. In contrast, if the user allows up to 5% of the ESTs showing alternative splicing, the database will report one additional exon that shows one ESTs with an alternative 3′-splice site in addition to 45 ESTs including the major version of the exon (Table 3).
Table 3.

Output of constitutive exons of the gene ARMCX4

chromostrandstartendcountalt3alt5alt3+5skipconstit LevelinclLevel3usage Level5usage Levelalt3single Countalt3single Localt5single Countalt5single Localt3and5single Countalt3and5single LocOnlyESTexon CountOnlyEST exonsgenename
All constitutive exons of gene ARMCX4, no alternative eventsa
chrX+100 742 594100 742 6784400001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 759 923100 760 342500001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 764 576100 764 665700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 766 011100 766 042700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 786 630100 786 999300001.0001.0001.0001.0000#0#0#0#ARMCX4
All constitutive exons of gene ARMCX4, up to 5% alternative events allowedb
chrX+100 742 179100 742 2594510000.9781.0000.9781.0001100 742 1910#0#1chrX+100 742 179– 100 742 255ARMCX4
chrX+100 742 594100 742 6784400001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 759 923100 760 342500001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 764 576100 764 665700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 766 011100 766 042700001.0001.0001.0001.0000#0#0#0#ARMCX4
chrX+100 786 630100 786 999300001.0001.0001.0001.0000#0#0#0#ARMCX4

Columns are as in Table 2 and as defined in Table 1.

aOutput of constitutive exons of the gene ARMCX4 with no alternative versions allowed.

bOutput of constitutive exons of the gene ARMCX4 with at most 5% of ESTs showing alternative versions of the exon.

Output of constitutive exons of the gene ARMCX4 Columns are as in Table 2 and as defined in Table 1. aOutput of constitutive exons of the gene ARMCX4 with no alternative versions allowed. bOutput of constitutive exons of the gene ARMCX4 with at most 5% of ESTs showing alternative versions of the exon.

DISCUSSION

HEXEvent allows users to apply their own definition of alternative/constitutive exons and generates a list of exons matching the input criteria. For each queried exon, HEXEvents reports the genome location, the type(s) of alternative splicing, their location and EST counts supporting each alternative version. Based on these entries, an exon inclusion and splice site usage levels are reported. The HEXEvent database is a valuable tool to customize future bioinformatic analyses of alternative splicing. While HEXEvent is currently based on UCSC Genome Browser and available EST information, we plan to add alternative splicing and exon inclusion data derived from deep sequencing reactions in the very near future. Furthermore, we intend to expand the database to other species. Comparison with other databases: while the UCSC Genome Browser offers sets of alternatively spliced exons for download (15), those sets do not include inclusion/usage-level information and no set of constitutively spliced exons is made available. Furthermore, all available sets of alternatively spliced exons have a splice event centric view, i.e. they include all exons that show a certain splicing event, but do not list all splicing events for an individual exon. ASPicDB is a database providing information about the splicing pattern of human genes (14). While ASPicDB also reports a list of exons in a certain region or gene, this list cannot be downloaded. In contrast, the lists of exons generated by HEXEvent are downloadable to provide the foundation for subsequent bioinformatic analyses. Thus, HEXEvent is suitable for local single-gene analyses as well as for more complex or genome-wide analyses using downloaded lists. Furthermore, in HEXEvent, users can set their own definition of constitutive and alternative exons by specifying inclusion levels up to which an exon is considered a member of either category. Finally, HEXEvent summarizes alternative splice site versions of the same exons to one entry, thereby specifying all possible alternative splice sites. In comparison to HEXEvent, ASPicDB has one entry per alternative splice site version, which inherently makes the output harder to view and process. While this is a comparison to the most recent alternative splicing database published in NAR, it is worth noting that several other useful databases exist with prior publication date, such as Hollywood (17) or ASD/ASTD (18). Alternative Splicing Database/Alternative Splicing and Transcript Diversity database (ASD/ASTD) was closed in the beginning of 2012. Instead, their features have been integrated in Ensembl (19). These databases are excellent venues to interrogate the splicing patterns of individual genes, especially in light of their excellent accompanying graphics. However, all of the existing databases are limited by (i) not offering the ability to download multiple splicing events at a time and by (ii) not permitting user definitions of alternative splicing. The most useful features of HEXEvent bridge this gap, thus permitting users to custom design genome-wide exon data sets.

AVAILABILITY

This HEXEvent database is freely available at http://hexevent.mmg.uci.edu and open to all users. There is no login requirement.

FUNDING

Funding for open access charge: National Institutes of Health [RO1 GM62287 and R21 CA149548 to K.J.H.]; Postdoc Programme of the German Academic Exchange Service, DAAD (fellowship to A.B.). Conflict of interest statement. None declared.
  19 in total

1.  The mutational spectrum of single base-pair substitutions in mRNA splice junctions of human genes: causes and consequences.

Authors:  M Krawczak; J Reiss; D N Cooper
Journal:  Hum Genet       Date:  1992 Sep-Oct       Impact factor: 4.132

2.  Genomic splice-site analysis reveals frequent alternative splicing close to the dominant splice site.

Authors:  Yimeng Dou; Kristi L Fox-Walsh; Pierre F Baldi; Klemens J Hertel
Journal:  RNA       Date:  2006-10-19       Impact factor: 4.942

Review 3.  Combinatorial control of exon recognition.

Authors:  Klemens J Hertel
Journal:  J Biol Chem       Date:  2007-11-16       Impact factor: 5.157

4.  Alternative splicing of fibroblast growth factor receptor 2 (FGF-R2) in human prostate cancer.

Authors:  R P Carstens; J V Eaton; H R Krigman; P J Walther; M A Garcia-Blanco
Journal:  Oncogene       Date:  1997-12-18       Impact factor: 9.867

5.  Finishing the euchromatic sequence of the human genome.

Authors: 
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

6.  Discovery of novel splice forms and functional analysis of cancer-specific alternative splicing in human expressed sequences.

Authors:  Qiang Xu; Christopher Lee
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

Review 7.  Splice variants as cancer biomarkers.

Authors:  Brigitta M N Brinkman
Journal:  Clin Biochem       Date:  2004-07       Impact factor: 3.281

8.  Computational analysis and experimental validation of tumor-associated alternative RNA splicing in human cancer.

Authors:  Zhining Wang; H Shuen Lo; Howard Yang; Sheryl Gere; Ying Hu; Kenneth H Buetow; Maxwell P Lee
Journal:  Cancer Res       Date:  2003-02-01       Impact factor: 12.701

9.  Ensembl 2012.

Authors:  Paul Flicek; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Laurent Gil; Leo Gordon; Maurice Hendrix; Thibaut Hourlier; Nathan Johnson; Andreas K Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Monika Komorowska; Gautier Koscielny; Eugene Kulesha; Pontus Larsson; Ian Longden; William McLaren; Matthieu Muffato; Bert Overduin; Miguel Pignatelli; Bethan Pritchard; Harpreet Singh Riat; Graham R S Ritchie; Magali Ruffier; Michael Schuster; Daniel Sobral; Y Amy Tang; Kieron Taylor; Stephen Trevanion; Jana Vandrovcova; Simon White; Mark Wilson; Steven P Wilder; Bronwen L Aken; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Jennifer Harrow; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; Giulietta Spudich; Jan Vogel; Andy Yates; Amonida Zadissa; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

10.  HOLLYWOOD: a comparative relational database of alternative splicing.

Authors:  Dirk Holste; George Huo; Vivian Tung; Christopher B Burge
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

View more
  19 in total

1.  TET-catalyzed oxidation of intragenic 5-methylcytosine regulates CTCF-dependent alternative splicing.

Authors:  Ryan J Marina; David Sturgill; Marc A Bailly; Morgan Thenoz; Garima Varma; Maria F Prigge; Kyster K Nanan; Sanjeev Shukla; Nazmul Haque; Shalini Oberdoerffer
Journal:  EMBO J       Date:  2015-12-28       Impact factor: 11.598

2.  Comparative analysis of housekeeping and tissue-selective genes in human based on network topologies and biological properties.

Authors:  Lei Yang; Shiyuan Wang; Meng Zhou; Xiaowen Chen; Yongchun Zuo; Dianjun Sun; Yingli Lv
Journal:  Mol Genet Genomics       Date:  2016-02-20       Impact factor: 3.291

3.  Pre-mRNA Splicing Modulation by Antisense Oligonucleotides.

Authors:  Natalia N Singh; Diou Luo; Ravindra N Singh
Journal:  Methods Mol Biol       Date:  2018

4.  Splice site proximity influences alternative exon definition.

Authors:  Francisco Carranza; Hossein Shenasa; Klemens J Hertel
Journal:  RNA Biol       Date:  2022-01       Impact factor: 4.766

5.  Alternative RNA structures formed during transcription depend on elongation rate and modify RNA processing.

Authors:  Tassa Saldi; Kent Riemondy; Benjamin Erickson; David L Bentley
Journal:  Mol Cell       Date:  2021-02-24       Impact factor: 17.970

6.  Splicing predictions reliably classify different types of alternative splicing.

Authors:  Anke Busch; Klemens J Hertel
Journal:  RNA       Date:  2015-03-24       Impact factor: 4.942

Review 7.  Biological databases for human research.

Authors:  Dong Zou; Lina Ma; Jun Yu; Zhang Zhang
Journal:  Genomics Proteomics Bioinformatics       Date:  2015-02-21       Impact factor: 7.691

8.  MiasDB: A Database of Molecular Interactions Associated with Alternative Splicing of Human Pre-mRNAs.

Authors:  Yongqiang Xing; Xiujuan Zhao; Tao Yu; Dong Liang; Jun Li; Guanyun Wei; Guoqing Liu; Xiangjun Cui; Hongyu Zhao; Lu Cai
Journal:  PLoS One       Date:  2016-05-11       Impact factor: 3.240

9.  IsoPlot: a database for comparison of mRNA isoforms in fruit fly and mosquitoes.

Authors:  I-Man Ng; Jia-Hsin Huang; Shang-Chi Tsai; Huai-Kuang Tsai
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

10.  ExoLocator--an online view into genetic makeup of vertebrate proteins.

Authors:  Aik Aun Khoo; Mario Ogrizek-Tomas; Ana Bulovic; Matija Korpar; Ece Gürler; Ivan Slijepcevic; Mile Šikic; Ivana Mihalek
Journal:  Nucleic Acids Res       Date:  2013-11-23       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.