| Literature DB >> 31744472 |
Stanislav A Listopad1, Trina M Norden-Krichmar2.
Abstract
BACKGROUND: Researchers commonly analyze lists of differentially expressed entities (DEEs), such as differentially expressed genes (DEGs), differentially expressed proteins (DEPs), and differentially methylated positions/regions (DMPs/DMRs), across multiple pairwise comparisons. Large biological studies can involve multiple conditions, tissues, and timepoints that result in dozens of pairwise comparisons. Manually filtering and comparing lists of DEEs across multiple pairwise comparisons, typically done by writing custom code, is a cumbersome task that can be streamlined and standardized.Entities:
Keywords: Differential expression analysis; Differentially expressed genes (DEGs); Differentially expressed proteins (DEPs); Differentially methylated regions/positions (DMRs/DMPs); Python
Mesh:
Year: 2019 PMID: 31744472 PMCID: PMC6862834 DOI: 10.1186/s12859-019-3121-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Data and control flow diagram of A-Lister
Example of a Name List File
| Control | Treated1 | Treated2 |
|---|---|---|
| AADACL2 | AADACP1 | AADACP1 |
| AADACL4 | DUSP5P1 | AMICA1 |
Example of a DE-Sample File
| gene | locus | sample1 | sample2 | log2(FC) | |
|---|---|---|---|---|---|
| FAM3A | chrX:154506158–154,516,242 | q1 | q2 | 2.73 | 0.0023 |
| FAM3A | chrX:154506158–154,516,242 | q3 | q4 | 0.0649976 | 0.81 |
Example of a DE-Series File
| gene | locus | log2(FC) | log2(FC)2 | ||
|---|---|---|---|---|---|
| FAM3A | chrX:154506158–154,516,242 | 2.73 | 0.0023 | 0.0649976 | 0.81 |
A-Lister Command Line Interface (CLI)
| Command | Argument | Brief Description |
|---|---|---|
| name-list | Full path to the input file. | |
| The query to be performed over input name lists. | ||
| -id | Delimiter used in input file. | |
| -o | Output directory. | |
| -od | Delimiter used in output file. | |
| -v | Verbose flag. | |
| -e, −-examples | Show examples and exit. | |
| -h, −-help | Show help/manual and exit. | |
| diff-expression | Full path to the input file. | |
| The directional query to be performed over pairwise comparisons. | ||
| Specifies the layout of pairwise comparison within the file. | ||
| -n | Specifies the ID column in file. | |
| -fc | Specifies the fold change column(s) in file. | |
| -s1 | Specifies sample1 column in file. | |
| -s2 | Specifies sample2 column in file. | |
| -f | Filter parameter used to filter the files by columns/attributes. | |
| -id | Delimiter used in input file. | |
| -o | Output directory. | |
| -od | Delimiter used in output file. | |
| -v | Verbose flag. | |
| -e, −-examples | Show examples and exit. | |
| -h, −-help | Show help/manual and exit. |
Bolded parameters are mandatory
Example of Name-List Command with Intersection (AND) and Fuzzy Intersection (FAND) Query
| Set1 | Set2 | AND (same) | FAND (similar) |
|---|---|---|---|
| ACTA2 | ACTA2 | ACTA2 | ACTA2 |
| ACTG1 | ACTN1 | ACTN1 | ACTG1 |
| ACTN1 | ADAMTS1 | ADAMTS1 | ACTN1 |
| ACAD9 | ERVV-2 | FLNB | ADAMTS1 |
| ADAMTS1 | FLNB | KLF4 | ADAMTS9 |
| ADAMTS9 | KDM4A | PIK3R3 | FLNA |
| EXD1 | KLF4 | SLC20A1 | FLNB |
| FLNA | KLF6 | SLC25A25 | KLF10 |
| FLNB | MYOM1 | KLF2 | |
| KLF10 | PIK3R3 | KLF4 | |
| KLF2 | SLC19A2 | KLF6 | |
| KLF4 | SLC20A1 | PIK3IP1 | |
| OR1S1 | SLC25A25 | PIK3R3 | |
| PIK3IP1 | TNFRSF12A | SLC12A2 | |
| PIK3R3 | ZIM2 | SLC19A2 | |
| SLC12A2 | SLC20A1 | ||
| SLC20A1 | SLC25A25 | ||
| SLC25A25 | TNFRSF12A | ||
| TNFSF10 | TNFSF10 | ||
| ZFAT-AS1 |
Fig. 2Data and process flow chart for use case 2. Input Files M2, M4, and M5 each contain 65,678 rows of differentially expressed genes. A-Lister is used to filter these files by abs(log2(foldchange)) > 1.0. The filtered files are then processed by A-Lister with the OR and DIFF set operators, resulting in 36 genes
Fig. 3Heatmap visualization of significantly differentially expressed genes for use case 3. a All genes that are significantly differentially expressed for both LeanPre vs. LeanPost and OvobPre vs. OvobPost pairwise comparisons. b All genes that are significantly upregulated in both pairwise comparisons. c All genes that are significantly downregulated in both pairwise comparisons
Fig. 4Example of screenshot of Graphical User Interface (GUI) version of A-Lister
Comparison to existing software
| Application | Filters | Query | Set Operators | Input Type | Interface |
|---|---|---|---|---|---|
| A-Lister | yes | yes | AND, FAND, OR, DIFF | DE data, lists | Command Line and Graphical User Interface |
| Functional Heatmap [ | yes | no | AND (implicit) | DE timeseries data | Web App |
| Galaxy (Text Manipulation) [ | yes | no | AND, OR, DIFF | Tabular data | Web App |
| Intervene [ | no | no | AND | Genomic regions, binary, counts, lists | Command Line, Web App |
| VennPainter [ | no | no | AND | Lists | Graphical User Interface |
| InteractiVenn [ | no | no | AND | Lists | Web App |
| SuperExactTest [ | no | no | AND, OR | Lists | R |
| Gene-Overlap [ | no | no | AND, OR | Lists | R |