| Literature DB >> 32313420 |
Shesh N Rai1,2,3,4, Chen Qian1,2, Jianmin Pan1, Marion McClain3,4,5, Maurice R Eichenberger6, Craig J McClain3,4,5,7, Susan Galandiuk6.
Abstract
The analysis of plasma microRNAs (miRNAs) has been widely used as a method for finding potential biomarkers for human diseases, especially those with a link to cancer. Methods of analyzing plasma miRNA have been thoroughly discussed from sample extraction to data modeling. However, some issues exist within the process that have rarely been talked about. Rice et al. discussed some issues in plasma miRNA studies, such as the lack of standard methodology including the use of different cycle threshold, time to plasma extraction, among others. These issues can lead to inconsistent data, and thus impact the result and assay reproducibility. Other external issues, such as batch effect and operator effect, may also indirectly impact the statistical analysis. Here, we discuss issues in plasma miRNA studies from a statistical point of view. The interaction effect of different ways of calculating fold-change, the choice of housekeeping genes, and methods of normalization are among the issues we discuss, with data demonstrations. P values are calculated and compared to determine the effect of those issues on statistical conclusions. Statistical methods such as analysis of variance and analysis of covariance are crucial in the analysis of miRNA but investigators are often confused about them; therefore, a brief explanation of these statistical methods is also included. In addition, 3-group classification is discussed, as it is often challenging, compared with 2-group classification.Entities:
Keywords: ANCOVA; ANOVA; batch effect; classification; fold-change; housekeeping genes; normalization; operator effect; plasma miRNA; quantile normalization; varying threshold
Year: 2020 PMID: 32313420 PMCID: PMC7157974 DOI: 10.1177/1176934320913338
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1.Flowchart of the data processing. BC indicates breast cancer; CAA, colorectal advanced adenoma; CRC, colorectal cancer; FC, fold-change; LC, lung cancer; PC, pancreatic cancer.
List of statistical issues in plasma miRNA analysis.
| Varying threshold |
| Batch effect |
| Operator effect |
| Normalization methods |
| Choice of housekeeping genes |
| Methods of fold-change |
| Sample size issue |
| Group classification |
Abbreviation: miRNA, microRNA.
Comparison between fixed threshold and variable threshold.
| MicroRNAs | Fixed threshold (0.03) | Variable threshold |
|---|---|---|
| miR374 |
|
|
| miR142-3p |
|
|
| miR523 |
| .0718 |
| miR374-5p |
|
|
| miR376c | .3540 | .2097 |
| miR27a |
|
|
| miR520d-5p | .7852 | .4820 |
| miR122 | .5186 | .6140 |
| miR485-3p | .2369 | .5905 |
| miR21 |
|
|
| miR218 |
|
|
Significance level = 0.05.
Comparison between batch effect and operator effect.
| MicroRNAs | Fixed threshold = 0.03 | |||
|---|---|---|---|---|
| % change in | ||||
| miR374 | .9930 | .5532 |
| 10.8 |
| miR142-3p | .6838 | .9329 |
| 22.4 |
| miR523 | .5541 | .0549 |
| –3.79 |
| miR374-5p | .7785 | .1278 |
| –23.4 |
| miR376c | .4627 | .2647 | .4848 | 36.9 |
| miR27a | .1894 | .5129 |
| 2.3 |
| miR520d-5p | .2651 | .0707 | .9788 | 24.7 |
| miR122 | .6168 | .7082 | .4998 | –3.6 |
| miR485-3p | .7991 | .7192 | .2403 | 1.4 |
| miR21 | .9481 |
|
| –50.8 |
| miR218 | .7525 | .4029 |
| 24.6 |
Significance level = 0.05.
Interaction effect between normalizations, fold-changes, and housekeeping genes.
| MicroRNAs | Delta-Ct normalization | Quantile normalization | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HKG = miR520-5p | HKG = RNU6 | HKG = miR520-5p | HKG = RNU6 | |||||||||
| FC1 | FC2 | FC1 | FC2 | FC1 | FC2 | FC1 | FC2 | |||||
| miR192 | .9377 | 0.9763 | 0.1758 | .2516 | 0.7494 | 0.3513 | .3342 | 0.8350 | 0.3916 | .7616 | 0.9232 | 0.5166 |
| miR29c |
| 0.4801 | 0.1246 |
| 0.3612 | 0.2693 |
| 0.3848 | 0.1947 |
| 0.4014 | 0.2212 |
| miR21 |
| 0.4185 | 0.1895 |
| 0.3292 | 0.2242 |
| 0.3535 | 0.1774 |
| 0.3943 | 0.2043 |
| miR19a |
| 0.5574 | 0.196 |
| 0.4177 | 0.2241 |
| 0.3841 | 0.1693 |
| 0.4061 | 0.1889 |
| miR150 | .3785 | 1.2773 | 0.3885 | .9134 | 0.9793 | 0.9782 | .4749 | 0.8639 | 0.6431 | .7932 | 0.9342 | 0.9943 |
| miR374 |
| 0.2648 | 0.0989 |
| 0.2045 | 0.1254 |
| 0.2528 | 0.1968 |
| 0.2794 | 0.1723 |
| miR193a-5p | .4168 | 1.2684 | 0.3363 | .9311 | 0.9796 | 0.6821 | .6508 | 0.8921 | 0.0397 | .6539 | 1.1342 | 1.0682 |
| miR346 | .8836 | 1.0265 | 0.7007 | .2965 | 0.7321 | 0.5721 | .9617 | 1.0091 | 0.4971 | .7547 | 0.8985 | 0.5378 |
| miR372 |
| 2.1137 | 2.4506 | .1037 | 1.7181 | 1.7517 | .2731 | 1.2768 | 0.8368 |
| 1.8760 | 1.0264 |
| miR122 | .0525 | 0.4163 | 0.0563 |
| 0.3289 | 0.043 |
| 0.2106 | 0.0051 |
| 0.2451 | 0.0503 |
Abbreviations: FC, fold-change; HKG, housekeeping genes.
Significance level = 0.05.
Sample size required at power of 90% at different FDR level.
| FDR | Adjusted α | Sample size required | |
|---|---|---|---|
| One-sided | Two-sided | ||
| 1% | 0.00033 | 23 | 25 |
| 5% | 0.0018 | 18 | 20 |
| 10% | 0.0036 | 16 | 18 |
Abbreviation: FDR, false discovery rate.
Sample size required for classification at α = 0.05.
| Number of miRNAs | Power | Sample size required per group |
|---|---|---|
| 10 | 90 | 120 |
| 5-8 | 80 | 96 |
| 5-8 | 68 | 72 |
Abbreviation: miRNA, microRNAs.
Comparison between 2 quantile normalization methods.
| MicroRNAs | Quantile normalization | Quantile normalization | ||||
|---|---|---|---|---|---|---|
| Approach 1 | Approach 2 | |||||
| FC1 | FC2 | FC1 | FC2 | |||
| miR192 | .4079 | 0.8880 | 0.9904 |
| 1.6047 | 1.6992 |
| miR29c |
| 0.3805 | 0.3813 |
| 0.7718 | 0.7869 |
| miR21 |
| 0.3687 | 0.3425 |
| 0.8310 | 0.7633 |
| miR19a |
| 0.3864 | 0.2960 |
| 0.7875 | 0.5904 |
| miR150 | .4707 | 0.8844 | 0.7971 |
| 1.8415 | 1.6381 |
| miR374 |
| 0.2592 | 0.1898 |
| 0.5377 | 0.3501 |
| miR193a-5p | .8408 | 1.0438 | 1.4808 |
| 1.5312 | 1.5241 |
| miR346 | .8474 | 0.9666 | 0.8776 |
| 1.4904 | 1.1032 |
| miR372 |
| 1.6193 | 1.6126 | .5117 | 1.0806 | 1.0344 |
| miR122 |
| 0.2278 | 0.0646 |
| 0.4137 | 0.0805 |
Abbreviation: FC, fold-change.
Significance level = 0.05.
P value from normality test.
| MicroRNAs | Delta-Ct normalization | Quantile normalization | ||||||
|---|---|---|---|---|---|---|---|---|
| HKG = miR520-5p | HKG = RNU6 | HKG = miR520-5p | HKG = RNU6 | |||||
| CRC + CAA | BC + LC + PC | CRC + CAA | BC + LC + PC | CRC + CAA | BC + LC + PC | CRC + CAA | BC + LC + PC | |
| miR192 | .2357 | .9302 | .9789 | .2407 |
|
| .5066 |
|
| miR29c | .7084 | .4263 |
|
|
|
| .0895 |
|
| miR21 | .6869 | .0623 | .0785 |
|
|
| .0767 |
|
| miR19a | .4524 | .6339 | .5364 |
|
|
| .2869 |
|
| miR150 | .9878 | .5045 | .0839 | .4572 | .1634 |
|
| .0748 |
| miR374 | .9261 | .0506 | .0546 |
|
|
|
|
|
| miR193a-5p | .6837 | .1162 | .4115 | .9972 |
|
|
|
|
| miR346 | .4568 | .8454 | .3524 | .2321 |
|
|
|
|
| miR372 | .1057 |
| .2187 | .6227 |
|
|
|
|
| miR122 | .0897 | .1458 | .1309 | .6836 |
|
|
|
|
Abbreviations: BC, breast cancer; CAA, colorectal advanced adenoma; CRC, colorectal cancer; HKG, housekeeping genes; LC, lung cancer; PC, pancreatic cancer.
Significance level = 0.05.
Comparison of P value between 2-sample t test and Wilcoxon rank-sum test.
| MicroRNAs | Delta-Ct normalization | Quantile normalization | ||||||
|---|---|---|---|---|---|---|---|---|
| HKG = miR520-5p | HKG = RNU6 | HKG = miR520-5p | HKG = RNU6 | |||||
| Wilcoxon | Wilcoxon | Wilcoxon | Wilcoxon | |||||
| miR192 | .9377 | .7935 | .2516 | .2506 | .3342 | .7789 | .7616 | .3693 |
| miR29c |
|
|
|
|
|
|
|
|
| miR21 |
|
|
|
|
|
|
|
|
| miR19a |
|
|
|
|
|
|
|
|
| miR150 | .3785 | .2656 | .9134 | .5197 | .4749 | .7894 | .7932 | .6294 |
| miR374 |
|
|
|
|
|
|
|
|
| miR193a-5p | .4168 | .8023 | .9311 | .7545 | .6508 | .9152 | .6539 | .2019 |
| miR346 | .8836 | .7202 | .2965 | .2124 | .9617 | .4269 | .7547 | .1838 |
| miR372 |
|
| .1037 | .1998 | .2731 |
|
| .3617 |
| miR122 | .0525 | .1916 |
|
|
|
|
|
|
Abbreviation: HKG, housekeeping genes.
Significance level = 0.05.
P value from normality test for mean normalization.
| MicroRNAs | Mean normalization | |
|---|---|---|
| CRC + CAA | BC + LC + PC | |
| miR192 |
| .0510 |
| miR29c | .5102 | .1721 |
| miR21 | .7943 |
|
| miR19a | .1265 | .7713 |
| miR150 | .1578 | .6555 |
| miR374 | .1670 | .2009 |
| miR193a-5p |
| .6890 |
| miR346 | .2017 | .5728 |
| miR372 |
| .0978 |
| miR122 | .1909 | .7966 |
Abbreviations: BC, breast cancer; CAA, colorectal advanced adenoma; CRC, colorectal cancer; LC, lung cancer; PC, pancreatic cancer.
Significance level = 0.05.
Mean normalization fold-change and comparison of P value between 2-sample t test and Wilcoxon rank-sum test.
| MicroRNAs | Mean normalization | |||
|---|---|---|---|---|
| FC1 | FC2 | Wilcoxon | ||
| miR192 | .0530 | 1.2982 | 1.2687 | .1746 |
| miR29c |
| 0.6296 | 0.6266 |
|
| miR21 |
| 0.5855 | 0.6065 |
|
| miR19a |
| 0.728 | 0.6767 |
|
| miR150 |
| 1.7068 | 1.6342 |
|
| miR374 |
| 0.3564 | 0.2399 |
|
| miR193a-5p |
| 1.7073 | 1.6767 |
|
| miR346 | .2896 | 1.2759 | 0.5676 | .2018 |
| miR372 |
| 3.121 | 3.0901 |
|
| miR122 | .0807 | 0.5732 | 0.2223 | .1343 |
Abbreviation: FC, fold-change.
Significance level = 0.05.