Literature DB >> 35218669

Multiple cancer type classification by small RNA expression profiles with plasma samples from multiple facilities.

Kuno Suzuki1, Hideyoshi Igata2, Motoki Abe2, Yusuke Yamamoto3.   

Abstract

Liquid biopsy is expected to be a promising cancer screening method because of its low invasiveness and the possibility of detecting multiple types in a single test. In the last decade, many studies on cancer detection using small RNAs in blood have been reported. To put small RNA tests into practical use as a multiple cancer type screening test, it is necessary to develop a method that can be applied to multiple facilities. We collected samples of eight cancer types and healthy controls from 20 facilities to evaluate the performance of cancer type classification. A total of 2,475 cancer samples and 496 healthy control samples were collected using a standardized protocol. After obtaining a small RNA expression profile, we constructed a classification model and evaluated its performance. First, we investigated the classification performance using samples from five single facilities. Each model showed areas under the receiver curve (AUC) ranging from 0.67 to 0.89. Second, we performed principal component analysis (PCA) to examine the characteristics of the facilities. The degree of hemolysis and the data acquisition period affected the expression profiles. Finally, we constructed the classification model by reducing the influence of these factors, and its performance had an AUC of 0.76. The results reveal that small RNA can be used for the classification of cancer types in samples from a single facility. However, interfacility biases will affect the classification of samples from multiple facilities. These findings will provide important insights to improve the performance of multiple cancer type classifications using small RNA expression profiles acquired from multiple facilities.
© 2022 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.

Entities:  

Keywords:  NGS; liquid biopsy; machine learning; multiple cancer type classification; multiple facilities; small RNA

Mesh:

Substances:

Year:  2022        PMID: 35218669      PMCID: PMC9207371          DOI: 10.1111/cas.15309

Source DB:  PubMed          Journal:  Cancer Sci        ISSN: 1347-9032            Impact factor:   6.518


INTRODUCTION

Globally, the number of cancer patients has been increasing. It is important to detect and treat cancer at an early stage because the survival rate decreases significantly as the disease progresses. ,  Various methods are used for cancer screening, including computed tomography, endoscopy, X‐ray, and PET. These screening tests are effective in reducing cancer deaths. , , However, undergoing numerous screening tests is physically hard, time‐consuming, and costly for patients. As a solution to these problems, liquid biopsy is expected to be a promising cancer‐screening method because of its low invasiveness and ability to detect multiple cancer types simultaneously. Circulating tumor DNA (ctDNA) and small RNAs (microRNA [miRNA] and PIWI‐interacting RNA [piRNA]) are the focus of cancer screening tests. , , , , In a study of multiple cancer classification using ctDNA, eight types of cancers were successfully classified, with a median sensitivity of 70% and a specificity of over 99%. However, the classification performance of stage 1 was lower than that of stages 2 and 3. The performance of cancer classification by miRNAs has been reported. In gastric cancer, the sensitivity and specificity were 73% and 89% ; in pancreatic and biliary tract cancers, they were 82% and 97% ; in breast cancer, they were 97% and 83% ; in cervical cancer, they were 84% and 90% ; and in prostate cancer, they were 95% and 87%. In addition, several studies have reported that miRNAs can detect even early‐stage cancers with high sensitivity, and it is expected that miRNAs will be useful as an early cancer screening test. , , , , , However, issues remain for miRNA‐based multiple cancer type screening tests. First, the possibility of identifying cancer types by collecting samples from multiple facilities has not been determined. Second, there have been reports that small RNAs specific to cancer types have been identified, but the variety of small RNAs is not consistent among reports. , , , , , ,  To overcome these issues, we collected cancer and healthy control samples from multiple facilities with a standardized protocol. Then, the classification accuracy was evaluated by machine learning models constructed with the small RNA expression profiles.

MATERIALS AND METHODS

Clinical samples

In this study, 2475 cancer samples and 496 healthy control samples were collected from 20 facilities: BE, Kobe University; CH, Chiba University; DA, Saitama Medical University, International Medical Center; EH, National Cancer Center Hospital East; GM, Center Hospital of the National Center for Global Health and Medicine; HC, National Cancer Center Hospital, Cancer Screening Center; HS, Harasanshin Hospital; JD, Juntendo University; KB, Kyoto University; KC, Kanagawa Cancer Center; KH, Kurosawa Hospital; NC, National Cancer Center Hospital; NY, Nagoya University; OC, Osaka International Cancer Institute; OY, Okayama University; SG, Sagara Hospital; SU, Showa University; TA, Osaka City University; TM, Tokyo Medical University; and WG, Saitama Medical University, Saitama Medical Center. Esophageal cancer (ESO), gastric cancer (GAS), colorectal cancer (COL), liver and biliary tract cancer (LBI), pancreatic cancer (PAN), lung cancer (LUN), breast cancer (BRE), and prostate cancer (PRO) samples from stage 0 to 4 were obtained. The stage classification was based on the 8th edition of the TNM classification published by the Union for International Cancer Control. The ICD codes for each cancer type were ESO (C15), GAS (C16), COL (C18, 19, 20), LBI (C22, 23, 24), PAN (C25), LUN (C34), BRE (C50), and PRO (C61). All cancer samples were collected before the start of treatments. Healthy control (CTR) samples were obtained according to the criteria, with no obvious suspicion based on a comprehensive evaluation of imaging studies, blood tests, and clinical findings. Samples were collected by KB from 2013 to 2016; HC from 2016 to 2018; and NC, EH, OC, KC, JD, TM, HS, SG, GM, KH, WG, DA, OY, BE, CH, TA, and SU from 2020 to 2021. The details are described in Table S1. Informed consent was obtained from the participants. This research was approved by the National Cancer Center Hospital Institutional Review Board (approval number: 2018–200) and was conducted according to the guidelines of the Declaration of Helsinki.

Plasma sample collection

Blood samples were collected in vacuum tubes with EDTA‐2Na and refrigerated within 30 min. The blood samples were centrifuged only once at 1,500–1,900 ×g for 10 min within 12 h of venipuncture, and plasma fractions were collected. The plasma samples were stored at −80°C until further analysis, and more than two freeze–thaw cycles were avoided.

RNA purification

Small RNAs were automatically extracted from 300 μL of plasma using the Maxwell RSC miRNA Plasma and Serum Kit (Promega) with the Equipment Maxprep Liquid Handler (Promega) and Maxwell RSC (Promega). The purifications were performed according to the manufacturer’s instruction manual.

Small RNA sequence data acquisition

cDNA libraries were synthesized from 5 μL of RNA eluate with 22 cycles of PCR using the QIAseq miRNA Library Kit (QIAGEN) with the Biomek i5 Automated Liquid Handling Workstation (Beckman Coulter). The concentration of each amplification product was measured with a QuantiFluor ONE ds DNA System (Promega). cDNA libraries were merged in one tube at a final concentration of 1.0–1.5 pmol/L. Small RNA sequence data were obtained using NextSeq 550Dx with the NextSeq 500/550 High Output Kit v2.5, 75 cycles (Illumina) and the NextSeq Phix Control Kit (Illumina). An automated program was set up for each instrument according to the manufacturer’s instruction manual, and all the samples were processed using the same program.

Sequence data acquisition

To avoid bias toward certain cancer types in a particular experimental batch, data acquisition was conducted under the following conditions. First, a batch consisted of at least three categories of samples. Second, the number of samples from a single category should be <50% of the total.

Small RNA expression analysis

The raw sequence data were processed using fastp (version 0.19.6) to remove the amplification adapter sequence. The unique molecular index (UMI) sequences and the probe adapter were trimmed using umi_tools (version 1.0.1). Mapping was performed using Bowtie (1.3.0) with reference to the human genome reference (hg19). The following options were applied: ‐n0, ‐v0, ‐l18, ‐k1, and ‐‐best. After sorting with SAMtools (version 1.7), an index was created and deduplicated with umi_tools. Then, using featureCounts (version 2.0.0), 2,576 miRNAs and 27,683 piRNAs were identified, and the expression levels of each gene were calculated using the following annotation files: miRBase (version 20) with QIAseq Spikeins and piRNAdb (version 1.7.5). The average total count of miRNAs and piRNAs was 4.27 ± 1.09 × 106 (mean ± SD; min: 440,695, max: 8,890,949). Samples with a total count of <1,000,000 miRNAs and piRNAs were omitted from subsequent analyses. For each of 2,576 miRNAs and 27,683 piRNAs, DESeq2 , was used for normalization. Features were removed if they met the following criteria: (i) genes with a minimum expression level of 50 or less in all samples,and (ii) Pearson correlation coefficient with miR‐369‐3p of |r|>0.5 to exclude miRNAs that are under the control of genomic imprinting.  The remaining 534 mi/piRNAs were normalized by DESeq2 and used for subsequent analyses.

Machine learning for multiple cancer classification

XGBoost was used to classify the cancer types. Each experiment was conducted 50 times with different random seeds, and the dataset was randomly divided into 90% and 10% partitions for training and testing, respectively. Hyperparameters were optimized by Optuna to maximize the fivefold cross‐validation accuracy or the area under the receiver operating characteristic (ROC) curve (AUC) score. The search ranges of the parameters were as follows: learning_rate [0.01, 0.4], max_depth [3, 10], min_child_weight [0.1, 10], gamma [1e‐8, 1.0], and colsample_bytree [0.6, 0.95]. For the constructing classifier, the samples were randomly selected so that each class would have the same number of samples in each experiment. The SHAP value, and the contribution of all small RNAs was quantified for each cancer type. of the cancer type classification model was calculated

Statistical analysis

The correlation analysis, principal component analysis (PCA), Cohen’s d value and box plots were performed using the statistical analysis software R (version 3.6.3).  The effect size was calculated from Cohen’s d value.

Calculation of the hemolysis index

The hemolysis index was calculated from the small RNA expression counts of each sample using miR‐23a‐3p for the miR‐451a express ion levels. , , The following formula was used: Hemolysis index = Log2 (miR‐451a) − Log2 (miR‐23a‐3p).

Feature selection based on principal component analysis loading factors

For the first and second principal components in the PCA, the factor loadings of each miRNA were calculated, the threshold values were set, and the small RNAs were removed from the data according to the threshold values. The small RNA lists for 10%, 15%, 20%, and 25% removal are shown in Table S2.

One‐dimensional nearest neighbor matching

The x‐axis (PC1) and y‐axis (PC2) of each sample in the PCA were extracted, and the matching rate was defined as the probability that both sides of an arbitrary point projected on the x‐axis or y‐axis were in the same class.

RESULTS

Study design

Cancer and healthy control plasma samples were collected from multiple facilities to construct and validate the cancer type classification model. A total of 2,475 cancer samples and 496 healthy control samples were collected. Information on the age, sex, and stage of the collected samples is summarized in Table 1.
TABLE 1

Characteristics of the participants

CategoryNumber of samplesAgeSex (%)Stage (%)
MeanSDMaleFemale01234
ESO25165.78.9182 (73)69 (27)10 (4)78 (31)51 (20)65 (26)47 (19)
GAS28265.312.8171 (61)111 (39)0144 (51)43 (15)41 (15)54 (19)
COL33564.711.8181 (54)154 (46)11 (3)86 (26)83 (25)96 (29)59 (18)
LBI32169.99.2214 (67)107 (33)3 (1)69 (21)101 (31)72 (22)76 (24)
PAN31566.910.5174 (55)141 (45)5 (2)32 (10)75 (24)56 (18)147 (47)
LUN33567.910.8216 (64)119 (36)9 (3)129 (39)36 (11)60 (18)101 (30)
BRE35056.012.50350 (100)42 (12)161 (46)111 (32)26 (7)10 (3)
PRO28668.67.0286 (100)0045 (15)192 (67)35 (12)14 (5)
CTR49649.610.9252 (51)244 (49)
Characteristics of the participants For these 2,971 samples, the expression levels of small RNAs (miRNAs and piRNAs) were measured by next‐generation sequencing (NGS), and 583 types of small RNAs were detected. Thirty‐four imprinted genes located on chromosome 14q32.31 were removed from our analysis because these genes were differentially expressed in healthy controls. A total of 534 small RNAs were used to construct the cancer classification models. First, samples of the multiple cancer types were collected from a single facility, and the performance of the cancer type classification in each of the five different facilities was examined using Models 1–5. Next, samples collected from multiple facilities were used to evaluate the performance of the classification model (Model 6) for multiple cancer types to determine whether the same results could be obtained across facilities (Figure 1).
FIGURE 1

Schematic of cancer type classification model establishment. Plasma samples were obtained from 2971 subjects, including 496 healthy controls (CTR) and 2475 cases. The small RNA sequences were matched to a database, and 583 genes were identified. Imprinted genes and genes with low expression were removed, and 534 small RNAs were retained as features for the machine learning models. Cancer classification Models 1–5 were constructed using samples obtained from five facilities, and Model 6 was constructed using samples from all facilities

Schematic of cancer type classification model establishment. Plasma samples were obtained from 2971 subjects, including 496 healthy controls (CTR) and 2475 cases. The small RNA sequences were matched to a database, and 583 genes were identified. Imprinted genes and genes with low expression were removed, and 534 small RNAs were retained as features for the machine learning models. Cancer classification Models 1–5 were constructed using samples obtained from five facilities, and Model 6 was constructed using samples from all facilities

Verification of the reproducibility of data acquisition

To identify cancer type‐specific expression profiles, it is necessary to have high reproducibility in data acquisition. To investigate the reproducibility of this study, six different reference samples were examined. For these reference samples, freezing and thawing were performed in the same cycle numbers to avoid unexpected sample alterations. The correlation coefficients of the small RNA expression profiles between the experiments were calculated. Among the six reference samples, the highest mean value of the correlation coefficient was 0.993 (minimum 0.990, maximum 0.995), as shown in Figure 2A,C, and the lowest mean value was 0.981 (minimum 0.969, maximum 0.992), as shown in Figure 2B,D. In the series of experiments, the correlation coefficients ranged from 0.969 to 0.995 (Figure S1). These results verified that the data acquisition was highly reproducible.
FIGURE 2

Evaluation of data acquisition reproducibility using reference samples. The distribution of correlation analysis using small RNA expression profiles measured from the same reference sample is shown. Reference samples were always measured at fixed positions of the 96‐well plate. (Reference samples 1 (A) and 2 (B) are shown. Each data acquisition experiment was assigned a number, and the correlation coefficients between experiments were color‐coded in matrices. The histogram of the correlation coefficients of Reference sample 1 is shown in (C) and that of Reference sample 2 is shown in D)

Evaluation of data acquisition reproducibility using reference samples. The distribution of correlation analysis using small RNA expression profiles measured from the same reference sample is shown. Reference samples were always measured at fixed positions of the 96‐well plate. (Reference samples 1 (A) and 2 (B) are shown. Each data acquisition experiment was assigned a number, and the correlation coefficients between experiments were color‐coded in matrices. The histogram of the correlation coefficients of Reference sample 1 is shown in (C) and that of Reference sample 2 is shown in D)

Classification performance using each single‐facility sample

The small RNA expression profiles of samples from multiple cancer types that were collected from five single facilities were obtained to evaluate the classification performance (Models 1–5). The classification models were constructed by downsampling with the fewest number of samples to equalize the cancer types. The facilities, cancer types, and number of samples used to construct the models are summarized in Table 2. The cancer type classification performance outcomes of Models 1–5 are shown in Figure 3. The AUCs of each model are shown in Figure S2. The sensitivities of Model 1 using COL, ESO, LBI, PAN, GAS, and CTR samples collected at KB were 69%, 58%, 47%, 52%, 45%, and 73%, respectively. These were higher than the sensitivity of 17% for six‐class random classification. The AUC for the KB samples was 0.89 (95% confidence interval [CI], 0.88–0.89), as shown in Figure 3A. The sensitivities of Model 2 using BRE, COL, LUN, PAN, and GAS samples collected at OC were 83%, 30%, 38%, 34%, and 50%, respectively. These were higher than the sensitivity of 20% for five‐class random classification. The AUC for the OC samples was 0.77 (95% CI, 0.75–0.78), as shown in Figure 3B. The sensitivities of Model 3 using BRE, LUN, PRO, and GAS samples collected at KC were 56%, 42%, 52%, and 56%, respectively. These were higher than the sensitivity of 25% for four‐class random classification. The AUC was 0.76 (95% CI, 0.74–0.78), as shown in Figure 3C. The sensitivities of Model 4 using COL, ESO, PAN, and GAS samples collected at NC were 31%, 52%, 38%, and 47%, respectively. These were higher than the sensitivity of 25% for four‐class random classification. The AUC was 0.67 (95% CI, 0.63–0.70), as shown in Figure 3D. The sensitivities of Model 5 using BRE, COL, LBI, and LUN samples collected at JD were 59%, 40%, 48%, and 52%, respectively. These were higher than the sensitivity or 25% for four‐class random classification. The AUC was 0.77 (95% CI, 0.74–0.79), as shown in Figure 3E. Based on the above classification performance outcomes of the five facilities, Models 1–5 using samples from a single facility showed higher classification performance outcomes than random classification. These data suggest that a machine learning model with small RNA profiles can classify multiple cancer types at least within a single facility.
TABLE 2

Number of samples used in each classification model

ModelFacilityCategory and number of samples
ESOGASCOLLBIPANLUNBREPROCTR
1KB100100100134100100
2OC66806672104
3KC481159251
4NC46334540
5JD37312849
FIGURE 3

Classification performance outcomes of models constructed from single‐facility samples. Classification models of multiple cancer types were constructed only for cancer types with a sample size of 25 or more. The number of samples for each cancer type was downsampled to 100 in Model 1 for KB (A), 66 in Model 2 for OC (B), 48 in Model 3 for KC (C), 33 in Model 4 for NC (D), and 28 in Model 5 for JD (E)

Number of samples used in each classification model Classification performance outcomes of models constructed from single‐facility samples. Classification models of multiple cancer types were constructed only for cancer types with a sample size of 25 or more. The number of samples for each cancer type was downsampled to 100 in Model 1 for KB (A), 66 in Model 2 for OC (B), 48 in Model 3 for KC (C), 33 in Model 4 for NC (D), and 28 in Model 5 for JD (E) Next, we evaluated the influence of stage on performance in cancer classification. Stages 0, 1, and 2 were grouped together as early stages, and stages 3 and 4 were grouped together as advanced stages. We selected KB and OC based on sample numbers and compared their performance outcomes (Table S3). KB samples showed the following sensitivities: COL (early 58%, advanced 44%), ESO (early 42%, advanced 67%), GAS (early 71%, advanced 51%), PAN (early 53%, advanced 65%), and LBI (early 56%, advanced 54%), as shown in Figure S3A. OC samples showed the following sensitivities: COL (early 57%, advanced 57%), LUN (early 67%, advanced 31%), and PAN (early 21%, advanced 53%), as shown in Figure S3B. Early stages showed equal to or higher sensitivities than advanced stages in five out of eight classifications.

principal component analysis using multiple‐facility samples

To explore facility‐specific biases, we performed PCA using multiple‐facility samples. The color‐coded PCA by cancer type, facility, and data acquisition period are shown in Figure 4A,B,C. No clear clusters of cancer types were observed. The data acquisition facilities and the first principal component (PC1) were related, and clusters were formed in some facilities. Regarding the data acquisition period, clusters were formed in relation to the second principal component (PC2).
FIGURE 4

PCA using samples from multiple facilities. The results of PCA using 2967 samples from all facilities. The results are color‐coded to indicate the categories (A), facilities with a sample size of 150 or more (B), and data acquisition periods (C)

PCA using samples from multiple facilities. The results of PCA using 2967 samples from all facilities. The results are color‐coded to indicate the categories (A), facilities with a sample size of 150 or more (B), and data acquisition periods (C) To further clarify the reason for the clustering in some facilities, the characteristics of the top 10 miRNAs of the factor loadings of PC1 of each small RNA were investigated. Hsa‐miR‐144‐3p, hsa‐miR‐32‐5p, and hsa‐miR‐96‐5p were included in the top 10 miRNAs. These small RNAs are known to be abundant in red blood cells (RBCs). , , Furthermore, other RBCs‐derived miRNAs, such as hsa‐miR‐451a, hsa‐miR‐486‐5p, hsa‐miR‐4732‐3p, and hsa‐miR‐363‐3p, were enriched in higher rankings in PC1 factor loadings (Table S4). The formation of clusters at different facilities was thought to be due to differences in the degree of hemolysis in the samples. Therefore, the hemolysis index of each sample was colored in the plot. The results showed that the increase or decrease in PC1 was linked to the hemolysis index (Figure 5A). Then, the distributions of the hemolysis index at collection facilities were compared. The distribution range of the hemolytic index was different among facilities (Figure 5B). In fact, the distribution of the hemolytic index in KC was different from that in KH, and the clusters of both in PCA were also clearly separated (Figure 5C). The distribution of the hemolytic index in KB was near that in KC, and the clusters of both in PCA were also close (Figure 5D). These results suggest that PC1 reflects the effect of hemolysis and that there were differences in the hemolysis index among the facilities, which led to the formation of clusters.
FIGURE 5

Comparison of the hemolysis index between facilities. The results of PCA performed on 2967 samples are color‐coded to indicate the hemolysis index (A). A higher value in red indicates a higher degree of hemolysis. Distribution of the hemolysis index of each facility (B). PCA plot positions are indicated in KC and KH (C) and in KB‐KC (D). Differences in these facility pair positions had a correlation in their distribution of the hemolytic index

Comparison of the hemolysis index between facilities. The results of PCA performed on 2967 samples are color‐coded to indicate the hemolysis index (A). A higher value in red indicates a higher degree of hemolysis. Distribution of the hemolysis index of each facility (B). PCA plot positions are indicated in KC and KH (C) and in KB‐KC (D). Differences in these facility pair positions had a correlation in their distribution of the hemolytic index

Classification performance using multiple‐facility samples

It was suggested that the hemolysis index and data acquisition period could affect the cancer classification performance. Therefore, we attempted to reduce these effects by removing the small RNAs that have large contributions to the PC1 and PC2 scores. PCA was performed again using the data from which 10%, 15%, 20%, and 25% of small RNAs in both positive and negative directions ranked by factor loadings were removed. The numbers of remaining features were 534 for no removal (Figure 6A), 324 for 10% removal (Figure 6B), 232 for 15% removal (Figure 6C), 160 for 20% removal (Figure 6D), and 98 for 25% removal (Figure 6E). The retained small RNAs are listed in Table S2. As the removal rate increased, these clusters converged into a group (Figure 6E).
FIGURE 6

Improvement of the clustering results by feature selection. (A) The results of PCA using 2967 samples from all facilities without any feature selection. The PCA result was shown that excluding the top 10%, 15%, 20%, and 25% of small RNAs from PC1 and PC2 factor loadings in both positive and negative directions. The number of retained features was 324 small RNAs (B), 232 small RNAs (C), 160 small RNAs (D), and 98 small RNAs (E), respectively. Each category of the matching rate for PC1 was shown in (F), and that for PC2 is shown in (G)

Improvement of the clustering results by feature selection. (A) The results of PCA using 2967 samples from all facilities without any feature selection. The PCA result was shown that excluding the top 10%, 15%, 20%, and 25% of small RNAs from PC1 and PC2 factor loadings in both positive and negative directions. The number of retained features was 324 small RNAs (B), 232 small RNAs (C), 160 small RNAs (D), and 98 small RNAs (E), respectively. Each category of the matching rate for PC1 was shown in (F), and that for PC2 is shown in (G) To quantitate the change in cluster formation with/without the removal process, the probability that the nearest neighbor points of an arbitrary point were in the same class was calculated. In PC1, the matching rates focusing on cancer type were almost the same with/without small RNA removal, but the matching rates focusing on facility were reduced: 5.0% (without removal), 6.2% (10% removal), 5.7% (15% removal), 3.6% (20% removal), and 2.6% (25% removal). The matching rates of the data acquisition period were reduced to 5.5% (without removal), 3.0% (10% removal), 3.0% (15% removal), 2.4% (20% removal), and 2.2% (25% removal), as shown in Figure 6F. The reducing effect was largest under 25% removal. In PC2, the matching rates of cancer types and facilities were almost the same with/without small RNA removal, but the matching rates of the data acquisition period were reduced to 7.6% (without removal), 13.0% (10% removal), 11.0% (15% removal), 4.1% (20% removal), and 4.3% (25% removal), as shown in Figure 6G. The reducing effect was largest under 25% removal. These results indicate that the removal of small RNAs, which have a large contribution to the PC1 and PC2 scores, can reduce the impact on the hemolysis index and the data acquisition period. Therefore, we evaluated the sensitivities of the multiple cancer type classification model constructed using samples from multiple facilities after the removal of small RNAs. The sensitivities before the removal of small RNAs were as follows: CTR 67%, BRE 59%, COL 32%, ESO 44%, LBI 53%, LUN 27%, PAN 29%, PRO 54%, and GAS 34% (Figure 7A). The AUC was 0.83 (95% CI, 0.83–0.84). The sensitivities after the removal of 25% small RNAs were as follows (Model 6): CTR 58%, BRE 47%, COL 19%, ESO 31%, LBI 44%, LUN 21%, PAN 23%, PRO 39%, and GAS 23% (Figure 7B). The AUC was 0.76 (95% CI, 0.76–0.76). The random classification sensitivity for nine classes was 11%. The results showed that the sensitivity of the classification of multiple cancer type samples from multiple facilities was higher than that of random classification.
FIGURE 7

Classification performance of the selected feature model from multiple‐facility samples. The confusion matrix for the XGBoost classifier with all 534 features is shown in (A), and the confusion matrix with 98 selected features is shown in (B)

Classification performance of the selected feature model from multiple‐facility samples. The confusion matrix for the XGBoost classifier with all 534 features is shown in (A), and the confusion matrix with 98 selected features is shown in (B)

DISCUSSION

In this study, the performance of the cancer classification models was evaluated by collecting samples of multiple cancer types from multiple facilities. As a result, the performance of the models was higher than that of random classification in both the single‐facility and multiple‐facility models. This finding suggests the existence of miRNAs that could identify each cancer type. However, the data acquisition period and the degree of hemolysis in each facility could affect the cancer type classification performance. One of the possible disturbing factors for the cancer type classification is batch effects. The KB confusion matrix showed that the six‐class classification results were divided into roughly two groups (Figure 3A). Depending on the data acquisition periods, there were differences in the reagent lots, including polymerases and ligases. This suggested that an amplification bias of small RNA may have occurred. Small RNAs with high variability between the two groups were extracted by effect sizes (see Materials and Methods 2.8) from the single reference sample. The top five were hsa‐miR‐21‐5p, hsa‐miR‐425‐5p, hsa‐piR‐33057, hsa‐piR‐32993, and hsa‐piR‐32871; the effect sizes were 5.00, 2.60, 2.26, 2.06, and 2.04; and the fold changes were 1.64, 1.19, 1.77, 2.02, and 2.07, respectively. Although hsa‐miR‐21‐5p is known as a representative cancer‐associated miRNA, it showed an effect size of 0.59 and a fold change of 1.07 in GAS versus CTR and 0.47 and 1.07 in PAN versus CTR when comparing the same data acquisition periods. It was one‐half to one‐third smaller than that reported in similar studies. , , ,  These results indicate that amplification bias occurred due to differences in the data acquisition period, which was thought to be the cause of the division into two groups in the KB confusion matrix. The change in small RNA expression of each cancer type in the same acquisition condition was smaller compared to the amplification bias. This situation could have affected the performance of the cancer type classification. The consistency of cancer type‐associated small RNAs between our classification models and previously reported models was investigated (Table S5). To avoid interfacility variability, we extracted cancer type‐associated small RNAs from Models 1–5 by quantifying their contribution. ESO, GAS, COL, LBI, PAN, and BRE were selected because these cancer types were tested in more than two facilities. We narrowed down to the common miRNAs in multiple facilities and compared these common miRNAs with previously reported miRNAs. Two or three miRNAs in our common miRNAs were also reported in previous studies of cancer type‐associated miRNAs. Our results were consistent with previous reports to some extent. Regarding the effect of hemolysis on the classification performance, we collected samples with a standardized sample collection protocol (see Materials and Methods 2.2). Therefore, we expected that the interfacility variation in the hemolysis index would not exceed the intrafacility variation. However, it was found that the interfacility variability exceeded the intrafacility variability (Figure 5B). Thus, the differences in the hemolysis index among the facilities became noise, and the accuracy of cancer type classification decreased. At the same time, the differences between facilities could be misinterpreted as differences in cancer types, resulting in a higher accuracy than the values with minimized hemolysis influences (Figure 7A,B). As a result, the difference in the hemolysis index among facilities not only affected the performance of the cancer classification but also suggested the possibility that the differences among the facilities could be confounded by the differences in the cancer types. Although the classification performance was higher than random classification in multiple cancer type classifications, further improvement is needed when considering the application as a cancer screening test. The expression levels of miRNAs in plasma and serum are reportedly similar, and we decided to use plasma samples to avoid differences in clot formation between samples when preparing serum. To improve the performance, two essential issues exist. First, the expression changes of small RNAs characterizing cancer types are not significant enough. To overcome this issue, it is necessary to efficiently extract cancer‐associated small RNAs to increase expression changes. Second, improvement of the sample preparation method is necessary to reduce the differences in the degree of hemolysis among different facilities. A practical approach for efficiently extracting cancer‐associated small RNAs is to purify and analyze small RNAs within exosomes after preventing the contamination of red blood cells in plasma samples using the double centrifugation method , or prefiltering. Although the ultracentrifugation protocol is used as a standard method for exosome purification, it has poor throughput and is not suitable for automated sample preparation. Therefore, for practical use, the use of size exclusion chromatography , or chemical precipitation would be helpful. As a sample preparation method to reduce the degree of hemolysis, it should be effective to collect blood samples using a blood collection tube that contains a reagent to inhibit the hemolysis of red blood cells. It is expected that these approaches will further improve the classification performance of multiple cancer types, thus establishing a cancer screening test with practical performance. In conclusion, to develop a robust diagnosis test for the cancer type screening using small RNAs, we found that small RNAs can be used for the classification of different cancer types in samples from a single facility; however, interfacility noise affects the classification of samples from multiple facilities. This finding provides insight into the importance of enhancing the signal‐to‐noise ratio to overcome interfacility biases for the classification of multiple cancer types from samples derived from multiple facilities.

DISCLOSURE

KS, MK, MT, TN and TY are employees of PFDeNA, HI and MA are employees of Preferred Networks. All other authors have no conflicts of interest to declare. Fig S1 Click here for additional data file. Fig S2 Click here for additional data file. Fig S3 Click here for additional data file. Tab S1 Click here for additional data file. Tab S2 Click here for additional data file. Tab S3 Click here for additional data file. Tab S4 Click here for additional data file. Tab S5 Click here for additional data file. App S1 Click here for additional data file.
First authorCorresponding authorAuthor nameORCIDAffiliationContribution 1Contribution 2Contribution 3
Co‐firstCo‐correspondingKuno Suzuki0000‐0001‐9776‐9008Healthcare Business Department, PFDeNA, Inc., Tokyo, JapanSample selectionData analysisProject administration
Co‐firstHideyoshi Igata0000‐0001‐8342‐0614Preferred Networks, Inc., Tokyo, JapanData analysisProject administration
Terunao IwanagaDepartment of Gastroenterology, Graduate School of Medicine, Chiba University, Chiba, JapanSample selectionSample collection
Hiroaki KanzakiDepartment of Gastroenterology, Graduate School of Medicine, Chiba University, Chiba, JapanSample selectionSample collectionProject administration
Naoya KatoDepartment of Gastroenterology, Graduate School of Medicine, Chiba University, Chiba, JapanSupervisor
Nobuko TanakaDepartment of Laboratory Medicine, Chiba University Hospital, Chiba, JapanSample collection
Kenji KawasakiDepartment of Laboratory Medicine, Chiba University Hospital, Chiba, JapanSample selectionSample collection
Kazuyuki Matsushita0000‐0001‐5812‐2818Department of Laboratory Medicine & Division of Clinical Genetics and Proteomics/ Center for Cancer Genomes/Center for Ultrasound, Chiba University HospitalSupervisor
Ryu SamashimaDepartment of Urology, Harasanshin Hospital, Fukuoka, JapanSample selectionSample collection
Keiji TsukinoDepartment of Urology, Harasanshin Hospital, Fukuoka, JapanSample selectionSample collection
Akira Yokomizo0000‐0002‐5741‐6280Department of Urology, Harasanshin Hospital, Fukuoka, JapanSupervisor
Yosuke MiyashitaDepartment of Respiratory Medicine, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Issei SumiyoshiDepartment of Respiratory Medicine, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Kazuhisa TakahashiDepartment of Respiratory Medicine, Juntendo University Graduate School of Medicine, Tokyo, JapanSupervisor
Nobuko SerizawaDepartment of Gastroenterologyerology, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selection
Ko TomishimaDepartment of Gastroenterologyerology, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Akihito NagaharaDepartment of Gastroenterologyerology, Juntendo University Graduate School of Medicine, Tokyo, JapanSupervisor
Yumiko Ishizuka0000‐0002‐1588‐3067Department of Breast and Endocrine Surgery, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Yoshiya Horimoto0000‐0001‐8935‐0768Department of Breast and Endocrine Surgery, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionProject administration
Masayoshi Nagata0000‐0002‐3077‐191XDepartment of Urology,Juntendo University Graduate School of Medicine, Tokyo, JapanSample selection
Keisuke IshikawaDepartment of Urology,Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Shigeo HorieDepartment of Urology,Juntendo University Graduate School of Medicine, Tokyo, JapanSupervisor
Shuichiro ShiinaDepartment of Gastroenterological Imaging and Interventional Oncology, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collectionProject administration
Motomi NasuDepartment of Esophageal and Gastroenterological Surgery, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Takashi Hashimoto0000‐0001‐8967‐3815Department of Esophageal and Gastroenterological Surgery, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selection
Shinji Mine0000‐0002‐6232‐3732Department of Esophageal and Gastroenterological Surgery, Juntendo University Graduate School of Medicine, Tokyo, JapanSupervisor
Shingo Kawano0000‐0001‐8547‐1140Department of Coloproctological Surgery, Juntendo University Faculty of Medicine, Tokyo, JapanSample selectionSample collection
Kiichi Sugimoto0000‐0003‐1210‐2920Department of Coloproctological Surgery, Juntendo University Faculty of Medicine, Tokyo, JapanSample selection
Kazuhiro Sakamoto0000‐0002‐7087‐1667Department of Coloproctological Surgery, Juntendo University Faculty of Medicine, Tokyo, JapanSupervisor
Hiroyuki TakemuraDepartment of Clinical Laboratory, Juntendo University Hospital, Tokyo, JapanSample selectionSample collection
Mitsuru WakitaDepartment of Clinical Laboratory, Juntendo University Hospital, Tokyo, JapanSample selectionSample collection
Yoko TabeDepartment of Clinical Laboratory Medicine, Juntendo University Graduate School of Medicine, Tokyo, JapanSample selectionSample collection
Shunsuke Kato0000‐0002‐4860‐1687Department of Clinical Oncology, Juntendo University Graduate School of Medicine, Tokyo, JapanSupervisorSample selection
Yohei MiyagiMolecular Pathology and Genetics Division, Kanagawa Cancer Center Research Institute, Yokohama, JapanSupervisor
Hiroyuki AdachiDepartment of Thoracic Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Tetsuya IsakaDepartment of Thoracic Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Hiroyuki ItoDepartment of Thoracic Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Takashi YamanakaDepartment of Breast and Endocrine Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Tatsuya YoshidaDepartment of Breast and Endocrine Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Toshinari YamashitaDepartment of Breast and Endocrine Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Takashi OgataDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Takanobu YamadaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Takashi OshimaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Naoto YamamotoDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Masaaki MurakawaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Soichiro MorinagaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Satoshi KobayashiDepartment of Gastroenterology, Hepatobiliary and Pancreatic Medical Oncology Division, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Shun TezukaDepartment of Gastroenterology, Hepatobiliary and Pancreatic Medical Oncology Division, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Makoto UenoDepartment of Gastroenterology, Hepatobiliary and Pancreatic Medical Oncology Division, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Mitsuyuki KoizumiDepartment of Urology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Kimito OsakaDepartment of Urology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Takeshi KishidaDepartment of Urology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Sumito SatoDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Yo MikayamaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Manabu ShiozawaDepartment of Gastrointestinal Surgery, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Yasuhiro InokuchiDepartment of Gastroenterology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Mitsuhiro FurutaDepartment of Gastroenterology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Nozomu MachidaDepartment of Gastroenterology, Kanagawa Cancer Center, Yokohama, JapanSample selectionSample collection
Shinya SatoMolecular Pathology and Genetics Division, Kanagawa Cancer Center Research Institute, Yokohama, JapanSample selectionProject administration
Yoshihiko YanoDivision of Gastroenterology, Department of Internal Medicine, Kobe University Graduate School of Medicine, Kobe, JapanSample selectionSample collectionProject administration
Atsushi MiwaInstitute for Preventive Medicine, Kurosawa Hospital, Takasaki, JapanProject administrationSample selectionSample collection
Kazuto Ito0000‐0002‐3735‐2769Departmentof Urology, Kurosawa Hospital, Takasaki, JapanSupervisor
Isao KurosawaDepartmentof Urology, Kurosawa Hospital, Takasaki, JapanSupervisor
Osamu KikuchiDepartment of Therapeutic Oncology, Kyoto University Graduate School of Medicine, Kyoto, JapanProject administration
Hiromitsu TazawaClinical bioresource Center, Kyoto University Hospital, Kyoto, JapanSample selectionSample collection
Manabu MutoDepartment of Clinical Oncology, Kyoto University Hospital, Kyoto, JapanSupervisor
Takashi HondaDepartment of Gastroenterology and Hepatology, Nagoya University Graduate School of Medicine, Nagoya, JapanProject administrationSample selectionSample collection
Hiroki KawashimaDepartment of Gastroenterology and Hepatology, Nagoya University Graduate School of Medicine, Nagoya, JapanSample selectionSample collectionSupervisor
Masatoshi IshigamiDepartment of Gastroenterology and Hepatology, Nagoya University Graduate School of Medicine, Nagoya, JapanSample selectionSample collectionSupervisor
Yutaka SaitoDepartment of Endoscopy, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Hiroyuki DaikoDepartment of Esophageal Surgery, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Takaki YoshikawaDepartment of Gastric Surgery, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Yukihide KanemitsuDepartment of Colorectal Surgery, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Ken Kato0000‐0002‐1733‐5072Department of Head and Neck, Esophageal Medical Oncology / Department of Gastrointestinal Medical Oncology, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Minoru EsakiDepartment of Hepatobiliary and Pancreatic Surgery, National Cancer Center Hospital, Tokyo, JapanSample selectionSample collection
Takuji OkusakaDepartment of Hepatobiliary and Pancreatic Oncology, National Cancer Center Hospital, Tokyo, JapanSample selectionSample collection
Hiromi SakamotoDepartment of Biobank and Tissue Resources, Fundamental Innovative Oncology Core, National Cancer Center Research Institute, Tokyo, JapanSample selectionSample collection
Teruhiko YoshidaDepartment of Biobank and Tissue Resources, Fundamental Innovative Oncology Core, National Cancer Center Research Institute, Tokyo, JapanSample selectionSample collection
Takahiro Ochiya0000‐0002‐0776‐9918Laboratory of Integrative Oncology, National Cancer Center Research Institute, Tokyo, JapanSupervisor
Department of Molecular and Cellular Medicine, Institute of Medical Science, Tokyo Medical University, Tokyo, JapanSupervisor
Mitsuhito SasakiDepartment of Hepatobiliary and Pancreatic Oncology, National Cancer Center Hospital East, Kashiwa, JapanSample selectionSample collectionProject administration
Masafumi Ikeda0000‐0002‐4050‐2086Department of Hepatobiliary and Pancreatic Oncology, National Cancer Center Hospital East, Kashiwa, JapanSample selectionSample collection
Masashi Kudo0000‐0003‐2161‐1807Department of Hepatobiliary and Pancreatic Surgery, National Cancer Center Hospital East, Kashiwa, JapanSample selection
Naoto Gotohda0000‐0002‐4468‐5844Department of Hepatobiliary and Pancreatic Surgery, National Cancer Center Hospital East, Kashiwa, JapanSample selection
Shuichi Mitsunaga0000‐0001‐9496‐3278Department of Hepatobiliary and Pancreatic Oncology, National Cancer Center Hospital East, Kashiwa, JapanSample selectionSupervisor
Takeshi KuwataDepartment of Genetic Medicine and Services, National Cancer Center Hospital East, Kashiwa, JapanSample selection
Takashi KojimaDepartment of Gastrointestinal Oncology, National Cancer Center Hospital East, Kashiwa, JapanSample selection
Tatsuro Murano0000‐0003‐0423‐2151Department of Gastroenterology and Endoscopy, National Cancer Center Hospital East, Kashiwa, JapanSample selectionProject administration
Tomonori YanoDepartment of Gastroenterology and Endoscopy, National Cancer Center Hospital East, Kashiwa, JapanSample selectionProject administration
Taiki Yamaji0000‐0002‐4799‐8782Division of Epidemiology, National Cancer Center Institute for Cancer Control, National Cancer Center, Tokyo, JapanSample selectionProject administration
Takahisa Matsuda0000‐0002‐9244‐2820Cancer Screening Center, National Cancer Center Hospital, Tokyo, JapanSample selectionProject administration
Shoichiro Tsugane0000‐0003‐4105‐2774Division of Cohort Research, National Cancer Center Institute for Cancer Control, National Cancer Center, Tokyo, JapanSupervisor
Kazuki Hashimoto0000‐0002‐8324‐2202Department of Breast and Endocrine Surgery, National Center for Global Health and Medicine, Tokyo, JapanSample selectionSample collectionProject administration
Kazuhiko YamadaDepartment of Surgery, National Center for Global Health and Medicine, Tokyo, JapanSample collectionSample selection
Nobuyuki TakemuraHepato‐Biliary‐Pancreatic Surgery Division, Department of Surgery, National Center for Global Health and Medicine, Tokyo, JapanSample collectionSample selection
Kyoji ItoHepato‐Biliary‐Pancreatic Surgery Division, Department of Surgery, National Center for Global Health and Medicine, Tokyo, JapanSample collectionSample selection
Fuminori MiharaHepato‐Biliary‐Pancreatic Surgery Division, Department of Surgery, National Center for Global Health and Medicine, Tokyo, JapanSample collectionSample selection
Akihiko Shimomura0000‐0002‐2557‐8170Department of Breast and Medical Oncology, National Center for Global Health and Medicine, Tokyo, JapanSupervisor
Kunitoshi Shigeyasu0000‐0002‐1266‐2875Department of Gastroenterological Surgery, Dentistry and Pharmaceutical Sciences, Okayama University Graduate School of Medicine, Okayama, JapanSample selectionSample collection
Kazuhiro Noma0000‐0002‐5584‐3908Department of Gastroenterological Surgery, Dentistry and Pharmaceutical Sciences, Okayama University Graduate School of Medicine, Okayama, JapanSample selectionSample collection
Toshiyoshi Fujiwara0000‐0002‐5377‐6051Department of Gastroenterological Surgery, Dentistry and Pharmaceutical Sciences, Okayama University Graduate School of Medicine, Okayama, JapanSample selectionSample collection
Hideki Yamamoto0000‐0003‐4919‐3483Department of Clinical Genomic Medicine, Dentistry and Pharmaceutical Sciences, Okayama University Graduate School of Medicine, Okayama, JapanSample selectionProject administration
Mizuki Morita0000‐0001‐8592‐5499Okayama University Hospital Biobank, Okayama University Hospital, Okayama, JapanSample collectionProject administration
Shinichi Toyooka0000‐0002‐7588‐6745Department of General Thoracic Surgery and Breast and Endocrinological Surgery, Dentistry and Pharmaceutical Sciences, Okayama University Graduate School of Medicine, Okayama, JapanSupervisor
Akihiro Tamori0000‐0001‐9119‐4864Department of Hepatology, Graduate School of Medicine, Osaka City University, Osaka, JapanSample selectionProject administrationProject administration
Tasuku Nakabori0000‐0002‐9486‐850XDepartment of Hepatobiliary and Pancreatic Oncology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Kenji Ikezawa0000‐0001‐7517‐8960Department of Hepatobiliary and Pancreatic Oncology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Kazuyoshi Ohkawa0000‐0001‐9700‐2472Department of Hepatobiliary and Pancreatic Oncology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Kei KunimasaDepartment of Thoracic Oncology, Osaka international Cancer Institute, Osaka, JapanSample selectionSample collection
Kazumi NishinoDepartment of Thoracic Oncology, Osaka international Cancer Institute, Osaka, JapanSample selectionSample collection
Toru KumagaiDepartment of Thoracic Oncology, Osaka international Cancer Institute, Osaka, JapanSample selectionSample collection
Toshihiro KudoDepartment of Medical Oncology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Naotoshi SugimotoDepartment of Medical Oncology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Masayoshi YasuiDepartment of Gastroenterological Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Takeshi OmoriDepartment of Gastroenterological Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Hiroshi MiyataDepartment of Gastroenterological Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Toru KimuraDepartment of General Thoracic Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Tomohiro ManiwaDepartment of General Thoracic Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Jiro OkamiDepartment of General Thoracic Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Hiroki KusamaDept. of Breast and Endocrine Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Nobuyoshi Kittaka0000‐0002‐0906‐1155Dept. of Breast and Endocrine Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Takahiro Nakayama0000‐0002‐3713‐1624Dept. of Breast and Endocrine Surgery, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Masashi NakayamaDepartment of, Urology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Yasutomo NakaiDepartment of, Urology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Kazuo NishimuraDepartment of, Urology, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Shoji YotsuiClinical Laboratory, Osaka International Cancer Institute, Osaka, JapanSample collection
Takashi YamamotoClinical Laboratory, Osaka International Cancer Institute, Osaka, JapanSample collection
Tomoyuki YamasakiClinical Laboratory, Osaka International Cancer Institute, Osaka, JapanSample collection
Emi YamashitaDepartment of Medical Informatics, Osaka International Cancer Institute, Osaka, JapanSample selectionProject administration
Kazune SaitoNext‐generation Precision Medicine Research Center, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Keiichi YoshidaNext‐generation Precision Medicine Research Center, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collectionProject administration
Masayuki OhueNext‐generation Precision Medicine Research Center, Osaka International Cancer Institute, Osaka, JapanSample selectionSample collection
Masakazu Kohda0000‐0001‐5970‐2224Healthcare Business Department, PFDeNA, Inc., Tokyo, JapanData analysisProject administrationSample selection
Tatsuya YamaguchiHealthcare Business Department, PFDeNA, Inc., Tokyo, JapanSample collectionProject administrationData analysis
Masami TanakaHealthcare Business Department, PFDeNA, Inc., Tokyo, JapanSample selectionData analysisProject administration
Takashi NishizawaHealthcare Business Department, PFDeNA, Inc., Tokyo, JapanSample selectionSupervisorData analysis
Tetsuhiko TairaSagara Hospital, Department of Medical Oncology, Sagara Hosipital, Kagoshima, JapanSample selectionSample collection
Junko Kawano0000‐0002‐4289‐1492Sagara Hospital, Department of Breast and Thyroid Surgical Oncology, Sagara Hosipital, Kagoshima, JapanSample selectionSample collection
Yasuaki Sagara0000‐0001‐5705‐1739Sagara Hospital, Department of Breast and Thyroid Surgical Oncology, Sagara Hosipital, Kagoshima, JapanSupervisor
Yosuke HoritaDepartment of Medical Oncology, International Medical Center, Saitama Medical University, Hidaka, JapanSample selectionSample collection
Yoshiaki Mihara0000‐0002‐3752‐0494Department of Medical Oncology, International Medical Center, Saitama Medical University, Hidaka, JapanSample selectionSample collection
Tetsuya Hamaguchi0000‐0002‐8353‐5371Department of Medical Oncology, International Medical Center, Saitama Medical University, Hidaka, JapanProject administration
Okihide SuzukiDepartment of Digestive Tract and General Surgery, Saitama Medical Center, Saitama Medical University, Kawagoe, JapanSample selectionSample collectionProject administration
Yoichi KumagaiDepartment of Digestive Tract and General Surgery, Saitama Medical Center, Saitama Medical University, Kawagoe, JapanSample selectionSample collection
Hideyuki IshidaDepartment of Digestive Tract and General Surgery, Saitama Medical Center, Saitama Medical University, Kawagoe, JapanSupervisor
Motoki YamagishiDepartment of Urology, Showa University Fujigaoka Hospital, Yokohama, JapanSample selectionSample collectionProject administration
Hideaki ShimoyamaDepartment of Urology, Showa University Fujigaoka Hospital, Yokohama, JapanSample selectionSample collection
Haruaki SasakiDepartment of Urology, Showa University Fujigaoka Hospital, Yokohama, JapanSupervisor
Takehiko Nakasato0000‐0003‐2777‐3022Depertment of Urology, Kikuna Memorial Hospital, Yokohama, JapanSample selectionSample collectionProject administration
Takeshi ShichijoDepertment of Urology, Showa university, Tokyo, JapanSample selectionSample collection
Takashi Fukagai0000‐0003‐0261‐6303Depertment of Urology, Showa university, Tokyo, JapanSupervisor
Kota NishimuraDepartment of Urology, Showa University Koto Toyosu Hosupital, Tokyo, JapanSample selectionSample collection
Kidai HirayamaDepartment of Urology, Showa University Koto Toyosu Hosupital, Tokyo, JapanSample selectionSample collection
Masashi MoritaDepartment of Urology, Showa University Koto Toyosu Hosupital, Tokyo, JapanSample selectionSample collection
Yujin Kudo0000‐0003‐3187‐9970Department of Thoracic and Thyroid Surgery, Tokyo Medical University, Tokyo, JapanSample selectionSample collectionProject administration
Susumu TakeuchiDepartment of Thoracic and Thyroid Surgery, Tokyo Medical University, Tokyo, JapanSample selectionSample collection
Norihiko IkedaDepartment of Thoracic and Thyroid Surgery, Tokyo Medical University, Tokyo, JapanSupervisor
Naohiro KamodaDepartment of Urology, Tokyo Medical University, Tokyo, JapanSample selectionProject administration
Kazunori NamikiDepartment of Urology, Tokyo Medical University, Tokyo, JapanSample selectionSample collection
Yoshio OhnoDepartment of Urology, Tokyo Medical University, Tokyo, JapanSupervisor
Tomohiro Umezu0000‐0002‐0123‐8921Department of Molecular Pathology, Tokyo Medical University, Tokyo, JapanSample selectionSample collectionProject administration
Yoshiki MurakamiDepartment of Molecular Pathology, Tokyo Medical University, Tokyo, JapanSample selection
Masahiko Kuroda0000‐0001‐7052‐4289Department of Molecular Pathology, Tokyo Medical University, Tokyo, JapanSupervisor
Co‐correspondingMotoki AbePreferred Networks, Inc., Tokyo, JapanData analysisSupervisor
Co‐correspondingYusuke Yamamoto0000‐0002‐5262‐8479Laboratory of Integrative Oncology, National Cancer Center Research Institute, Tokyo, JapanSupervisorSample selectionData analysis
  53 in total

1.  Isolation and characterization of exosomes from cell culture supernatants and biological fluids.

Authors:  Clotilde Théry; Sebastian Amigorena; Graça Raposo; Aled Clayton
Journal:  Curr Protoc Cell Biol       Date:  2006-04

2.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features.

Authors:  Yang Liao; Gordon K Smyth; Wei Shi
Journal:  Bioinformatics       Date:  2013-11-13       Impact factor: 6.937

3.  Novel Approach to Fecal Occult Blood Testing by Assay of Erythrocyte-Specific microRNA Markers.

Authors:  Chung Wah Wu; Xiaoming Cao; Calise K Berger; Patrick H Foote; Douglas W Mahoney; Julie A Simonson; Bradley W Anderson; Tracy C Yab; William R Taylor; Lisa A Boardman; John B Kisiel; David A Ahlquist
Journal:  Dig Dis Sci       Date:  2017-06-28       Impact factor: 3.199

4.  Circulating microRNAs as novel minimally invasive biomarkers for breast cancer.

Authors:  Helen M Heneghan; Nicola Miller; Aoife J Lowery; Karl J Sweeney; John Newell; Michael J Kerin
Journal:  Ann Surg       Date:  2010-03       Impact factor: 12.969

5.  Plasma microRNAs are promising novel biomarkers for early detection of colorectal cancer.

Authors:  Zhaohui Huang; Dan Huang; Shujuan Ni; Zhilei Peng; Weiqi Sheng; Xiang Du
Journal:  Int J Cancer       Date:  2010-07-01       Impact factor: 7.396

6.  Clinical utility of circulating tumor DNA sequencing in advanced gastrointestinal cancer: SCRUM-Japan GI-SCREEN and GOZILA studies.

Authors:  Yoshiaki Nakamura; Hiroya Taniguchi; Masafumi Ikeda; Hideaki Bando; Ken Kato; Chigusa Morizane; Taito Esaki; Yoshito Komatsu; Yasuyuki Kawamoto; Naoki Takahashi; Makoto Ueno; Yoshinori Kagawa; Tomohiro Nishina; Takeshi Kato; Yoshiyuki Yamamoto; Junji Furuse; Tadamichi Denda; Hisato Kawakami; Eiji Oki; Takako Nakajima; Naohiro Nishida; Kensei Yamaguchi; Hisateru Yasui; Masahiro Goto; Nobuhisa Matsuhashi; Koushiro Ohtsubo; Kentaro Yamazaki; Akihito Tsuji; Wataru Okamoto; Katsuya Tsuchihara; Takeharu Yamanaka; Izumi Miki; Yasutoshi Sakamoto; Hiroko Ichiki; Masayuki Hata; Riu Yamashita; Atsushi Ohtsu; Justin I Odegaard; Takayuki Yoshino
Journal:  Nat Med       Date:  2020-10-05       Impact factor: 53.440

7.  fastp: an ultra-fast all-in-one FASTQ preprocessor.

Authors:  Shifu Chen; Yanqing Zhou; Yaru Chen; Jia Gu
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

8.  Pre-profiling factors influencing serum microRNA levels.

Authors:  Sara A MacLellan; Calum MacAulay; Stephen Lam; Cathie Garnis
Journal:  BMC Clin Pathol       Date:  2014-06-21

9.  Circulating miRNA panels for specific and early detection in bladder cancer.

Authors:  Wataru Usuba; Fumihiko Urabe; Yusuke Yamamoto; Juntaro Matsuzaki; Hideo Sasaki; Makiko Ichikawa; Satoko Takizawa; Yoshiaki Aoki; Shumpei Niida; Ken Kato; Shin Egawa; Tatsuya Chikaraishi; Hiroyuki Fujimoto; Takahiro Ochiya
Journal:  Cancer Sci       Date:  2018-12-12       Impact factor: 6.716

10.  Plasma Level Of miR-21 And miR-451 In Primary And Recurrent Breast Cancer Patients.

Authors:  Maryam Motamedi; Morteza Hashemzadeh Chaleshtori; Sorayya Ghasemi; Fariborz Mokarian
Journal:  Breast Cancer (Dove Med Press)       Date:  2019-10-25
View more
  1 in total

1.  Multiple cancer type classification by small RNA expression profiles with plasma samples from multiple facilities.

Authors:  Kuno Suzuki; Hideyoshi Igata; Motoki Abe; Yusuke Yamamoto
Journal:  Cancer Sci       Date:  2022-03-14       Impact factor: 6.518

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.