Literature DB >> 30297422

Randomly distributed embedding making short-term high-dimensional data predictable.

Huanfei Ma¹, Siyang Leng^2,3,4, Kazuyuki Aihara^5,6, Wei Lin^7,4,8,9,10, Luonan Chen^11,12,13,14.

Abstract

Future state prediction for nonlinear dynamical systems is a challenging task, particularly when only a few time series samples for high-dimensional variables are available from real-world systems. In this work, we propose a model-free framework, named randomly distributed embedding (RDE), to achieve accurate future state prediction based on short-term high-dimensional data. Specifically, from the observed data of high-dimensional variables, the RDE framework randomly generates a sufficient number of low-dimensional "nondelay embeddings" and maps each of them to a "delay embedding," which is constructed from the data of a to be predicted target variable. Any of these mappings can perform as a low-dimensional weak predictor for future state prediction, and all of such mappings generate a distribution of predicted future states. This distribution actually patches all pieces of association information from various embeddings unbiasedly or biasedly into the whole dynamics of the target variable, which after operated by appropriate estimation strategies, creates a stronger predictor for achieving prediction in a more reliable and robust form. Through applying the RDE framework to data from both representative models and real-world systems, we reveal that a high-dimension feature is no longer an obstacle but a source of information crucial to accurate prediction for short-term data, even under noise deterioration.

Entities: Chemical Disease Gene Species

Keywords: high-dimensional data; nonlinear dynamics; prediction; short-term data; time series

Year: 2018 PMID： 30297422 PMCID： PMC6205453 DOI： 10.1073/pnas.1802987115

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

The big data era has witnessed the accumulation of various types of time series data from microscopic gene expression data through mesoscopic neural activity data to macroscopic ecological or/and atmosphere data (1–5). A challenging task is making accurate forecast or prediction (6, 7) based on such time series datasets, in particular for those datasets with short-term time points but high-dimensional variables. Generally, these two properties are both considered as obstacles for accurate and robust prediction, because short-term datasets always result in fewer statistical patterns for prediction while high-dimensional system variables are likely to bring the curse of dimensionality problem. Specifically, for the model-based methods, such as regression methods (8), or equation-based models (9, 10), taking account of higher-dimensional variables requires a larger number of parameters or weights in the model, making it impractical to estimate these parameters or weights accurately only with short-term data. For the model-free methods, such as the empiricism-based methods where the nearest neighbors in historical data are used to predict the future values (11, 12), short-term data make the depicted attractor sparse in a high-dimensional space, which therefore, yields a problem of the false nearest neighbors. Additionally, machine learning methods, including deep belief network (13), long short-term memory network (14), and reservoir computing (15–18), have been intensively studied and applied to achieve systems reconstructions and dynamics prediction (19–26). However, based on the neural networks framework (27, 28), the performance of the artificial neural networks crucially and largely relies on the length of the available training data. Thus, these representative methods are effective in accurate prediction only when the training set contains a sufficiently large amount of training data. To handle high-dimensional data, dimension reduction techniques [e.g., various principal component analyses (29, 30), sparse regularization (31–33), and local linearizations] are usually applied for feature extraction. However, the consequence of these applications is likely to overlook interactions (particularly nonlinear interactions) or associations mutually between variables in high-dimensional systems. These interactions in nonlinear dynamics are the crucial information for prediction, remedying the difficulty due to the limited length of observed data, and therefore, the reduction techniques are not always beneficial to accurate prediction of dynamics in complex nonlinear systems (34). Thus, making a good use of the deterministic association or interaction information among the high-dimensional variables becomes a pivotal key to designing a useful prediction method (35). In this work, we propose a model-free framework, named as randomly distributed embedding (RDE), to accurately predict future dynamics based on the observed short-term high-dimensional data. In addition to using the temporal information of each variable, such as the traditional methods usually execute for the long-term data, we exploit the spatial information of the short-term data, such as associations or interactions among the high-dimensional variables. Particularly, the RDE framework can be thought of as an exchange scheme between the spatial information among the observed high-dimensional variables and the time-dependent probability distributions for the temporal dynamics. Thus, it improves the predictability significantly for a target variable. By using the RDE framework to the short-term high-dimensional data produced by both representative models and real-world systems, we show that a high-dimensional feature is no longer an obstacle but a source of information crucial to accurate prediction for short-term data even under noise perturbation.

RDE Framework

Delay and Nondelay Embeddings Form Low-Dimensional Attractors.

Usually in a typical high-dimensional nonlinear system, there is a large number of variables interacting with each other; however, the steady dynamics after a transient phase is generally constrained into a low-dimensional subspace due to dissipation. Thus, the state-space technique, based on the embedding theorem, makes it possible to reconstruct a low-dimensional attractor from time series data observed from such a system (36, 37). As particularly shown in Fig. 1, with the -dimensional time series data , two kinds of 3D (three-dimensional) attractors can be reconstructed. Specifically, according to the delayed embedding theory (36, 37), one kind is reconstructed in a form of , where is the observed time series of a single variable and also, the target variable to be predicted. The other kind, according to the generalized embedding theory (37–39), is formed by , where , and are the observed time series of multivariables that are randomly selected and used to predict . To make the expression clear and compact, we name as the delay attractor and as the nondelay attractor.

Fig. 1.

Sketch of embedding the original attractor in a high-dimensional space into a reconstructed attractor in a low-dimensional space.

Sketch of embedding the original attractor in a high-dimensional space into a reconstructed attractor in a low-dimensional space. The dimension L for reconstructing the above attractors is equal to three, which is an example of the reconstructed space. In fact, the reconstructed dimension for a general attractor, based on the embedding theory (), could be either smaller or larger; however, it is usually much less than the high dimension of the original dissipative system. Thus, as conveyed by the embedding theory, these delay and nondelay attractors of lower dimensions theoretically preserve the dynamical information of the entire system in different ways. As illustrated in Fig. 1, the temporal (or delay) information of the target single variable is explored in the delay attractor while the spatial or association information among high-dimensional variables is mainly exploited in the nondelay attractor.

A Predictor: Mapping from Nondelay Attractor of Multivariables to Delay Attractor of One Target Variable.

The embedding theory reveals that all of the above reconstructed attractors with appropriately reconstructed dimensions are topologically conjugated to the original attractor because of a diffeomorphism map (i.e., ) (37). Thus, for each index tuple , a component of such a mapping, denoted by , can be obtained as a predictor for the target variable in the form ofNotice that is much lower than the dimension of the entire system. Then, typical approximation frameworks with usual fitting algorithms could be used to implement this predictor. In this paper, we apply the Gaussian Process Regression method (40) to fit each . The above mapping actually transforms the association information among multivariables into the temporal dynamics of the predicted variable.

Multiple Predictors Forming a Probability Distribution at Each Future Time Point.

Provided with the observed high-dimensional data, we reconstruct nondelay attractors as many as possible with different index tuples . For each nondelay attractor, we can fit the corresponding predictor to predict a specific target variable . Here, each tuple is randomly chosen with replacement from any index combinations of variables in the original high-dimensional data. When the corresponding is obtained, one-step prediction could be further obtained where is the time instance to be predicted at time point . Thus, the more variables of the original data that are experimentally observed, the more one-step prediction values, , for can be obtained. From the fact that each nondelay embedding preserves the dynamical information of the entire system in a different way, these embeddings have different performances in making prediction, especially under noise deterioration. In fact, at each future time point, the multiple prediction values actually form a probability (frequency) distribution, except for some degenerative tuples that appear as the outliers in the distribution of prediction as illustrated in Fig. 2 ( and ).

Fig. 2.

Distribution of prediction errors by random embedding under different noise levels for a benchmark model of a linear system. (A–C) Probability distributions under different noise strengths.

Distribution Leveraging Prediction Accuracy.

Compared with each single prediction, the above-obtained distribution renders more information leveraging prediction accuracy. Specifically, better prediction can be estimated bywhere represents an estimation based on the available probability information of the random variable . A straightforward scheme to obtain this estimation is to use the expectation of the distribution as the final prediction value [i.e., , where denotes the probability density function of the random variable ]. In fact, this expectation scheme is particularly useful for a general case where each random embedding yields a prediction error , satisfying and becoming a random variable with an expectation very close to zero. However, when the prediction error expectation deviates far from zero, an aggregation scheme, independent of the zero-expectation assumption, has to be taken into account in the RDE framework. Concretely, in light of the feature bagging strategy in machine learning (41, 42), each random embedding is treated as a feature, and thus, the final prediction value is estimated by the aggregated average of the selected features: that is,where each is a weight related to the in-sample fitting error of and represents the number of the best embeddings showing fewer fitting errors for the final prediction ().

RDE Algorithm.

Altogether with the above settings, the RDE framework is established to make future state prediction as accurate as possible. The elementary principle for this framework is schematically depicted in Fig. 3, and the algorithm for more general tuple is presented in .

Fig. 3.

General principle of the RDE framework.

Results

Synthetic Data.

To illustrate the mechanism and the basic idea of the RDE framework, we begin with a benchmark model with additive noise. The model contains 10 interacted variables, showing dynamical behavior of attractive periodicity that has a box-counting dimension that is in the 10D space (the details of the system are provided in ). According to the RDE framework, multiple tuples of two components could be randomly chosen and used to make one-step prediction for the underlined component . In the noise-free situation, the majority of the randomly chosen index tuples (the 2D random embeddings) can bring accurate prediction, while there are some degenerative cases where the chosen index tuple cannot make good prediction, yielding large errors. Fig. 2 shows the numerical results of one-step prediction under the noise-free condition, where the distribution of the prediction errors presents a delta function-like form at the zero error, leaving small probability of large errors. In the situation where the time series data are deteriorated by noise, each embedding shows different ability to cope with noise when making prediction due to the different way in which the random embedding preserves the dynamical information of the entire system. Accordingly, the distribution of the prediction errors shows a normal distribution-like form except for the outliers in the degenerative cases as shown in Fig. 2. In Fig. 2, the distribution is further dispersed under a higher level of noise strength while keeping the distribution center at zero. The prediction of the benchmark system under noise deterioration using the RDE framework is further carried out in Fig. 4, where the distribution of the prediction and the final predicted value as well as outliers are depicted for each one-step prediction. The correlation between the predicted values and the real values reaches 0.99, confirming that the RDE framework works effectively in accurate prediction for the benchmark model, even under noise deterioration.

Fig. 4.

Ten time points of one-step prediction for one variable in the benchmark model of a linear system. (A) The distribution of prediction by random embeddings. Based on this information, the final predicted values are made. (B) In addition to the original data and the predicted data, the box plot of the distribution is also shown in plane with median values, upper and lower quartiles, bounds, and outliers. To validate the applicability of the RDE framework to make multistep prediction for high-dimensional nonlinear systems, we consider a 90D coupled Lorenz system. As shown in Fig. 5 , the multistep dynamics of the 90D system can be accurately predicted from a measured time series with only 50 time points, which clearly covers small segments of the attractor.

Fig. 5.

Validations with synthetic data. (A) Prediction for the nonlinear 90D coupled Lorenz system, where multistep prediction up to 30 steps for six components is shown. (B) For the coupled Lorenz system of 90 dimensions, the training data as well as the predicted data cover only small segments of the underlying attractor. Each circle represents a data point corresponding to the data in A; some data points are buried within the attractor. (C) Prediction for grids of a spiral pattern in an ISCAM model, where four selective samples of one-step prediction for the pattern are shown. Spatiotemporal dynamics produces data evolving across time as well as space (43, 44), such as one of the typical high-dimensional systems involving a large number of interacted variables. Since the variables interact with each other in an unknown manner, the prediction of such a multivariable system based on limited time series data thus becomes a challenging task. We consider the data generated from an ideal storage cellular automaton model (ISCAM) simulating heterocatalytic reaction–diffusion processes at metal surfaces (45, 46). The one-step prediction results by using the RDE framework for a spiral pattern in the grids are illustrated in Fig. 5, which clearly shows the effectiveness of our method for the spatiotemporal pattern prediction. Here, 800 variables are involved in the system, and 100 consecutive pattern series are observed as the training set.

Real-World Data.

In the era of big-data, high-dimensional data are ubiquitously collected from numerous real-world systems. We first consider a set of gene expression data as representative high-throughput biological data, typically with a large number of genes but with a very small number of time sampling points. The dataset was obtained by a gene expression profiling study of both miRNA and mRNA in mouse liver (47), which consists of time series containing 12 time points of 46,628 probes (each probe measures every 4 h over 48 h). Due to the complicated gene regulation mechanism (48), despite the high dimension and different types of probes or genes, the time evolution of all of these probes can be regulated by certain underlying complicated regulation dynamics, thus forming a high-dimensional dynamical system. Consequently, it is possible to use the RDE framework to predict the gene expression dynamics of each specific probe. As shown in Fig. 6, the RDE framework achieves fairly accurate one-step prediction in a leave-one-out way.

Fig. 6.

Real-world data. (A) One-step prediction for five probes from the gene dataset. (B) A 1-h prediction for the wind speed in the Tokyo capital region based on data collected from five geometrically local stations and delays up to 5 h. (C) A 1-d prediction for the daily cardiovascular disease admissions with trend, where the standard deviation (STD) is shown as shaded area. Climate datasets, usually collected at different locations by regular sampling intervals, are known by their complex spatiotemporal characteristics. Here, we consider the wind speed datasets collected around the Tokyo capital region in Japan by the Japan Meteorological Agency (49). Taking delays for each factor into consideration, the system shows a high-dimensional property, and the 1-h prediction is made by the RDE framework with a training set containing 400 time points as shown in Fig. 6. The correlation between the predicted series and the original series reaches 0.9. The final real data test comes from the city of Hong Kong, and it is composed of time series of air pollutants and disease admissions in major hospitals in Hong Kong (50, 51). Considering the delay effect of every potential factor as well as a dummy vector of weekday effect (52), we have a 48D system, and without the RDE framework, it is difficult to predict the disease admissions with only 200 observations in high accuracy. However, by using the RDE framework, the 1-d forward prediction is obtained as shown in Fig. 6, where the correlation between the predicted values and the original values reaches 0.74 and the predicted trend of the disease risk fits fairly well with the true trend.

Discussion

Expectation Scheme or Aggregation Scheme.

The expectation scheme is simple and straightforward for applications. However, the aggregation scheme needs fitting error estimation before making final prediction. Considering the short-term property of the training set, we adopt a leave-one-out strategy to obtain the fitting error for the aggregation scheme. Thus, the aggregation scheme requires higher computational cost than the expectation scheme, but it does not rely on the zero-mean assumption of the prediction errors as summarized in . Notice that the distribution of such a prediction error is unknown a priori. Then, it is nontrivial to make a selection from these two schemes in advance. For the choice, we judge whether or not the distribution of prediction is symmetric using the skewness quantity for a distribution. Larger skewness suggests that the distribution is asymmetric and consequently, that the normal expectation is unlikely to become the best candidate for the final prediction. The effectiveness of the skewness is further illustrated by using a benchmark system as shown in .

Number of Mappings from Nondelay Attractors to a Delay Attractor.

The advantage of the RDE framework exists in decoding the intertwined information among various variables of a complex system by considering a large number of embeddings in low-dimensional subspaces. Specifically, the number of possible nondelay embeddings grows combinatorially as the system dimension increases in a manner aswhere is the number of observed variables and is the embedding dimension. However, if we intend to obtain all of the possible nondelay embeddings as increases, the computational cost grows drastically, and the curse of dimensionality problem emerges unavoidably. As a matter of fact, in practice it is neither necessary nor practically profitable to exhaust all of the candidate nondelay embeddings. When we estimate the expectation of the underlined distribution, according to the sampling theory (53, 54), the width of the confidence interval for the estimated expectation decreases as the number of sampling increases. Particularly for the normal distribution, the confidence interval could even be analytically provided in advance (). With this interval, only a small number of random embeddings are sufficient to reach the precision of the expectation estimation scheme. Actually, the computation of all of the corresponding mappings is highly parallel, and thus, the computational cost can be further alleviated by using parallel computation. For the aggregation scheme, however, we use the in-sample test or the Monte Carlo method with replacement to score candidate random embeddings. In fact, as the number of random embeddings increases, the best in-sample error (or the fitting error) decreases exponentially as shown in . Thus, we terminate random embeddings sampling when the in-sample error converges (at the elbow of the exponential decrease), which reduces computational cost and brings good generalization as well.

Short-Term Data, Robustness, and Comparisons.

Since the RDE framework fully exploits the information embedded in low-dimensional attractors and does not require the coverage of the whole attractor, it is possible to deal with very limited training data. To validate this, we carry out a length test on the coupled Lorenz systems with 15 variables. The test is based on multiple randomly selected sections of measured data. The results are shown in Fig. 7 , where two criteria for one-step predictions are plotted vs. the length of measured data. Compared with other prediction methods for high-dimensional data, the RDE framework particularly works well with very short-term data. Clearly, around 20 time points of the measured data are sufficient for reconstructing system’s dynamics. In the literature, both the classic single-variable embedding (SVE) method (11) and the recently proposed multiview embedding (MVE) method (55) can deal with the prediction of high-dimensional data. To make predictions, they both rely on the nearest neighbors in the attractor reconstructed by the historical data, and thus, they may suffer from false nearest neighbors when the length of the time series data is very short. However, the RDE framework does not require that the measured data (training data) cover the whole attractor. It works effectively even when only small segments of the attractor are covered by the measured data as shown in Fig. 5. As clearly shown in Fig. 7 , for the same short-term data (less than 30 points), both methods, MVE and SVE, have poor convergence, while the RDE framework performs well. Indeed, MVE and SVE work well only when the training data become longer (but they are still far from convergence), since longer training data produce better coverage of the nearest neighbors in the attractor.

Fig. 7.

Performance comparisons of different methods with different lengths of training data and different levels of noise. Two criteria are used to evaluate the prediction quality: the correlation and the rms error between the predicted series and the test data. (A and B) The length test based on 100 randomly chosen sections for each length of training data. (C and D) The noise test based on 100 independent trials. Here, the median, the upper quartile, and the lower quartile are shown. SNR, signal-to-noise ratio. Noise is inevitable in real applications, and to test the practical robustness of the RDE framework, we also consider the effect of additive white noise in the above 15D coupled Lorenz system with 50 time points as training data. Fig. 7 shows that the RDE framework works well for the signal-to-noise ratio larger than 10, which is as robust as the empirical data-based MVE and SVE methods. Moreover, although both the RDE framework and the RBF (radial basis function) network method proposed in ref. 33 use the inverse embedding technique, the RDE framework fully leverages the information in the distribution of a large amount of random embeddings, while the RBF method uses inverse embedding directly for a high-dimensional system. This difference outstandingly promotes the robustness of the RDE framework against noise deterioration as shown in Fig. 7 .

Conclusion

In summary, we have established a framework to make predictions from short-term high-dimensional data accurately. The novelty of this RDE framework roots in a full exploitation of the information embedded in a large number of low-dimensional nondelay attractors as well as in an appropriate use of the exploited distribution of the target variable for prediction. On one hand, the RDE framework creates a distribution, patching all pieces of information from various embeddings into the entire dynamics of the predicted variable. On the other hand, the selection of suitable estimation schemes based on the distribution information thereby significantly increases the prediction reliability and robustness, even for those short-term data with noise deterioration. As validated by datasets produced by both benchmark models and real-world systems, the method is especially effective for the observed short-term high-dimensional time series. This virtue makes the RDE framework potentially useful in mining big datasets from real-world systems.

Materials and Methods

Given time series data sampled from variables of a system with length (i.e., , where ), one can estimate the box-counting dimension of the system’s dynamics using the false nearest neighbor algorithm (56) and choose embedding dimension . Assume that the target variable to be predicted is represented as . The RDE algorithm is listed as follows:Here, the condition implies that the distribution is nearly symmetric; then, the expectation of the distribution is used as the final prediction. Otherwise, the distribution is asymmetric, indicating that the expectation is not the best choice for the final prediction; then, the aggregation average is used as the final prediction. In this work, we empirically set as 0.1, and a statistical hypothesis test with shuffling data could be carried out to get a significant level. In this algorithm, the number of tuples is determined using a confidence interval or convergence of in-sample errors as given in , and the number of best tuples is empirically chosen as . The RDE algorithm described above is for one-step prediction, but the RDE framework can be extended to multistep prediction. Particularly for the case where is approximated as a linear mapping, the form of can be further explicitly obtained as presented in . Randomly pick tuples from with replacement, and each tuple contains numbers. For the th tuple , fit a predictor so as to minimize . Standard fitting algorithms could be adopted. In this paper, Gaussian Process Regression is used. Use each predictor , and make one-step prediction for a specific future time . Multiple predicted values form a set . Exclude the outliers from the set, and use the Kernel Density Estimation method to approximate the probability density function of its distribution. Calculate the skewness of such distribution. In the case , where is a threshold value, make the final prediction as . Otherwise, calculate the in-sample prediction error for the fitted using the leave-one-out method. Based on the rank of the in-sample error, best tuples are picked out, and the final prediction is given by the aggregated average in the form of , where the weight .

23 in total

Review 1. Genomics, gene expression and DNA arrays.

Authors: D J Lockhart; E A Winzeler
Journal: Nature Date: 2000-06-15 Impact factor: 49.962

2. Network motifs in the transcriptional regulation network of Escherichia coli.

Authors: Shai S Shen-Orr; Ron Milo; Shmoolik Mangan; Uri Alon
Journal: Nat Genet Date: 2002-04-22 Impact factor: 38.330

3. Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication.

Authors: Herbert Jaeger; Harald Haas
Journal: Science Date: 2004-04-02 Impact factor: 47.728

4. Predicting chaotic time series.

Authors:
Journal: Phys Rev Lett Date: 1987-08-24 Impact factor: 9.161

5. Comprehensive analysis of microRNA-mRNA co-expression in circadian rhythm.

Authors: Young Ji Na; Jung Hwan Sung; Suk Chan Lee; Young Ju Lee; Yeun Joo Choi; Woong Yang Park; Hee Sup Shin; Ju Han Kim
Journal: Exp Mol Med Date: 2009-09-30 Impact factor: 8.718

6. Determining embedding dimension for phase-space reconstruction using a geometrical construction.

Authors:
Journal: Phys Rev A Date: 1992-03-15 Impact factor: 3.140

7. Reservoir observers: Model-free inference of unmeasured variables in chaotic systems.

Authors: Zhixin Lu; Jaideep Pathak; Brian Hunt; Michelle Girvan; Roger Brockett; Edward Ott
Journal: Chaos Date: 2017-04 Impact factor: 3.642

Review 8. Data-driven predictions in the science of science.

Authors: Aaron Clauset; Daniel B Larremore; Roberta Sinatra
Journal: Science Date: 2017-02-02 Impact factor: 47.728

9. Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach.

Authors: Jaideep Pathak; Brian Hunt; Michelle Girvan; Zhixin Lu; Edward Ott
Journal: Phys Rev Lett Date: 2018-01-12 Impact factor: 9.161

10. Challenges of Big Data Analysis.

Authors: Jianqing Fan; Fang Han; Han Liu
Journal: Natl Sci Rev Date: 2014-06 Impact factor: 17.275

8 in total

Review 1. Reconstructing data-driven governing equations for cell phenotypic transitions: integration of data science and systems biology.

Authors: Jianhua Xing
Journal: Phys Biol Date: 2022-09-09 Impact factor: 2.959

2. Harvesting random embedding for high-frequency change-point detection in temporal complex systems.

Authors: Jia-Wen Hou; Huan-Fei Ma; Dake He; Jie Sun; Qing Nie; Wei Lin
Journal: Natl Sci Rev Date: 2021-12-27 Impact factor: 23.178

3. Difficulty in inferring microbial community structure based on co-occurrence network approaches.

Authors: Hokuto Hirano; Kazuhiro Takemoto
Journal: BMC Bioinformatics Date: 2019-06-13 Impact factor: 3.169

4. Partial cross mapping eliminates indirect causal influences.

Authors: Siyang Leng; Huanfei Ma; Jürgen Kurths; Ying-Cheng Lai; Wei Lin; Kazuyuki Aihara; Luonan Chen
Journal: Nat Commun Date: 2020-05-26 Impact factor: 14.919

5. Forecasting high-dimensional dynamics exploiting suboptimal embeddings.

Authors: Kazuyuki Aihara; Yoshito Hirata; Shunya Okuno
Journal: Sci Rep Date: 2020-01-20 Impact factor: 4.379

6. Ranking of communities in multiplex spatiotemporal models of brain dynamics.

Authors: James B Wilsenach; Catherine E Warnaby; Charlotte M Deane; Gesine D Reinert
Journal: Appl Netw Sci Date: 2022-03-14

7. A novel method to detect the early warning signal of COVID-19 transmission.

Authors: Mingzhang Li; Shuo Ma; Zhengrong Liu
Journal: BMC Infect Dis Date: 2022-07-18 Impact factor: 3.667

8. Early dynamics of chronic myeloid leukemia on nilotinib predicts deep molecular response.

Authors: Yuji Okamoto; Mitsuhito Hirano; Kai Morino; Masashi K Kajita; Shinji Nakaoka; Mayuko Tsuda; Kei-Ji Sugimoto; Shigehisa Tamaki; Junichi Hisatake; Hisayuki Yokoyama; Tadahiko Igarashi; Atsushi Shinagawa; Takeaki Sugawara; Satoru Hara; Kazuhisa Fujikawa; Seiichi Shimizu; Toshiaki Yujiri; Hisashi Wakita; Kaichi Nishiwaki; Arinobu Tojo; Kazuyuki Aihara
Journal: NPJ Syst Biol Appl Date: 2022-10-13

8 in total