| Literature DB >> 28325916 |
Qing-Xia Zheng1, Hai-Yan Fu2, He-Dong Li3, Bing Wang1, Cui-Hua Peng1, Sheng Wang1, Jun-Lan Cai1, Shao-Feng Liu1, Xiao-Bing Zhang1, Yong-Jie Yu4,5,6,7.
Abstract
Time shift among samples remains a significant challenge in data analysis, such as quality control of natural plant extracts and metabolic profiling analysis, because this phenomenon may lead to invalid conclusions. In this work, we propose a new time shift alignment method, namely, automatic time-shift alignment (ATSA), for complicated chromatographic data analysis. This technique comprised the following alignment stages: (1) automatic baseline correction and peak detection stage for providing useful chromatographic information; (2) preliminary alignment stage through adaptive segment partition to correct alignment for the entire chromatogram; and (3) precise alignment stage based on test chromatographic peak information to accurately align time shift. In ATSA, the chromatographic peak information of both reference and test samples can be completely employed for time-shift alignment to determine segment boundaries and avoid loss of information. ATSA was used to analyze a complicated chromatographic dataset. The obtained correlation coefficients among samples and data analysis efficiency indicated that the influences of time shift can be considerably reduced by ATSA; thus accurate conclusion could be obtained.Entities:
Year: 2017 PMID: 28325916 PMCID: PMC5428255 DOI: 10.1038/s41598-017-00390-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Illustration of difference between correlation coefficient and total peak correlation coefficient (TPC). (A) Original reference and test chromatograms with chromatographic peaks by circle. (B) Aligned chromatogram with maximal Pearson product-moment correlation coefficient. (C) Obtained aligned chromatogram with maximal TPC.
Figure 2Illustration time shift alignment of the ATSA. (A) Candidate peaks in the test segment. (B) Precise alignment based on the segment of test chromatographic peaks. (C) Aligned chromatograms. Values in (B) indicated time-shift values for precise alignment.
Figure 3(A) Reference chromatograms with and without baseline correction. Left column: Original chromatograms without time-shift alignment. Right column: Chromatograms with time-shift alignment.
Figure 4(A) Correlation coefficients with various initial segment sizes. (B) Correlation coefficients with various initial time-shift values. Values in (A) are the number of the final segments for preliminary alignment with different segment size. Values in (B) are time consumed for alignment (unit/second).
Figure 5(A) Correlation coefficients calculated based on peak area of original and time-shift alignment chromatograms. (B) Relative area deviation (RD%) vs area. Inserted plot in (A) shows the relationship between original chromatographic areas and those after time-shift alignment. Inserted plot in (B) shows statistical parameter under various RD%. For instance, the marker RD% of 5%, the x axis indicated that 95% peaks with RD% is no more than 5%, and the value in y axis indicate that their area is 97% of the total area. RD% = (Areaaligned − Areaoriginal)/Areaoriginal × 100%.
Figure 6(A) Correlation coefficients of aligned chromatograms by COW and ATSA.
Figure 7Detailed difference between COW and ATSA in the aligned chromatograms. (A) Elution range from 9 to 10.5 min. (B) Range from 28.5 to 32.5 min.