| Literature DB >> 30044822 |
Hikaru Watanabe1, Issei Nakamura1, Sayaka Mizutani2, Yumiko Kurokawa3, Hiroshi Mori4, Ken Kurokawa1,4, Takuji Yamada1,5.
Abstract
The human skin microbiome can vary over time, and inter-individual variability of the microbiome is greater than the temporal variability within an individual. The skin microbiome has become a useful tool to identify individuals, and one type of personal identification using the skin microbiome has been reported in a community of less than 20 individuals. However, identification of individuals based on the skin microbiome has shown low accuracy in communities larger than 80 individuals. Here, we developed a new approach for personal identification, which considers that minor taxa are one of the important factors for distinguishing between individuals. We originally established a human skin microbiome for 66 samples from 11 individuals over two years (33 samples each year). Our method could classify individuals with 85% accuracy beyond a one-year sampling period. Moreover, we applied our method to 837 publicly available skin microbiome samples from 89 individuals and succeeded in identifying individuals with 78% accuracy. In short, our results investigate that (i) our new personal identification method worked well with two different communities (our data: 11 individuals; public data: 89 individuals) using the skin microbiome, (ii) defining the personal skin microbiome requires samples from several time points, (iii) inclusion of minor skin taxa strongly contributes to the effectiveness of personal identification.Entities:
Mesh:
Year: 2018 PMID: 30044822 PMCID: PMC6059399 DOI: 10.1371/journal.pone.0199947
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overview of samples and personal identification.
A) The collection of samples, which were taken from the forehead of 11 individuals at three time points in each of two years. Likewise, public data were sampled from forehead at 6 to 15 time points over 3 months (ERP00512) [7]. For each sample, we analyzed the microbial community composition using 16S rDNA amplicon sequences. B) The personal identification flow is described. We then developed a classifier to identify individuals based on microbial community compositions. i) X is the query, and known samples are defined as references. ii) The mean distance between query and reference is calculated. iii) We assign query X to a reference individual in the according to the distance, and assess the individual is true owner of sample X or not.
Fig 2Microbial communities of 11 individuals.
A) Shannon diversity index values for the six time-point samples of individuals A to K. B) Relative abundance of OTUs in samples from the six time points for individuals A to K. For each individual, the first three columns represent first-year samples, and the remaining three columns represent the second-year samples. C) Each point represents the Canberra distance between a pair of samples grouped by three comparison schemes. *P ≤ 0.05 (Wilcoxon rank-sum test). N.S., not significant.
Results obtained using the personal identification method.
| Data | Query | Reference | Number of individuals | Number of samples | Accuracy |
|---|---|---|---|---|---|
| Our data | all samples | all samples | 11 | 66 | 0.95 |
| Our data | first year | second year | 11 | 33 | 0.85 |
| Our data | second year | first year | 11 | 33 | 0.85 |
| Public data | all samples | all samples | 89 | 837 | 0.78 |
Fig 3Boxplot for the accuracy of assessing the effects of each factor.
A) Accuracies derived from reference samples from within the same year as the query and from different years. B) Accuracy of identifying individuals with consideration of different numbers of reference samples. *P ≤ 0.05 (Wilcoxon rank-sum test). C) Effect of the cut-off value for bacterial relative abundance on personal identification accuracy. D) Effect of the number of reads for bacterial relative abundance on personal identification accuracy.