| Literature DB >> 23936105 |
Sándor Spisák1, Norbert Solymosi, Péter Ittzés, András Bodor, Dániel Kondor, Gábor Vattay, Barbara K Barták, Ferenc Sipos, Orsolya Galamb, Zsolt Tulassay, Zoltán Szállási, Simon Rasmussen, Thomas Sicheritz-Ponten, Søren Brunak, Béla Molnár, István Csabai.
Abstract
Our bloodstream is considered to be an environment well separated from the outside world and the digestive tract. According to the standard paradigm large macromolecules consumed with food cannot pass directly to the circulatory system. During digestion proteins and DNA are thought to be degraded into small constituents, amino acids and nucleic acids, respectively, and then absorbed by a complex active process and distributed to various parts of the body through the circulation system. Here, based on the analysis of over 1000 human samples from four independent studies, we report evidence that meal-derived DNA fragments which are large enough to carry complete genes can avoid degradation and through an unknown mechanism enter the human circulation system. In one of the blood samples the relative concentration of plant DNA is higher than the human DNA. The plant DNA concentration shows a surprisingly precise log-normal distribution in the plasma samples while non-plasma (cord blood) control sample was found to be free of plant DNA.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23936105 PMCID: PMC3728338 DOI: 10.1371/journal.pone.0069805
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The accession numbers and names of the plants.
| Accession number | Name | Accession number | Name |
| NC_008101.1 |
| NC_008097.1 |
|
| NC_008099.1 |
| NC_008100.1 |
|
| NC_008117.1 |
| NC_008116.1 |
|
| NC_008114.1 |
| NC_008115.1 |
|
| NC_008155.1 |
| NC_008235.1 |
|
| NC_008289.1 |
| NC_008326.1 |
|
| NC_008325.1 |
| NC_008334.1 |
|
| NC_008335.1 |
| NC_008336.1 |
|
| NC_001840.1 |
| NC_000925.1 |
|
| NC_002186.1 |
| NC_001319.1 |
|
| NC_001320.1 |
| NC_001568.1 |
|
| NC_001675.1 |
| NC_001713.1 |
|
| NC_000926.1 |
| NC_000927.1 |
|
| NC_008359.1 |
| NC_001799.1 |
|
| NC_002202.1 |
| NC_008372.1 |
|
| NC_008407.1 |
| NC_008408.1 |
|
| NC_008454.1 |
| NC_008456.1 |
|
| NC_008457.1 |
| NC_008535.1 |
|
| NC_008588.1 |
| NC_008589.1 |
|
| NC_008591.1 |
| NC_008590.1 |
|
| NC_008602.1 |
| NC_008641.1 |
|
| NC_001666.2 |
| NC_008788.1 |
|
| NC_008796.1 |
| NC_008822.1 |
|
| NC_008829.1 |
| NC_002652.1 |
|
| NC_009143.1 |
| NC_002694.1 |
|
| NC_009270.1 |
| NC_009259.1 |
|
| NC_009269.1 |
| NC_009273.1 |
|
| NC_009275.1 |
| NC_009266.1 |
|
| NC_009267.1 |
| NC_009268.1 |
|
| NC_009271.1 |
| NC_009272.1 |
|
| NC_009274.1 |
| NC_009265.1 |
|
| NC_002762.1 |
| NC_004677.2 |
|
| NC_009573.1 |
| NC_007898.2 |
|
| NC_009598.1 |
| NC_009601.1 |
|
| NC_009600.1 |
| NC_009599.1 |
|
| NC_009618.1 |
| NC_009681.1 |
|
| NC_009766.1 |
| NC_009765.1 |
|
| NC_009808.1 |
| NC_009949.1 |
|
| NC_009950.1 |
| NC_009962.1 |
|
| NC_009963.1 |
| NC_010093.1 |
|
| NC_010109.1 |
| NC_002693.2 |
|
| NC_010323.1 |
| NC_010358.1 |
|
| NC_010359.1 |
| NC_010360.1 |
|
| NC_010361.1 |
| NC_010362.1 |
|
| NC_010433.1 |
| NC_010442.1 |
|
| NC_010548.1 |
| NC_003119.6 |
|
| NC_010601.1 |
| NC_010654.1 |
|
| NC_003386.1 |
| NC_010772.1 |
|
| NC_010776.1 |
| NC_011032.1 |
|
| NC_011031.1 |
| NC_011163.1 |
|
| NC_011395.1 |
| NC_011600.1 |
|
| NC_011828.1 |
| NC_011930.1 |
|
| NC_011942.1 |
| NC_011954.1 |
|
| NC_012052.1 |
| NC_012099.1 |
|
| NC_012097.1 |
| NC_012101.1 |
|
| NC_012224.1 |
| NC_012568.1 |
|
| NC_012575.1 |
| NC_004115.1 |
|
| NC_012615.1 |
| NC_011157.3 |
|
| NC_011152.3 |
| NC_012898.1 |
|
| NC_012903.1 |
| NC_012818.1 |
|
| NC_012927.1 |
| NC_012978.1 |
|
| NC_011713.2 |
| NC_013086.1 |
|
| NC_013088.1 |
| NC_013273.1 |
|
| NC_013359.1 |
| NC_013498.1 |
|
| NC_013553.1 |
| NC_004543.1 |
|
| NC_004561.1 |
| NC_013703.1 |
|
| NC_013707.1 |
| NC_013823.1 |
|
| NC_013843.1 |
| NC_008096.2 |
|
| NC_014062.1 |
| NC_014063.1 |
|
| NC_014057.1 |
| NC_014056.1 |
|
| NC_014267.1 |
| NC_014287.1 |
|
| NC_013991.2 |
| NC_014348.1 |
|
| NC_014340.1 |
| NC_014345.1 |
|
| NC_004766.1 |
| NC_001603.2 |
|
| NC_014346.1 |
| NC_004799.1 |
|
| NC_014592.1 |
| NC_014569.1 |
|
| NC_014573.1 |
| NC_014582.1 |
|
| NC_014575.1 |
| NC_014589.1 |
|
| NC_014570.1 |
| NC_014697.1 |
|
| NC_014699.1 |
| NC_014674.1 |
|
| NC_014675.1 |
| NC_014676.1 |
|
| NC_004823.1 |
| NC_014808.1 |
|
| NC_014807.1 |
| NC_014874.1 |
|
| NC_015083.1 |
| NC_015104.1 |
|
| NC_015113.1 |
| NC_015084.1 |
|
| NC_015139.1 |
| NC_004993.1 |
|
| NC_011153.4 |
| NC_011154.4 |
|
| NC_011155.4 |
| NC_011156.4 |
|
| NC_011158.4 |
| NC_011159.4 |
|
| NC_015206.1 |
| NC_015204.1 |
|
| NC_015308.1 |
| NC_015402.1 |
|
| NC_015401.1 |
| NC_015403.1 |
|
| NC_015359.1 |
| NC_015605.1 |
|
| NC_015610.1 |
| NC_015604.1 |
|
| NC_015608.1 |
| NC_015621.1 |
|
| NC_015623.1 |
| NC_015543.1 |
|
| NC_005086.1 |
| NC_005087.1 |
|
| NC_005353.1 |
| NC_005878.2 |
|
| NC_005973.1 |
| NC_006050.1 |
|
| NC_006084.1 |
| NC_006137.1 |
|
| NC_006290.1 |
| NC_006861.1 |
|
| NC_007144.1 |
| NC_007288.1 |
|
| NC_001631.1 |
| NC_001865.1 |
|
| NC_000932.1 |
| NC_007407.1 |
|
| NC_007500.1 |
| NC_007499.1 |
|
| NC_007578.1 |
| NC_001879.2 |
|
| NC_007602.1 |
| NC_007758.1 |
|
| NC_007932.1 |
| NC_007944.1 |
|
| NC_007943.1 |
| NC_007942.1 |
|
| NC_007957.1 |
| NC_007977.1 |
|
The chloroplast genomes of these species were obtained from the NCBI archive and their presence was tested in the sequenced human samples.
The initial number of sequence reads and the ones matching the chloroplast genome collection.
| Sample ID | Fragmentsize (bp) | Total numberof reads | Bowtie | BLAST | ||
| Bacteria | Mammalia | Plantsonly | ||||
| AD1 |
| 88,223,059 | 58 (0.657 ppm) | 11 | 0 | 44 |
| AD2 | 200…10 k | 69,413,572 | 33 (0.475 ppm) | 4 | 0 | 19 |
| AD3 |
| 129,543,409 | 28 (0.216 ppm) | 4 | 0 | 18 |
| CRC1 |
| 211,165,918 | 248 (1.174 ppm) | 70 | 1 | 154 |
| CRC2 | 200…10 k | 214,141,527 | 261 (1.218 ppm) | 42 | 0 | 193 |
| CRC3 |
| 104,843,894 | 184 (1.754 ppm) | 44 | 0 | 112 |
| IBD1 |
| 163,948,523 | 23319 (142.234 ppm) | 820 | 39 | 21565 |
| IBD2 | 200…10 k | 148,952,652 | 237 (1.591 ppm) | 32 | 2 | 190 |
| IBD3 |
| 165,613,909 | 93 (0.561 ppm) | 4 | 0 | 85 |
| NEG1 |
| 153,790,123 | 257 (1.67 ppm) | 82 | 4 | 161 |
| NEG2 | 200…10 k | 137,374,304 | 275 (2.00 ppm) | 47 | 1 | 215 |
| NEG3 |
| 143,953,730 | 90 (0.625 ppm) | 21 | 0 | 69 |
BLAST against the complete NCBI reference sequence database confirms that the chloroplast matching reads identified by the NGS alignment software Bowtie match only plant genomes and just in very few cases mammal genomes. The bacterial alignments are partly the result of the chloroplasts' genetic homology to them or may indicate the presence of circulating bacterial DNA. Among the IBD fractions, the one with largest DNA fragment size has the largest concentration.
Figure 1Coverage of the tomato chloroplast in the IBD sample.
Small gray dots indicate the counts of alignments at individual nucleotide positions, darker shades are the result of several overlapping points. The orange line is the smoothed coverage of the tomato chloroplast, while the short gray dash indicates the average coverage level of the human genome for the same sample.
Figure 2Brassica rapa chloroplast coverage pileup for the DRP000446 study.
The gray spikes shows the counts of alignments at individual nucleotide positions (vertical scale is logarithmic). 27742 nucleotide positions of the total 180852 are covered. There are two regions around 100,000 and 135,000 where the coverage is more than 10 times than at other parts of the chloroplast. These are the regions where the ribosomal RNA genes are found which share very similar sequence with other chloroplasts and bacterial genomes. Indeed if we BLAST all the 1634 reads that matched the chloroplast genomes against the NCBI reference sequence database, 733 of them also match various bacterial genomes, but 897 does not match any other organisms, just plants. Removing those alignments that match bacterial genomes too, (gray spikes) makes the distribution more uniform.
Figure 3The cumulative distribution of plant DNA amount for over 900 subjects.
It (black dots) can be fitted with log-normal distribution (red curve) above the sensitivity cutoff (0.35). The gray shaded band is the result of the simulation of 300 realizations of the log-normal process with taking into account the varying sizes of the samples. Among the independent samples (larger dots), the ones from patients with inflammatory diseases (IBD, DRP000446) have the largest concentration. For the SRP016573 sample only the maternal plasma concentration is shown, full blood samples with 0.001 ppm and 0.004 ppm and cord blood samples with zero alignments are omitted from the figure.
Figure 4This heatmap shows the number of chloroplast matching reads on a scale for the SRP009039 study.
From the total 903 subjects the ones with the largest number of matches are shown (only the plant genomes with more than 50, and only the subjects with more than 10 matching reads), the rows are the plant species, the columns are the samples. The automatic clustering recovers the related plant species and the subjects can be also grouped by the food types.
The number of sequence reads in the samples and the number and ratio of chloroplast matching ones.
| Study accession number | Total number of reads | Bowtie | BLAST | ||
| Bacteria | Mammalia | Plants only | |||
| AD | 287,180,040 | 119 (0.414 ppm) | 19 | 0 | 81 |
| CRC | 530,151,339 | 693 (1.307 ppm) | 156 | 1 | 459 |
| IBD | 478,515,084 | 23649 (49.421 ppm) | 856 | 41 | 21840 |
| NEG | 435,118,157 | 622 (1.429 ppm) | 150 | 5 | 445 |
| DRP000446 | 3,284,956 | 1634 (497.419 ppm) | 733 | 5 | 894 |
| SRP009039 | 2,551,402,380 | 3781 (1.481 ppm) | 908 | 20 | 2746 |
| SRP016573 maternal plasma | 1,578,181,738 | 1110 (0.703 ppm) | 169 | 12 | 851 |
| SRP016573 father full blood | 672,693,854 | 3 (0.004 ppm) | 0 | 0 | 2 |
| SRP016573 mother full blood | 696,977,900 | 1 (0.001 ppm) | 0 | 0 | 1 |
| SRP016573 cord blood | 560,035,524 | 0 (0.000 ppm) | 0 | 0 | 0 |
Though some reads which were identified by the aligner software (Bowtie) can be fitted to other species like bacteria, most of them match plants only.