| Literature DB >> 26698853 |
Margaret Staton1, Teodora Best2, Sudhir Khodwekar3, Sandra Owusu3, Tao Xu4, Yi Xu4, Tara Jennings5, Richard Cronn5, A Kathiravetpilla Arumuganathan6, Mark Coggeshall7, Oliver Gailing3, Haiying Liang4, Jeanne Romero-Severson8, Scott Schlarbaum9, John E Carlson2.
Abstract
Forest health issues are on the rise in the United States, resulting from introduction of alien pests and diseases, coupled with abiotic stresses related to climate change. Increasingly, forest scientists are finding genetic/genomic resources valuable in addressing forest health issues. For a set of ten ecologically and economically important native hardwood tree species representing a broad phylogenetic spectrum, we used low coverage whole genome sequencing from multiplex Illumina paired ends to economically profile their genomic content. For six species, the genome content was further analyzed by flow cytometry in order to determine the nuclear genome size. Sequencing yielded a depth of 0.8X to 7.5X, from which in silico analysis yielded preliminary estimates of gene and repetitive sequence content in the genome for each species. Thousands of genomic SSRs were identified, with a clear predisposition toward dinucleotide repeats and AT-rich repeat motifs. Flanking primers were designed for SSR loci for all ten species, ranging from 891 loci in sugar maple to 18,167 in redbay. In summary, we have demonstrated that useful preliminary genome information including repeat content, gene content and useful SSR markers can be obtained at low cost and time input from a single lane of Illumina multiplex sequence.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26698853 PMCID: PMC4689444 DOI: 10.1371/journal.pone.0145031
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Coordinates for sampled trees.
| Species | Location | °N Latitude | °W Longitude |
|---|---|---|---|
| Acer saccharum | Boone County, MO | 38° 56’46.27” | 92° 19’27.99” |
| Juglans nigra | Boone County, MO | 39° 01’05.04” | 92° 45’42.81” |
| Liquidambar styraciflua | Boone County, MO | 38° 56’44.25” | 92° 19’24.66” |
| Nyssa sylvatica | Boone County, MO | 38° 57’02.75” | 92° 18’33.07” |
| Gleditisia triacanthos | DeKalb, TN | 35° 54’46.607” | 85° 54’32.083” |
| Quercus alba | Ava, MO | 36057’17” | 92039’40” |
| Persea borbonia | St. Stephen, SC | 33021’538” | 79° 58’136” |
Genome size estimates obtained from flow cytometry.
| Species name | 2C DNA in pg (mean value) | Estimated 1n in Mbp | # genotypes; # replicates measured |
|---|---|---|---|
|
| 1.42 | 695 | 4; 4 |
|
| 2.53 | 1,238 | 4; 4 |
|
| 1.99 | 975 | 3; 4 |
|
| 2.57 | 1,255 | 2; 4 |
|
| 1.63 | 799 | 6; 4 |
|
| 1.90 | 930 | 3; 4 |
Measurements are provided in picograms (pg) and millions of nucleotide base pairs (Mbp).
Statistics from low coverage whole genome sequencing and microsatellite discovery.
| Species | Genome coverage (X-fold depth) | GC% | # gSSRs | # PALs | gSSRs per Mb | PALs per Mb | % of PALs matching a plant gene | % of trinucleotide PALs matching a plant gene |
|---|---|---|---|---|---|---|---|---|
| Black cherry | 2.63 | 36.8 | 30,818 | 8,932 | 24.0 | 6.9 | 6.8% | 18.0% |
| Black walnut | 1.71 | 34.7 | 44,577 | 12,751 | 37.4 | 10.7 | 2.7% | 5.6% |
| Blackgum | 0.04 | 34.0 | 8,154 | 1,103 | 153.8 | 20.8 | 1.4% | 2.5% |
| Green ash | 0.77 | 34.7 | 13,590 | 2,650 | 18.1 | 3.5 | 2.4% | 2.5% |
| Honeylocust | 1.71 | 31.6 | 30,997 | 4,715 | 14.4 | 2.2 | 2.4% | 5.9% |
| Redbay | 1.49 | 37.9 | 56,887 | 18,167 | 30.4 | 9.7 | 1.3% | 1.8% |
| Sugar maple | N/A | 38.1 | 6,051 | 891 | 59.0 | 8.7 | 1.3% | 1.9% |
| Sweetgum | 0.14 | 36.9 | 7,340 | 1,889 | 63.7 | 16.4 | 1.2% | 1.5% |
| White ash | 0.34 | 33.9 | 5,325 | 1,079 | 16.8 | 3.4 | 3.2% | 2.2% |
| White oak | 0.11 | 33.6 | 6,995 | 1,005 | 84.0 | 12.1 | 1.6% | 4.8% |
Genome coverage and GC percentages calculated from reconstructed fragments, i.e. overlapping paired ends joined to create a single sequence. Potentially amplifiable loci (PALs) are gSSRs (genomic SSRs) that have flanking primers, allowing them to be tested for polymorphism. The rates of these markers per million bases (Mb) are calculated based on the total bases of reconstructed fragments for each species.
Fig 1Identified repetitive elements and genes in genomic reads.
The percent of reconstructed fragments with sequence similarity to known plant repetitive elements and gene sequences vary across species. The majority of identified repetitive elements originate from the retrotransposon classes of Gypsy and Copia.
Percent of genes represented in low coverage reconstructed fragments, based on comparison to three model plant species.
| % | % | % | |
|---|---|---|---|
| Black Cherry | 41% | 52% | 55% |
| Black Walnut | 39% | 49% | 53% |
| Blackgum | 12% | 14% | 15% |
| Green Ash | 39% | 49% | 51% |
| Honeylocust | 41% | 51% | 54% |
| Redbay | 41% | 49% | 53% |
| Sugar Maple | 18% | 21% | 23% |
| Sweetgum | 21% | 24% | 27% |
| White Ash | 32% | 40% | 41% |
| White Oak | 13% | 15% | 17% |
Fig 2Number of PALs (potentially amplifiable loci) for each of ten hardwood tree species.
Hundreds to thousands of PALs were identified for each species sequenced. For all species the most commonly identified repeat motif was 2 bases, followed by 3 base motifs. Reptitive motifs of 4 bases were found the least often.
Frequency of repeat patterns for two base pair and three base pair motif gSSRs.
For both 2- and 3-base pair repeat motifs, repeats with a lower GC% was more commonly found. Shifted and reverse complement motifs were merged into a single category; e.g. AG shifted by a single base is GA, and on the reverse strand is CT or TC. Abbreviations: BC = black cherry, BW = black walnut, BG = blackgum, GA = green ash, HL = honeylocust, RB = redbay, SM = sugar maple, SG = sweetgum, WA = white ash, WO = white oak.
|
|
|
|
|
|
|
|
|
|
|
|
| AT|TA | 38% | 56% | 74% | 23% | 64% | 55% | 51% | 39% | 32% | 55% |
| AG|GA|CT|TC | 57% | 39% | 14% | 42% | 22% | 39% | 36% | 50% | 49% | 37% |
| AC|CA|TG|GT | 5% | 5% | 12% | 35% | 14% | 6% | 13% | 11% | 19% | 8% |
| GC|CG | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% |
|
|
|
|
|
|
|
|
|
|
|
|
| AAT|ATA|TAA|ATT|TTA|TAT | 52% | 59% | 78% | 53% | 64% | 37% | 51% | 38% | 65% | 58% |
| ATG|TGA|GAT|CAT|ATC|TCA | 10% | 5% | 3% | 8% | 1% | 14% | 8% | 4% | 8% | 7% |
| AAG|AGA|GAA|CTT|TTC|TCT | 19% | 26% | 13% | 22% | 21% | 32% | 27% | 46% | 14% | 23% |
| AGT|GTA|TAG|ACT|CTA|TAC | 1% | 1% | 0% | 3% | 0% | 2% | 0% | 0% | 2% | 1% |
| AAC|ACA|CAA|GTT|TTG|TGT | 5% | 2% | 3% | 5% | 11% | 6% | 7% | 6% | 3% | 9% |
| CCA|CAC|ACC|TGG|GTG|GGT | 3% | 1% | 0% | 6% | 0% | 2% | 2% | 2% | 5% | 2% |
| AGC|GCA|CAG|GCT|CTG|TGC | 4% | 2% | 1% | 1% | 1% | 2% | 1% | 0% | 1% | 0% |
| AGG|GAG|GGA|CCT|CTC|TCC | 5% | 4% | 3% | 2% | 1% | 4% | 2% | 3% | 2% | 0% |
| ACG|CGA|GAC|CGT|GTC|TCG | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% |
| GGC|GCG|CGG|GCC|CCG|CGC | 0% | 0% | 0% | 0% | 1% | 0% | 0% | 0% | 0% | 0% |