| Literature DB >> 29385572 |
Marie-Ka Tilak1, Fidel Botero-Castro1, Nicolas Galtier1, Benoit Nabholz1.
Abstract
Standard Illumina libraries are biased toward sequences of intermediate GC-content. This results in an underrepresentation of GC-rich regions in sequencing projects of genomes with heterogeneous base composition, such as mammals and birds. We developed a simple, cost-effective protocol to enrich sheared genomic DNA in its GC-rich fraction by subtracting AT-rich DNA. This was achieved by heating DNA up to 90 °C before applying Illumina library preparation. We tested the new approach on chicken DNA and found that heated DNA increased average coverage in the GC-richest chromosomes by a factor up to six. Using a Taq polymerase supposedly appropriate for PCR amplification of GC-rich sequences had a much weaker effect. Our protocol should greatly facilitate sequencing and resequencing of the GC-richest regions of heterogeneous genomes, in combination with standard short-read and long-read technologies.Entities:
Keywords: GC content; GC enrichment; bird; high-throughput sequencing
Mesh:
Substances:
Year: 2018 PMID: 29385572 PMCID: PMC5808798 DOI: 10.1093/gbe/evy022
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Gene content and GC-content computed in 100-kb nonoverlapping windows across the chicken genomes (Gallus_gallus-5.0). Linear and quadratic regression lines are shown in blue and red, respectively.
Pretreatments, Melting Temperature (Tm) and GC Content (FastQC) for Each Library
| Library | Heating (°C) | Shearing | Polymerase | DMSO (%) | Tm (°C) | GC Content (%) |
|---|---|---|---|---|---|---|
| CHK1 | No | No | Phusion | 0 | 86 | 41 |
| CHK1 | No | No | Kapa | 0 | 86 | 41 |
| CHK2-75 | 75 | No | Phusion | 0 | 86 | 41 |
| CHK2-75 | 75 | No | Kapa | 0 | 86 | 41 |
| CHK3-75 | 75 | 5 min | Phusion | 0 | 86 | 41 |
| CHK3-75 | 75 | 5 min | Kapa | 0 | 86 | 41 |
| CHK2-85 | 85 | No | Phusion | 0 | 88 | 51 |
| CHK2-85 | 85 | No | Kapa | 0 | 89.5 | 52 |
| CHK3-85 | 85 | 5 min | Phusion | 0 | 88 | 51 |
| CHK3-85 | 85 | 5 min | Kapa | 0 | 89.5 | 52 |
| CHK-90 | 90 | 5 min | Kapa | 3 | 86, 91, 94 | 59 |
. 2.—Melting curves of a standard (CHK1, blue) and three heated (CHK2-85, CHK3-85, CHK-90) Illumina libraries.
Mapping of Reads from Standard (CHK1) and Heated (CHK2-85, CHK-90) Libraries to the Reference Chicken Genome
| CHK1 | CHK2-85 | CHK-90 | ||||||
|---|---|---|---|---|---|---|---|---|
| Chromosomes (% GC) | (%) Mapped Reads | (%) GC Mapped Reads | (%) Mapped Reads | (%) GC Mapped Reads | Coverage Increase | (%) Mapped Reads | (%) GC Mapped Reads | Coverage Increase |
| 1–5 (40.3%) | 58.5 | 39 | 43.4 | 51 | <1 | 41.1 | 60 | <1 |
| 6–10 (42.3%) | 13.4 | 41 | 14 | 52 | ∼1 | 13.1 | 66 | ∼1 |
| 11–15 (42.8%) | 7.6 | 43 | 10.4 | 53 | ∼1.4 | 10.9 | 67 | ∼1.4 |
| 16–20 (47.7%) | 3.4 | 46 | 6.9 | 54 | ∼2 | 7.3 | 68 | ∼2.1 |
| 21–25 (50%) | 2.2 | 48 | 5.2 | 55 | ∼2.4 | 5.2 | 68 | ∼2.4 |
| 26–31 (53%) | 1.4 | 51 | 3.9 | 56 | 2.8 | 5.8 | 69 | ∼4.1 |
| 32–33 (54.9%) | 0.06 | 53 | 0.4 | 58 | 6.6 | 0.4 | 70 | 6.6 |
| W-Z-LGE64 (41.3%) | 4.6 | 40 | 4.7 | 52 | ∼1 | 4.27 | 60 | <1 |
Coverage increase was calculated by dividing the percentage of mapped reads of CHK2-85 (respectively, CHK-90) by that of CHK1.
. 3.—(A) Proportion of reads mapped to the various chromosomes of the chicken genome. Colors represent the different libraries. Chromosomes are sorted according to average GC-content. (B) Average GC-content of chromosomes.
. 4.—Relationship between GC-content and coverage recorded on the small contigs of the chicken genome. Contigs are divided into 29 groups (represented by 29 dots with an equal number of contigs) according to their GC-content.