| Literature DB >> 31861815 |
Jinghao Chen1,2, Chao Xing1,2, Xin Zheng1, Xiaofang Li1.
Abstract
Functional (meta) genomics allows the high-throughput identification of functional genes in a premise-free way. However, it is still difficult to perform Sanger sequencing for high GC DNA templates, which hinders the functional genomic exploration of a high GC genomic library. Here, we developed a procedure to resolve this problem by coupling the Sanger and PacBio sequencing strategies. Identification of cadmium (Cd) resistance genes from a small-insert high GC genomic library was performed to test the procedure. The library was generated from a high GC (75.35%) bacterial genome. Nineteen clones that conferred Cd resistance to Escherichia coli subject to Sanger sequencing directly. The positive clones were in parallel subject to in vivo amplification in host cells, from which recombinant plasmids were extracted and linearized by selected restriction endonucleases. PacBio sequencing was performed to obtain the full-length sequences. As the identities, partial sequences from Sanger sequencing were aligned to the full-length sequences from PacBio sequencing, which led to the identification of seven unique full-length sequences. The unique sequences were further aligned to the full genome sequence of the source strain. Functional screening showed that the identified positive clones were all able to improve Cd resistance of the host cells. The functional genomic procedure developed here couples the Sanger and PacBio sequencing methods and overcomes the difficulties in PCR approaches for high GC DNA. The procedure can be a promising option for the high-throughput sequencing of functional genomic libraries, and realize a cost-effective and time-efficient identification of the positive clones, particularly for high GC genetic materials.Entities:
Keywords: Cellulomonas; PacBio Sequel; functional genomics; high GC DNA library
Mesh:
Year: 2019 PMID: 31861815 PMCID: PMC7016576 DOI: 10.3390/genes11010007
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Workflow of the functional genomic procedure for a high (G + C)% library described in this study.
Statistical information of the Sanger and PacBio sequencing.
| Clone ID | Partial Length by Sanger Sequencing (Bp) (Forward, Reverse) | Full Length by PacBio Sequencing (Bp) | Genome Locus | GC Content (%) | Identity (%) |
|---|---|---|---|---|---|
| 3,4,18 | 15, 604 | 1487 | 3835167–3836672 | 79.73 | 95.32 |
| 6,7,9,15, 16,29,30,35 | 100, 981 | 841 | 4380530–4381372 | 77.65 | 97.34 |
| 8,27 | 314, 57 | 1447 | 1213476–1214931 | 79.22 | 96.72 |
| 13,28 | 51, 618 | 930 | 1608787–1609730 | 78.16 | 94.32 |
| 14 | 572, 933 | 1258 | 2627868–2629126 | 81.09 | 99.53 |
| 24 | 7, 746 | 900 | 1780961–1781871 | 77.77 | 97.72 |
| 40 | 1, 93 | 1017 | 1342804–1343830 | 79.01 | 95.85 |
Figure 2Screening of restriction endonucleases for the linearization of plasmids carrying the high GC amplicons. (A) Agarose gel (1% w/v) electrophoresis of the plasmids. Lanes: x18 to x29, plasmid x18 to x29; M, molecular weight ladder (1 kbp, Boehringer). (B) HindIII digestion of all plasmids, analyzed by gel electrophoresis. Lanes: M, molecular weight ladder (1 kbp, Boehringer); x18 to x29, plasmid x18 to x29. (C) XbaI digestion of x13. Lanes: x13, plasmid x13; M, molecular weight ladder (1 kbp, Boehringer).
Figure 3Schematic organization of the open reading frames identified in the Cadmium-resistance plasmids. Arrows indicate the directions of transcription of each ORFs in the plasmids. Left-facing ORFs are denoted by dark grey arrows; right facing ORFs are black. Truncated ORFs are indicated as incomplete arrows.
Figure 4Cadmium resistance test of the candidate Cd-resistant clones identified in this study. Lane: pUC118, E. coli strain DH5α carried an empty vector pUC118 as control; x29 to x28, strain DH5α carried vector x29 to x28, respectively. Row: 1 to 10−4, the initial OD 600 nm value of the inoculated strain.