| Literature DB >> 25233860 |
Shuguang Lu, Shuai Le, Yinling Tan, Ming Li, Chang Liu, Kebin Zhang, Jianjun Huang, Haimei Chen, Xiancai Rao, Junmin Zhu, Lingyun Zou, Qingshan Ni, Shu Li, Jing Wang, Xiaolin Jin, Qiwen Hu, Xinyue Yao, Xia Zhao, Lin Zhang, Guangtao Huang, Fuquan Hu1.
Abstract
BACKGROUND: Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Shotgun sequencing method failed to complete the sequence of this genome.Entities:
Mesh:
Year: 2014 PMID: 25233860 PMCID: PMC4177049 DOI: 10.1186/1471-2164-15-803
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Bacterial strains and plasmids used in this study
| Strains or plasmids | Relevant characteristics a | Source or reference |
|---|---|---|
| Strains | ||
|
| Belongs to serum typing group 9 of | Laboratory collection |
|
| Belongs to serum typing group 6 of | Laboratory collection |
|
| Host for the construction of shotgun library clones | Promega, WI, USA |
|
| The | This study |
|
| The | This study |
| Plasmids | ||
| pKD3 | Template plasmid for Red system; Ampr, Cmr | [ |
| pKD46 | Red expression plasmid; Ampr | [ |
| pCP20 | Flp expression plasmid; Ampr, Cmr | [ |
| pUC18 | Vector for the construction of shotgun library clones; Ampr | TaKaRa, Shiga, Japan |
aAmpr, ampicillin resistant; Cmr, chloramphenicol resistant.
Primers and other DNA sequences used in this study
| Primers or other DNA sequences a | Sequence (5′-3′) | Target genes or locations |
|---|---|---|
| Construction of the | ||
| Cm-F | GTGTAGGCTGGAGCTGCTTC | Chloromycetin-resistant gene of pKD3 |
| Cm-R | CATATGAATATCCTCCTTAG | |
| Nfi-F | TGTGCCGCCAGAACATGC |
|
| Nfi-R | GACGCAGATGAATTGGGT | |
| H1 | CGTGGAGGCAGTGCATCGACTGTCTGAACAGTATCACCGCTAAGGAGTGATTATG | Upstream of the |
| H2 | TTTGTAACATGTTGAGTTCTCAAATACGGAAATTATCCGCAGTTTACCTGAATTA | Downstream of the |
aPrimers and other DNA sequences were synthesised by BGI-Shenzhen (Shenzhen, China).
Figure 1Shotgun sequencing failed to determine the whole PaP1 genome. (A) PFGE map of the PaP1 genomic DNA, showing that the PaP1 genome should be approximately 90 kb. (B) Length of the 20 contigs obtained using the shotgun method. Contig20 is the longest contig (approximately 26.1 kb) and all of the contigs could not be assembled in an integral genome sequence. The total length of the 20 contigs is approximately 47.7 kb, almost half of 90 kb.
Figure 2Distribution of 20 contigs in the PaP1 genome sequence. The exact location of each contig is shown with a brownish red box. The longest contig (21,553–47,704 position) is approximately 26 kb. The total length of the contigs is approximately 47.7 kb, almost half of 91.7 kb.
Figure 3Trace of IPD ratio variations showing three instances of modified sequence regions in the PaP1 genome. (A) IPD ratios of an m6A and its surrounding bases. (B) IPD ratios of an m4C and its surrounding bases. (C) IPD ratios of an unknown modified base and its surrounding bases.
Figure 4Distribution of modified bases in the PaP1 genome. The positive IPD ratios represent the bases on the positive strand and the negative IPD ratios represent the bases on the negative strand. (A) The IPD ratios and positions of m4Cs and m6As in the PaP1 genome. (B) The IPD ratios and positions of unknown modified bases in the PaP1 genome. See also Additional file 2: Excel S1.
Figure 5Modified motif prediction of the PaP1 genome. (A) Modification scores by cognate base. We selected the top 1,400 context sequences (with a score cutoff of 37) for analysis. A file of 1,400 sequences is suitable as the input for the online motif finding server (MEME). (B) Putative modified motifs determined from the PaP1 genome. A left “+” means forward and the corresponding right “–” means reverse complement.
Comparison of PaP1 ORF48 against putative methyltransferases using BlastP
| # | Species | Subject ID | Alignment length | Score | Identity (%) | E-value |
|---|---|---|---|---|---|---|
| 1 | Pseudomonas phage JG004 | gi|418488276| | 156 | 317 | 99 | 2e-108 |
| 2 |
| gi|262194136| | 137 | 66 | 32 | 3e-10 |
| 3 |
| gi|489222890| | 141 | 66 | 32 | 5e-10 |
| 4 |
| gi|556501605| | 128 | 66 | 34 | 6e-10 |
| 5 |
| gi|489228496| | 130 | 66 | 32 | 6e-10 |
| 6 |
| gi|554465598| | 141 | 65 | 32 | 8e-10 |
| 7 |
| gi|262193963| | 137 | 64 | 32 | 3e-09 |
| 8 |
| gi|491614495| | 128 | 62 | 31 | 9e-09 |
| 9 |
| gi|108757550| | 117 | 61 | 31 | 2e-08 |
| 10 |
| gi|521967607| | 117 | 61 | 31 | 2e-08 |
| 11 |
| gi|310819901| | 117 | 61 | 32 | 3e-08 |
| 12 |
| gi|338536945| | 117 | 61 | 32 | 3e-08 |
| 13 |
| gi|521967482| | 117 | 61 | 32 | 3e-08 |
| 14 |
| gi|521967591| | 117 | 61 | 31 | 4e-08 |
| 15 |
| gi|310819798| | 117 | 60 | 32 | 5e-08 |
Figure 6Relationship of PaP1 ORF48 and related putative methyltransferases. (A) Multiple sequence alignments of PaP1 ORF48 and related putative methyltransferases (listed in Table 3). (B) Phylogenetic analysis of the PaP1 ORF48. This diagram was constructed on the basis of the PaP1 ORF48 and related putative methyltransferases (Table 3). The relative distances of each main branch are also shown in this figure.
Figure 7Agarose gel electropherogram of Endo V digestion. (A) Digestion of the PaP1 genomic DNA by Endo V or EcoRI. Endo V digestion of the PaP1 genomic DNA produced a smear band in the gel. (B) Digestion of the PaP3 genomic DNA by Endo V or EcoRI. The PaP3 genome had been successfully sequenced using the shotgun method before. Unlike the PaP1 genomic DNA, Endo V digestion of the PaP3 genomic DNA gave no smear band in the gel.
Figure 8Construction of the mutant and its use in shotgun sequencing. (A) Schematic of the nfi gene knockout strategy. The PCR product (donor DNA) containing 55 bp upstream homologous extensions of the nfi gene (H1) and 55 bp downstream homologous extensions of the nfi gene (H2) was prepared using the pKD3 plasmid as template. The nfi gene in the chromosome of E. coli DH5α is replaced with chloramphenicol resistant gene (cat) by Red recombination of H1 and H2. cat is then eliminated by subjecting the FLP recognition target (FRT) sites to FLP recombination; a single FRT site is retained. Nfi-F and Nfi-R are primers indicating the change in the nfi gene locus. The length of the region between Nfi-F and Nfi-R primers is 821 bp (nfi remain), 1,169 bp (nfi is replaced by cat), or 237 bp (cat is eliminated). (B) PCR verification using Nfi-F and Nfi-R primers. Lane 1. Wild-type E. coli DH5α (nfi remain). Lane 2. nfi is replaced with cat. Lane 3. cat is eliminated. (C) Distribution of eight newly obtained contigs in the PaP1 genome. These eight contigs were obtained by shotgun sequencing of the PaP1 genome using E. coli DH5α Δnfi as the host to construct shotgun library clones. The blue rectangular boxes represent contigs. The exact location of each contig is indicated by blue boxes.