| Literature DB >> 35889020 |
Bruno Maucourt1, David Roche2, Pauline Chaignaud1, Stéphane Vuilleumier1, Françoise Bringel1.
Abstract
Dichloromethane (DCM, methylene chloride) is a toxic halogenated volatile organic compound massively used for industrial applications, and consequently often detected in the environment as a major pollutant. DCM biotransformation suggests a sustainable decontamination strategy of polluted sites. Among methylotrophic bacteria able to use DCM as a sole source of carbon and energy for growth, Methylorubrum extorquens DM4 is a longstanding reference strain. Here, the primary 5'-ends of transcripts were obtained using a differential RNA-seq (dRNA-seq) approach to provide the first transcription start site (TSS) genome-wide landscape of a methylotroph using DCM or methanol. In total, 7231 putative TSSs were annotated and classified with respect to their localization to coding sequences (CDSs). TSSs on the opposite strand of CDS (antisense TSS) account for 31% of all identified TSSs. One-third of the detected TSSs were located at a distance to the start codon inferior to 250 nt (average of 84 nt) with 7% of leaderless mRNA. Taken together, the global TSS map for bacterial growth using DCM or methanol will facilitate future studies in which transcriptional regulation is crucial, and efficient DCM removal at polluted sites is limited by regulatory processes.Entities:
Keywords: dRNA-seq; dehalogenation; dichloromethane; gene expression; genome; methanol; methylotrophy; organohalide pollutant; transcriptional start site
Year: 2022 PMID: 35889020 PMCID: PMC9320726 DOI: 10.3390/microorganisms10071301
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Number of aligned reads on M. extorquens DM4 genome.
| Bank | Methanol | Dichloromethane | ||
|---|---|---|---|---|
| Replicate 1 | Replicate 2 | Replicate 1 | Replicate 2 | |
| “+” | 0.6 × 106 | 0.3 × 106 | 0.8 × 106 | 0.6 × 106 |
| (9%) | (5%) 1 | (14%) 1 | (8%) 1 | |
| “−” | 5.7 × 106 | 4.0 × 106 | 2.2 × 106 | 3.0 × 106 |
| (76%) | (74%) | (51%) | (64%) | |
1 For each biological replicate, percentage of aligned reads per total number of read.
Figure 1Distribution of the experimentally detected transcription start sites on the three replicons of M. extorquens DM4.
Figure 2Genome-wide experimentally detected transcription start sites in M. extorquens DM4. TSS categories were defined using TSSAR [34]. Primary TSSs, also called gene TSSs (red circle), locate within 250 nucleotides upstream of an annotated gene. Internal TSSs (green circle) locate within a gene on the sense strand. Antisense TSSs (yellow circle) locate within a gene, or at less than 30 nucleotide distance of a gene, on the antisense strand. This class further splits into Ai and Ad, for internal antisense and downstream antisense, respectively. Orphan TSSs (grey circle) do not locate nearby annotated genes.
Figure 3Length distribution of the 5′ untranslated region (5′UTR). A 0 distance to start codon represents TSS of leaderless transcripts. In this study, the maximum 5′UTR length was 250 nt.
Figure 4Remarkable nucleic acid motifs associated with M. extorquens DM4 gene expression. Transcription features (this work) and translation features [11] are shown. The −35 and −10 consensus sequences were deduced from TSS (P and IP) on the chromosome.
TSSs detected using dRNA-seq compared to previously mapped TSS with other methods.
| Gene | Product | Label in MaGe 1 | Promoter Region 2 | 5-URT (nt) | Approach 3 | Comments | ||
|---|---|---|---|---|---|---|---|---|
| −35 | −10 + 1 | /Name | ||||||
|
| DCM dehalogenase | METDI2656 | TTGACA <16 nt> TATAGAactagccc | 173 | dRNA-seq | |||
|
| transcriptional regulator of DCM dehalogenase | METDI2655 | TTGCGC <17 nt> TAACTAcaagg | 66 | dRNA-seq | |||
|
| serine hydroxyl-methyltransferase | METDI3959 | TTGGCC <18 nt> ACGAATagtgc | 75 | dRNA-seq | |||
|
| lipoprotein signal peptidase | METDI3108 | TTCCCC <17 nt> TAGAAGcgctcca | 150 | dRNA-seq | |||
|
| methanol dehydrogenase alpha SU precursor | METDI5145 | AAGACA <18 nt> TAGAAAatatag | 168 | dRNA-seq | |||
|
| uncharacterized conserved exported protein | METDI5146 | TTGACC <18 nt> ACCGTTgtcgtc | 41 | dRNA-seq | |||
|
| malate thiokinase large SU | METDI2482 | TTCCCG <17 nt> GAAGGTcggcccaa | 31 | dRNA-seq | |||
|
| coenzyme PQQ biosynthesis | METDI2503 | TGGCGC <19 nt> TGATGGcgcc | 119 | dRNA-seq | |||
|
| serine-glyoxylate | METDI2478 | TTGCGC <16 nt> CGGGATcgccccc G/P | 46 | dRNA-seq | |||
1 MaGe platform website: https://mage.genoscope.cns.fr/microscope/mage/index.php?; 2 TSS; 3 PE, primer extension; 4 ileS and lspA genes overlap over 3 nt in DM4 and AM1 strains suggesting a co-transcription.; 5 nd, data not available; 6 Sequence not found in AM1 sequenced genome [6], 7 Sequence of 30 nt was found between the −35 and −10 boxes in AM1 genome sequence [6].