| Literature DB >> 32055444 |
Paul A Dawson1,2, Shalini J Weerasekera1, Ranita J Atcheson1, Sarah A Twomey2, David G Simmons2.
Abstract
Sulfate is essential for healthy fetal growth and development. Cysteine dioxygenase type 1 (CDO1) plays an important role in the catabolism of cysteine to sulfate. Cdo1 knockout mice exhibit severe and lethal fetal phenotypes but the involvement of CDO1 gene variants in human development is unknown. We searched the NCBI and Ensembl gene databases and identified four alternatively spliced CDO1 coding mRNA transcripts, as well as 148 validated CDO1 gene variants, including 138 missense, 6 nonsense, 1 frameshift, 1 in-frame deletion, and 2 splice site variants. In silico analyses predicted 68 of the missense variants to be deleterious to CDO1 protein structure and function. We examined the relative abundance of the four CDO1 coding mRNA transcripts in human term placentas using qRT-PCR. CDO1 mRNA variant 2 was the most abundant transcript, with intermediate levels of variant 4 and lower levels of variants 1 and 3. Using in situ hybridization, we localised CDO1 mRNA expression to the syncytiotrophoblast layer of human term placenta. To investigate the regulation of CDO1 gene expression, we analysed the transcriptional activity of the human CDO1 5'-flanking region in the JEG-3 placental cell line using luciferase reporter assays. Transcriptional activities were identified in the regions -5 to -269 and - 269 to -1200 nucleotides upstream of the CDO1 transcription initiation site. Mutational analyses of a single nucleotide polymorphism -289C > G that is common in the general population (allele frequency = 0.37) and a putative transcription factor binding motif (CCAAT enhancer binding protein beta) did not alter transcriptional activity of the CDO1 5'-flanking region. Collectively, this study provides an overview and analysis of human CDO1 for future investigations of this gene in human health.Entities:
Keywords: Isoform; Mutation; Placenta; Splice variant; Sulfate
Year: 2020 PMID: 32055444 PMCID: PMC7005546 DOI: 10.1016/j.ymgmr.2020.100568
Source DB: PubMed Journal: Mol Genet Metab Rep ISSN: 2214-4269
Primers used for PCR, mutagenesis, DNA sequencing, and in situ probes.
| Primer | Sequence (5′ to 3′) | |||
|---|---|---|---|---|
| P1 | F | −1200 | ||
| P2 | F | −269 | ||
| P3 | F | −5 | ||
| P4 | R | +311 | ||
| P5 | F | TCAGTCCCCGCAGCCAT | −306 | |
| P6 | R | GGCTGCGGGGACTGACGCTGAGTAAAGGAGGAAAA | −292 | |
| P7 | F | GCATTTCTTCCACCTTT | −915 | |
| P8 | R | AGGTGGAAGAAATGCAAGTGTAGATTTCTTCCTTC | −901 | |
| P9 | F | CTGTGGATAACCGTATTAC | ||
| P10 | R | CAGATGTCGATGTTGGGG | ||
| P11 | F | GTTCGACCAGTACAGTCGTG | V1 c.156 | |
| F | GGTGAAGGACATGGCAGCAG | V2 c.232 | ||
| P13 | F | GGGGTGAAGGACATGGCAGT | V3 c.230 | |
| P14 | F | TGCCAGGGGCCTGGGGGTAT | V4 c.-62 | |
| P15 | R | AGCATCTTCAGAAAGCAGTG | V1 c.350, | V4 c.77 |
| P16 | R | GTAAGCCAATGGAATCATTG | V2 c.418, | V3 c.415 |
| P17 | F | aattaaccctcactaaagggACCTGCTGTGAGCCTTCACT | V1 c.815, | V2 c.758 |
| V3 c.755, | V4 c.351 | |||
| P18 | R | taatacgactcactatagggCCATGGGACACAGACATGAG | V1 c.*1605, | V2 c.*1548 |
| V3 c.*1545, | V4 c.*1141 | |||
Underlined sequence homologous to the pMetLuc2 vector to allow recombination cloning.
Underlined nucleotides introduce the -289C > G SNP (P5) and mutate the core sequence of the CEBPB motif (P7).
Primers located in the pMetLuc2 vector upstream (P9) or downstream (P10) of the multiple cloning site.
CDO1 variant mRNA-specific primers enabled amplification of V1 (P11 + P15), V3 (P13 + P16) and V4 (P14 + P15).
Since primer combination P12 + P16 amplified variants 1, 2 and 4, we calculated variant 2 mRNA abundance as [(amplicon abundance using P12 + P16) – (V1 + V4 amplicon abundance)].
Lower case represents T3 (P17) and T7 (P18) RNA polymerase binding sites, and uppercase represents CDO1 sequences in exon 7 (P17) and the 3′-UTR of exon 8 (P18).
F, forward primer; R, reverse primer.
Number denotes the position of the first 5′-nucleotide in the CDO1 5′-flanking region (P1-P8) or CDO1 variant V1, V2, V3 or V4 mRNAs (P11-P18).
Fig. 1Human CDO1 gene structure, mRNA variants and protein isoforms. (A) Exon-intron organization showing exons (vertical lines) and introns (horizontal lines) spread over approximately 12 kb. Reference sequence NC_000005.10. (B) Schematic showing exons 1–8 (boxes) and protein coding sequences (white portions) for CDO1 mRNA variants V1 (NM_001323565.1), V2 (NM_001801.2), V3 (NM_001323566.1) and V4 (NM_001323567.1). (C) Aligned human CDO1 protein isoform 1 (NP_001310494.1), 2 (NP_001792.2), 3 (NP_001310495.1) and 4 (NP_001310496.1) sequences. Alignments were generated using the Clustal W program [11]. *Amino acids present in all 4 isoforms. Sequences accessed from the NCBI from 19 to 22 January 2019. (D) Validated CDO1 missense (letters), nonsense (*) and in-frame single amino acid deletion (Δ) variants (Ensembl database) are shown below the CDO1 isoform 2 protein sequence. Location of each α-helix (H1-H2, waved lines) and β-sheet (S1-S13, horizontal arrows), as well as the amino acids interacting with cysteine substrate (bar above CDO1 sequence) and/or Fe2+, in CDO1 isoform 2 protein (PDB ID: 2IC1) from the NCBI structure database (https://www.ncbi.nlm.nih.gov/structure/). Highlighted missense variants are predicted to be detrimental (red) or moderately disruptive (blue) to the CDO1 protein structure as determined using a combination of tools (SIFT, PolyPhen, CADD, REVEL, MetaLR, MA). Serine 83 (vertical arrow) is absent in the alternatively spliced CDO1 variant 3 mRNA transcript (as indicated by ΔCAG in panel B).
Fig. 2Tissue expression of CDO1 mRNA. (A) Relative CDO1 mRNA levels in 20 human tissues using data obtained from the online NCBI database http://www.ncbi.nlm.nih.gov/gene, which was accessed 19 January 2019. RPKM, Reads Per Kilobase of transcript, per Million mapped reads. (B) Cellular localization of CDO1 mRNA in human term placenta. Low (top panels) and high (bottom panels) magnification images of placental sections probed with antisense and sense (control) CDO1 probes. Images show antisense probe staining of CDO1 mRNA within the syncytiotrophoblast layer. Black scale bar = 100 μM. (C) Relative abundance of each placental CDO1 mRNA variant (V1-V4) abundance. (D) Comparison of each CDO1 variant (V1-V4) mRNA abundance in placentas of male (M) and female (F) babies (p > 0.05 for each variant). Data are mean ± SEM with n = 4–5 males and n = 3 females.
Fig. 3Phylogenetic analysis of CDO1 and its 5′-flanking region. (A) Phylogenetic tree with distances representing the evolutionary divergence of species that carry a CDO1 gene. Species above the dotted line represent the placental types: hemochorial (red), endotheliochorial (orange) and epitheliochorial (green). Image adapted from a tree generated online using the Interactive Tree of Life (ITOL, http://itol.embl.de/). (B) Multiple species alignment of conserved putative transcription factor binding motifs in the CDO1 5′-flanking region. Thirteen placental species share six highly conserved domains with human CDO1: CEBPB, DLX1, ZNF239, ZNF300, MZF1 and KLF1. Scale bar at top is relative to the +1 of the transcription start site.
Fig. 4Sequence and transcriptional activity of the human CDO1 5′-flanking region. (A) CDO1 nucleotide sequence from −1030 to +70, with +1 and +52 (arrows) denoting the transcription initiation sites of CDO1 mRNA variants 1, 2, 3, 5, 6 and 7, and variants 4 and 8, respectively. Putative transcription factor binding motifs are boxed, with core sequences shaded. *Location of the -289C > G single nucleotide polymorphism (rs34869) reported in the NCBI SNP database. (B) Approximate mRNA expression level of each transcription factor in human placenta and the JEG-3 cell line using data obtained from the online NCBI UniGene EST database http://www.ncbi.nlm.nih.gov/est and the Gene Expression Atlas database https://www.ebi.ac.uk/gxa/home, respectively, which were accessed between 30 and 31 January 2019. TPM, Transcripts Per Million. (C) Transcriptional activity of the 5′-truncated CDO1 5′-flanking region assessed by luciferase activity in JEG-3 cells. Data are shown as fold-induction to luciferase activity of the pMetLuc2 reporter, n = 3. *P < 0.05 and ***P < 0.001 when compared to constructs containing the −1200 fragment. (D) Transcriptional activity of the pMetLuc2–1200 vector containing wild-type sequence (filled symbols) and individually mutated CEBPB motif and introduction of the SNP rs34869 (unfilled symbols). Data are shown as fold-induction to luciferase activity of the pMetLuc2 reporter, n = 3.