Literature DB >> 26484274

Piwi proteins and piRNAs in mammalian oocytes and early embryos: From sample to sequence.

David Rosenkranz1, Chung-Ting Han2, Elke F Roovers3, Hans Zischler1, René F Ketting3.   

Abstract

The role of the Piwi/piRNA pathway during mammalian oogenesis has remained enigmatic thus far, especially since experiments with Piwi knockout mice did not reveal any phenotypic defects in female individuals. This is in striking contrast with results obtained from other species including flies and zebrafish. In mouse oocytes, however, only low levels of piRNAs are found and they are not required for their function. We recently demonstrated dynamic expression of PIWIL1, PIWIL2, and PIWIL3 during mammalian oogenesis and early embryogenesis. In addition, small RNA analysis of human, crab-eating macaque and cattle revealed that piRNAs are also expressed in the female germline and closely resemble piRNAs from testis. Here, we thoroughly describe the experimental and computational methods that we applied for the generation, processing and analyses of next generation sequencing (NGS) data associated with our study on Piwi proteins and piRNAs in mammalian oocytes and embryos (Roovers et al., 2015). The complete sequence data is available at NCBI's Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) under the accession GSE64942.

Entities:  

Keywords:  Embryo; Oocytes; Ovary; Piwi; piRNA

Year:  2015        PMID: 26484274      PMCID: PMC4583685          DOI: 10.1016/j.gdata.2015.06.026

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE64942.

Experimental design, materials and methods

The Gene Expression Omnibus (GEO) Series GSE64942 comprises 41 high-throughput sequencing datasets (raw and processed) associated with the Roovers et al. study [1]. The following table lists all samples and provides the most relevant data associated with each sample (Table 1).
Table 1

Key features of GEO Series GSE64942 samples (hsap: Homo sapiens, mfas: Macaca fascicularis, and btau: Bos taurus).

GEO idSample nameSpeciesTissue or cell typeNaIO4 treatment
GSM1584495Bovine_GV_abtauOocytes at GV stageNo
GSM1584496Bovine_GV_b1btauOocytes at GV stageNo
GSM1584497Bovine_MI_1btauOocytes at MI stageNo
GSM1584498Bovine_MI_2btauOocytes at MI stageNo
GSM1584499Bovine_MII_a1btauOocytes at MII stageNo
GSM1584500Bovine_MII_a2btauOocytes at MII stageNo
GSM1584501Bovine_MII_b1btauOocytes at MII stageNo
GSM1584502Bovine_MII_b2btauOocytes at MII stageNo
GSM1584503Bovine_ovary_1btauAdult ovaryNo
GSM1584504Bovine_ovary_2btauAdult ovaryNo
GSM1584505Bovine_oxidized_1btauAdult ovaryYes
GSM1584506Bovine_oxidized_2btauAdult ovaryYes
GSM1584507Bovine_testis_1btauAdult testisNo
GSM1584508Bovine_testis_2btauAdult testisNo
GSM1584509Bovine_cumulus_MI_1btauCumulus cells from MI oocytesNo
GSM1584510Bovine_cumulus_MI_2btauCumulus cells from MI oocytesNo
GSM1584511Bovine_cumulus_MII_1btauCumulus cells from MII oocytesNo
GSM1584512Bovine_cumulus_MII_2btauCumulus cells from MII oocytesNo
GSM1584513Bovine_IVF_1btauIVF embryos in 2–4 cell stagesNo
GSM1584514Bovine_IVF_2btauIVF embryos in 2–4 cell stagesNo
GSM1584515Macaque_ovary_1mfasAdult ovaryNo
GSM1584516Macaque_ovary_2mfasAdult ovaryNo
GSM1584517Macaque_oxidized_1mfasAdult ovaryYes
GSM1584518Macaque_oxidized_2mfasAdult ovaryYes
GSM1584519Macaque_testis_1mfasAdult testisNo
GSM1584520Macaque_testis_2mfasAdult testisNo
GSM1584521Human_ovary_1hsapAdult ovaryNo
GSM1584522Human_ovary_2hsapAdult ovaryNo
GSM1584523Human_ovary_1_oxidizedhsapAdult ovaryYes
GSM1584524Human_ovary_2_oxidizedhsapAdult ovaryYes
GSM1584525Fetal_1st_1hsapOvary from 1st trimester fetusNo
GSM1584526Fetal_1st_2hsapOvary from 1st trimester fetusNo
GSM1584527Fetal_1st_1_oxidizedhsapOvary from 1st trimester fetusYes
GSM1584528Fetal_1st_2_oxidizedhsapOvary from 1st trimester fetusYes
GSM1584529Fetal_2nd_1hsapOvary from 2nd trimester fetusNo
GSM1584530Fetal_2nd _2hsapOvary from 2nd trimester fetusNo
GSM1584531Fetal_2nd _1_oxidizedhsapOvary from 2nd trimester fetusYes
GSM1584532Fetal_2nd _2_oxidizedhsapOvary from 2nd trimester fetusYes
GSM1614231Bovine_oxidized_2_repeatbtauAdult ovaryYes
GSM1614232Bovine_cumulus_GV_abtauCumulus cells from oocytes at GV stageNo
GSM1614233Bovine_cumulus_GV_bbtauCumulus cells from oocytes at GV stageNo

RNA isolation and NaIO4 treatment

RNA was extracted from the ovary and testis tissues by adding 750 μl TRIzol LS reagent (Ambion) and the tissues were mashed and sonicated (3 times 30 s, low power). The oocyte samples of all stages (~ 1000 oocytes per sample) and the IVF samples (450 embryos per sample) were treated similarly, but instead of adding 750 μl TRIzol LS, only 325 μl TRIzol LS was used (for following steps half amounts were used as well). First trimester fetal samples were taken up in 325 μl TRIzol LS and sonicated 3 times 30 s, whereas second trimester ovary samples were first grinded under liquid N2 and then taken up in 325 μl TRIzol LS. The RNA extraction was performed according to instructions of the manufacturer with small adjustments: Precipitation was performed overnight at − 80 °C with the addition of 1 μl glycoblue (Ambion). Secondly, samples were spun down for 1 h at 16,000 × g at 4 °C before washing with 70% ethanol. Samples that were sequenced directly without NaIO4 treatment were taken up in 6 μl RNase free MQ followed by library preparation. Samples that were oxidized were either first enriched for > 200 nt RNA molecules (bovine, macaque and human ovary) using the mirVana kit (Ambion), or were oxidized directly (low input samples: GV oocytes and fetal ovary tissues). In the case of GV oocytes, to control for successful oxidation and subsequent RNA isolation, we added 1/10th concentration of macaque testis RNA, since the testis piRNAs are methylated and the methylation state of the piRNAs from GV stage oocytes was still unknown. For the oxidation, we divided the samples into two and performed NaIO4 treatment or mock treatment as follows: prepare 5 × borate buffer (148 mM borax, 148 boric acid, adjust pH to 8.6) and prepare freshly 200 mM NaIO4. Mix 4 μl 5 × borate buffer, 2.5 μl 200 mM NaIO4 or 2.5 μl MQ (mock samples), RNA, MQ to 20 μl. Leave the reaction at RT for 10 min and add 2 μl glycerol and incubate 1 more minute at RT. Continue with RNA precipitation by adding 2.2 μl 3 M NaAc pH 5.5, 25 μl isopropanol and 1 μl glycoblue. Mix and precipitate overnight at − 80 °C. Spin down samples 1 h at 16,000 × g at 4 °C and wash once with 70% ethanol. Spin 20 more minutes, remove supernatant and take up the pellet in 6 μl RNase free MQ and continue with library preparation.

Library construction and high-throughput sequencing

Total RNA was subjected to 15% TBE-urea gel for size selection of 15–35 nt. This excised gel fraction was eluted in 0.3 M NaCl for > 16 h and precipitated with 100% isopropanol and Glycoblue for > 1 h at − 20 °C. The precipitated RNA pellet was washed once with 75% ethanol and dissolved in nuclease-free water. The purified RNA fraction was confirmed by Bioanalyzer Small RNA assay (Agilent). Library preparation was based on the NEBNext® Small RNA Library Prep Set for Illumina® (New England Biolabs) with minor modifications. To counteract ligation bias and to remove PCR duplicates, small RNA was first ligated to the 3′ adapter and then the 5′ adapter, both of which contained four random bases at the 5′ and 3′ end, respectively. Adapters with random bases were chemically synthesized by Bioo Scientific. Adapter-ligated RNA was reverse-transcribed and PCR amplified for 14 cycles using index primers. The PCR amplified cDNA construct was checked on the Bioanalyzer (Agilent) using High Sensitivity DNA assay. We performed a size selection of the small RNA library on LabChip XT instrument (PerkinElmer) using the DNA 300 assay kit. All libraries were pooled to obtain 10 nM, which was denatured to 9 or 10 pmol with 5% PhiX spiked-in and sequenced as single-read for 50 cycles on an Illumina MiSeq or HiSeq 2500 instrument in either rapid or high-output mode.

Computational processing and analysis of sequence data sets

Software deposition

For the processing, filtering and analyses of sequence datasets we applied a set of Perl scripts that are available using the following download link: http://www.smallrnagroup-mainz.de/software/scripts_Roovers-et-al.zip. Many of these Perl scripts are part of the NGS toolbox, which is subject to constant updating and debugging. The NGS toolbox collection as well as the latest sRNAmapper and proTRAC [2] software version including detailed documentations are available at http://www.smallrnagroup-mainz.de/software.html.

Data filtering, mapping and annotation

The 5′ and 3′ RNA adapters used for library construction were tagged with a short stretch of four random nucleotides at their 3′ or 5′ end, respectively. As a result, all 50 bp raw sequence reads comprise the original small RNA sequence flanked by four random nucleotides, and end with the 3′ adapter sequence (Fig. 1).
Fig. 1

Scheme of sequencing construct. The cloned RNA molecule is flanked by 5′ and 3′ random tags. The read length is 50 nt and therefore, depending on the cloned RNA length, typically ends with an incomplete 3′ adapter sequence.

In order to avoid any PCR-induced bias we clipped adapter sequences and excluded raw sequence reads with identical small RNA sequences and flanking random tags from the sequence data sets as these may constitute cloning products that each represents the same molecule (cliplinker_random.pl). Remaining sequence reads within the size range of 18–34 nt (NGS toolbox: length-filter) were collapsed to non-identical sequences while storing information on read counts for each sequence in the FASTA headers (NGS toolbox: collapse). We then mapped the sequences to the genome of Homo sapiens (assembly: GRCh38, GenBank accession: GCA_000001405.15), Macaca fascicularis (assembly: Macaca_fascicularis_5.0, GenBank accession: GCA_000364345.1) and Bos taurus (assembly: UMD3.1, GenBank accession: 6 GCA_000003055.3) using piRmapper_1.0.pl which is now replaced by the sRNAmapper software that is available at http://www.smallrnagroup-mainz.de/software.html. The mapping process was performed requiring a perfect match for positions 1 to 18 and allowing one internal mismatch in the following part of the sequence. In addition, we allowed up to two non-template 3′ nucleotides in order to map piRNAs that may carry post-transcriptionally modified 3′ ends. Fig. 2 displays the results of the initial processing and mapping procedures.
Fig. 2

Results of initial processing and mapping.

Sequences that were successfully mapped to the genome were also mapped (in sense orientation) to non-coding (nc) RNA datasets from Ensembl database (release 77), miRBase (release 21 [3]), Genomic tRNA database [4], and Silva rRNA database (release 119, [5]) using SeqMap [6] in order to identify miRNAs or fragments of miRNA precursors, lincRNA (no lincRNAs are described for Bos taurus), miscRNA, rRNA, snoRNA, snRNA and tRNA. In addition, we scanned for low complexity reads discarding reads with ≥ 75% of their sequence being composed of the same 1–4 nt motif (kill_simples.pl. = NGS toolbox: duster). In case of multiple annotation for one sequence, we apportioned the read counts accordingly to the different ncRNA types. Fig. 3 displays the results of this annotation step. Exact values for the information provided in the Fig. 2, Fig. 3 are available in supplementary file 1 (Excel sheet). A set of FASTA-formatted sequence files with sequences sorted according to the described annotation are available for each sample in supplementary file 2 (zip-compressed folder).
Fig. 3

Results of ncRNA annotation for successfully mapped reads.

Cloning efficiency of different random tags

Besides the possibility to minimize a putative sequence bias induced by PCR amplification, the usage of randomly tagged RNA adapters also ensures a high cloning efficiency due to providing all possible ligation substrates in order to antagonize putative ligation bias resulting from unknown or obscure ligation preferences of the applied RNA ligase. Indeed, we observed a bias for specific random tags in both 5′ and 3′ adapters across the different sequence datasets (Fig. 4).
Fig. 4

Positional nucleotide composition of 5′ and 3′ random tags.

While all positions of the 5′ random tag are biased towards G, only the first position of the 3′ random tag is biased towards G, whereas the following positions are biased towards T (U). Interestingly, this pattern is essentially the same comparing 3′ uridylated testis and 3′ adenylated oocyte and IVF derived sequences [1]. This suggests equal ligation efficiency and adapter bias for 3′ U and 3′ A RNA populations. However, we cannot exclude a difference in the population composition of cloned RNAs for different random adapters as observed by Jayaprakash and coworkers [7].

Prediction of genomic piRNA clusters

We predicted genomic piRNA clusters with proTRAC version 2.0.4 using the following command line arguments: -clstrand 0.5, -pimin 24, -pimax 31, -distr 1-90, -nr, -nh, and -mmr. The option -mmr (per million mapped reads) is renamed -rpm (reads per million) in proTRAC version 2.0.5 and later. The original proTRAC output is available for download at http://www.smallrnagroup-mainz.de/data/proTRAC_Roovers-et-al.zip as these results are not attached to the original study. Selected datasets from this GEO Series have also been added to the piRNA cluster database that can be accessed under the following URL: http://www.smallrnagroup-mainz.de/piRNAclusterDB.html.
Specifications
Organism/cell line/tissueHuman (adult and fetal ovary), marmoset (adult testis and ovary), cattle (adult ovary, oocytes, cumulus and in vitro fertilized 2–4 cell stage embryos)
SexMale and female
Sequencer or array typeIllumina HiSeq 2500
Data formatRaw and analyzed
Experimental factorsNormal and untreated tissues/cells
Experimental featuresSequencing of small RNA transcriptomes. Comparison of small RNA libraries with and without a NaIO4 oxidation step (for selected samples).
ConsentHuman adult ovary samples were from cancer patients that underwent unilateral oophorectomy for fertility preservation and have signed informed consent. The human fetal material used was from elective abortions and donated for research with informed consent. The research on human material was approved by the Medical Ethical Committee of the Leiden University Medical Center (CME P08.087 and CME 05/03 K/YR).
Sample source locationLeiden, Netherlands (human samples), Göttingen, Germany (macaque samples), Utrecht, Netherlands (bovine samples)
  7 in total

1.  The microRNA Registry.

Authors:  Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  SeqMap: mapping massive amount of oligonucleotides to the genome.

Authors:  Hui Jiang; Wing Hung Wong
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

3.  Piwi proteins and piRNAs in mammalian oocytes and early embryos.

Authors:  Elke F Roovers; David Rosenkranz; Mahdi Mahdipour; Chung-Ting Han; Nannan He; Susana M Chuva de Sousa Lopes; Lucette A J van der Westerlaken; Hans Zischler; Falk Butter; Bernard A J Roelen; René F Ketting
Journal:  Cell Rep       Date:  2015-03-26       Impact factor: 9.423

4.  proTRAC--a software for probabilistic piRNA cluster detection, visualization and analysis.

Authors:  David Rosenkranz; Hans Zischler
Journal:  BMC Bioinformatics       Date:  2012-01-10       Impact factor: 3.169

5.  Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing.

Authors:  Anitha D Jayaprakash; Omar Jabado; Brian D Brown; Ravi Sachidanandam
Journal:  Nucleic Acids Res       Date:  2011-09-02       Impact factor: 16.971

6.  GtRNAdb: a database of transfer RNA genes detected in genomic sequence.

Authors:  Patricia P Chan; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2008-11-04       Impact factor: 16.971

7.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.

Authors:  Christian Quast; Elmar Pruesse; Pelin Yilmaz; Jan Gerken; Timmy Schweer; Pablo Yarza; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-11-28       Impact factor: 16.971

  7 in total
  26 in total

1.  Co-chaperone Hsp70/Hsp90-organizing protein (Hop) is required for transposon silencing and Piwi-interacting RNA (piRNA) biogenesis.

Authors:  Joseph A Karam; Rasesh Y Parikh; Dhananjaya Nayak; David Rosenkranz; Vamsi K Gangaraju
Journal:  J Biol Chem       Date:  2017-02-13       Impact factor: 5.157

2.  Epigenetic effects of environmental chemicals: insights from zebrafish.

Authors:  Neelakanteswar Aluru
Journal:  Curr Opin Toxicol       Date:  2017-07-14

3.  HIV-1 Tat and cocaine coexposure impacts piRNAs to affect astrocyte energy metabolism.

Authors:  Mayur Doke; Fatah Kashanchi; Mansoor A Khan; Thangavel Samikkannu
Journal:  Epigenomics       Date:  2022-02-16       Impact factor: 4.778

4.  piRNA-like small RNAs target transposable elements in a Clade IV parasitic nematode.

Authors:  Mona Suleiman; Asuka Kounosu; Ben Murcott; Mehmet Dayi; Rebecca Pawluk; Akemi Yoshida; Mark Viney; Taisei Kikuchi; Vicky L Hunt
Journal:  Sci Rep       Date:  2022-06-16       Impact factor: 4.996

5.  Small RNA sequencing of cryopreserved semen from single bull revealed altered miRNAs and piRNAs expression between High- and Low-motile sperm populations.

Authors:  E Capra; F Turri; B Lazzari; P Cremonesi; T M Gliozzi; I Fojadelli; A Stella; F Pizzi
Journal:  BMC Genomics       Date:  2017-01-04       Impact factor: 3.969

6.  unitas: the universal tool for annotation of small RNAs.

Authors:  Daniel Gebert; Charlotte Hewel; David Rosenkranz
Journal:  BMC Genomics       Date:  2017-08-22       Impact factor: 3.969

7.  Two modes of targeting transposable elements by piRNA pathway in human testis.

Authors:  Ildar Gainetdinov; Yulia Skvortsova; Sofia Kondratieva; Sergey Funikov; Tatyana Azhikina
Journal:  RNA       Date:  2017-08-25       Impact factor: 4.942

8.  Of rodents and ruminants: a comparison of small noncoding RNA requirements in mouse and bovine reproduction.

Authors:  Lauren G Chukrallah; Aditi Badrinath; Kelly Seltzer; Elizabeth M Snyder
Journal:  J Anim Sci       Date:  2021-03-01       Impact factor: 3.159

9.  piRNA cluster database: a web resource for piRNA producing loci.

Authors:  David Rosenkranz
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

10.  Hypomethylation and Genetic Instability in Monosomy Blastocysts May Contribute to Decreased Implantation Potential.

Authors:  Blair R McCallie; Jason C Parks; Alyssa L Patton; Darren K Griffin; William B Schoolcraft; Mandy G Katz-Jaffe
Journal:  PLoS One       Date:  2016-07-19       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.