Literature DB >> 28338730

Genetic variability of the activity of bidirectional promoters: a pilot study in bovine muscle.

Cédric Meersseman1,2, Rabia Letaief1, Véronique Léjard1, Emmanuelle Rebours1, Gabriel Guillocheau1, Diane Esquerré3, Anis Djari4, Amanda Chamberlain5,6, Christy Vander Jagt5,6, Christophe Klopp4, Mekki Boussaha1, Gilles Renand1, Abderrahman Maftah2, Daniel Petit2, Dominique Rocha1.   

Abstract

Bidirectional promoters are regulatory regions co-regulating the expression of two neighbouring genes organized in a head-to-head orientation. In recent years, these regulatory regions have been studied in many organisms; however, no investigation to date has been done to analyse the genetic variation of the activity of this type of promoter regions. In our study, we conducted an investigation to first identify bidirectional promoters sharing genes expressed in bovine Longissimus thoracis and then to find genetic variants affecting the activity of some of these bidirectional promoters. Combining bovine gene information and expression data obtained using RNA-Seq, we identified 120 putative bidirectional promoters active in bovine muscle. We experimentally validated in vitro 16 of these bidirectional promoters. Finally, using gene expression and whole-genome genotyping data, we explored the variability of the activity in muscle of the identified bidirectional promoters and discovered genetic variants affecting their activity. We found that the expression level of 77 genes is correlated with the activity of 12 bidirectional promoters. We also identified 57 single nucleotide polymorphisms associated with the activity of 5 bidirectional promoters. To our knowledge, our study is the first analysis in any species of the genetic variability of the activity of bidirectional promoters.
© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  bidirectional promoter; cattle genome; genetic variability; muscle

Mesh:

Substances:

Year:  2017        PMID: 28338730      PMCID: PMC5499805          DOI: 10.1093/dnares/dsx004

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

Whole-genome analyses have identified a class of regulatory regions that contain elements that initiate transcription of two different genes positioned with a head-to-head (5′–5′) orientation on opposite strands of the DNA. These regions termed ‘bidirectional promoters’ (BiPs) often have fewer than 1,000 base pairs separating the transcription start sites (TSSs) of the two genes. Bidirectional promoters have been known for years; however, recent genome-scale studies have shown that the regulation of the expression of up to 10% of the genes is controlled by bidirectional promoters. Bidirectional promoters are a common feature within not only the human genome but are also present in many other genomes, including yeast, plants, Invertebrates and Vertebrates., It has been shown that some bidirectional promoters, such as those of the histone genes, regulate the transcription of pair of genes that need coordinated expression to maintain stoichiometric relationships, or regulate the coexpression of genes that function in the same biological pathway. Detailed studies on bidirectional promoters have also shown that some are widely conserved among Vertebrates. The presence of such genomic architecture in many metazoans and the high inter-species conservation seen for some bidirectional promoters suggest that they are functionally important. Despite substantial interest in bidirectional promoters, the functional mechanisms underlying the activation of bidirectional promoters are currently not well characterized and, to our knowledge, no analysis of the variability of the activity of bidirectional promoters has been done so far. The aim of our study was to explore bidirectional promoter activity variation and to identify if genetic variants contribute to this variability. We performed this study in bovine Longissimus thoracis (LT). First, we identified bovine bidirectional promoters active in muscle, combining bovine gene information and RNA-Seq data. We then experimentally validated in vitro a subset of these bidirectional promoters. Finally, using gene expression and whole-genome genotyping data generated for 20 samples, we explore the variability of the activity in muscle of the identified bidirectional promoters and discovered genetic variants affecting this activity. We identified 120 bovine bidirectional promoters active in LT. A total of 16 bidirectional promoters were selected and all were validated experimentally in vitro. As we had RNA-Seq data for 20 different samples from the same tissue, it is possible to analyse the inter-individual variability of the activity of these 120 bidirectional promoters. As previously described, expression levels of gene pairs sharing a BiP are more correlated than those of neighbouring gene pairs in a unidirectional or convergent configuration. In addition, we found that the expression level of 77 genes is highly correlated with the activity of 12 bidirectional promoters active in muscle. We also identified 57 SNPs associated significantly with the activity of 5 bidirectional promoters. To our knowledge, our study is the first analysis in any species of the genetic variability of the activity of bidirectional promoters.

2. Materials and methods

2.1. Animals and tissue samples

The study was conducted with 11 Limousin bull calves from a large study on the genetic determinism of beef and meat quality traits. These eleven bull calves were not closely related to one another (for at least four generations). They were fattened in a single feedlot and fed ad libidum with wet corn silage. They were humanely slaughtered in an accredited commercial slaughterhouse when they reached 16 months. LT muscle samples were dissected immediately after death and tissue samples were snap frozen in liquid nitrogen and stored at −80 °C until analysis.

2.2. RNA sequencing and data analysis

RNA extraction and sequencing were performed as previously described., Briefly, after transfer to ice-cold RNeasy RLT lysis buffer (Qiagen), LT tissue samples were homogenized using a Precellys tissue homogeniser (Bertin Technologie). Total RNA was isolated using RNeasy Midi columns (Qiagen) and then treated with RNAse-free DNase I (Qiagen) for 15 min at room temperature according to the manufacturer’s protocols. The concentration of total RNA was measured with a Nanodrop ND-100 instrument (Thermo Scientific) and the quality was assessed with an RNA 6000 Nano Labchip kit using an Agilent 2100 Bioanalyzer (Agilent Technologies). All 11 samples had an RNA integrity number (RIN) value greater than eight. The mRNA-Seq libraries were prepared using the TruSeq RNA Sample Preparation Kit (Illumina) according to the manufacturer's instructions. Briefly, Poly-A containing mRNA molecules were purified from 4 μg total RNA of each sample using oligo(dT) magnetic beads and fragmented into 150–400 bp pieces using divalent cations at 94 °C for 8 min. The cleaved mRNA fragments were converted to double-stranded cDNA using SuperScript II reverse transcriptase (Life Technologies) and primed by random primers. The resulting cDNA was purified using Agencourt AMPure® XP beads (Beckman Coulter). Then, cDNA was subjected to end-repair and phosphorylation and subsequent purification was performed using Agencourt AMPure® XP beads (Beckman Coulter). These repaired cDNA fragments were 3′-adenylated producing cDNA fragments with a single ‘A’ base overhung at their 3′-ends for subsequent adapter-ligation. Illumina adapters containing indexing tags were ligated to the ends of these 3′-adenylated cDNA fragments followed by two purification steps using Agencourt AMPure® XP beads (Beckman Coulter). Ten rounds of PCR amplification were performed to enrich the adapter-modified cDNA library using primers complementary to the ends of the adapters. The PCR products were purified using Agencourt AMPure® XP beads (Beckman Coulter) and size-selected (200 ± 25 bp) on a 2% agarose Invitrogen E-Gel (Thermo Scientific). Libraries were then checked on an Agilent Technologies 2100 Bioanalyzer using the Agilent High Sensitivity DNA Kit and quantified by quantitative PCR with the QPCR NGS Library Quantification kit (Agilent Technologies). After quantification, three different tagged cDNA libraries were pooled in equal ratios and a final qPCR check was performed post-pooling. Each library pool was used for 2 × 100 bp paired-end sequencing on one lane of the Illumina HiSeq2000 with a TruSeq SBS v3-HS Kit (Illumina). After sequencing, the samples were demultiplexed and the indexed adapter sequences were trimmed using the CASAVA v1.8.2 software (Illumina). The Bos taurus reference transcriptome was downloaded from Ensembl (version 63, Bos_taurus.Btau_4.0.63.cdna.all.fa). To align the reads back to the assembled reference transcriptome the BWA programme (version 0.5.9-r16) was used. Reads were mapped for each sample separately to the assembled transcriptome. The BWA default values were used for mapping. Properly paired reads with a mapping quality of at least 30 (−q = 30) were extracted from the resulting BAM file using SAMtools for further analyses. Properly paired is defined as both left and right reads mapped in opposite directions on the same transcript at a distance compatible with the expected mean size of the fragments (<500 bp). Custom scripts were developed to identify paired-reads mapping to single locations and with the expected distance. Read pairs mapping to separate chromosomes were discarded for the present study. The number of paired-reads uniquely aligning to transcribed regions of each transcript was calculated for all genes in the annotated transcriptome. The transcript paired-read count was calculated as the number of unique paired-reads that aligned within the exons of each transcript, based on the coordinates of mapped reads.

2.3. SNP identification

BWA was also used to map reads onto the bovine genome assembly UMD3.1 version. Only reliable properly paired BWA mapped reads were considered for SNP calling. Indels were not considered because alternative splicing impedes reliable indel discovery. SNPs were called using the SAMtools software package. Genotype likelihoods were computed using the SAMtools utilities and variable positions in the aligned reads compared with the reference were called with the BCFtools utilities. SNPs were called only for positions with a minimal mapping quality (−Q) of 30, a minimum coverage (−d) of 4 and a maximum read depth (−D) of 10,000,000.

2.4. Data and statistical analyses

Information on bovine genes were obtained from the Ensembl Genome Browser (version 84) using the BioMart tool. Functional annotation analysis of genes sharing putative bidirectional promoters was done using the FATIGO tool of the online software suite Babelomics. Genes were assigned their Ensembl identities as input for Babelomics. Only one copy of each gene was used. Default parameter settings were used for the analysis. Statistical assessment of annotation differences between the two sets of sequences (SNP-containing genes versus all the other bovine genes) was carried out for each FATIGO analysis, using the Fisher Exact Test with correction for multiple testing. Prediction of putative TFBSs was performed using the bioinformatics tools MATCH. It uses a library of positional weight matrices from TRANSFAC 6.0 public version. A cut-off selection of 0.7 and 0.75 was applied for matrix and core similarities, respectively. Spearman's rank correlation coefficients were calculated for the correlation studies using the statistical R package.

2.5. Bidirectional promoter validation

Each putative bidirectional promoter region was PCR amplified from bovine genomic DNA using a pair of primers. The genomic DNA sequences were retrieved from the UCSC Genome Database. Sequence repeats were masked using RepeatMasker and Primer3 was then used to design primer pairs to amplify each putative bidirectional promoter region. PCR primers were synthesized by Eurofins MWG Operon. Restriction enzyme cutting sites for EcoRI (GAATTC) and BamHI (GGATCC) were artificially added into the PCR primers to facilitate directional cloning. Primer sequences are presented in Supplementary Table S1. Polymerase chain reactions were performed in 50 μl using 120 ng genomic DNA, 1 × PCR buffer, 1.5 mM MgCl2, 0.2 mM of each dNTP, 0.3 μM of each primer and 1U AccuPrime GC-rich Taq DNA polymerase (Invitrogen). The following cycling protocol was used: 95 °C for 15 min, followed by 35 cycles of 95 °C for 1 min, 52 °C for 1 min and 72 °C for 1 min, and a final extension step at 72 °C for 10 min. To check the quality of the amplification, 5 μl of PCR products were then analysed by gel electrophoresis with a 1% agarose gel. The PCR products were purified using the Qiagen MinElute Gel Extraction kit (Qiagen), digested with EcoRI and BamHI (New England Biolabs), purified with Qiagen Reaction Cleanup kit, quantified and then ligated using T4 DNA ligase (New England Biolabs) to the pBiP0 vector at a 3:1 ratio. The vector was previously digested with EcoRI/BamHI and dephosphorylated with alkaline phosphatase (New England Biolabs). The ligation products were transformed into Escherichia coli DH5α competent cells (Invitrogen). Ten clones were then amplified and plasmids were purified with the Pureyield Plasmid Miniprep System DNA purification kit (Promega). Positive clones were identified by digesting plasmid DNA with EcoRI and BamHI. Digestion products were then visualized after gel electrophoresis on a 1% agarose gel. One clone carrying the right plasmid construct was then amplified and plasmids were purified with the Macherey-Nagel Midi Endotoxin-free plasmid DNA purification kit. The plasmids were then sequenced bidirectionally using BigDye Terminator v3.1 Cycle Sequencing kit (Applied Biosystems) and the primers used for the PCR amplification. After purification on Sephadex G50 superfine column (GE Healthcare), the sequencing reaction products were analyzed using 3130 Genetic Analyzer sequencer (Applied Biosystems). One clone containing for each cloning orientation the right plasmid construct was then chosen. Murine C2C12 myoblastic cells (ATCC CRL-1772) were grown in Dulbecco’s Modified Eagle’s Medium with Glutamax-I (4.5 g/l glucose, Invitrogen) supplemented with 1% penicillin/streptomycin and 20% heat-inactivated fetal bovine serum. Cells were cultured at 37 °C in a humidified atmosphere of 5% CO2 in air. C2C12 (∼25,000 cells/well) were seeded into 24-well plates 1 day before transfection. Transfections were performed on 40% confluent C2C12 cells with TurboFect transfection reagent (Fermentas) using 2 μl of reagent per μg of DNA plasmid, during 4 h according to the manufacturer’s protocol. The cells were transfected with 0.8 μg of the vectors carrying unidirectional pCMV1/pCMV2 promoters or putative bidirectional promoters or with pBI-CMV2, pBiP-DsRed, pBiP0 control vectors. All transient transfection experiments were done in triplicate and repeated three times. Thirty-six hours after transfection cells were washed two times with PBS 1×, and then viewed with an Axio Observer Z1 microscope (Zeiss) and images were acquired using a CoolSNAP HQ2 camera (Photometrics) driven by the Axiovision imaging system software. Analyses of micrographies were performed using the AxioVision 4.7.2 software (Zeiss). Filter sets 38HE [excitation BP470/40(HE) and emission BP525/50(HE)] and 43HE [excitation BP545/30(HE) and emission BP620/60(HE)] were used to visualize AcGFP1 and DsRed monomer signals, respectively. Each reporter gene assay was performed in triplicate and with three independent transfection experiments.

2.6. Whole-genome SNP genotyping

Muscle samples were obtained and genomic DNA extracted using the DNA Midi kit (Qiagen). Quality of DNA was checked using a Nanodrop ND-100 spectrophotometer (Thermo Scientific) and quantity was estimated with Quant-iT Picogreen dsDNA kit (Life Technologies) on an ABI 7900HT (Life Technologies). All DNA samples were standardized to 50 ng/μL. DNA samples were genotyped on the BovineSNP50 Genotyping Beadchips (Illumina) at LABOGENA (Jouy-en-Josas, France) using the standard operating procedures recommended by the manufacturer.

3. Results and discussion

3.1. Bovine muscle whole-transcriptome analysis

To identify bidirectional promoters active in bovine muscle, we first obtained a global view of the bovine LT transcriptome using paired-end RNA sequencing (RNA-Seq) from twenty Limousin bull calves. We used already-published data from nine Limousin animals, and poly(A)-enriched mRNA from 11 new Limousin bull calves were retrotranscribed and subjected to high-throughput sequencing. The 11 RNA-Seq libraries were barcode-tagged and sequenced on five lanes (3 libraries per lane) of an Illumina HiSeq2000 sequencer. Sequencing of cDNA libraries generated a total of 1,303 million raw paired-end reads with a length of 100 bases, resulting in a total of 130 gigabases. The reads were then de-multiplexed to assign reads to each sequenced sample according to its barcode index. Approximately 17–48 million paired-end reads were obtained for each library. Reads from each sample were then mapped back to the bovine reference transcriptome. We used the set of Bos taurus Ensembl transcripts v63 RefSeq genes as the reference transcriptome. This set contains transcripts for 22,915 known or novel genes but also pseudogenes. Based on mappings done using the Burrows–Wheeler Aligner (BWA) programme, 67–81% of the mapped reads were aligned properly paired (Supplementary Table S2). A total of 22,025 transcripts (18,055 genes) were identified, with at least one paired-end read in all samples analysed. Similar RNA-Seq read mapping rate and the number of genes identified were obtained in other RNA-Seq bovine studies. For example, Chitwood et al. found that ∼69% of the RNA-Seq reads they generated while sequencing the transcriptomes of single bovine blastocysts mapped uniquely onto the bovine genome. Raw gene expression levels were estimated by measuring the normalized count number for each transcript (number of reads per transcript divided by the total number of mapped reads, for each sample). The five most frequent transcripts are shown in Table 1. These five genes represented nearly 17% of all sequencing reads mapped to the bovine genome and are associated with muscle cell metabolism or structure. These results were consistent with the physiological role of genes expected in the surveyed tissue. Not all genes were expressed among all samples. A total of 14,700 transcripts corresponding to 12,314 different genes were detected in all 20 samples, while 840 different genes (∼4%) were only expressed in one sample at a very low level (with on average less than 2 reads).
Table 1

Top five transcripts with the most assigned reads

Ensembl gene IDEnsembl transcript IDDescriptionGene symbol% total number of reads
ENSBTAG00000026986ENSBTAT00000061449TitinTTN4.46
ENSBTAG00000018204ENSBTAT00000009327Myosin heavy chain 1MYH14.30
ENSBTAG00000043561ENSBTAT00000060569Cytochrome c oxidase subunit ICOX13.61
ENSBTAG00000007090ENSBTAT00000012797Myosin heavy chain 2MYH22.36
ENSBTAG00000004965ENSBTAT00000006534Nucleoporin 133 kDaNUP1331.99
Top five transcripts with the most assigned reads

3.2. Identification of putative bidirectional promoters shared by genes co-expressed in bovine LT

In order to identify putative bidirectional promoters active in bovine muscle, the chromosomal locations of all the bovine genes were retreived from the Ensembl database (version 84). The Ensembl gene start sites were used as the TSS. Pair of genes in a head-to-head arrangement and separated by less than 1 kb between their TSSs were identified and the region between paired-genes were considered as putative bidirectional promoters. Among the 24,616 bovine genes annotated on the UMD3.1 genome assembly, we found 563 putative bidirectional promoters (Supplementary Table S3). The number of bovine bidirectional promoters we predicted is very similar to the one calculated by Xu et al., who found 574 putative bidirectional promoters in cattle. To overcome confusion during the subsequent analyses, we then used four filters (Supplementary Fig. S1). First, we removed all gene pairs for which at least one gene had paralogous sequences identified in the Ensembl database and predicted 192 bidirectional promoters. Second, we retained only gene pairs associated with these putative BiPs and for which we could detect the expression for both paired-genes in our RNA-Seq dataset, leaving 171 cases. Third, to avoid quantification problems relative to transcript isoforms, genes with known alternative transcripts were excluded from the study. Finally, for validation purposes, we only selected bidirectional promoters between protein-coding gene pairs. In total, 120 BiPs remained, each with both genes expressed in muscle (Supplementary Table S4).

3.3. Characteristics of genes shared by bidirectional promoters active in bovine LT

To characterise functions that are significantly enriched in genes sharing bidirectional promoters active in bovine muscle, a gene ontology (GO) analysis was performed. All the 120 co-expressed gene pairs could be analysed for functional classifications and were assigned to one or more GO annotations. GO term analysis showed a significant enrichment of specific GO terms when comparing the annotations of gene pairs associated with bidirectional promoters active in muscle against all transcripts from the bovine reference transcriptome. A summary of the classification of these genes into major biological process, cell component and molecular function categories is presented in Supplementary Table S5. Genes encoding proteins found in mitochondria, ribonucleoprotein complexes or organelle inner membrane or involved in chaperone binding and cellular response to DNA damage stimulus are significantly over-represented. No significant enrichment in KEGG terms/pathways was found. These results are in agreement with previous studies.,,, The chromosomal location of all bovine genes, of all the predicted bidirectional promoters and of the 120 BiPs active in muscle is presented in Fig. 1. Interestingly, the chromosomal distribution of these 120 bidirectional promoters does not reflect the gene content of the chromosomes (Mann–Whitney U-test, P-value <0.05). For example, we found only one (0.83%) predicted BiP active in muscle on BTA4, whereas this chromosome harbours 855 (3.48%) bovine genes. BTA11 and BTA19 have the highest number of bidirectional promoters active in muscle (both 11.67%) whereas these chromosomes have only ∼4 and ∼5%, respectively, of all bovine genes. The chromosomal location of the 120 BiPs active in muscle rather follows the chromosomal distribution of all bidirectional promoters predicted in cattle (Pearson correlation coefficient ρ = 0.802). For example, we found 0.89 and 0.83% predicted bidirectional promoters and BiPs active in muscle, respectively, on chromosome 4.
Figure 1

Chromosomal distribution of the 120 putative bovine bidirectional promoters active in muscle.

Chromosomal distribution of the 120 putative bovine bidirectional promoters active in muscle.

3.4. Experimental validation of some predicted bidirectional promoters

To assess the promoter activity of the predicted bidirectional promoters, some bidirectional promoters were selected and cloned into the promoterless pBiP0 reporter vector we recently constructed. This plasmid is a dual reporter vector with the Green Fluorescent Protein gene from the Aequorea coerulescens jellyfish (AcGFP1) inserted in a head-to-head configuration relative to a gene expressing a monomer mutant form of the Red Fluorescent Protein gene from the Discosoma striata reef coral (DsRed monomer). Both fluorescent protein genes are devoid of promoter and separated by a short multiple cloning region. The sequences of predicted bidirectional promoters were downloaded from the UCSC database. To ensure that all regulatory elements were included each sequence was extended until the first ATG (ATG not included) of both paired genes. Among the 120 promoter sequences 18 contain cutting sites for EcoRI or BamHI and were eliminated, since these two restriction enzymes are used for cloning into the dual reporter vector. We previously validated one of these bidirectional promoters. Sixteen new bidirectional promoters were then randomly selected for validation. Information relative to these 16 predicted bidirectional promoters is indicated in Table 2. The selected promoter regions were amplified by PCR and then cloned into the pBiP0 vector. Inserts were checked by sequencing.
Table 2

Information on the 16 tested predicted bovine bidirectional promoters

BiP #Ensembl gene IDBTAGene startGene endStrandDescriptionGene nameDistance
2ENSBTAG00000016337233283559632849025−1Acyl-CoA thioesterase 13THEM2293
ENSBTAG000000003652332849318328614451tyrosyl-DNA phosphodiesterase 2TDP2
12ENSBTAG00000014646177439283174397072−1Mitotic spindle organizing protein 2BMZT2143
ENSBTAG000000021301774397215744128071sphingomyelin phosphodiesterase 4, neutral membrane (neutral sphingomyelinase-3)SMPD4
19ENSBTAG000000035502910321681056464−1Chromosome 11 open reading frame 54C11orf54163
ENSBTAG0000000354529105662710625801TATA box binding protein (TBP)-associated factor, RNA polymerase I, D, 41kDaTAF1D
24ENSBTAG00000004295119301181593029730−1NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 8, 19kDaNDUFA8144
ENSBTAG000000042961193029874930643571MORN repeat containing 5MORN5
27ENSBTAG000000049918100163708100212829−1Inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated proteinIKBKAP324
ENSBTAG0000000499681002131531002193461family with sequence similarity 206, member AFAM206A
31ENSBTAG00000006195294469168244693901−1Chromosome 29 open reading frame, human C11orf68C29H11orf68270
ENSBTAG000000061992944694171446962951DR1-associated protein 1 (negative cofactor 2 alpha)DRAP1
61ENSBTAG00000023018284428304144397693−1Poly(ADP-ribose) glycohydrolasePARG121
ENSBTAG000000116942844397814444185821translocase of inner mitochondrial membrane 23 homolog (yeast)TIMM23
62ENSBTAG00000011930176344884163466323−1DEAD (Asp-Glu-Ala-Asp) box polypeptide 54DDX54327
ENSBTAG000000119371763466650634721651chromosome 17 open reading frame, human C12orf52 (C17H12orf52)RITA
66ENSBTAG0000001258628643897986449372−1Heat shock 60kDa protein 1 (chaperonin)HSPD1153
ENSBTAG00000012589286449525864515641heat shock 10kDa protein 1 (chaperonin 10)HSPE1
81ENSBTAG000000156671123242932335178−1Transmembrane protein 127TMEM127204
ENSBTAG0000001565911233538223423081cytosolic iron-sulfur protein assembly 1CIAO1
85ENSBTAG00000016559294314685643149460−1Nudix (nucleoside diphosphate linked moiety X)-type motif 22NUDT22150
ENSBTAG000000165552943149610431521431tRNA phosphotransferase 1TRPT1
86ENSBTAG000000165582513545531360122−1splA/ryanodine receptor domain and SOCS box containing 3SPSB3457
ENSBTAG0000001656125136057913658021nucleotide binding protein 2NUBP2
100ENSBTAG00000025028124776804947781910−1Mitotic spindle organizing protein 1MZT1205
ENSBTAG000000198861247782115478098791bora, aurora kinase A activatorBORA
104ENSBTAG000000215473103353700103422206−1WD repeat domain 65WDR65399
ENSBTAG0000002154431034226051034295201EBNA1 binding protein 2EBNA1BP2
115ENSBTAG00000016679174126275441307126−1Electron-transferring-flavoprotein dehydrogenaseETFDH252
ENSBTAG000000334861741307378413106011chromosome 17 open reading frame, human C4orf46C17H4orf46
118ENSBTAG00000000489111018610210189962−1WD repeat domain 54WDR5490
ENSBTAG000000402261110190052101959321chromosome 11 open reading frame, human C2orf81C11H2orf81

Distance, distance between transcription start sites, in bp.

Information on the 16 tested predicted bovine bidirectional promoters Distance, distance between transcription start sites, in bp. The promoter activities of these sequences were then assayed after transient transfections in murine myoblastic cells (C2C12) as no bovine muscle cell line was available. The pBI-CMV2 and pBiP-DsRed vectors were used as positive controls, whereas the empty pBiP0 vector was used as a negative control to verify that no reporter gene expression could be observed without promoter. The expression of the reporter genes was assessed by fluorescence microscopy 36 h after transfection to evaluate transfection efficiencies and check reporter activities. No significant reporter gene expression was observed with the empty pBiP0 transfected cells and positive signals were therefore attributed to the inserted sequences. Fluorescence microscopy analyses showed that all the 16 selected predicted bidirectional promoters seem to drive bidirectional expression in C2C12 cells, although variability in the reporter signals was observed (Fig. 2; Supplementary Fig. S2). These good results suggest that the procedure we developed to detect bovine bidirectional promoters active in muscle is accurate and has a low false-positive rate.
Figure 2

Fluorescence microscopy analysis of bidirectional promoter BiP100 in C2C12 cells. Images were taken at x20 magnification, 36h after transfection.

Fluorescence microscopy analysis of bidirectional promoter BiP100 in C2C12 cells. Images were taken at x20 magnification, 36h after transfection.

3.5. Co-expression analysis of genes shared by bidirectional promoters active in bovine muscle

We have in our study the expression levels of genes expressed in LT for twenty samples. It is therefore possible to analyse the co-expression of each pair of genes sharing a bidirectional promoter. Using the normalized expression levels, we calculated the Spearman's rank correlation coefficient for each gene pair. We found that 34% (41/120) of the genes shared by bidirectional promoters are co-expressed (P-value < 0.05%) (Supplementary Table S6). In order to evaluate whether gene pairs sharing bidirectional promoters were significantly enriched for co-expressed genes, we then performed a similar co-expression analysis for pairs of non-overlapping neighbouring genes in a tail–tail orientation (convergent configuration) or in the same orientation (unidirectional configuration). We identified these two different types of gene pairs using the chromosomal locations of all the bovine genes from the Ensembl database and using the same filters as for the identification of gene pairs sharing BiPs (Supplementary Fig. S1). We found 218 and 163 gene pairs in the convergent and unidirectional configurations (Supplementary Tables S7 and S8), respectively. The Spearman's rank correlation coefficient for each gene pair was then calculated using the normalized expressed levels measured in the 20 samples. Thirty-five percent (76/218) and 30.1% (49/163) of the gene pairs in the convergent and unidirectional configurations, respectively, were co-expressed (Supplementary Tables S9 and S10). The percentage of co-expressed genes is comparable for the bidirectional, convergent and unidirectional configurations, suggesting that co-expression is most likely mediated by shared chromatin environment rather than only by specific regulatory regions. Similar findings were previously found in other species, including yeast. In addition, on average the expression levels in muscle of gene pairs sharing a BiP were more correlated compared with pairs of neighbouring genes in a unidirectional or convergent configuration (Z-score test, P-value <0.05%). Using normalized gene expression counts previously measured in 18 bovine tissues and for three different cows,, we calculated the Pearson correlation coefficient between the expression levels of the 120 gene pairs sharing a BiP, the 163 gene pairs in an unidirectional configuration and the 218 gene pairs in a convergent configuration. We then compared the average expression correlation between these three types of gene pairs and found no significant differences between gene pairs in unidirectional and convergent configurations, but identified significant differences between gene pairs sharing a BiP and the other two types of gene pairs (Z-score test, P-value <0.05%), The higher correlation between expression levels of genes sharing BiPs has previously been described and is a common feature of bidirectional promoters.

3.6. The activity of the bidirectional promoters is highly variable

Not all the 240 genes sharing the 120 active bidirectional promoters were expressed in all the twenty samples. Among these genes, 227 genes (corresponding to 114 bidirectional promoters) were expressed in all 20 samples; whereas 13 genes (from 13 different bidirectional promoters) were expressed in all but one sample. In order to evaluate inter-individual variability of the activity of the bidirectional promoters active in muscle, we calculated for each sample the expression ratio of each gene pair (using normalized gene expression levels). We considered that the value of this ratio is a measure of the activity of the bidirectional promoter. We then calculated the coefficient of variation for the activity of each bidirectional promoter. We could calculate the activity (paired-gene ratio) only for 107 bidirectional promoters, as some genes were not expressed in some of the 20 samples. Interestingly there were significant differences when we compared the activity variability of these 107 bidirectional promoters. Thirteen bidirectional promoters had a thightly controled activity (CV <50%) whereas 41 had a very variable activity (CV >100%; Supplementary Table S11). These results suggest a strong or a loosened gene expression regulation of the gene pairs depending on the shared bidirectional promoter. This highly variable activity could be due to polymorphisms within the regulatory elements located within the bidirectional promoter regions.

3.7. The expression level of some genes is highly correlated with the activity of some bidirectional promoters active in muscle

Thanks to the availability of the expression levels of genes expressed in LT for 20 samples, it is also possible to analyse the effect of these genes on the activity of the identified bidirectional promoters. Using the normalized expression levels, we calculated the Spearman's rank correlation coefficient for each bidirectional promoter ratio with each gene expressed within the 20 muscle samples. We found after correction for multiple testing 77 significant correlations (P < 1.9 × 10−8), between 77 different genes and the activity of 12 different bidirectional promoters (Supplementary Table S12). We found 74 positive correlations and only 3 negative correlations (anti-correlations). Among these 77 correlations, we could find 9 significant correlations between the activity of a bidirectional promoter and the expression level of one of the two paired-genes sharing the bidirectional promoter. To characterize the functions of the genes showing expression levels highly correlated with the activity of these 12 bidirectional promoters, we performed a GO annotation. No significant enrichment in GO terms and KEGG terms/pathways was found. Interestingly, with the exception of the paired-genes sharing a bidirectional promoter, no highly correlated genes were in close vicinity of the bidirectional promoter, indicating that the relationship between these genes and the bidirectional promoters are trans-effects. In addition, no genes are associated with more than a single bidirectional promoter, suggesting that the regulation of bidirectional promoters is highly variable and no master regulator genes may be at play. We have to be cautious with our results, as a significant correlation does not relate necessarily to an effect of the gene showing a high correlation on the activity of the bidirectional promoter. Indeed, the paired-genes sharing the bidirectional promoter could themselves have an impact on the regulation of the expression of the gene, for which the expression level shows a strong correlation. However, some of the genes showing a strong correlation with the activity of some of the bidirectional promoters encode for proteins involved in gene regulation, such as histones H2a.1 and H3.1 and transcription factors Hnf1a and Zfn133 and therefore might have direct effect on the activity of these bidirectional promoters. However, no binding sites for Hfn1a or Zfn133 were predicted within the sequence of bidirectional promoter #118 or #81, respectively. More work is needed to establish the directionality of the relationships detected with the correlation and the true involvement of identifed genes with regulatory functions.

3.8. Some genetic variants might alter the activity of the bidirectional promoters active in muscle

After assessing if some genes expressed in muscle influence the activity of the identified bidirectional promoters, we performed a similar analysis to this time identify DNA polymorphisms altering the activity of these bidirectional promoters. DNA samples from the twenty animals were genotyped with the Illumina BovineSNP50 Genotyping Beadchips. Among the 54,001 SNPs included in the chip, only polymorphic single nucleotide polymorphisms (SNPs) mapping to bovine autosomes on the bovine genome assembly UMD3.1 version, genotyped on ≥90% of the individuals, without a low minor allele frequency (MAF ≥0.05) and in Hardy–Weinberg Equilibrium (P-value <0.001) were retained for further analysis. 21,470 informative SNPs were left after all the filtering steps. We calculated the Spearman's rank correlation coefficient for each bidirectional promoter ratio with the genotypes of each SNP. We found after correction for multiple testing 27 significant correlations (11 and 16 positive and negative correlations respectively, P-value <1.94 × 10−8), between 27 different SNPs and the activity of 4 different bidirectional promoters. Annotation of these variants indicates that 20 SNPs are intergenic while the remaining 7 SNPs are within genes but none introduce missense changes in the corresponding proteins or are located in putative regulatory regions (Table 3). GO annotation of these seven genes did not reveal any significant enrichment. None of the intergenic or intronic SNPs were located within known genes encoding bovine long non-coding RNAs., The 27 SNPs were neither in close vicinity of the bidirectional promoter and they are all associated with only one bidirectional promoter. This suggest, as seen with the genes whose expression were significantly correlated with the activity of some bidirectional promoters that the effect of these SNPs are trans-effects and there might not be master regulator genes. In addition, annotation of these variants indicates that most SNPs are intergenic and it is unlikely that these 27 SNPs have any direct effect on the activity of the bidirectional promoter.
Table 3

List of SNPs from the Illumina BovineSNP50 Beadchip with genotypes highly correlated to the activity of some bidirectional promoters

SNP IDBTAPositionAllelesConsequenceEnsembl gene IDEnsembl transcript IDGene symbolBiP #RhoP-value
Hapmap42177-BTA-31679161,125,554A/Gintergenic_variant1180.991.78E-24
Hapmap42933-BTA-40454185,731,453A/Gintergenic_variant101−0.991.95E-20
ARS-BFGL-NGS-1085142 105,521,127A/Gintergenic_variant118−0.991.78E-24
Hapmap54229-rs29017613430,200,987A/Gintergenic_variant1180.991.78E-24
ARS-BFGL-NGS-35869521,707,809A/Gintergenic_variant118−0.991.78E-24
ARS-BFGL-NGS-93953522,737,219A/Gintergenic_variant1180.991.78E-24
Hapmap49852-BTA-107572629,790,005A/Gintergenic_variant104−0.991.78E-24
Hapmap25168-BTC-033275633,713,818A/Gintergenic_variant1040.991.78E-24
Hapmap36813- SCAFFOLD50174_9004633,768,128A/Gintergenic_variant1040.991.78E-24
Hapmap23923-BTC-066021639,721,727A/Cintergenic_variant104−0.991.78E-24
Hapmap36286- SCAFFOLD260285_242657111,161,115A/Cintergenic_variant104−0.991.78E-24
ARS-BFGL-NGS-990318101,251,865A/Cintergenic_variant1180.991.78E-24
ARS-BFGL-NGS-60678985,454,475A/Gintergenic_variant1040.991.78E-24
ARS-BFGL-NGS-4488106,476,252A/Gintron_variantENSBTAG00000024878ENSBTAT00000048409ANKRD31104−10
ARS-BFGL-NGS-58191010,428,184A/Gintron_variantENSBTAG00000025853ENSBTAT00000017647HOMER11180.991.78E-24
ARS-BFGL-NGS-273411191,438,914A/Gintergenic_variant118−0,991.78E-24
ARS-BFGL-NGS-1158891279,814,959A/Cintron_variantENSBTAG00000010395ENSBTAT00000013726DOCK9640.991.95E-20
ARS-BFGL-NGS-235091470,636,087A/Gintergenic_variant118−0.991.78E-24
ARS-BFGL-NGS-8591473,919,098A/Gintergenic_variant104−0.991.78E-24
ARS-BFGL-NGS-998021674,999,809A/C5_prime_UTR_variantENSBTAG00000010850ENSBTAT00000014402SERTAD41040.991.78E-24
Hapmap52466-rs29015577193,617,183A/Cintergenic_variant118−0.991.78E-24
Hapmap47625-BTA-447261921,878,635A/Gintergenic_variant104−0.991.78E-24
ARS-BFGL-NGS-141871925,165,920A/Gintron_variantENSBTAG00000014806ENSBTAT00000019703ATP2A3118−0.991.78E-24
ARS-BFGL-NGS-1092912313,517,193T/Aintron_variantENSBTAG00000027197ENSBTAT00000064399KIF664−0.991.95E-20
ARS-BFGL-NGS-579582623,000,155A/Gintergenic_variant104−0.991.78E-24
ARS-BFGL-NGS-420332814,243,381C/Gintergenic_variant118−0.991.78E-24
ARS-BFGL-NGS-770282826,689,199A/Gintron_variantENSBTAG00000005666ENSBTAT00000007444LRRC201180.991.78E-24
List of SNPs from the Illumina BovineSNP50 Beadchip with genotypes highly correlated to the activity of some bidirectional promoters In order to identify SNPs altering directly the activity of some of the bidirectional promoters active in muscle, we performed another correlation analysis between the activity of the bidirectional promoters and this time the genotypes from coding SNPs identified with the RNA-Seq data. Mapped paired-end reads from each sample were used with the SAMtools package for variant calling. Using stringent parameters (e.g. minimum coverage of 8 reads and mapping quality of 20) we detected 8,121 different billalelic SNPs. Among the SNPs identified, similarly to the SNPs from the Illumina BovineSNP50 Genotyping Beadchip, we used only polymorphisms mapping to bovine autosomes, genotyped on ≥90% of the individuals, without a low minor allele frequency (MAF ≥0.05) and in Hardy–Weinberg Equilibrium (P-value <0.001). A total of 8,116 informative SNPs were left after these filtering steps. We calculated the Spearman's rank correlation coefficient for each bidirectional promoter ratio with the genotypes of each SNP. We found after correction for multiple testing 30 significant correlations (P-value  <5.13 × 10−8), between 30 different SNPs (in 27 different genes) and the activity of 4 different bidirectional promoters (Table 4). Annotation of these variants indicates that all these SNPs are within protein-coding genes. GO annotation of the 27 genes containing the 30 coding SNPs (cSNPs) did not reveal any significant enrichment. None of the 30 cSNPs introduce missense changes in the corresponding proteins; however, 11 cSNPs are located in the upstream or downstream regions.
Table 4

List of coding SNPs with genotypes highly correlated to the activity of some bidirectional promoters

cSNP IDBTAPositiondbSNP IDAllelesConsequenceEnsembl gene IDEnsembl transcript IDGene symbolBiP #RhoP-value
ENSBTAT00000002157_515961,291,956G/Aintron_variantENSBTAG00000001644ENSBTAT00000002157MDN1104−0.991.78E-24
ENSBTAT00000003078_1061582,337,162G/Aintron_variantENSBTAG00000002381ENSBTAT00000003078ZDHHC51180.991.78E-24
ENSBTAT00000003826_4792122,218,555C/Tupstream_gene_variantENSBTAG00000002939ENSBTAT00000003826FURIN104−0.991.78E-24
ENSBTAT00000005923_585252,977,744T/Cupstream_gene_variantENSBTAG00000004509ENSBTAT00000005923SLX4118−0.991.78E-24
ENSBTAT00000008129_2401619,500,611G/Cintron_variantENSBTAG00000006186ENSBTAT00000008129KCTD31180.991.78E-24
ENSBTAT00000008507_1231582,265,194T/Cintron_variantENSBTAG00000006493ENSBTAT00000008507CLP1118−0.991.78E-24
ENSBTAT00000008728_7471218,197,212C/Tintron_variantENSBTAG00000006640ENSBTAT00000008728RB11180.991.78E-24
ENSBTAT00000018163_6652166,866,347T/Gupstream_gene_variantENSBTAG00000013666ENSBTAT00000018163SLC25A29104−0.991.78E-24
ENSBTAT00000020493_130762,925,278A/Gintron_variantENSBTAG00000015419ENSBTAT00000020493ARHGEF3764−0.991.95E-20
ENSBTAT00000025492_4021955,675,666T/Cintron_variantENSBTAG00000019153ENSBTAT00000025492JMJD624−16.38E-137
ENSBTAT00000028060_5202116,720,681C/Tintron_variantENSBTAG00000037383ENSBTAT00000028060AKAP13118−0.991.78E-24
ENSBTAT00000028277_3651576,757,181T/Gintron_variantENSBTAG00000021223ENSBTAT00000028277CRY2118−0.991.78E-24
ENSBTAT00000028656_5691475,708,914T/Gintron_variantENSBTAG00000039968ENSBTAT00000028656TMEM55A1040.991.78E-24
ENSBTAT00000028865_1032539,432,034G/Aintron_variantENSBTAG00000037400ENSBTAT00000028865TNRC181180.991.78E-24
ENSBTAT00000029400_1951773,658,214C/Tupstream_gene_variantENSBTAG00000021656ENSBTAT00000029400SPECC1L1040.991.78E-24
ENSBTAT00000029403_2031862,789,937A/Gintron_variantENSBTAG00000030393ENSBTAT00000029403RDH13104−0.991.78E-24
ENSBTAT00000033704_2336113,701,647rs442770236G/Tupstream_gene_variantENSBTAG00000004316ENSBTAT00000033704BOD1L1040.991.78E-24
ENSBTAT00000035362_4662010,305,279G/Adownstream_gene_variantENSBTAG00000027980ENSBTAT00000035362TAF9118−0.991.78E-24
ENSBTAT00000037465_1382510,053,612T/Cintron_variantENSBTAG00000026375ENSBTAT00000037465RMI2118−0.991.78E-24
ENSBTAT00000043122_7021086,384,804A/Gupstream_gene_variantENSBTAG00000020379ENSBTAT00000043122AREL11180.991.78E-24
ENSBTAT00000043260_9001080,246,761A/Gupstream_gene_variantENSBTAG00000014334ENSBTAT00000043260ZFYVE261180.991.78E-24
ENSBTAT00000043778_1651582,285,586T/Cintron_variantENSBTAG00000002411ENSBTAT00000043778CTNND11180.991.78E-24
ENSBTAT00000043778_4921582,284,426G/Aintron_variantENSBTAG00000002411ENSBTAT00000043778CTNND1118−0.991.78E-24
ENSBTAT00000043778_7601582,284,694C/Tintron_variantENSBTAG00000002411ENSBTAT00000043778CTNND1118−0.991.78E-24
ENSBTAT00000054096_5098103,426,705G/Aupstream_gene_variantENSBTAG00000038335ENSBTAT00000054096IGBP11180.991.78E-24
ENSBTAT00000056520_3522327,378,739C/Tupstream_gene_variantENSBTAG00000039620ENSBTAT00000056520MGC1515861040.991.78E-24
ENSBTAT00000061199_1851420,989,316A/Gupstream_gene_variantENSBTAG00000044106ENSBTAT00000061199SPIDR118−0.991.78E-24
ENSBTAT00000061451_269233,434,098G/Aintron_variantENSBTAG00000021237ENSBTAT00000061451DST118−0.991.78E-24
ENSBTAT00000061451_704233,432,111G/Tintron_variantENSBTAG00000021237ENSBTAT00000061451DST1180.991.78E-24
ENSBTAT00000034195_104853,736,890G/Aintron_variantENSBTAG00000017734ENSBTAT00000034195VPS13A1180.991.78E-24
List of coding SNPs with genotypes highly correlated to the activity of some bidirectional promoters We found among these correlations a significant negative correlation between a cSNP within Taf9, a gene encoding the RNA polymerase II, TATA box binding protein-associated factor (TAF) 32kD (also known as TafII32 or Taf2g), and the activity of bidirectional promoter #118. TAF9 is involved in the initiation of the transcription by RNA polymerase II. Interestingly, Taf1, another TATA box binding factor has motifs over-represented in human bidirectional promoters and two-hybrid experiments performed in yeast have shown that Taf1 is able to bind to DNA but also to different TATA box binding factors, including Taf9. Many bidirectional promoters lack a TATA box;, however, it has been shown that several TATA-less promoters require TAFs for transcription. The cSNP found in Taf9 is therefore an interesting variant and might point out an effect of Taf9 on the activity of bidirectional promoter #118. The 11 cSNPs might have an impact on the regulation of the corresponding gene, which in turn might affect the activity of some bidirectional promoters. Spearman's rank correlation coefficients were calculated for the genotypes of these 11 cSNPs and the normalized expression level of the corresponding gene (Table 5). Significant correlations were found at P-value <0.05 for 4 cSNPs, suggesting that these cSNPs have an effect on the regulation of the expression of their corresponding gene. The seven other cSNPs, including the variant found in Taf9, might have most probably an effect on the post-transcriptional regulation of the corresponding gene, which consequently alter the amount of proteins which then affect the activity of the bidirectional promoter.
Table 5

Correlation scores between cSNPs and the expression level of the corresponding gene

cSNP IDBTAPositiondbSNP IDAllelesConsequenceEnsembl transcript IDGene symbolRhoP-value
ENSBTAT00000003826_47942122,218,555C/Tupstream_gene_variantENSBTAT00000003826FURIN−0.520.02
ENSBTAT00000018163_6652166,866,347T/Gupstream_gene_variantENSBTAT00000018163SLC25A29−0.490.03
ENSBTAT00000043122_7021086,384,804A/Gupstream_gene_variantENSBTAT00000043122AREL10.490.03
ENSBTAT00000043260_9001080,246,761A/Gupstream_gene_variantENSBTAT00000043260ZFYVE260.460.04
ENSBTAT00000005923_585252,977,744T/Cupstream_gene_variantENSBTAT00000005923SLX4−0.430.06
ENSBTAT00000035362_46672010,305,279G/Adownstream_gene_variantENSBTAT00000035362TAF9−0.430.06
ENSBTAT00000029400_1951773,658,214C/Tupstream_gene_variantENSBTAT00000029400SPECC1L0.350.14
ENSBTAT00000061199_18511420,989,316A/Gupstream_gene_variantENSBTAT00000061199SPIDR−0.290.22
ENSBTAT00000033704_2336113,701,647rs442770236G/Tupstream_gene_variantENSBTAT00000033704BOD1L0.170,46
ENSBTAT00000054096_5098103,426,705G/Aupstream_gene_variantENSBTAT00000054096IGBP10.140.54
ENSBTAT00000056520_35282327,378,739C/Tupstream_gene_variantENSBTAT00000056520MGC151586−0.120.63
Correlation scores between cSNPs and the expression level of the corresponding gene

3.9. Some bidirectional promoter regions are located within quantitative trait loci for meat-related phenotypes

The positions of the 120 bidirectional promoters sharing genes expressed in bovine LT were compared with the position of known quantitative trait loci (QTLs) deposited in the public database AnimalQTLdb. Fifty-four bidirectional promoters were located in 244 different QTL regions (Supplementary Table S14). For example, 7 different bidirectional promoters are found in 8 QTL regions for intramuscular fat; whereas 18 bidirectional promoters are within 18 QTLs for shear force. Karim et al. have shown that a genetic variant located within a bidirectional promoter shared by PLAG1 and CHCHD7 influences bovine stature. It will be interesting to investigate if variants within some of the bidirectional promoter regions we identified affect quantitative traits, including meat quality traits.

4. Conclusion

In this study, we identified 120 bidirectional promoters active in bovine muscle and validated in vitro 16 of them. Using RNA-seq data from 20 muscle samples, we found that the activity of these bidirectional promoters is highly variable and that the expression level of 77 genes is highly correlated with the activity of 12 of the bidirectional promoters. We also identified 57 SNPs associated significantly with the activity of 5 bidirectional promoters. Interestingly, we found that a coding SNP within Taf9 has an effect on the activity of bidirectional promoter #118. To our knowledge, our study is the first analysis in any species of the genetic variability of the activity of bidirectional promoters.

Data availability

The sequencing data have been submitted to the European Nucleotide Archive (accession numbers ERP002220, E-MTAB-2646 and E-MTAB-4625). Click here for additional data file.
  54 in total

1.  The UCSC Genome Browser Database.

Authors:  D Karolchik; R Baertsch; M Diekhans; T S Furey; A Hinrichs; Y T Lu; K M Roskin; M Schwartz; C W Sugnet; D J Thomas; R J Weber; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Variants modulating the expression of a chromosome domain encompassing PLAG1 influence bovine stature.

Authors:  Latifa Karim; Haruko Takeda; Li Lin; Tom Druet; Juan A C Arias; Denis Baurain; Nadine Cambisano; Stephen R Davis; Frédéric Farnir; Bernard Grisart; Bevin L Harris; Mike D Keehan; Mathew D Littlejohn; Richard J Spelman; Michel Georges; Wouter Coppieters
Journal:  Nat Genet       Date:  2011-04-24       Impact factor: 38.330

3.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

4.  Two genes, COL4A3 and COL4A4 coding for the human alpha3(IV) and alpha4(IV) collagen chains are arranged head-to-head on chromosome 2q36.

Authors:  R Momota; M Sugimoto; T Oohashi; K Kigasawa; H Yoshioka; Y Ninomiya
Journal:  FEBS Lett       Date:  1998-03-06       Impact factor: 4.124

5.  Comparison of the promoters of the mouse (APEX) and human (APE) apurinic endonuclease genes.

Authors:  L Harrison; A G Ascione; Y Takiguchi; D M Wilson; D J Chen; B Demple
Journal:  Mutat Res       Date:  1997-12       Impact factor: 2.433

6.  Gene-based single nucleotide polymorphism discovery in bovine muscle using next-generation transcriptomic sequencing.

Authors:  Anis Djari; Diane Esquerré; Bernard Weiss; Frédéric Martins; Cédric Meersseman; Mekki Boussaha; Christophe Klopp; Dominique Rocha
Journal:  BMC Genomics       Date:  2013-05-07       Impact factor: 3.969

7.  Conservation, rearrangement, and deletion of gene pairs during the evolution of four grass genomes.

Authors:  Nicholas Krom; Wusirika Ramakrishna
Journal:  DNA Res       Date:  2010-09-23       Impact factor: 4.458

8.  BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments.

Authors:  Fátima Al-Shahrour; Pablo Minguez; Juan M Vaquerizas; Lucía Conde; Joaquín Dopazo
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  Prediction-based approaches to characterize bidirectional promoters in the mammalian genome.

Authors:  Mary Qu Yang; Laura L Elnitski
Journal:  BMC Genomics       Date:  2008       Impact factor: 3.969

10.  Identification of large intergenic non-coding RNAs in bovine muscle using next-generation transcriptomic sequencing.

Authors:  Coline Billerey; Mekki Boussaha; Diane Esquerré; Emmanuelle Rebours; Anis Djari; Cédric Meersseman; Christophe Klopp; Daniel Gautheret; Dominique Rocha
Journal:  BMC Genomics       Date:  2014-06-19       Impact factor: 3.969

View more
  1 in total

1.  Survey of allele specific expression in bovine muscle.

Authors:  Gabriel M Guillocheau; Abdelmajid El Hou; Cédric Meersseman; Diane Esquerré; Emmanuelle Rebours; Rabia Letaief; Morgane Simao; Nicolas Hypolite; Emmanuelle Bourneuf; Nicolas Bruneau; Anne Vaiman; Christy J Vander Jagt; Amanda J Chamberlain; Dominique Rocha
Journal:  Sci Rep       Date:  2019-03-12       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.