| Literature DB >> 35401673 |
Elizabeth M Ross1, Hari Sanjana1, Loan T Nguyen1, YuanYuan Cheng2, Stephen S Moore1, Ben J Hayes1.
Abstract
Fertility is a key driver of economic profitability in cattle production. A number of studies have identified genes associated with fertility using genome wide association studies and differential gene expression analysis; however, the genes themselves are poorly characterized in cattle. Here, we selected 13 genes from the literature which have previously been shown to have strong evidence for an association with fertility in Brahman cattle (Bos taurus indicus) or closely related breeds. We examine the expression variation of the 13 genes that are associated with cattle fertility using RNA-seq, CAGE-seq, and ISO-seq data from 11 different tissue samples from an adult Brahman cow and a Brahman fetus. Tissues examined include blood, liver, lung, kidney, muscle, spleen, ovary, and uterus from the cow and liver and lung from the fetus. The analysis revealed several novel isoforms, including seven from SERPINA7. The use of three expression characterization methodologies (5' cap selected ISO-seq, CAGE-seq, and RNA-seq) allowed the identification of isoforms that varied in their length of 5' and 3' untranslated regions, variation otherwise undetectable (collapsed as degraded RNA) in generic isoform identification pipelines. The combinations of different sequencing technologies allowed us to overcome the limitations of relatively low sequence depth in the ISO-seq data. The lower sequence depth of the ISO-seq data was also reflected in the lack of observed expression of some genes that were observed in the CAGE-seq and RNA-seq data from the same tissue. We identified allele specific expression that was tissue-specific in AR, IGF1, SOX9, STAT3, and TAF9B. Finally, we characterized an exon of TAF9B as partially nested within the neighboring gene phosphoglycerate kinase 1. As this study only examined two animals, even more transcriptional variation may be present in a genetically diverse population. This analysis reveals the large amount of transcriptional variation within mammalian fertility genes and illuminates the fact that the transcriptional landscape cannot be fully characterized using a single technology alone.Entities:
Keywords: RNA sequencing; androgen receptor; cattle; fertility; gene expression; isoforms; transcriptomics; variation
Year: 2022 PMID: 35401673 PMCID: PMC8990236 DOI: 10.3389/fgene.2022.784663
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Genes (identified from literature) as involved in Brahman fertility.
| Full Gene Name (Abbreviate gene name) | Function | Evidence for association with fertility in Brahman |
|---|---|---|
| Androgen receptor ( | Member of nuclear receptor superfamily ( | Positional candidate in GWAS affecting male and female traits ( |
| Insulin-like growth factor 1 ( | Aids in cell growth, differentiation, and transformation ( | Positional candidate affecting male and female traits in GWAS including related cattle breeds ( |
| Inhibin subunit alpha ( | Codes for α subunit of inhibin protein ( | Candidate gene for GWAS ( |
| Proenkephalin ( | Codes for the neurotransmitters methionine enkephalin and leucine enkephalin | Biological candidate for GWAS affecting male and female traits ( |
| Pleiomorphic adenoma gene 1 ( | Family of zinc finger transcription factors ( | Haplotype analysis ( |
| Ribosomal protein S20 ( | Family of S10P ribosomal proteins ( | Functional candidate for GWAS of female traits ( |
| Rhotekin ( | Family of rhotekin ( | Functional candidate for GWAS of male traits ( |
| Serine peptidase inhibitor, clade A member 7 ( | Family of SERPIN ( | Candidate gene for GWAS in male traits ( |
| SRY-transcription factor 9 ( | Family of the SOX gene ( | Candidate gene for GWAS in male traits ( |
| Signal transducer and activator of transcription 3 ( | Family of STAT protein involved in the growth hormone receptor (GHR) signaling pathway regulating growth hormone or somatotropin, involved in estrogen receptor pathway, and functional disruption causes infertility, obesity, hyperphagia, and thermal dysregulation ( | Candidate gene for GWAS ( |
| Serine/threonine kinase 11 interacting protein ( | Affects male fertility ( | Functional candidate for GWAS of male traits ( |
| TATA-box binding protein associated factor 1 ( | Family of TBP-associated factors, TAFs ( | Positional and functional candidate for GWAS of male traits ( |
| TATA-box binding protein associated factor 9b ( | Family of TBP-associated factors, TAFs ( | Positional and functional candidate for GWAS of male traits ( |
List of genes with their respective positions within Bos taurus genome and Brahman genome with their alignment length and identity.
| Gene | Isoforms | Bos taurus | Bos indicus | Length of gene | Chromosome | Strand direction | Homology (%) | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Start Position | Stop Position | Start Position | Stop Position | |||||||
| AR | X1 | 51674157 | 51881942 | 84786064 | 84957236 | 171172 | X | Negative | 99.96 | |
| IGF | X1 | 66206081 | 66263849 | 66145894 | 66203733 | 57839 | 5 | Negative | 100 | |
| X2 | 66206081 | 66263849 | 66145894 | 66203733 | 57839 | 99.65 | ||||
| X3 | 66206081 | 66261980 | 66145894 | 66199198 | 53304 | 100 | ||||
| X4 | 66206081 | 66261980 | 66145894 | 66199195 | 53301 | 100 | ||||
| X5 | 66192424 | 66263849 | 66132237 | 66203733 | 71496 | 99.37 | ||||
| X6 | 66192595 | 66263849 | 66132408 | 66147706 | 15298 | 99.33 | ||||
| X7 | 66192424 | 66261980 | 66132237 | 66147706 | 15469 | 99.76 | ||||
| X8 | 66192424 | 66261980 | 66132237 | 66147706 | 15469 | 99.75 | ||||
| Preprotein | 66192424 | 66263849 | 66132237 | 66147706 | 15469 | 99.79 | ||||
| INHA | X1 | 107501844 | 107504762 | 107614722 | 107617642 | 2920 | 2 | Positive | 99.82 | |
| PENK | X1 | 23542677 | 23546157 | 23420291 | 23423774 | 3483 | 14 | Negative | 100 | |
| PLAG1 | X1 | 23330541 | 23332546 | 23192090 | 23194095 | 2005 | 14 | Negative | 100 | |
| X2 | 23330541 | 23331794 | 23192090 | 23193343 | 1253 | |||||
| RTKN2 | X1 | 18284769 | 18393694 | 17858619 | 17967072 | 108453 | 28 | Negative | 99.3 | |
| X2 | 18284769 | 18385527 | 17858619 | 17958855 | 100236 | 99.26 | ||||
| X3 | 18284769 | 18348138 | 17858619 | 17921964 | 63345 | 99.33 | ||||
| SERPINA7 | X1 | 54824445 | 54829800 | 53237946 | 53241468 | 3522 | X | Positive | 99.69 | |
| Precursor | 54824445 | 54827949 | 53237963 | 53241468 | 3505 | 99.68 | ||||
| SOX9 | X1 | 58919579 | 58922699 | 60166722 | 60169843 | 3121 | 19 | Negative | 100 | |
| STAT3 | X1 | 42419849 | 42450618 | 43645371 | 43676141 | 30770 | 19 | Negative | 99.66 | |
| X2 | 42419849 | 42450618 | 43645371 | 43676141 | 30770 | 99.66 | ||||
| X3 | 42421282 | 42450618 | 43647176 | 43676141 | 28965 | 99.64 | ||||
| STK11IP | X1 | 107521189 | 107535476 | 107634090 | 107648385 | 14295 | 2 | Positive | 99.45 | |
| X2 | 107521189 | 107535476 | 107634090 | 107648385 | 11551 | 99.5 | ||||
| X3 | 107521189 | 107533477 | 107634090 | 107646987 | 12297 | 99.46 | ||||
| TAF1 | X1 | 79206804 | 79276760 | 80853773 | 80923837 | 70064 | X | Negative | 99.86 | |
| X2 | 79206804 | 79276760 | 80853773 | 80923837 | 70064 | 99.86 | ||||
| X3 | 79206804 | 79276760 | 80853773 | 80923837 | 70064 | 99.79 | ||||
| X4 | 79206804 | 79276760 | 80853773 | 80923837 | 70064 | 99.84 | ||||
| X5 | 79206804 | 79276760 | 80853773 | 80923837 | 70064 | 99.84 | ||||
| X6 | 79197983 | 79276760 | 80857229 | 80923837 | 66608 | 99.87 | ||||
| X7 | 79225602 | 79276760 | 80872738 | 80923837 | 51099 | 99.9 | ||||
| X8 | 79229398 | 79276760 | 80885675 | 80923837 | 38162 | 99.9 | ||||
| X9 | 79227390 | 79276760 | 80885675 | 80923837 | 38162 | 99.9 | ||||
| TAF9B | X1 | 74232003 | 74255254 | 74070058 | 74093312 | 23254 | X | Positive | 100 | |
| X2 | 74232003 | 74255924 | 74070058 | 74093982 | 23924 | 100 | ||||
| Subunit | 74232003 | 74239263 | 74070058 | 74077335 | 7277 | 100 | ||||
Length from the first base of the first exon to the last base of the last exon.
Negative = 3′ to 5′ direction; positive = 5′ to 3′ direction.
Genome-wide SNPs compared to Bos taurus genome.
| Mother | Foetus | |
|---|---|---|
| Homozygous SNP | 7762707 | 7734830 |
| Heterozygous SNP | 11408849 | 11537221 |
| Haplotype level Homology | 99.501% | 99.500% |
Homozygous for alternate alleles from reference.
Position and length of 5′ untranslated region.
| Gene | 5′ gene position | CAGE-seq TSS peak | UTR length | |
|---|---|---|---|---|
| Start position | Stop position | |||
| AR | 84957236 | 84958298 | 84958477 | 1,241 |
| IGF1 | 66203733 | 66203715 | 66204034 | 301 |
| INHA | 107614722 | 107614985 | 107614491 | 231 |
| PENK | 23423774 | 23424308 | 23424459 | 685 |
| RPS20 | 23140852 | 23141036 | 23141219 | 367 |
| RTKN2 | 17967072 | 17966877 | 17967512 | 440 |
| SERPINA7 | 53237946 | 53236117 | 53236027 | 1,919 |
| SOX9 | 60169843 | 60169524 | 60170244 | 401 |
| STAT3 | 43676141 | 43676023 | 43676320 | 179 |
| STK11IP | 107634090 | 107634194 | 107633817 | 273 |
| TAF1 | 80923837 | 80923816 | 80923981 | 144 |
| TAF9B | 74070058 | 74070080 | 74069977 | 81 |
Distance between the start of the coding region within the first exon and the center of the TSS peak from the CAGE-seq data.
FIGURE 1Tissue-specific expression of genes in RNA-seq, ISO-seq, and CAGE-seq data. RPM, reads per million.
FIGURE 2Allele-specific expression for each tissue with significance. Significance at p < .0001 and p < .01 indicated for reference (dashed lines). The ratio of alleles observed in whole genome sequencing (WGS) of the same animals is indicated as the background of each panel. Only loci that were heterozygous in the WGS data were tested for allele-specific expression. In the position, 1, 2, and 3 refer to exons 1, 2, and 3, respectively.
FIGURE 3Representation of isoforms with length of exons (red boxes) and introns (blue lines). Positions are displayed relative to the first expressed base pair (x axis) of the gene.