| Literature DB >> 36176302 |
Karim Hasanpur1, Sevda Hosseinzadeh1, Atiye Mirzaaghayi1, Sadegh Alijani1.
Abstract
Accurate normalization of the gene expression assays, using housekeeping genes (HKGs), is critically necessary. To do so, selection of a proper set of HKGs for a specific experiment is of great importance. Despite many studies, there is no consensus about the suitable set of HKGs for implementing in the quantitative real-time PCR analyses of chicken tissues. A limited number of HKGs have been widely used. However, wide utilization of a little number of HKGs for all tissues is challenging. The emergence of high-throughput gene expression RNA-seq data has enabled the simultaneous comparison of the stability of multiple HKGs. Therefore, employing the average coefficient of variations of at least three datasets per tissue, we sorted all reliably expressed genes (REGs; with FPKM ≥ 1 in at least one sample) and introduced the top 10 most suitable and stable reference genes for each of the 16 chicken tissues. We evaluated the consistency of the results of five tissues using the same methodology on other datasets. Furthermore, we assessed 96 previously widely used HKGs (WU-HKGs) in order to challenge the accuracy of the previous studies. The New Tuxedo software suite was used for the main analyses. The results revealed novel, different sets of reference genes for each of the tissues with 17 common genes among the top 10 genes lists of 16 tissues. The results did disprove the suitability of WU-HKGs such as Actb, Ldha, Scd, B2m, and Hprt1 for any of the tissues examined. On the contrary, a total of 6, 13, 14, 23, and 32 validated housekeeping genes (V-HKGs) were discovered as the most stable and suitable reference genes for muscle, spleen, liver, heart, and kidney tissues, respectively. Although we identified a few new HKGs usable for multiple tissues, the selection of suitable HKGs is required to be tissue specific. The newly introduced reference genes from the present study, despite lacking experimental validation, will be able to contribute to the more accurate normalization for future expression analysis of chicken genes.Entities:
Keywords: New Tuxedo; RNA-sequencing; chicken; coefficient of variation; housekeeping genes
Year: 2022 PMID: 36176302 PMCID: PMC9514876 DOI: 10.3389/fgene.2022.827538
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Accession numbers of the used datasets for discovering the most stably expressed genes for 16 chicken tissues.
| Training datasets | Adipose | SRP143406 | SRP343295 | SRP042257 | SRP212250 | ||
| Blood | SRP200185 | SRP200118 | SRP200118 | SRP310357 | SRP310357 | ||
| Brain | SRP102082 | SRP081121 | SRP233052 | SRP233052 | |||
| Burs | ERP122030 | ERP122030 | SRP163233 | SRP098825 | |||
| Duodenum | SRP348148 | SRP299602 | SRP173587 | SRP055561 | |||
| Heart | SRP265642 | SRP097223 | SRP153755 | ||||
| Ileum | SRP149780 | SRP200118 | SRP300399 | SRP126304 | |||
| Jejunum | SRP280208 | SRP140601 | ERP121879 | ERP121879 | |||
| Kidney | SRP097223 | SRP092600 | SRP338989 | ||||
| Liver | SRP143406 | SRP097223 | SRP321387 | SRP294224 | SRP161836 | SRP133195 | |
| Lung | SRP097223 | SRP265640 | SRP233531 | SRP238721 | SRP081121 | ||
| Muscle | SRP217060 | SRP217060 | SRP217060 | SRP217060 | SRP159467 | SRP321387 | |
| Ovary | SRR12315154 | SRP143406 | SRP256253 | SRP256253 | |||
| Skin | SRP343295 | SRP142597 | SRP126033 | SRP112878 | |||
| Spleen | SRP097223 | SRP225741 | SRP174144 | SRP280208 | SRP158365 | SRP174144 | |
| Trachea | SRP338989 | SRP247563 | SRP226600 | SRP126851 | |||
| Evaluation datasets | Heart | SRP152925 | SRP266037 | SRP159467 | |||
| Kidney | SRP338989 | SRP338989 | SRP338989 | ||||
| Liver | SRP111815 | SRP104528 | SRP233052 | SRP233052 | SRP081121 | SRP100368 | |
| Muscle | SRP255211 | SRP104528 | SRP327337 | SRP327185 | SRP313854 | SRP226900 | |
| Spleen | SRP254842 | SRP254842 | SRP223412 | SRP173965 | SRP174144 | SRP174144 |
FIGURE 1Flowchart of detection of the most suitable housekeeping genes for a specific tissue. This workflow was repeated for all tissues separately. At least three datasets were analyzed per tissue. The comparisons within each experiment were performed between the treated and control groups, each with at least three replicates. All the used datasets are publicly available data generated using Illumina paired-end RNA-Sequencing method. Software programs within the New Tuxedo suite were hisat2 (for mapping of reads onto reference genome), Stringtie (for assembly and read counting), and Cuffdiff (for differentiall expression analysis). Expression stability was monitored per gene based on the coefficient of variation (CV), and ranking of most stably expressed genes were performed based on the average CV criterion.
FIGURE 2Scatterplot of the reliably expressed genes of one dataset of adipose tissue. The relationship of mean (in log10 scale) and variation of expression of genes were assessed in order to understand whether the variability of expression is increased with the mean of expression. The expression values are given as fragments per kilobase of transcript per million reads in log10 scale.
Top 10 most suitable reference genes for chicken tissues based on average coefficient of variations (CVs) across at least three experiments.
| Adipose | Blood | Brain | Bursa | Duodenum | Heart | Ileum | Jejunum | Kidney | Liver | Lung | Muscle | Ovary | Skin | Spleen | Trachea |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Abcb6 | Psma1 | Serbp1 | Rap2c | Amot | Mrpl33 | Dhx30 |
| Itgb1bp3 | Fam120a | Atl1 | Adam17 |
|
| Wdr81 |
|
| Prrc2c |
| Nr1h3 | Tmem259 |
|
| Dhx38 |
| Uck1 | Xpo6 | Tfip11 | Slc39a3 | Pcif1 | Ddb1 | Hdac1 |
|
| Ubr7 |
| Bet1l | Cnot9 |
| Rufy3 | Arnt |
|
|
| Mvb12a |
| Tomm22 | Lonp1 | Scyl3 |
|
| Eif3a | Rpl39l | Arhgef9 | Wasf2 | Zyx |
| Ckap2l |
| Col4a1 | Mrps25 | Slc35a1 | Xpo7 | Erlin1 | Tcf25 | Zc3h11b |
|
| Pabpc1 | Tmed10 |
|
| Usp5 | Nfyc |
| Lasp1 |
|
| Pisd | Fem1b | Poll |
| Dnajc5 | Cfap92 |
| Tmem57 |
| Nono | Ascc2 | Cnot1 | Cuedc2 | Nup188 | Mif4gd |
| Pcbd1 | Phc1 | Ipo9 |
| Mtmr3 | Nek9 | Parp9 |
| Thrap3 | Cox7b | Sumo3 | Rpl7l1 | Psmd13 |
| Stx10 |
| Hbp1 | Rnf130 | Spout1 |
| Kbtbd4 | Baz1b | Gzf1 | Hspd1 |
|
| Rpl7a | Rps3a | Man2c1 | Fhl3 | Slc7a3 |
|
| Herc2 | Adat1 |
| Copg2 | Brd8 |
| Cdk12 | Cd80 |
|
| Rpsap58 | Uso1 | Tpm3 | Med1 |
| Map3k14 | Mkrn2 | Ndufb9 | Myh9 | Cog5 | Arcn1 | Gsr |
|
| Mettl21c |
| Kcnh4 | Eif4ebp1 | Cd99l2 | Chmp1a |
|
| Nol9 | Aamp |
| Arpc2 |
| Eya3 | Clpx | Yipf3 | Mpp1 | Mrpl40 |
Highlighted (bold) genes are in common for at least two tissues. Underlined genes are among the widely used housekeeping genes.
Number of genes that are in common in the top 100 genes list of the training datasets and the top 100 to top 500 genes lists of the evaluation datasets.
| Heart | Kidney | Liver | Muscle | Spleen | |
|---|---|---|---|---|---|
| Top-100 genes of evaluation datasets | 23 | 32 | 14 | 6 | 13 |
| Top-200 genes of evaluation datasets | 38 | 51 | 26 | 11 | 21 |
| Top-300 genes of evaluation datasets | 48 | 68 | 36 | 16 | 31 |
| Top-400 genes of evaluation datasets | 59 | 76 | 46 | 19 | 35 |
| Top-500 genes of evaluation datasets | 63 | 80 | 49 | 23 | 43 |
Most stably expressed genes of five chicken tissues that are in common between the top 100 genes of the training dataset and the top 100 genes of the evaluation dataset.
| Heart | Kidney | Liver | Muscle | Spleen |
|---|---|---|---|---|
| Mob1a | Ikbkb | Rp11-529k1.3 | Srpra | Nfyc |
|
| Wnk1 | Dpagt1 | Cops7a | Hdac1 |
| Ticam1 | Aplp2 | Rbm7 | Hnrnpd | Cnp |
| Cep68 | Ilf2 | Npepps | Dhx38 | Adam17 |
|
|
| Eif2b5 | Gtpbp1 | Spata5 |
|
| Ugp2 |
| Puf60 | Grk2 |
| Fancm | Bpnt1 | Amfr | Rpn1 | |
| Abi2 |
| Psmd7 | Hnrnpab | |
| Aamp | Etfdh | Fam120a | Mtmr3 | |
| Tmem41a | Rufy3 | Ctnna1 | Nup188 | |
| Ak3 | Oraov1 | Xpo6 | Tor1b | |
| Hbp1 | Slirp | Lig3 | Znrf2 | |
| Casc4 |
|
|
| |
| Rbl2 | Hspd1 | Prpf6 | ||
| Pepd | Tanc1 | |||
|
|
| |||
| Tfip11 | Cp | |||
| Pisd | Gpr18 | |||
| Spout1 | Cog5 | |||
|
| Vwa9 | |||
| Mif4gd |
| |||
|
| Hvcn1 | |||
| Tasor2 |
| |||
| Gsn | ||||
| Fam104a | ||||
| Mrps7 | ||||
|
| ||||
| Rps6kb1 | ||||
| Myo19 | ||||
| Tubb2a | ||||
| Stx17 | ||||
| Mrps16 |
Highlighted (bold) genes are in common for at least two tissues. Underlined genes are among the widely used housekeeping genes.
FIGURE 3Box plots of the expression of most stable, validated housekeeping genes in comparison with that of five randomly selected widely used housekeeping genes of five chicken tissues. For each tissue, the expression variation of newly introduced housekeeping genes (left side of the vertical line) was compared with that of five old and widely used housekeeping genes (right side of the vertical line). The expression values are given as fragments per kilobase of transcript per million reads in log10 scale (whiskers: min to max).