Literature DB >> 27087983

Whole-transcriptome analyses of the Sapsaree, a Korean natural monument, before and after exercise-induced stress.

Ji-Eun Kim1, Junkyung Choe1, Jeong Hee Lee1, Woong Bom Kim1, Whan Cho1, Ji Hong Ha2, Ki Jin Kwon2, Kook Il Han3, Sung-Hwan Jo1.   

Abstract

BACKGROUND: The Sapsaree (Canis familiaris) is a Korean native dog that is very friendly, protective, and loyal to its owner, and is registered as a natural monument in Korea (number: 368). To investigate large-scale gene expression profiles and identify the genes related to exercise-induced stress in the Sapsaree, we performed whole-transcriptome RNA sequencing and analyzed gene expression patterns before and after exercise performance.
RESULTS: We identified 525 differentially expressed genes in ten dogs before and after exercise. Gene Ontology classification and KEGG pathway analysis revealed that the genes were mainly involved in metabolic processes, such as programmed cell death, protein metabolic process, phosphatidylinositol signaling system, and cation binding in cytoplasm. The ten Sapsarees could be divided into two groups based on the gene expression patterns before and after exercise. The two groups were significantly different in terms of their basic body type (p ≤ 0.05). Seven representative genes with significantly different expression patterns before and after exercise between the two groups were chosen and characterized.
CONCLUSIONS: Body type had a significant effect on the patterns of differential gene expression induced by exercise. Whole-transcriptome sequencing is a useful method for investigating the biological characteristics of the Sapsaree and the large-scale genomic differences of canines in general.

Entities:  

Keywords:  Bioinformatics; Exercise; NGS; Physical stress; RNA-Seq; Sapsaree; Transcriptome

Year:  2016        PMID: 27087983      PMCID: PMC4832554          DOI: 10.1186/s40781-016-0097-1

Source DB:  PubMed          Journal:  J Anim Sci Technol        ISSN: 2055-0391


Background

The Sapsaree (Canis familiaris) is a native Korean dog that is distributed throughout the Korean peninsula, and is very friendly, protective, and loyal to its owner. The Sapsaree population size decreased dramatically during the Korean War in the 1950s, and the breed was considered endangered. A program of systematic mating and reproduction to preserve the Sapsaree from extinction and maintain a pure pedigree generated the current population of about 4,000 individuals, including the 500 dogs now living at the Sapsaree Breeding Research Institute in Gyeongsan, Gyeongbuk province [1]. The Sapsaree was registered as a Korean Natural Monument (number: 368) in 1992. Although recent studies have shed light on the origin and various morphological and behavioral traits of the Sapsaree, the genetic backgrounds, abundant genetic polymorphisms, and novel genes of the Sapsaree are still not completely understood [2]. RNA sequencing (RNA-Seq) is one of the most useful next-generation sequencing tools for investigating the landscape of a whole transcriptome. Using RNA-Seq, researchers have identified differentially expressed and novel genes, unraveled the expression profiles underlying phenotypic changes, and discovered unannotated, transcriptionally active regions that cannot be detected by conventional gene prediction [3]. Furthermore, the recent use of RNA-Seq has captured the scale and complexity of organ-specific and tissue-specific transcriptomes, making RNA-Seq the technique of choice for investigating gene expression during complex phenomena such as stress [4]. Exercise is usually recognized as a stress factor, as is any environmental change that activates or pressures cells and tissues. And the level of response to physical exercise is different from individuals to individuals. So, it is needed to research the biological responses under physical stress. Several RNA-Seq studies of exercise-induced stress have been performed in equines [3, 4]. There has been little progress, however, in canine-based RNA-Seq studies, which have been limited to certain diseases. Therefore, we performed a large-scale analysis of whole-transcriptome data to investigate the gene expression levels before and after exercise in the Sapsaree. This study will provide the basic approach for identified the biologic characteristics and breeding the working dogs.

Methods

Morphological traits and blood sample collection

The dogs were handled in accordance with Article 23 ‘Experiments with Animal’ of Korea’s Animal Protection Law, 2015, and the Korean Sapsaree Foundation cooperated and approved all animal care procedures prior to the initiation of the experiment. Ten Sapsarees housed in controlled environmental conditions were used in the experiment. The basic morphological traits of each Sapsaree including body height (BH), body length (BL), depth of chest (DC), body weight (W), and hair color were measured before the experiment. Then, the dogs were exercised one-on-one by trainers for 1 h in 5 min intervals, with the trainers leading the dogs in quick step 10 times around a course on a 20 m × 20 m square field located at the Korean Sapsaree Foundation facility in Gyeongsan. The exercise course included hurdling (40 cm, 70 cm, and 80 cm), high jumping (50 cm), A-shape hurdling, bridge-shape hurdling, and seesaw hurdling. Blood samples were drawn under a veterinarian’s supervision from a cephalic vein before and immediately after exercise, resulting in a total of 20 samples. Immediately after collection, portions of the blood samples were dispensed into serum separator tubes (SST) (1.5 ml in each tube [Greiner Bio One, Kremsmuenster, Austria]) to measure hormone levels, and into PAXgene blood RNA tubes (2.5 ml in each tube, [PreAnalytiX, Hombrechtikon, Switzerland]) for RNA sequencing according to the respective manufacturers’ protocols [5, 6]. The blood for serum was allowed to clot at room temperature before centrifugation. The serum was then separated and stored frozen at -70 °C until completion of the case enrollment and sample collection.

Measurement the stress indicator in blood

Four substances related to physical stress were chosen based on previous studies: cortisol, aspartate aminotransferase (AST), creatine kinase (CK), and creatinine. The serum CK, AST, creatinine levels are commonly used to muscle damage indicators and renal function [7-10]. And a recent study indicated that physical stress resulted in immediate increase in the plasma concentrations of cortisol [11]. The level of target substances in serum from the upper side of the SST tubes were measured using a BS-400 chemistry analyzer (Mindray, Shenzhen, China) and an Immulite 1000 Immunoassay System (Siemens, New York, USA) [12, 13].

RNA sequencing and bioinformatic analysis

Total RNA was extracted from the PAXgene tubes according to the manufacturer’s instructions [6]. Starting with the total RNA, mRNA was purified using poly (A) selection or rRNA depletion, converted into double-stranded cDNA, and amplified by PCR. To check the RNA quality, all the RNA samples were examined for RNA Integrity Number, and 28S to 18S rRNA value using Bioanalyzer. Next, the construction of library were used with the Illumina TruSeq RNA Sample Preparation Kit v2 (catalog #RS-122-2001, Illumina, San Diego, CA) following the manufacturer’s instructions [14]. And the library was quantified using the KAPA library quantification kit (Kapa Biosystems KK4854) following the manufacturer’s instructions [15]. The final individual libraries were sequenced using the Illumina Hiseq2000 platform, which created 100 bp paired-end (PE) RNA-sequencing reads. To collect high-quality transcriptome data, we filtered the sequencing data by phred score (Q ≥ 20) and minimum length (≥25 bp) using the SolexaQA software [16]. The filtered reads were mapped to 48,370 reference mRNAs from Canis lupus familiaris using the bowtie2 software (mismatches ≤ 2) [17, 18]. The number of mapped reads for each mRNA was counted and then normalized using the DESeq packages in R [19]. Differentially expressed genes (DEGs) were selected by over 100 mapped read counts, a ≥ twofold change in reads coverage and a binomial test with a false discovery rate (FDR) ≤ 0.01 at the first. And then, the final DEGs were identified by a ≥ twofold change in reads coverage, a binomial test with a false discovery rate (FDR) ≤ 0.01, and a read count ≥ 1,000 either before or after exercise. The FDR was applied to identify the threshold p-value for multiple tests and was calculated using DESeq. Correlation analysis and hierarchical clustering was performed to group the genes according to patterns of expression using the AMAP library in R [20]. And we tested the stability of the gene expression levels before and after the exercise of housekeeping genes, such as HNRNPH1, GAPDH, RPL8, TAF4B, and TAF1 with t-test method supporting the statistical significance [21].

Functional enrichment analysis

Functional enrichment analyses were carried out using the Gene Ontology (GO) database and including all three GO categories (biological processes, cellular components, and molecular functions), providing a structured and controlled vocabulary to describe the gene products [22]. We also used the KEGG database to identify the biological mechanisms and metabolic pathways associated with the differentially expressed genes corresponding their enzyme commission numbers [23]. DAVID is a web-accessible annotation system (https://david.ncifcrf.gov/home.jsp) that provides a comprehensive set of functional annotation tools for investigators for understanding the biological meanings behind large lists of genes [24]. We used DAVID to analyze the clusters of differentially expressed genes annotated by the Entrez gene IDs of the genes with counts ≥ 2 and FDR ≤ 0.1 of each GO and KEGG term.

Statistical analysis

To identify correlations between the gene expression pattern and body weight, we carried out t-tests comparing W/BH, W/BL, and W/DC values between two groups of dogs that showed different gene expression patterns. To control for group differences in baseline weight, the weight was divided by the BH, BL, and DC. A p-value ≤ 0.05 was used as the cutoff for significance in all analyses using the t-test function of R.

Results and discussion

The basic morphological traits of the 10 Sapsarees measured before the experiment are shown in Table 1. All the dogs were males, 13 to 60 months of age. The BH measured from the ground to the top of the withers ranged from 49 cm to 62 cm. The BL measured from the point of the shoulder to the rear point of the croup ranged from 58 cm to 70 cm. The DC measured from the elbow to the top of the withers ranged from 21 cm to 28 cm. The W measured by weighing balance ranged from 18.7 kg to 30 kg [25].
Table 1

Summary of the morphological traits of 10 Sapsarees

GroupNameBirthMonth of agea SexHair colorb BHc (cm)BLd (cm)DCe (cm)Wf (kg)W/BHW/BLW/CL
ICheongbaek2011.02.1546MaleBT586527220.380.340.81
IRookie2010.03.2357MaleW62672524.50.400.370.98
IChaeum2010.10.2450MaleY50602115.40.310.260.73
IHwangryong2011.11.0837MaleDY606925200.330.290.80
IPyeonggang2013.11.1913MaleBT49592118.70.380.320.89
Average----55.86423.820.120.360.3160.842
IITong2010.08.1552MaleBT586626310.530.471.19
IIHuimang2009.12.1160MaleW586828240.410.350.86
IIPyeongtan2013.11.1713MaleY54632225.50.470.401.16
IIHwangdol2011.09.2939MaleSY627027300.480.431.11
IIBongsik2010.07.2253MaleDY535823190.360.330.83
-Average----576525.225.90.450.3961.03

aMonths of age measured from birth to the date of exercise

bHair color : BT (Black & Tan), W (White), Y (Yellow), DY (Dark Yellow), SY (Strong Yellow)

c BH Body Height, d BL Body Length, e DC Depth of Chest, f W Body Weight

Summary of the morphological traits of 10 Sapsarees aMonths of age measured from birth to the date of exercise bHair color : BT (Black & Tan), W (White), Y (Yellow), DY (Dark Yellow), SY (Strong Yellow) c BH Body Height, d BL Body Length, e DC Depth of Chest, f W Body Weight

Physical stress indicators in serum

Compared with those before exercise, the concentrations of AST, CK, and creatinine were slightly increased after exercise, but the increases were not significant (Additional file 1: Tables S1 and S2). Cortisol, a key hormone from the adrenal glands, was significantly elevated after exercise in all individuals except for Cheongbaek (Additional file 2: Figure S1, Additional file 1: Table S2). Hence, it seems that cortisol can be used as a marker of exercise-induced stress.

RNA sequencing and Bioinformatic analysis

After the RNA quality check, the RNA sequencing generated over 75 Gbp (about 3.8 Gbp per sample) of data consisting of 100 bp paired-end reads (Table 2). Trimming resulted in reads with a mean length of 86.58 bp across all samples and a total combined length of about 57 Gbp, which was 74.9 % of the raw sequence.
Table 2

Summary of raw sequencing reads

NameExerciseNum. of reads (ea)Avg. length (bp)Total length (bp)
Cheongbaekbefore315334521003153345200
after382719801003827198000
Rookiebefore417241501004172415000
after385157681003851576800
Chaeumbefore346584901003465849000
after337849721003378497200
Hwangryongbefore363587841003635878400
after416314541004163145400
Pyeonggangbefore387088141003870881400
after437816221004378162200
Tongbefore403145361004031453600
after397866021003978660200
Huimangbefore353845581003538455800
after320137421003201374200
Pyeongtanbefore418785021004187850200
after407298581004072985800
Hwangdolbefore393365681003933656800
after384130461003841304600
Bongsikbefore332399881003323998800
after372720961003727209600
Total - 7573398210075733898200 (100 %)
Summary of raw sequencing reads Using bowtie2, 80.76 % of the filtered reads were successfully mapped to the current dog reference genes (Canis lupus familiaris, 48,370 mRNAs) [18, 26]. A novel bioinformatics pipeline for processing large amounts of transcriptome sequences was built. We calculated the expression levels of all the genes with mapped reads from the 10 individuals before and after exercise. By comparing the coverage before and after exercise, we identified 2,549 DEGs. The numbers of up-regulated and down-regulated genes were different in each individual. Pyeonggang, Huimang, and Pyeongtan had fewer than 10 DEGs, but each of the other dogs had more than 100 genes (Additional file 3: Figure S2). After filtering out genes with low levels of expression, 525 genes were grouped into two clusters (C1 and C2) of 276 and 249 genes, respectively, depending on their expression pattern (Fig. 1 and Additional file 4).
Fig. 1

HeatMap showing hierarchical clustering of differentially expressed genes regulated by exercise. The log2Ratio for each significantly differentially expressed gene was used. Each column represents a Sapsaree individual, and each row represents a differentially expressed gene. Expression differences are shown in different colors; red indicates up-regulation after exercise, and green indicates down-regulation after exercise. The 525 genes were grouped into two clusters (C1 and C2). The dogs were divided into two groups. Group I included Cheongbaek, Rookie, Chaeum, Hwangryong, and Pyeonggang. Group II included Tong, Huimang, Pyeongtan, Hwangdol, and Bongsik

HeatMap showing hierarchical clustering of differentially expressed genes regulated by exercise. The log2Ratio for each significantly differentially expressed gene was used. Each column represents a Sapsaree individual, and each row represents a differentially expressed gene. Expression differences are shown in different colors; red indicates up-regulation after exercise, and green indicates down-regulation after exercise. The 525 genes were grouped into two clusters (C1 and C2). The dogs were divided into two groups. Group I included Cheongbaek, Rookie, Chaeum, Hwangryong, and Pyeonggang. Group II included Tong, Huimang, Pyeongtan, Hwangdol, and Bongsik And the reliability of the gene expression analysis based on RNA-Seq was confirmed by t-test which supported the statistical significance of the stability of housekeeping genes expression before and after the exercise (Additional file 1: Table S3). A total of 26 unique genes among the 525 differentially expressed genes were assigned to 11 functional groups based on GO assignments (Additional file 1: Table S4). The genes were involved in biological processes such as programmed cell death, negative regulation of cell motion, regulation of cellular component biogenesis, protein metabolic process, and others. The cellular components linked to the genes were the intracellular parts, such as cytoplasm. The molecular function assignments were mainly to the catalytic and binding activities, such as cation binding. A further functional classification of all the differentially expressed genes was performed using the KEGG database. A total of 84 unique genes among the 525 differentially expressed genes were assigned to 30 metabolic pathway terms, including phosphatidylinositol signaling system, ribosome, proteasome, oxidative phosphorylation, and others (Additional file 1: Table S5). The functional annotation analyses indicated that the genes regulated under exercise-induced stress were mainly related to the metabolite pathways.

Correlation between gene expression pattern and body type

Based on the gene expression patterns induced by exercise, the 10 Sapsaree individuals were divided into two groups (Groups I and II; Fig. 1). Group I included Cheongbaek, Rookie, Chaeum, Hwangryong, and Pyeonggang, while Group II included Tong, Huimang, Pyeongtan, Hwangdol, and Bongsik. The differentially expressed genes formed two clusters. Cluster 1 (C1) was down-regulated after exercise in Group I but up-regulated after exercise in Group II. Cluster 2 (C2) was up-regulated after exercise in Group I but down-regulated after exercise in Group II. We examined the cortisol change after exercise, age, and body type for differences between the two groups. Only the body type showed a significant difference between Group I and Group II. Using W/BH, W/BL, and W/DC to characterize the overall body type, we found that Group I had a heavier body type than Group II (Table 1). In addition, W/BH, W/BL, and W/DC were each significantly different (p ≤ 0.05) between the two groups based on independent t-tests (Table 3). Those findings suggest differences in gene regulation between heavier dogs and lighter dogs under exercise stress.
Table 3

Differences in body type between groups I and II

Mean of group Ib Mean of group IIc td dfe p-valuef
W/BHa 0.3570.452-3.3868.1240.0093
W/BLa 0.3120.397-3.2119.4130.0100
W/DCa 0.8231.018-2.5988.3640.0306

a W Body Weight, BH Body Height, BL Body Length, DC Depth of Chest

bMean of Group I: average value of Group I

cMean of Group II: average value of Group II

dt : t-value, edf : the degree of freedom value, f p-value : significance

Differences in body type between groups I and II a W Body Weight, BH Body Height, BL Body Length, DC Depth of Chest bMean of Group I: average value of Group I cMean of Group II: average value of Group II dt : t-value, edf : the degree of freedom value, f p-value : significance

Identification of candidate marker genes for different response patterns to exercise-induced stress

We identified 525 genes with differential expression before and after exercise. We manually curated seven genes (PTPRC, LOC102157092, RPL18, S100A8, LOC612054, LOC102151356, EIF1) that had distinctly different expression patterns before and after exercise in Groups I and II (Fig. 2). For example, PTPRC (Entrez ID : 490255) is described as ‘protein tyrosine phosphatase, receptor type, C’. Some genes had uncharacterized functions. Figure 2 shows the log2 fold changes of the seven selected genes after exercise for each of the dogs. There were clear differences in the changes in expression levels of the seven selected genes after exercise between the two groups of dogs. The seven selected genes could be used as markers of exercise stress and for grouping dogs according to body type.
Fig. 2

Representative genes that were differentially expressed between the two groups of dogs. The x-axis represents the seven selected genes, and the y-axis represents the log2 fold change (2FC) value of each gene. The 2FC values for each individual are represented by a colored bar. a shows two genes with negative 2FC values in Group I and positive 2FC values in Group II. b shows five genes with positive 2FC values in Group I and negative 2FC values in Group II. The gene identifiers are shown at the top of each histogram: PTPRC (490255, protein tyrosine phosphatase, receptor type c), LOC102157092 (102157092, complement receptor type 1-like), RPL18 (476422, ribosomal protein L18), S100A8 (490461, S100 calcium binding protein A8), LOC612054 (612054, uncharacterized LOC612054), LOC102151356 (102151356, uncharacterized LOC102151356), and EIF1 (403674, eukaryotic translation initiation factor 1)

Representative genes that were differentially expressed between the two groups of dogs. The x-axis represents the seven selected genes, and the y-axis represents the log2 fold change (2FC) value of each gene. The 2FC values for each individual are represented by a colored bar. a shows two genes with negative 2FC values in Group I and positive 2FC values in Group II. b shows five genes with positive 2FC values in Group I and negative 2FC values in Group II. The gene identifiers are shown at the top of each histogram: PTPRC (490255, protein tyrosine phosphatase, receptor type c), LOC102157092 (102157092, complement receptor type 1-like), RPL18 (476422, ribosomal protein L18), S100A8 (490461, S100 calcium binding protein A8), LOC612054 (612054, uncharacterized LOC612054), LOC102151356 (102151356, uncharacterized LOC102151356), and EIF1 (403674, eukaryotic translation initiation factor 1)

Conclusions

This study provides the Sapsaree whole-transcriptome sequence data and identifies genes that are differentially expressed before and after exercise in the Sapsaree, which could be used as markers of exercise stress. The pattern of changes in global gene expression induced by exercise was different depending on the body type. RNA sequencing and gene expression analysis can be useful for grouping the Sapsarees by body type and response to exercise. This study provides a basis for future research investigating the biologic characteristics of the Sapsaree and the large-scale genomic differences of canines in general.
  17 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Gene Ontology (GO) database and informatics resource.

Authors:  M A Harris; J Clark; A Ireland; J Lomax; M Ashburner; R Foulger; K Eilbeck; S Lewis; B Marshall; C Mungall; J Richter; G M Rubin; J A Blake; C Bult; M Dolan; H Drabkin; J T Eppig; D P Hill; L Ni; M Ringwald; R Balakrishnan; J M Cherry; K R Christie; M C Costanzo; S S Dwight; S Engel; D G Fisk; J E Hirschman; E L Hong; R S Nash; A Sethuraman; C L Theesfeld; D Botstein; K Dolinski; B Feierbach; T Berardini; S Mundodi; S Y Rhee; R Apweiler; D Barrell; E Camon; E Dimmer; V Lee; R Chisholm; P Gaudet; W Kibbe; R Kishore; E M Schwarz; P Sternberg; M Gwinn; L Hannick; J Wortman; M Berriman; V Wood; N de la Cruz; P Tonellato; P Jaiswal; T Seigfried; R White
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  Serum alanine aminotransferase in skeletal muscle diseases.

Authors:  Rahul A Nathwani; Shireen Pais; Telfer B Reynolds; Neil Kaplowitz
Journal:  Hepatology       Date:  2005-02       Impact factor: 17.425

5.  Serum creatine kinase levels and renal function measures in exertional muscle damage.

Authors:  Priscilla M Clarkson; Amy K Kearns; Pierre Rouzier; Richard Rubin; Paul D Thompson
Journal:  Med Sci Sports Exerc       Date:  2006-04       Impact factor: 5.411

6.  Development and evaluation of canine reference genes for accurate quantification of gene expression.

Authors:  Bas Brinkhof; Bart Spee; Jan Rothuizen; Louis C Penning
Journal:  Anal Biochem       Date:  2006-06-15       Impact factor: 3.365

7.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

8.  Creatine-kinase- and exercise-related muscle damage implications for muscle performance and recovery.

Authors:  Marianne F Baird; Scott M Graham; Julien S Baker; Gordon F Bickerstaff
Journal:  J Nutr Metab       Date:  2012-01-11

9.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

10.  Whole Genome Association Study to Detect Single Nucleotide Polymorphisms for Behavior in Sapsaree Dog (Canis familiaris).

Authors:  J H Ha; M Alam; D H Lee; J-J Kim
Journal:  Asian-Australas J Anim Sci       Date:  2015-07       Impact factor: 2.509

View more
  1 in total

1.  Genetic diversity and population structure of the Sapsaree, a native Korean dog breed.

Authors:  Chandima Gajaweera; Ji Min Kang; Doo Ho Lee; Soo Hyun Lee; Yeong Kuk Kim; Hasini I Wijayananda; Jong Joo Kim; Ji Hong Ha; Bong Hwan Choi; Seung Hwan Lee
Journal:  BMC Genet       Date:  2019-08-05       Impact factor: 2.797

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.