Literature DB >> 34009337

Reorganization of chromatin architecture during prenatal development of porcine skeletal muscle.

Renqiang Yuan^1,2, Jiaman Zhang², Yujie Wang², Xingxing Zhu², Silu Hu², Jianhua Zeng³, Feng Liang¹, Qianzi Tang², Yaosheng Chen¹, Luxi Chen^1,4, Wei Zhu², Mingzhou Li², Delin Mo¹.

Abstract

Myofibres (primary and secondary myofibre) are the basic structure of muscle and the determinant of muscle mass. To explore the skeletal muscle developmental processes from primary myofibres to secondary myofibres in pigs, we conducted an integrative three-dimensional structure of genome and transcriptomic characterization of longissimus dorsi muscle of pig from primary myofibre formation stage [embryonic Day 35 (E35)] to secondary myofibre formation stage (E80). In the hierarchical genomic structure, we found that 11.43% of genome switched compartment A/B status, 14.53% of topologically associating domains are changed intradomain interactions (D-scores) and 2,730 genes with differential promoter-enhancer interactions and (or) enhancer activity from E35 to E80. The alterations of genome architecture were found to correlate with expression of genes that play significant roles in neuromuscular junction, embryonic morphogenesis, skeletal muscle development or metabolism, typically, NEFL, MuSK, SLN, Mef2D and GCK. Significantly, Sox6 and MATN2 play important roles in the process of primary to secondary myofibres formation and increase the regulatory potential score and genes expression in it. In brief, we reveal the genomic reorganization from E35 to E80 and construct genome-wide high-resolution interaction maps that provide a resource for studying long-range control of gene expression from E35 to E80.

Entities: Chemical Disease Gene Mutation Species

Keywords: Hi-C; PEI; TAD; pig; skeletal muscle development

Mesh：

Substances：
Chromatin

Year: 2021 PMID： 34009337 PMCID： PMC8154859 DOI： 10.1093/dnares/dsab003

Source DB: PubMed Journal: DNA Res ISSN： 1340-2838 Impact factor: 4.458

1. Introduction

Skeletal muscle is one of the most dynamic tissues that comprises ∼40% of total body weight and is able to break down energy materials (glucose, amino acid and fatty acid) to produce chemical energy for maintaining posture and movement. Skeletal muscles of agricultural animals (typically, pig, beef cattle and broiler) is a significant dietary protein sources for human consumption. For muscle development, including embryonic muscle development, postnatal growth and regeneration, its disruption is implicated in various muscle dysfunctions, including muscular atrophy and muscular dystrophy. Given the importance of skeletal muscle in many ways, a comprehensive understanding skeletal muscle development is crucial for human biomedical research related to muscle and improving meat production of livestock. Skeletal muscle is a heterogeneous and highly complex tissue that composes of fibres with different morphological, contractile and metabolic characteristics including oxidative (Types I and IIa) and glycolytic (Types IIb and IIx) fibres. The number and size of muscle fibres determined the muscle mass. Nevertheless, the total number of muscle fibre remains stable after birth, therefore, the embryonic stage is critical for muscle development. Muscle fibres development from progenitor cells in the paraxial mesoderm that is segmented into somites that lie on either side of the neural tube. Myogenesis in embryonic muscle development involved two waves of fibre formation, including primary fibre formation and secondary fibres formation. In pigs, primary fibres generate from ∼35 to 55 days of gestation, followed by the secondary myofibres forming based on the template of the primary myofibre around 50–90 days of gestation. Myogenic regulatory factors (Myf5, MyoD1, myogenin and MRF4), the basic helix–loop–helix family of transcription factors, are well-known that control the determination and differentiation of skeletal muscle cells during embryogenesis and postnatal myogenesis. For example, MyoD1 is a crucial master regulator in controlling muscle-specific gene transcription. Previous report had found three regulatory regions to regulate MyoD1 expression: proximal regulatory region, distal regulatory region and core enhancer region. Although the transcriptional regulation mechanism of MyoD1 is well established, nevertheless, it committed years and manpower for this work. Furthermore, there are a great many of other genes that need to be resolved their expression regulatory mechanisms in myogenesis. A growing body of evidences have shown that the eukaryotic genome forms a highly ordered, hierarchical structure in the nucleus that closely correlates and may even be causally linked with transcriptional machinery to allow appropriate gene expression., Deciphering the reorganization of chromatin architecture during skeletal muscle development is benefit for explain the complex molecular regulatory network that functions in myogenic differentiation. Here, we conducted an integrative three-dimensional (3D) structure of genome and transcriptomic characterization of porcine longissimus dorsi muscle (LDM) from E35 to E80 for the first time. We aimed to construct the 3D atlas of pig skeletal muscle genome, to explore dynamic changes of genomic spatial structure during development and to analyse the regulatory relationship between genomic structure and gene expression during the stage of secondary myofibres formation.

2. Materials and methods

2.1 Ethics statement

All the animals used in this study were collected according to the guidelines for the care and use of experimental animals that established by the China Council on Animal Care and the Ministry of Agriculture of China. All experiments approved by the Animal Care and Use Committee of Guangdong Province, China. Approval ID and permit numbers are SCXK (Guangdong) 2011-0029 and SYXK (Guangdong) 2011-0112.

2.2 Animals and sample collection

LDM were collected from embryonic Day 35 (E35) and 80 (E80) of purebred Luchuan pigs (a Chinese indigenous breed). E35 and E80 embryos were from three and one sows, respectively. Muscle tissue of E35 for RNA-seq are from three female embryos that from three sows while for Hi-C and chromatin immunoprecipitation (ChIP)-seq are pooled all female embryos. Muscle tissue of E80 for RNA-seq, Hi-C and ChIP-seq are from three female embryos. As the sexual characteristics of pig foetuses were not visible before Day 49 of gestation period, therefore, we using a PCR-based method to determine the gender of the embryos of E35 as previous. The primer SRYB-5: 5′-TGAACGCTTTCATTGTGTGGTC-3′ and primer SRYB-3: 5′-GCCAGTAGTCTCTGTGCCTCCT-3′ used to amplify a 163-bp region of Sry that located on the Y chromosome. The primer STS-Bo1-5: 5′-GAGCAGACTCCAACTACTCTCACTCCAC-3′ and primer STS-Bo1-3: 5′-TCAGAAGGATGATTTAGAGTGTCTGTTCAG-3′ used to amplify a 326-bp fragment of STS-Bo1 serving as an internal control.

2.3 Hi-C library preparation

In situ high-throughput chromatin conformation capture (Hi-C) was generated according previous described with some modifications. Briefly, muscle tissue was homogenized and fixed with freshly made 1% formaldehyde solution at room temperature for 30 min. Chromatin were digested with 200 U DpnII enzyme (R0543S, NEB, USA) at 37°C for 90 min, 65°C for 20 min and 25°C for 5 min. Nucleotide fill-in with 0.4 mM Biotin-14-dATP (19524-016, Invitrogen), 10 mM dCTP, 10 mM dGTP, 10 mM dTTP and 5 U/μL Klenow Fragment (M0210L, NEB) at 37°C for 45 min. Ligation was performed by a T4 DNA ligase (L6030-HC-L, Enzymatics, USA) at 20°C for 30 min. DNA was sheared to the length of 300–400 bp and washed by M280 beads at 20°C for 20 min. Hi-C libraries were amplified 10 PCR amplification cycles using mixed universal PCR primer and index primer. The resulting libraries were sequenced on BGISEQ-500 as 100 bp paired-end.

2.4 Hi-C data analysis

Hi-C reads were mapped pig genome (Sscrofa11.1, sex chromosomes are excluded) by BWA software (v.0.7.15) with default parameters. Unaligned read pairs, PCR duplicate paired reads and low-quality alignments defined as one or both reads failing to meet the threshold [Mapping Quality (MAPQ) ≥ 30] were removed by using Juicer (Supplementary Fig. S1A). The contact matrices used for further analysis were normalized by Knight–Ruiz algorithm and quantile algorithm. Normalized interaction matrices were separately generated at resolutions of 100, 20 and 5 kb.

2.5 Identification of compartments A and B

Compartments A/B were identified as previously described with minor modifications. Primarily, defined the compartments A and B at 100 kb low resolution as previous. Then constructed the observed/expected matrix at resolution of 20 kb. We defined O/E scores of bins that located in compartment A or B at low resolution as A score or B score, respectively. An A–B index was then created by subtracting the mean A and B scores. A–B indexes are positive values (more association with A) that were called as A compartments, while negative values (more association with B) were called as B compartments. For the compartment switching analysis, we considered only regions where more than two out of three biological replicates showed changes in A–B index from positive to negative or vice versa.

2.6 Compartment strength

To evaluate the compartmentalization of each chromosome of the genome, we adopted the method as previous described with minor modification. For saddle plot, we rearrange the A–B index (20 kb bin) from the lowest to the highest value for each chromosome. We then reshuffle the O/E map of the chromosome and divided the resulting map into a 50 × 50 matrix. The compartment strength was defined as (AA + BB)/2AB, where AA and BB are the mean level of interaction in regions belonging to the same compartment, while AB is mean level of interaction in regions belonging to different compartments. We chose the top 20% of strong A compartment and the top 20% of strong B compartment for measure.

2.7 Identification of topologically associating domains and specific topologically associating domain boundaries

Topological domains are called by combining directionality index (DI) score and InsulationScore (IS) at 20 kb resolution. First, we identified topological domains based on the DI score and a Hidden Markov Model as previously described, then using IS to identify and divide the large topologically associating domains (TADs) into more small TADs. The TAD boundaries were <400 kb, and unorganized chromatin regions were longer than 400 kb. For specific boundary, we merged reads of three replicates of each stage for TAD calling (E35: 4,050 TADs; E80: 4,028 TADs). Boundaries were identified as ‘specific’ if the boundary regions were called in only one stage and lacked a significant correlation in DI between the two stages compared. The correlation coefficient of DIs between stages was performed as previously described. We made a boundary as a centre and calculated spearman correlation of the DIs of ±10 bins for each centre between E35 and E80. Similarly, 20 bins randomly selected from the two stages and calculated the spearman correlation between the two vector as random correlation. The randomization war repeated 10,000 times to achieve the random distribution.

2.8 Domain score

To compute the change in interaction frequency between stages, we determined a consensus TAD as previous described, which are conserved in at least 50% samples (3,654 TADs, Supplementary Fig. S4A). We calculated an intradomain contact score., The D-score is the ratio of the number of contacts that occur between regions within the same TAD (intra-TAD contacts) over the total number of intrachromosomal contacts for a TAD (intra-TAD + inter-TAD). To identify TADs that have differential D-scores, we used two-tailed t-test to evaluate D-scores. TADs with an false discovery rate (FDR) <0.05 were selected for subsequent analysis.

2.9 Identification of promoter–enhancer interactions

To identify the promoter–enhancer interactions (PEIs), we merged replicates and called by the PSYCHIC at a resolution of 5 kb. Which use the hierarchical TAD-specific bi-linear model as background model and identify over-represented DNA–DNA interactions according to a piecewise power law regression for each TAD. Then, we filtered promoter–promoter interactions and other interactions, retaining PEIs. We reserved the high-confidence PEIs (FDR ≤ 0.0001, length of PEI ≥ 20 kb) for later analysis. The promoter distal interactions would be defined as poised enhancers (PEs), regular enhancers (REs) and super-enhancers (SEs) if they overlapped without H3K27ac peak, at least 1 bp H3K27ac regular peaks or at least 2.5 kb (50%) H3K27ac SE peaks, respectively.

2.10 Regulatory potential score

To comprehensively consider the influence of multiple enhancers on the expression of a gene, so as to accurately elucidate dynamic PEIs architecture contribute to the functional transcriptomic divergence between E35 and E80, we introduce a regulatory potential score (RPS) for each gene that an enhancer’s quantitative effect on a gene should depend on its spatial proximity. The RPS is calculated as ∑n (log10I), of which, I indicate the normalized interaction intensity (observed value − expected value). If a promoter without enhancer interaction, the RPS = 0. We divided the differential PEIs in two categories: (i) different RPS: the RPS of gene were subtract between E35 and E80, then made a Z-score for delta values to gain normally distributed data, the Z-values >|mean ± 2 s.d.| were defined as significant difference and (ii) RPS that was not significantly different but with ≥4 differential active enhancers (RE and SE).

2.11 ChIP-seq and data analysis

ChIP assays were performed by Wuhan IGENEBOOK Biotechnology Co., Ltd. Briefly, muscle tissue samples were homogenized and cross-linked with 1% formaldehyde for 10 min at room temperature and then quenched by 125 mM glycine. After samples lysed on ice, chromatins were obtained and sheared to length of 200–500 bp by sonication. Five micrograms H3K27AC antibodies (ab4729, Abcam) were used in the immunoprecipitation reactions at 4°C overnight. The next day, 30 μL of protein beads was added and further incubated for 3 h. Then the beads were washed with 20 mM Tris/HCL (pH 8.1), 50 mM NaCl, 2 mM EDTA, 1% Triton X-100, 0.1% SDS for once; 10 mM Tris/HCL (pH 8.1), 250 mM LiCl, 1 mM EDTA, 1% NP-40, 1% deoxycholic acid for twice; and TE buffer 1× (10 mM Tris-Cl at pH 7.5. 1 mM EDTA) for twice. The DNA fragments were eluted with 100 mM NaHCO3, 1% SDS. Purified DNA was used to construct libraries by performing the I NEXTFLEX® ChIP-Seq Library Prep Kit according to the manufacturer’s instructions (NOVA-5143-02, Bioo Scientific) and sequenced on Illumina Hiseq X ten as 150 bp paired-end. Raw reads were filtered by quality score and aligned to the pig genome (Sscrofa11.1). Unique aligned and deduplicated reads were used for peak calling. MACS2 was used to determine H3K27ac peak with FDR cut-off 0.05 (Supplementary Fig. S1C). To identify the SE, we used the ROSE (Ranked Order of Super Enhancers) software with default parameters. This algorithm stitches constituent enhancers together if they are within a certain distance (within 12.5 kb) and ranks the enhancers by all H3K27ac ChIP signal. It then separates SEs from REs by identifying an inflection point of H3K27ac signal versus enhancer rank.

2.12 RNA-seq and expression analysis

Muscle tissue was homogenized with the Trizol reagent, and total RNA was extracted according to the manufacture’s protocols. Purified RNA was eluted with RNase-free water and quantified by Nanodrop. Strand-specific RNA libraries were constructed with Ribo-Zero kit (RZH1046, Epicentre) and sequenced on Illumina HiSeq X Ten as 150 bp paired-end. Raw read pairs were aligned to the pig reference genome (Sscrofa11.1) using Hisat2 (v.2.0.4). Stringtie (v.1.3.3) was applied to quantify mRNA expression. Normalized read counts are reported as transcripts per million (TPM) (Supplementary Fig. S1B).

2.13 Functional enrichment analysis

For functional enrichment analysis, we mapped pig genes to their human (Homo sapiens) orthologs and choose human as the target analysis specie. The selected genes (TPM > 0.5 at least in a sample) were analysed using Metascape (http://metascape.org May 2, 2021, date last accessed) with default parameters. Enrichment analysis were performed against all genes in the genome as background set with gene ontology-biological processes as ontology sources. The most statistically significant 10 terms in each cluster were depicted using −log10 (P-value) bar plots.

2.14 Statistical analysis

All statistical analyses were performed by two-tailed Student’s t-test or Wilcoxon rank-sum test using GraphPad Prism8 and RStudio. All data were expressed as means ± s.d.

3. Result

3.1 Dynamic chromatin compartment in skeletal muscle development

The spatial organization of chromosomes is intimately linked to cell physiology. To study chromatin organization in mammalian muscle cells, we constructed in situ Hi-C libraries for porcine LDM of E35 and E80, separately (Fig. 1A). A total of six libraries generated ∼5,394.27 million (M) validly aligned contacts with a depth of ∼889.04 M per library that reach a maximum resolution of 5 kb (∼90.30% bins have at least 1,000 reads) (Supplementary Fig. S1A and D). After normalization, the contact maps of both stages showed highly reproducible (QuASAR-Rep score > 0.92) (Supplementary Fig. S1E), which consistent with RNA-seq data (Pearson’s r > 0.97) (Supplementary Fig. S1F and G). The contact possibility is declined with the genome distance increases (Supplementary Fig. S1H) and most of contacts are occur in 20 Mb (∼81.41%) (Supplementary Fig. S1I).

Figure 1

Changes of chromatinic compartments A/B. (A) Schematic overview of the study. (B) Correlation heatmaps show Pearson correlations for their intrachromosomal interaction frequency patterns (100 kb bin; chromosome 6). (C) Boxplots show the compartment strength for average interaction frequency between the same classes of compartments (AA and BB) compare to those between different classes of compartments (AB) for each euchromosome (see Materials and methods). P-values were calculated by Student’s t-test (two-tailed). (D) The proportion of the genome that changed compartments A/B status between E35 and E80. (E) The expression changes for genes that compartments were switched. P-values were calculated by Wilcoxon rank-sum test. (F and G) Function enrichment for genes (TPM > 0.5) with compartment status were switched. (H–J) Genome browser shots show the compartments status and gene expressions at locales of Chr.14: 8.5–9.5 Mb (NEFL, H), Chr.9: 36–37 Mb (SLN, I) and Chr.15: 112.5–113.5 Mb (MYL1, J). Each chromosome be decomposed into compartment regions at a resolution of 20 kb, including accessible compartment A (1,099.21 Mb, ∼48.51% of the genome, which are gene and GC-rich, actively transcribed) and densely packed compartment B (1,166.75 Mb, ∼51.49% of the genome, which are gene and GC-poor, inactively transcribed) (Supplementary Fig. S2A and B) that presented as plaid patterns in chromatin interaction (Supplementary Fig. S2C) or the derived correlation heatmaps (Fig. 1B). Compared to E80, E35 showed blurred plaid patterns (Fig. 1B and Supplementary Fig. S2C) and a less well-segregated compartment (Supplementary Fig. S2D), which as well supported by the intensive compartment strength in E80 than that in E35 (Fig. 1C and Supplementary Fig. S2E). As expected, during development, compartment statues of 4.39% genomes were switched from A to B, concomitant with decreased gene expression (n = 329); while 7.04% of compartments were switched from B to A, concomitant with increased gene expression (n = 1,284) (Fig. 1D and E). Functional enrichment showed that genes with compartments change from A to B were mainly associated with development and growth, such as the pathways of ‘embryonic limb morphogenesis’, ‘regulation of neuron differentiation’ and ‘negative regulation of cell proliferation’ (Fig. 1F). NEFL (Fig. 1H) encodes the core subunits of neurofilament, whose mutations are associated with Charcot–Marie–Tooth neuropathy, and depletion is sufficient to cause mild sensorimotor dysfunctions and spatial deficits. Genes with compartments switched from B to A were involved in metabolism relative pathways, including ‘carboxylic acid biosynthetic process’ and ‘proteolysis involved in cellular protein catabolic process’, suggest skeletal muscles have the function of energy metabolism at their foetal development (Fig. 1G). Typically, SLN (Fig. 1I), which encodes a protein sarcolipin that is necessary for muscle-based non-shivering thermogenesis by regulating the sarco/endoplasmic reticulum Ca2+-ATPase pump in skeletal muscle. In addition, SLN promotes oxidative metabolism in skeletal muscle in response to increased metabolic demand. Brown adipose tissue is the significantly important heat production tissue that also dependent on non-shivering thermogenesis via the activation of uncoupling protein 1 (UCP1) in human and mouse. Whereas brown adipose tissue and UCP1 gene both are lacked in pig., The compartment switching from B to A and increased expression of SLN in skeletal muscle may indicate that skeletal muscle-dependent non-shivering thermogenesis probably is the main way for piglets to adapt to low temperature environment. In addition, the pathways related to development were enriched, including ‘muscle structure development’ and ‘osteoblast differentiation’ (Fig. 1G). In which, MYL1 (Fig. 1J) encodes the skeletal muscle fast-twitch specific myosin, and its mutations could result in conspicuously reduced in skeletal muscle; knockdown of MYL1 in zebrafish disrupted muscle structure and impaired touch-evoked escape responses.

3.2 Dynamic intra-TAD interactions are associate with transcription and compartmentalization

TAD is the structural unit of chromosome that involved in transcription, which displays high frequencies of physical interaction within a given domain but lower frequencies outside of these domains. We called TADs (E35: ∼4,074 TADs with size of 460 kb, E80: ∼3,965 TADs with size of 440 kb) in resolution of 20 kb (Supplementary Fig. S3A). The accuracy of TAD boundaries was evidenced by the aggregate analysis of DI and IS (Supplementary Fig. S3B and C), and CTCF binding regions are high enrichment in TAD boundaries (Supplementary Fig. S3D). We found that TAD boundaries are largely invariable [Pearson’s r (DI) > 0.96, Pearson’s r (IS) > 0.94, Jaccard index > 0.98] between E35 and E80 (Supplementary Fig. S3 E–G). We only identified ∼2% differential TAD boundaries between E35 and E80 (Supplementary Fig. S3H), indicating that TAD is a conserved chromatin structure, which is consistent with previous reports., Although TAD boundaries are stable, the contacts within TADs are dynamic. D-score was used to explain interactions within a given TAD, defined by the ratio of intra-TAD contacts over all intrachromosomal contacts for a TAD., It is positively correlative (Pearson’s r > 0.99, P < 2.2 × 10−16) to A–B index and with increase of D-score the gene expression showed the tendency of increase (Supplementary Fig. S4B and C). There are 531 significantly differential D-score TADs (P ≤ 0.001, two-tailed t-test) between E35 and E80 (Fig. 2A). Of which, 483 TADs (1,417 genes) with higher D-score showed more active compartment status (higher A–B index and gene expression) in E35 than that in E80 (Fig. 2B and C). Compared to E35, E80 has 48 TADs (133 genes) with higher D-score indicating more active compartment status (Fig. 2B and C). Genes that located in TADs with higher D-score in E35 were enriched in morphogenesis and synapse-associated pathways (e.g. ‘chemical synaptic transmission’, ‘synapse organization’ and ‘embryonic skeletal system development’) (Fig. 2D). While located in TADs with higher D-score in E80, these genes were enriched in skeletal muscle development and metabolism (e.g. ‘muscle tissue development’) (Fig. 2E).

Figure 2

D-score dynamics within topological domain during development. (A) Number of significantly differential TADs (two-tailed t-test) between E35 and E80. (B) A–B indexes of significantly differential TADs between E35 and E80. P-values calculated by Wilcoxon rank-sum test. (C) The average expression changes for genes within TADs with significantly differential D-score. The number of genes in TAD are showed. P-values calculated by Wilcoxon rank-sum test. (D and E) Function enrichment for genes of significantly differential D-score.

3.3 Rewiring of PEI in skeletal muscle development

Promoter interactions are considered closely associated with gene expression regulation., To uncover an extensive genome-wide catalogue of PEIs in skeletal muscle and define how they response to muscle development, we found 48,909 and 54,350 high-confidence PEIs (FDR ≤ 0.0001) overlapping 13,548 and 12,993 genes by PSYCHIC in E35 and E80 at resolution of 5 kb (three libraries were merged and reached 99.54% of bins have at least 1,000 reads), respectively (Supplementary Fig. S5). Gene expression is positively correlated with the number of promoter interactions., We observed that highly expressed genes interacted with more enhancers (Fig. 3A), suggesting additive enhancer effects on target-gene transcription level. To explore the regulatory effects of multi-enhances on a gene, we introduced RPS, which is a measure for the sum of interactions of a gene regulated by multi-enhancers depending on spatial proximity. Indeed, the increase of gene expression is concomitant with enhanced RPS (Fig. 3B). Furthermore, the activity of enhancer is another crucially regulative factor for gene expression., H3K27ac are used to annotate the activated enhancers dividing into RE and SE, while enhancer without H3K27ac is defined to PE. As expected, RPS is positively correlated with H3K27ac enrichment (r > 0.8, Supplementary Fig. S6A and B). The promoter of gene that interact with SEs showed larger RPS and higher expression than promoters interacting with RE followed by promoters interacting with PE (Fig. 3C and Supplementary Fig. S6C and D).

Figure 3

PEIs regulate gene expression. (A) Expression of genes with different contacted enhancer numbers. Enhancer numbers were divided into four groups: number = 0, 1–2, 3–5 and >5. (B) Expression of genes with different RPS, genes with different RPS were divided into three groups: 0< RPS ≤2, 2< RPS ≤5 and RPS >5. (C) Expression of genes with different RPS with PE, RE or SE. There are 951 genes (RPS > |mean ± 2 s.d|) were identified that are rewired PEIs for responding to muscle development. Of which, 307 genes lessened interactions accompanied by gene expression decrease, whereas 644 genes gain more interactions correlate with increasing gene expression during muscle development (Fig. 4A). Because RPS does not include the effect of enhancer activity on gene expression, we examined H3K27ac enrichment on the distal enhancer that interacted with gene promoter that RPS are not differential, and identified 1,533 genes that were H3K27ac high enrichment at distal enhancers in E35, while 246 genes were H3K27ac more enrichment at distal enhancers in E80 [|active enhancer(E80) − active enhancer(E35) | ≥ 4] (Fig. 4B). The activation of enhancers that interacted with promoter is significantly positively correlated with gene expression (Fig. 4B).

Figure 4

PEIs rewiring between E35 and E80. (A) The expression changes for genes with changed RSP. (B) The expression changes for genes with stable RPS, but changed distal enhancer activation [|active enhancer(E80) − active enhancer(E35) | ≥ 4]. (C) Gene ontology enrichment for genes that changed RPS and (or) distal enhancer activation. (D–F) Genome browser view of contact frequency, PEIs, H3K27ac and transcript abundance at the MuSK (D), Sox6 (E) and MANT2 (F) locus. The RPS and TPM of represent genes are showed. Next, we merged that RPS and enhancer activity are increased genes and RPS and activity are reduced genes from E35 to E80, respectively. Function enrichment analysis showed that categories are consistent with that of enrichment in genes with changed compartment A/B and D-score. Genes with high RPS and enhancer activity in E35 were mainly enriched in pathways related to morphology and neuron differentiation, including ‘embryonic morphogenesis’, ‘tissue morphogenesis’ and ‘regulation of neuron differentiation’ (Fig. 4C). MuSK is a key gene for neuromuscular junction (NMJ) development; its knockout disrupts the formation of NMJ., It is highly expressed in primary myotubes while is dramatically down-regulated in skeletal muscle. Compared with E35, E80 presents decreased expression level for MuSK, which accompanied by reducing enhancer activation (Fig. 4D). As well as MuSK, MAP1B,,NNAT,,NCALD,ZIC1, and ZIC4, are associated with neuron development. Their promoters are interacting with more spatially closer enhancers in E35 than E80 (Supplementary Fig. S7A–D). Genes with high RPS and enhancer activity in E80 were enriched in ‘skeletal system development’, ‘extracellular structure organization’, ‘muscle structure development’ and ‘actin filament-based process’ (Fig. 4C), such as DAG1,TOMD1,MYH8,SYNM,LAMA2, that encode cellular structural proteins in muscle cells (Supplementary Fig. S8A–E); CAPN3,,PLN,,STIM1, that regulate muscle function by Ca2+ transport mechanism (Supplementary Fig. S8F–H); GCK,ACO2, that encode metabolic enzymes in tricarboxylic acid cycle (Supplementary Fig. S8I and J); IGF2 and Mef2D, that are associated with growth and development of skeletal muscle (Supplementary Fig. S8K and L). These genes are interacting with more enhancers in E80 than E35. Interestingly, of which, we observed Sox6, and MATN2 (Fig. 4E and F). They play a crucial role in the timely induction of the global switching in myogenesis, which indicated that more interacting enhancers, enhancers activity and higher gene expression on Sox6 and MATN2 in E80 than E35 correlate with in myoblasts differentiated to secondary to myofibres.

4. Discussion

Previous report had shown that the total number of myofibres is constant after birth, suggesting hyperplasia of myofibre is of prime importance for the muscle mass. Myofibres formation could be divide into two stages in pigs, including primary and secondary fibres formations. We first perform Hi-C to investigate the change of genome architecture from primary to secondary fibre formations in this study. The researches of early mammalian development had shown that chromatin compartments are gradually established., In contrast with previous reports, we found the chromatin compartment almost established before E35, although we observed the blurred plaid in E35 than E80 (Fig. 1B and Supplementary Fig. S2C), which may consistent with the more active progenitors in E35 than in E80. It was found that ∼11.43% of compartments were switched between E35 and E80 (Fig. 1D), which is comparable with C2C12 induced differential differentiation of 3 days (8%) and 8 days. Compartment changes are related to gene transcription, but not all genes whose compartment are changed their transcription levels, or vice versa. This is due to compartment is the large genomic structure that hard to fine regulation of gene expression. As well as compartment A/B, TADs are gradually established in the mammalian embryogenesis., However, after the establishment and consolidation of TAD, its stable and conserved across different cell types and species. Of course, the number and size of TAD are significantly different if performed various tools or resolutions. We identified that the number and size of TADs are comparable between E35 and E80 (Supplementary Fig. S3A). Furthermore, over 98% of TADs boundaries were stable (Supplementary Fig. S3H), consistent with the stability of TADs. Although TADs are highly stable, however, the interaction frequencies of intra-TADs are dynamic., Increase or decrease of intra-TADs interaction frequencies is associated with up-regulation or down-regulation of gene expressions, respectively, indicating there exist a precise architecture (such as PEI) for gene regulation. PEIs are the direct architecture to finely regulate gene expression. Nevertheless, it requires a large amount of data to reach an extremely high resolution. We merged our data of replicates and identified ∼50,000 PEIs with an extremely rigid cut-off (FDR ≤ 0.0001) under resolution of 5 kb (Supplementary Fig. S5A and B). In spite of the rigid criterion, over half of genes are interacting with enhancer, indicating enhancer is a universal and important element in gene regulation. Usually, the enhancer number is positively correlate with expression level, showing the enhancer additivity,, such as Sox6 interacted with more enhancers in E80 compared with that in E35 (Fig. 4E). SVEP1 and LMNA have comparable PEIs (or RPS) and their mRNA expression are not differential (Supplementary Fig. S9). However, some gene interacted with different amounts of enhancer while their expression were not changed, showing the enhancer redundancy. For example, the promoter of MyoD1 separately interacted with 10 (RPS: 12.24) and 17 (RPS: 19.06) enhancers in E35 and E80, whereas the gene expressions (mean TPM, E35: 70.17, E80: 89.83) were not significantly differential. In addition, enhancer activation is another significant factor that affect gene expression. As enhancer number, spatial proximity and enhancer activation are always co-regulate gene expression, however, there lack an algorithm that can integrate all these factors. Therefore, we ignored other factors when analyse one of them. We identified mounts of genes that may play significant role in skeletal development, that were regulated by differentially active enhancer, such as MuSK (Fig. 4D). Primary myoblasts express high levels of MyHC-I and MyHC-emb, while secondary myoblasts express low levels of MyHC-I and high levels of MyHC-neo. Nfix is highly expressed in foetal myoblasts and acts as a transcriptional switch from embryonic to foetal myogenesis. We observed that the promoter of Nfix interacting with SEs in E80 but not in E35. However, the Nfix’s mRNA expression are comparable between E35 and E80, which may result from the heterogeneity of muscle tissue (data not shown). Sox6 is another gene that required for foetal fibre type differentiation., It cooperates with Nfix directly inhibit the expression of MyHC-I in foetal muscle and activate more genes that expressed specifical in secondary myofibres (such as CKM, β-enolase). In addition, MATN2 is an extracellular matrix protein in controlling the early stages of myogenic differentiation. Nfix directly regulates MATN2 expression and MATN2 positive-feedback regulates Nfix expression. In this study, we observed that chromatin interactions and mRNA expression of Sox6 and MATN2 both are increased from E35 to E80 (Fig. 4E and F). These results indicated that MATN2 as extracellular regulatory signals and Sox6-Nfix as intracellular transcription factor complex drive the PEIs rewiring of genes in myoblasts and determine myoblasts differentiated into secondary myofibres. In summary, we revealed the changes of genome architecture from compartment A/B, TAD and PEI three scales in early muscle development and confirmed dynamic spatial reorganization is consistent with gene expression modulation. The most important is that we constructed the high-resolution interaction maps (5 kb resolution), which could provide genome-wide atlases of promoter interaction for studies on the regulation mechanism of skeletal muscle development.

Supplementary data

Supplementary data are available at DNARES online.

Funding

This work was supported by grants from the National Key R & D Program of China (2020YFA0509500), the National Natural Science Foundation of China (32072697, 31902134, 31772576 and U19A2036), the Science and Technology Projects of Zhanjiang (2019A01004), and the Sichuan Science and Technology Program (2021YFYZ0009 and 2021YFYZ0030). We appreciate High-performance Computing Platform of Sichuan Agricultural University for providing the powerful support for data analysis.

Data availability

All the sequencing raw data of Hi-C, RNA-seq and ChIP-seq have been deposited to Sequence Read Archive (SRA) under accession number: SRP305313 and the processed data files have been deposited to GEO under accession number: GSE166346.

Conflict of interest

All authors declare that they have no conflict of interest. Click here for additional data file.

83 in total

Review 1. Gene regulatory networks and transcriptional mechanisms that control myogenesis.

Authors: Margaret Buckingham; Peter W J Rigby
Journal: Dev Cell Date: 2014-02-10 Impact factor: 12.270

2. A functional screen for sonic hedgehog regulatory elements across a 1 Mb interval identifies long-range ventral forebrain enhancers.

Authors: Yongsu Jeong; Kenia El-Jaick; Erich Roessler; Maximilian Muenke; Douglas J Epstein
Journal: Development Date: 2006-01-11 Impact factor: 6.868

3. The MAP1B Binding Domain of Na_v1.6 Is Required for Stable Expression at the Axon Initial Segment.

Authors: Laura Solé; Jacy L Wagnon; Elizabeth J Akin; Miriam H Meisler; Michael M Tamkun
Journal: J Neurosci Date: 2019-03-26 Impact factor: 6.167

4. MicroRNAome of porcine pre- and postnatal development.

Authors: Mingzhou Li; Youlin Xia; Yiren Gu; Kai Zhang; Qiulei Lang; Lei Chen; Jiuqiang Guan; Zonggang Luo; Haosi Chen; Yang Li; Qinghai Li; Xiang Li; An-an Jiang; Surong Shuai; Jinyong Wang; Qi Zhu; Xiaochuan Zhou; Xiaolian Gao; Xuewei Li
Journal: PLoS One Date: 2010-07-12 Impact factor: 3.240

5. Sarcolipin is a newly identified regulator of muscle-based thermogenesis in mammals.

Authors: Naresh C Bal; Santosh K Maurya; Danesh H Sopariwala; Sanjaya K Sahoo; Subash C Gupta; Sana A Shaikh; Meghna Pant; Leslie A Rowland; Eric Bombardier; Sanjeewa A Goonasekera; A Russell Tupling; Jeffery D Molkentin; Muthu Periasamy
Journal: Nat Med Date: 2012-09-09 Impact factor: 53.440

6. A regulatory mutation in IGF2 causes a major QTL effect on muscle growth in the pig.

Authors: Anne-Sophie Van Laere; Minh Nguyen; Martin Braunschweig; Carine Nezer; Catherine Collette; Laurence Moreau; Alan L Archibald; Chris S Haley; Nadine Buys; Michael Tally; Göran Andersson; Michel Georges; Leif Andersson
Journal: Nature Date: 2003-10-23 Impact factor: 49.962

7. STIM1 signalling controls store-operated calcium entry required for development and contractile function in skeletal muscle.

Authors: Jonathan Stiber; April Hawkins; Zhu-Shan Zhang; Sunny Wang; Jarrett Burch; Victoria Graham; Cary C Ward; Malini Seth; Elizabeth Finch; Nadia Malouf; R Sanders Williams; Jerry P Eu; Paul Rosenberg
Journal: Nat Cell Biol Date: 2008-05-18 Impact factor: 28.824

8. STIM1 as a key regulator for Ca2+ homeostasis in skeletal-muscle development and function.

Authors: Santeri Kiviluoto; Jean-Paul Decuypere; Humbert De Smedt; Ludwig Missiaen; Jan B Parys; Geert Bultynck
Journal: Skelet Muscle Date: 2011-04-04 Impact factor: 4.912

9. Condensin-driven remodelling of X chromosome topology during dosage compensation.

Authors: Emily Crane; Qian Bian; Rachel Patton McCord; Bryan R Lajoie; Bayly S Wheeler; Edward J Ralston; Satoru Uzawa; Job Dekker; Barbara J Meyer
Journal: Nature Date: 2015-06-01 Impact factor: 49.962

10. Linkages between changes in the 3D organization of the genome and transcription during myotube differentiation in vitro.

Authors: Malina D Doynova; James F Markworth; David Cameron-Smith; Mark H Vickers; Justin M O'Sullivan
Journal: Skelet Muscle Date: 2017-04-05 Impact factor: 4.912

2 in total

1. Loss of Monoallelic Expression of IGF2 in the Adult Liver Via Alternative Promoter Usage and Chromatin Reorganization.

Authors: Jinsoo Ahn; Joonbum Lee; Dong-Hwan Kim; In-Sul Hwang; Mi-Ryung Park; In-Cheol Cho; Seongsoo Hwang; Kichoon Lee
Journal: Front Genet Date: 2022-07-22 Impact factor: 4.772

2. Reorganization of 3D genome architecture across wild boar and Bama pig adipose tissues.

Authors: Jiaman Zhang; Pengliang Liu; Mengnan He; Yujie Wang; Hua Kui; Long Jin; Diyan Li; Mingzhou Li
Journal: J Anim Sci Biotechnol Date: 2022-03-12

2 in total