| Literature DB >> 20659319 |
Yan Xu1, Minlu Zhang, Yanhua Wang, Pooja Kadambi, Vrushank Dave, Long J Lu, Jeffrey A Whitsett.
Abstract
BACKGROUND: Pulmonary surfactant is required for lung function at birth and throughout life. Lung lipid and surfactant homeostasis requires regulation among multi-tiered processes, coordinating the synthesis of surfactant proteins and lipids, their assembly, trafficking, and storage in type II cells of the lung. The mechanisms regulating these interrelated processes are largely unknown.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20659319 PMCID: PMC3091648 DOI: 10.1186/1471-2164-11-451
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Microarray Data Used In This Study
| Array Name | Investigator | Mouse Model | Reference |
|---|---|---|---|
| CEBPA_KO | Ikegami | Martis, et al. 2006 | |
| CNB | Dave | Dave, et al. 2006 | |
| CTNNB1_ACT | Mucenski | CatnbΔ (ex3) mice: | Mucenski, et al. 2005 |
| CTNNB1_KO | Mucenski | Mucenski, et al. 2003 | |
| Cyclopamine_ | Shannon | Lung explant culture treated with Cyclopamine for 1-3 days | |
| D508 | Whitsett | Xu, et al. 2006 | |
| FGF18_OE | Whitsett | Whitsett, et al. 2002 | |
| Fgfr2IIIb | Perl | Perl, et al. 2003 | |
| FoxA2_KO | Whitsett | Wan, et al. 2004 | |
| FoxaDKO | Whitsett | Wan, et al. 2005 | |
| FoxM1_KO | Whitsett | Wang, et al. | |
| HIF1KO | Shannon | ||
| LAL | Yan | Lian, et al. 2004 | |
| MIA | Shannon | Lin, et al. 2008 | |
| SHH12.5 | Shannon | ||
| SHH13.5 | Whitsett | Miller, et al. 2004 | |
| SHH18.5 | Whitsett | Miller, et al. 2004 | |
| SPA_KO | Levine | alveolar macrophage from | |
| SPC_2M | Glasser | Glasser, et al. 2003 | |
| SPC_PND1 | Glasser | Glasser, et al. 2008 | |
| SPC_typeII | Glasser | Isolated typeII cells from | Glasser, et al. 2003 |
| SPD_AM | Whitsett | Isolated alveolar macrophage from | Zhang, et al. 2006 |
| SPD_typeII | Ikegami | isolated typeII cell from | Korfhagen, et al. 1998 |
| Stat3_tyII | Ikegami | Type II cells from | Xu, et al. 2007 |
| SU5402 | Shannon | Lung explant culture treated with 0.1% DMSO or SU5402, E12.5 | Metzger, et al. 2007 |
| TTF1_Lung | Whitsett | DeFelice, et al. 2003 | |
| TTF1_Thyroid | Whitsett | ||
Functional Classification of Gene Clusters
| BioProcess | Clusters |
|---|---|
| Carbohydrate/organic acid metabolism | 23, 24 |
| cell adhesion | 1,3,23,24 |
| cell cycle | 14,15,16 |
| cell differentiation | 1,10 |
| cell migration/motility | 3 |
| defense response | 10,20,21,22 |
| development | 1,3,10,19,23,27,28 |
| DNA metabolism/replication | 14,15 |
| localization/transport | 1, 3, 10,28 |
| lipid metabolism | 1, 2, 23,28 |
| metabolism | 5,6,8,9,12,13,14,15,16,17,23,24,28 |
| morphogenesis | 1,23,24,27,28 |
| negative regulation of biological process | 13,23,24 |
| Regulation (Transcription/signaling) | 5,6,9,11,14,15,16,18, 19, 23, 24 |
| protein modification | 12 |
| regulation of cell size | 24 |
| RNA splicing | 7 |
| cytoskeleton organization and biogenesis | 19, 24 |
| blood vessel development | 3,19 |
Clusters listed in Additional file 1 were subject to Gene Ontology analysis http://david.abcc.ncifcrf.gov/ to determine the extent of enrichment of biological function among genes in each cluster. Clusters sharing biological functions were grouped together according to the function. Shown in the table are enriched functional classes with enrichment p-value < 0.01 and shared by more than 20% of the genes in the cluster.
Clusters Feature Comparison
| Cluster name | Gene Number | Function and Process | Mouse Phenotype | Cell Components |
|---|---|---|---|---|
| C1 | 313 | Lipid biosynthesis; Morphogenesis; Differentiation; Proliferation; Lung and respiratory tube development; Vascular development | Abnormal vasculature development; Abnormal cardiovascular physiology; Abnormal alveolar morphology; Abnormal respiratory mechanics | Insoluble fraction; Membrane fraction |
| C2 | 54 | Lipid Metabolism; Lipid Transport | Abnormal respiratory alveolar morphology; Abnormal lipid homeostasis | Endoplasmic reticulum |
| C28 | 205 | Response to external stimulus; Lipid metabolic process | Abnormal blood chemistry | Insoluble fraction; Integral to plasma membrane |
Genes from each cluster were subject to gene set enrichment analysis to identify enriched functions and processes, mouse phenotypes and cell components http://toppgene.cchmc.org/.
Figure 1Identification of over-represented TFBSs in each gene cluster. Upstream genomic sequence (3 kb) was searched for TFBS in evolutionarily conserved regions (ECR) that are over-represented in a gene cluster. Proximal promoter regions (1.2 kb) were searched for over-represented TFBS in the cluster. We also determined the over-represented TFBS frequency in the proximal promoter region for each gene in the cluster. The relative importance of a TFBS was determined by the average ranking order of ECR, prompter and frequency analysis and normalized to -2.5 to 2.5. A heatmap was generated based on the normalized relative importance of TFBSs. ND: Frequency was not determined if the TFBS was not enriched in the promoter region of the gene cluster compared to all promoters in the mouse genome used as the background set (p-value > 0.05).
TF-TG functional similarity and expression correlation (CEBPA and SREBP)
| CEBPA | SREBP | ||||||
|---|---|---|---|---|---|---|---|
| Cebpa | Kyrmizi et al. 2006 | Cebpa | Kyrmizi et al. 2006 | Srebf1 | She et al. 2005 | Srebf1 | She et al. 2005 |
| Foxf1a | Kim et al, 2005 | S100 g | Mtdh | Lipg | Reed et al. 2008 | ||
| Foxa1 | Lpcat1 | Supt16h | Wdr81 | ||||
| Ets1 | Lefterova et al. 2008 | Sftpb | Martis et al. 2006 | Id2 | 6330416G13Rik | ||
| Sox7 | Dlk1 | Shimomura et al. 1998 | Ebf1 | Abca3 | |||
| Foxa2 | Martis et al 2006 | Serpinb6b | Elf5 | Wars | Reed et al. 2008 | ||
| Wwtr1 | Timp3 | Lefterova et al. 2008 | Ankib1 | Lyzs | |||
| Elf5 | Edil3 | Fah | Serpinf1 | ||||
| Smad5 | Abca3 | Fli1 | Dhcr7 | Reed et al. 2008 | |||
| Tbx4 | 3110001I20Rik | Soat1 | Farrell, et al. 2005 | Siva1 | |||
| Fli1 | Bex2 | Ahr | Iwano et al. 2005 | Ndst1 | |||
| Ahr | Tspan11 | Cdkn2b | Cds2 | ||||
| Etv5 | Vsnl1 | Foxo3 | Bcl6b | ||||
| Id2 | Tavor et al. 2003 | Cd38 | Sox7 | Matn4 | |||
| Runx1t1 | Rochford et al 2004 | 1190002N15Rik | Mid1ip1 | Ier3 | |||
| Mef2c | Pard6b | Cbfa2t3 | Scd1 | Horton et al. 2002 | |||
| Ebf1 | Jimenez, et al. 2007 | Emp2 | Mef2c | Hck | |||
| Klf7 | Id2 | Tavor et al. 2003 | Myb | Dag1 | |||
| Prdm1 | Kit | Zfx | Bcl2a1a | ||||
| Jun | Rangatia et al. 2002 | Mme | Acsl4 | Sox7 | |||
| Utp11l | B3gnt2 | Cebpa | Pedersen et al. 2007 | Ahr | Iwano et al. 2005 | ||
| Tcfcp2l1 | Ndst1 | Dhcr7 | Rab6b | ||||
| Cbfa2t3 | Lyz1 | Lefterova et al. 2008 | Etv5 | Slc1a5 | |||
| Fos | Cammenga et al. 2003 | Lyz | Lefterova et al. 2008 | Foxa1 | Slc34a2 | ||
| Stat3 | Numata et al. 2005 | Syne2 | Foxa2 | Enpp2 | |||
| Sox2 | Tgoln1 | Rab2 | Hdc | Ai et al. 2006 | |||
| Myb | Verbeek, et al. 1999 | Klf7 | Runx1t1 | Sftpb | |||
| Srebf1 | Le et al. 2002 | Atp6v1b2 | Tbx4 | Kdr | |||
| Klf9 | Me1 | Lefterova et al. 2008 | Tcfcp2l1 | Tsn | |||
| Foxo3a | Tcfcp2l1 | Upk3b | Rtkn2 | ||||
| Cdkn2b | Rtkn2 | Foxf1a | Zdhhc14 | ||||
| Ankib1 | Krt19 | Sox2 | Dtna | ||||
| Fah | Slc34a2 | Stat3 | Lphn3 | ||||
| Mapk14 | Kumar et al. 2003 | Prdx6 | Aytl2 | Lpcat1 | |||
| Cyp4v3 | Fabp5 | Ets1 | Scd2 | Tabor et al. 1999 | |||
| Elovl1 | Ier3 | Exosc7 | Emp2 | ||||
| Qk | Lefterova et al. 2008 | Scd1 | Christy et al. 1989 | Elovl1 | Hc | ||
| Rcan1 | Cd55 | Fos | Cyp4v3 | ||||
| Exosc7 | Exosc7 | Ggcx | Mid1ip1 | ||||
| Gadd45g | Kdr | Klf9 | Lyz | ||||
Correlation: the expression profile similarities between TF and genes in the same cluster were calculated using Pearson Correlation and ranked accordingly from high to low based on the correlation coefficient. The top 40 genes with the highest correlation with Cebpa and Srebf1 are listed in Table 4.
Kappa similarity was calculated to estimate functional similarity between TF and genes based on the shared annotation terms. TF-TG functional association were ranked from high to low based on the Kappa value. The top 40 genes sharing the highest functional annotations with Cebpa and Srebf1 are listed in Table 4.
We collected the positive TF-TG relationships from Ingenuity knowledge base (Ingenuity), Transfac 11.3 (Biobase), Eldorado (Genomatix) and PubMed. References for the known TF-TG relationships are listed in the table.
Summary of TF connectivity
| TFBS | Total Connectivity | C1 | C2 | C28 | TF in Lung |
|---|---|---|---|---|---|
| CEBP | 447 | 238 | 51 | 158 | Cebpa, Cebpb, Cebpd, Cebpg |
| IRFF | 404 | 239 | 0 | 165 | Irf1, Irf2, Irf3, Irf5, Irf7 |
| HNF3 | 359 | 228 | 51 | 80 | Foxa1, Foxa2 |
| GATA | 358 | 218 | 44 | 96 | Gata6, Gata1 |
| ETSF | 344 | 172 | 14 | 158 | Ets1, Ets2, Etv5, Nfe2l2, Elf2 |
| SREB | 312 | 162 | 43 | 107 | Srebf1 |
| FOXO | 268 | 151 | 0 | 117 | Foxo1, Foxo4, Foxo3a |
| FKHD | 213 | 95 | 25 | 93 | Foxf2, Foxc1 |
| HAND | 201 | 94 | 0 | 107 | Lmo2 |
| STAT | 182 | 0 | 0 | 182 | Stat6, Stat3 |
| MEF2 | 176 | 110 | 0 | 66 | Mef2a |
| NFAT | 169 | 0 | 0 | 169 | Ilf3, Nfatc3 |
| CP2F | 168 | 83 | 0 | 85 | Atf4, Tcfcp2, Atf3, Atf1 |
| NFKB | 166 | 78 | 0 | 88 | Nfkb1 |
| EREF | 165 | 113 | 0 | 52 | Esrra |
| LEFF | 150 | 97 | 0 | 53 | Lef1 |
| HFH | 134 | 75 | 19 | 40 | Foxf1a, Foxi1 |
| PARF | 134 | 134 | 0 | 0 | Tef, Tead1 |
| AP1R | 129 | 60 | 0 | 69 | Nfe2 |
| LEFF | 121 | 87 | 34 | 0 | Tcf4 |
| CIZF | 117 | 117 | 0 | 0 | Znf384 |
| HAND | 113 | 43 | 0 | 70 | Tcf12 |
| BARBIE | 111 | 98 | 13 | 0 | Unknown |
| NKXH | 106 | 68 | 38 | 0 | Nkx2-1 |
| SORY | 104 | 93 | 11 | 0 | Sox5, Sox9 |
| NR2F | 98 | 37 | 20 | 41 | Hnf4a, Nr2f1, Nr2f2 |
| OCT | 92 | 92 | 0 | 0 | Pou2f1, Pou6f2 |
| CREB | 79 | 0 | 10 | 69 | Creb1 |
| MYOD | 74 | 0 | 0 | 74 | Myog |
| NKX7 | 62 | 39 | 0 | 23 | Nkx6-2 |
| EBOX | 59 | 0 | 21 | 38 | Tcf4, Max |
| P53F | 57 | 0 | 0 | 57 | Trp53 |
| RORA | 57 | 0 | 0 | 57 | Rora |
| HAML | 54 | 0 | 0 | 54 | Runx2, Pebp1 |
| RXRF | 54 | 0 | 0 | 54 | Nr1h2 |
| GREF | 48 | 48 | 0 | 0 | Nr3c1 |
| BRN5 | 46 | 25 | 0 | 21 | Pou6f1 |
| HESF | 46 | 0 | 20 | 26 | Hes1 |
| EGRF | 45 | 0 | 45 | 0 | Egr1, Wt1 |
| HOXH | 44 | 0 | 0 | 44 | Meis1 |
| SPIF | 39 | 0 | 39 | 0 | Klf11 |
| HNF1 | 38 | 0 | 38 | 0 | Hnf1a, Hnf1b, Hmbox1 |
| E2FF | 33 | 0 | 33 | 0 | E2f1, E2f2, E2f3, E2f4, E2f5, E2f7 |
| SMAD | 23 | 0 | 23 | 0 | Smad4 |
| ZBPF | 23 | 0 | 23 | 0 | Zfp219 |
| NKX6 | 22 | 22 | 0 | 0 | Nkx6-1 |
| LXHF | 21 | 21 | 0 | 0 | Lxh3 |
| AP2F | 19 | 0 | 19 | 0 | Tcfap2c |
| PTBP | 19 | 0 | 19 | 0 | Tbp |
| GLIF | 16 | 0 | 16 | 0 | Zic2 |
| BCDF | 15 | 0 | 15 | 0 | Crx |
| SPZ1 | 11 | 0 | 11 | 0 | Spz1 |
| PAX2 | 10 | 0 | 10 | 0 | Pax2 |
| MTF1 | 9 | 0 | 9 | 0 | Mtf1 |
| ZF5F | 8 | 0 | 8 | 0 | Zfp161 |
We calculated the confidence score based on the integrative evidence of TF-TG relationship. Using the cut off confidence score of 0.5, the overall connectivity of each TF to its potential TGs within three clusters were calculated and summarized in Table 5. The corresponding TFs expressed in lung were also listed.
Selected Genes and their potential regulators in rank order
| Gene | ELOVL1 | SLC34A2 | SOAT1 | ZDHHC3 | LPCAT1 | STARD4 | DLK1 | PRDX6 | ABCA3 |
|---|---|---|---|---|---|---|---|---|---|
| SREBP (Srebf1/2) | CEBP (Cebpa/b/g) | SREBP1 (Srebf1/2) | SREBP1 (Srebf1) | CEBP (Cebpa/b/g) | SREBP1 (Srebf1) | CEBP (Cebpa/b/g) | CEBP (Cebpa/b/g) | SREBP1 (Srebf1) | |
| 2 | CEBP (Cebpa/b/g) | ETS1 (Ets1) | ZIC2 (Zic2) | GATA (Gata6) | SREBP1 (Srebf1) | CEBP (Cebpa/b/g) | HNF3 (Foxa1/2) | SREBP (Srebf1/2) | HNF3 (Foxa1/2) |
| 3 | HNF3 (Foxa1/2) | STAT6 (Stat6) | CEBP (Cebpa/b/g) | HNF3 (Foxa1/2) | NFAT (Ilf3, Nfatc3) | ETS1 (Ets1) | KROX (Egr1) | NFKB (Nfkb1) | TTF1 (Nkx2-1) |
| 4 | TTF1 (Nkx2-1) | SREBP1 (Srebf1/2) | KROX (Egr1) | XFD1 (NP) | ETS1 (Ets1) | FOXP3 (NP) | SREBP1 (Srebf1) | ETS1 (Ets1) | EGR (Egr1) |
| 5 | TCF4 (Tcf4) | ERR1 (Esrra) | UF1H3B (Foxa1/2) | IRF1 (Irf1) | EGR (Egr1) | HNF3 (Foxa1/2) | HNF1 (Hnf1a/1b) | HNF3 (Foxa1/2) | KROX (Egr1) |
| 6 | NFKB (Nfkb1) | LMO2COM (Lmo2) | IRF1 (Irf1) | FREAC7 (NP) | STAT6 (Stat6) | ETS2 (Ets2) | TAXCREB (Creb1) | TTF1 (Nkx2-1) | E2F (E2f1-5) |
| 7 | ZIC2 (Zic2) | HEB (Tcf12) | LXR (Nr1h2) | TTF1 (Nkx2-1) | E2F (E2f1-5) | ATF1 (Atf1) | GATA1 (Gata1) | TCF4 (Tcf4) | CEBP (Cebpa/b/g) |
| 8 | SMAD4 (Smad4) | HNF3 (Foxa1/2) | LMO2COM (Lmo2) | EGR (Egr1) | GATA1 (Gata1) | HNF4A (Hnf4a) | ETS1 (Ets1) | NRF2 (Gabpa) | GATA (Gata6) |
| 9 | BARBIE (NP) | TATA (Tbp) | WT1 (Wt1) | ZF5 (Zfp161) | IRF1 (Irf1) | GATA3 (Gata6) | WT1 (Wt1) | SMAD4 (Smad4) | ZNF219 (Zfp219) |
| 10 | ETS2 (Ets2) | HNF4A (Hnf4a) | SMAD4 (Smad4) | AP2G (Tcfap2c) | AP2G (Tcfap2c) | ERR1 (Esrra) | STAT (Stat3) | GRE (Nr3c1) | WT1 (Wt1) |
Genes were selected based on their functional relevance to surfactant biosynthesis/transport. For each gene of interest, its potential TF regulators were predicted in rank order of TF-TG confidence score and TF-TG relationships were supported by promoter assay. Top 10 TFBS and their corresponding TF in lung are listed under the selected genes.
Figure 2Graphic representation of a subnetwork consisting of predicted TF-TG pairs with the highest connectivity. The graphic representation of a subnetwork consisting of predicted TF-TG pairs with confidence cutoff as 0.60 and the top 6 TFs with the highest connectivity. SREBP, HNF3, ETSF, CEBP, GATA and IRFF were identified as regulatory hubs in this network. The network has 183 nodes and 386 links. Round nodes represent TGs, red diamond nodes represent TFs. Blue edges indicate the TF-TG predictions from C1, red edges for C2, green for C28, yellow for both C1 and C2, brown for both C1 and C28, light blue for both C2 and C28, and pink edges for TF-TG predication from C1, C2, and C28. The thickness of the edge corresponds to the frequency of the TF-TG prediction from all three clusters.
Figure 3Graphic representation of a CEBPA-SREBP centered sub-network. The graphic representation of a CEBPA-SREBP centered sub-network, showing the potential connections between SREBP, CEBPA and their predicted gene targets. 3A represents top ranked common gene targets for CEBP and SREBP and 3B represents top ranked unique gene targets for CEBP or SREBP. Solid line represented literature-validated relationships and dotted lines represent predicted relationships. Known markers of lung maturation and function are highlighted in purple.
Figure 4Promoter reporter assay of predicted C/EBPA and SREBP targets in transient transfection of MLE-15 cells. Schematic representation of the ≥1 kb Slc34a2, Elovl1 and Zdhhc3 promoter-luciferase constructs made in pGL3 reporter plasmids are depicted above the graphs. C/EBPα (green) and SREBP1c (red) represent consensus motifs on each mouse gene promoter. Transcription start sites are shown at +1 bp. The dose response effects of C/EBPα and SREBP1c expression after co-transfection with fixed amounts of the promoter-reporter constructs were assessed in MLE-15 cells, an immortalized mouse lung epithelial cell line, as measured by luciferase activity in 6-well plates. Values represent two independent experiments carried out in duplicate with means ± S.D. (n = 6).
Functionality and subcellular location of selected genes
| Symbol | Description | Expression & Subcellular Location | Function | Disease |
|---|---|---|---|---|
| Elongation of very long chain fatty acid protein1 | Expressed in lung type II cells. Endoplasmic Reticulum Membrane | Tissue-specific synthesis of very long fatty acids and sphingolipids. May catalyze the conversion of beta-ketoacyl CoA to beta-hydroxyacyl CoA or Reduction of trans-2-enoyl CoA to the saturated acyl CoA derivative. | Parkinson's disease | |
| Solute carrier family 34 (sodium phosphate), Member 2 | Apical Membrane of Type II cells | Actively transporting phosphate into cells via Na+ cotransport. May have a role in the synthesis of surfactant in lungs' alveoli. | pulmonary alveolar microlithiasis, ovarian cancer | |
| Sterol O-acyltransferase 1 | Expressed in lung type II cells. Endoplasmic Reticulum Membrane | Catylyzes the formation of fatty acid-cholesterol esters. Plays a role in lipoprotein assembly and dietary cholesterol absorption. | atherosclerosis | |
| Palmitoyltransferase Zinc finger DHHC domain-containing protein 3 | Expressed in lung type II cells. Golgi apparatus | Palmitoyltransferase with broad specificity; membrane protein trafficking | ||
| Acyltransferase-like 2 Phosphonoformate immuno-associated protein 3 | Expressed in lung type II cells. Endoplasmic Reticulum and Golgi Apparatus Membrane | Mediates the conversion of 1-acyl-sn-glycero-3-phosphocholine (LPC) into phosphatidylcholine (PC). May synthesize phosphatidylcholine in pulmonary surfactant, play a pivotal role in respiratory physiology. | migraines | |
| START domain-containing protein 4 | Expressed in lung type II cells. Mitochondria. | May be involved in the intracellular transport of sterols or other lipids. May bind cholesterol or other sterols | Huntington's disease | |
| Protein delta homolog 1 | Expressed in lung type II cells. Membrane | May function in adipocyte differentiation and in neuroendocrine differentiation | lung cancer, bronchiolo-alveolar adenocarcinoma, blepharophimosis, obesity, neoplasia, hypertriglyceridemia | |
| Peroxiredoxin 6 | Expressed in lung type II cells. Cytoplasm, Lysosome, lung secretory organelles. | Involved in redox regulation of the cell. Can reduce H(2)O(2) and short chain organic, fatty acid, and phospholipid hydroperoxides. May play a role in the regulation of phospholipid turnover as well as in protection against oxidative injury | acute allergic pulmonary eosinophilia, asthma, follicular adenoma, Huntington's disease, neoplasia | |
| ATP-binding cassette, sub-family A (ABC1), member 3 | Expressed in lung type II cells. Membrane | Plays an important role in the formation of pulmonary surfactant, probably by transporting lipids such as cholesterol | surfactant metabolism dysfunction, inflation, respiratory failure, atelectasis | |
Type II cell expression information is obtained from PBGE DB. Subcellular location is based on Gene Ontology http://amigo.geneontology.org/ DB and GeneCard http://www.genecards.org/. Protein function is based on Uniprot Knowledgebase http://www.uniprot.org/uniprot/. Disease information is based on the Ingenuity knowledgebase (Ingenuity) and Genecard http://www.genecards.org/.