| Literature DB >> 16171524 |
Carlos Pérez-Plasencia1, Gregory Riggins, Guelaguetza Vázquez-Ortiz, José Moreno, Hugo Arreola, Alfredo Hidalgo, Patricia Piña-Sanchez, Mauricio Salcedo.
Abstract
BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE), useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV), where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC).Entities:
Mesh:
Substances:
Year: 2005 PMID: 16171524 PMCID: PMC1261262 DOI: 10.1186/1471-2164-6-130
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
SAGE data general statistics.
| Frequency distribution | Tags | Individual Genes |
| >200 tags | 1,348 (4.4%) | 5 (0.11%) |
| 100–200 tags | 2,371 (7.8%) | 21 (0.5%) |
| 20–100 tags | 5,536 (18.19%) | 80 (1.8%) |
| 5–19 tags | 6,349 (20.8%) | 575 (13.2%) |
| 2 – 4 tags | 13,316 (43.7%) | 3,050 (70.4%) |
| 1 tag | 1,498 (4.9%) | 609 (14.1%) |
| Total | 30,418 (100%) | 4,340 (100%) |
| No of unique tags | 8,062 | |
| Tags with SAGE database matches | 5,255 | |
| No. of different transcripts matched | 4,340 | |
| No of poorly characterized transcripts | 1,215 | |
| Genes with known function | 2,453 | |
Calculation of the frequency distributon of a given tag was based on total tags sequenced in library.
Some genes have more than one tag, hence we search for individual genes in a database created in SQL language.
Percentage of tags or genes in frequency group.
based on FatiGO Datamining website [31]
Figure 1A) Comparison of most expressed tags among different SAGE libraries. Normalized expression levels (TPM) are similar between libraries with different total sequenced tags, indicating comparable messenger abundance among top expressed genes. Expression levels were obtained from SAGEmap website . TPM: Tags per million. Normalization to compare libraries with different numbers of sequenced tags. TPM is obtained by the following formula [(Tag frequency)(1000,000)/Total No. of sequenced tags]. DN: digital northern, indicating gene expression level for a specific gene in a library. B. Graphical representation of expression levels (TPM) for three constitutive genes in several normal (N) and tumoral (T) tissues. Brain tissue libraries: SAGE BB542 whitematter (N) and SAGE Brain medulloblastoma B 98 04 P117 (T). Breast: SAGE Breast normal organoid (N) and B SAGE Breast carcinoma epithelium AP DCIS6 (T). Gastric: SAGE normal gastric body epithelial (N) and SAGE Hiroshima GC W246T (T). Liver: SAGE normal liver (N) and SAGE Liver cholangiocarcinoma B K2D (T). Kidney: SAGE Duke Kidney (N) and SAGE_Kidney_carcinoma_B_D2 (T). Colon: SAGE NC2 (N) and SAGE Tu98 (T). Prostate: SAGE PR317 normal prostate (N) SAGE PR317 prostate tumor (T). Lung: SAGE normal lung (N) and SAGE Lung adenocarcinoma MD L10 (T). Expression levels are indicated as tags per million.
Top 20 expressed genes in normal cervical tissue.
| Tag sequence | Tags | TPM | UniGene ID | Gene | Cluster name | Biological Function |
| TACCTGCAGA | 515 | 16930 | Hs.416073 | S100 calgranulin A | Regulation of cell cycle progression and differentiation. | |
| TAGGTTGTCT | 356 | 11703 | Hs.374596 | Tumor protein, translationally-controlled 1 | Unknown | |
| TTTCCTGCTC | 276 | 9073 | Hs.139322 | Small proline-rich protein 3 | Cross-linked envelope protein of keratinocytes | |
| GAGGGAGTTT | 201 | 6607 | Hs.523463 | Ribosomal protein L27a | Component of the ribosomal 60S subunit | |
| GTGACCACGG | 188 | 6180 | Hs.436980 | Glutamate receptor, N-methyl D-aspartate 2C | Ionotropic glutamate receptor | |
| GTGGCCACGG | 184 | 6049 | Hs.112405 | S100 calcium binding protein A9 (calgranulin B) | Regulation of cell cycle progression and differentiation. | |
| GGGCTGGGGT | 173 | 5687 | Hs.425125; Hs.90436 | Ribosomal protein L29; Sperm associated antigen 7 | Component of the ribosomal 60S subunit. | |
| GCATAATAGG | 168 | 5523 | Hs.381123 | Ribosomal protein L21 | Component of the ribosomal 60S subunit. | |
| TCAGATCTTT | 161 | 5292 | Hs.446628 | Ribosomal protein S4, X-linked | Component of the ribosomal 40S subunit. | |
| GTTGTGGTTA | 155 | 5095 | Hs.99785 | FLJ21245 | Unknown | |
| GGATTTGGCC | 151 | 4964 | Hs.437594 | Ribosomal protein, large P2 | Component of the ribosomal 60S subunit. | |
| TTGGGGTTTC | 143 | 4701 | Hs.448738 | Ferritin, heavy polypeptide 1 | Important for iron homeostasis, stores iron in a soluble, nontoxic, readily available form. | |
| TTGGTCCTCT | 138 | 4536 | Hs.381172 | Rribosomal protein L41 | Component of the ribosomal 60S subunit. | |
| TGCACGTTTT | 130 | 4273 | Hs.265174 | Ribosomal protein L32 | Component of the ribosomal 60S subunit. | |
| ACAAAGCATT | 128 | 4208 | Hs.369982 | Insulin-like growth factor binding protein 5 | IGF-binding proteins prolong the half-life of the IGFs and have been shown to either inhibit or stimulate the growth promoting effects of the IGF on cell culture. | |
| AGGGCTTCCA | 122 | 4010 | Hs.401929 | Ribosomal protein L10 | Component of the ribosomal 60S subunit. | |
| CCACTGCACT | 114 | 3747 | Hs.107003 | Cyclin B1 interacting protein 1 | Functions in progression of the cell cycle through G(2)/M. | |
| GGCAAGCCCC | 107 | 3517 | Hs.148340 | ribosomal protein L10a | Component of the ribosomal 60S subunit. | |
| CTGGGTTAAT | 105 | 3451 | Hs.334534 | Glucosamine (N-acetyl)-6-sulfatase | Lysosomal enzyme found in all cells. It is involved in the catabolism of heparin, heparan sulphate, and keratan sulphate. | |
| TGCACTTCAA | 103 | 3386 | Hs.62886 | SPARC-like 1 (mast9, hevin) | Calcium ion binding |
TPM: Tags per million; [(Tag frequency)(1000,000)/Total No of sequenced tags].
Biological function obtained from SOURCE, at
Figure 2Functional categories assigned to individual genes identified in normal cervical SAGE library. Genes can be assigned in different functional categories. The percentage was calculated with 3,764 initial genes from which 2,720 genes had Gene Ontology classification.
Genes belonging to 1q21 epidermal differentiation complex (EDC) expressed in cervical tissue
| TAG Sequence | TAGS | TPM | UniGene ID | Gene ID | Gene name |
| TACCTGCAGA | 515 | 16930 | Hs.416073 | S100 calcium binding protein A8 | |
| TTTCCTGCTC | 276 | 9073 | Hs.139322 | Small proline-rich protein 3 | |
| GTGGCCACGG | 184 | 6049 | Hs.112405 | S100 calcium binding protein A9 | |
| GATCAGGCCA | 18 | 591 | Hs.275243 | S100 calcium binding protein A12 | |
| GATCTCTTGG | 17 | 558 | Hs.38991 | S100 calcium binding protein A2 | |
| AGCAGATCAG | 15 | 493 | Hs.400250 | S100 calcium binding protein A10 | |
| CGTGGGACAC | 12 | 394 | Hs.110196 | Chromosome 1 open reading frame 42 | |
| CAGGCCCCAC | 12 | 394 | Hs.417004 | S100 calcium binding protein A11 | |
| ATGTGTAACG | 8 | 263 | Hs.81256 | S100 calcium binding protein A4 | |
| GAGCAGCGCC | 7 | 230 | Hs.112408 | S100 calcium binding protein A7 | |
| ATGATCCCTG | 7 | 230 | Hs.355542 | Small proline-rich protein 2A | |
| TTGTGATGTA | 7 | 230 | Hs.85844 | Tropomyosin 3 | |
| TTCCCTTACC | 6 | 197 | Hs.244349 | Late cornified envelope 3D | |
| GTCAGGGGAT | 5 | 164 | Hs. 12341 | ADAR Adenosine deaminase, RNA-specific | |
| CCCTTGAGGA | 5 | 164 | Hs.1076 | Small proline-rich protein 1B (cornifin) | |
| CCCAGATGAT | 4 | 131 | Hs.7854 | Solute carrier family 39 (zinc transporter), member 1 | |
| AACCCTAAAA | 2 | 65 | Hs.75117 | Interleukin enhancer binding factor 2, 45 kDa | |
| GCAAATTTGA | 2 | 65 | Hs.6396 | Jumping translocation breakpoint | |
| CAAGGATCTA | 2 | 65 | Hs.355906 | Chromosome 1 open reading frame 43 | |
| CAAGGATCTA | 2 | 65 | Hs.490551 | Ubiquitin associated protein 2-like | |
| AGCCACTGCA | 2 | 65 | Hs.516439 | Involucrin |
aTPM: Tags per million.
Figure 3Expression of genes clustered in 1q21, in normal cervical tissues. One hundred nanograms of total RNA purified of each sample was used in one RT-PCR reaction with gene specific primers; then one tenth of each RT-PCR reaction was subjected to agarose gel electrophoresis. MW: molecular weight marker; C1–C6 six different normal cervical samples
Oligonucleotides sequences used in this work.
| Gene | Sense (5'→3') | Antisense (5'→3') | Annealing temperature (°C) | Product size (bp) | Reference |
| HPV* | GCMCAGGGWCATAAYAATGG | CGTCCMARRGGAWACTGATC | 55 | 450 | [36] |
| S100 A8 | ATGCCGTCTACAGGGATGAC | ACGCCCATCTTTATCACCAG | 58 | 160 | This paper |
| S100 A9 | TCAGCTGGAACGCAACATAGA | TCAGCTGCTTGTCTGCATT | 56 | 205 | This Paper |
| SPRR3 | TTCCACAACCTGGAAACACA | TTCAGGGACCTTGGTGTAGC | 55 | 174 | This paper |
| NICE-3 | ACGGCTATGAAACAGCCCGCTA | GCACATTGCAACTGACTGGCTT | 57 | 330 | This paper |
| NICE-4 | ACGGAATCCAATGAGGAAGGCA | TCAGTATTGGCTGGCTCTGCAT | 57 | 294 | This paper |
| GAPDH | CATCTCTGCCCCCTCTGCTGA | GGATGACCTTGCCCACAGCCT | 60 | 205 | [38] |
Tm was calculated using primerquest program from [39]; however, it was necessary to adjust Tm in some cases.