| Literature DB >> 22747986 |
Nalini Raghavachari1, Jennifer Barb, Yanqin Yang, Poching Liu, Kimberly Woodhouse, Daniel Levy, Christopher J O'Donnell, Peter J Munson, Gregory J Kato.
Abstract
BACKGROUND: Transcriptomic studies in clinical research are essential tools for deciphering the functional elements of the genome and unraveling underlying disease mechanisms. Various technologies have been developed to deduce and quantify the transcriptome including hybridization and sequencing-based approaches. Recently, high density exon microarrays have been successfully employed for detecting differentially expressed genes and alternative splicing events for biomarker discovery and disease diagnostics. The field of transcriptomics is currently being revolutionized by high throughput DNA sequencing methodologies to map, characterize, and quantify the transcriptome.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22747986 PMCID: PMC3428653 DOI: 10.1186/1755-8794-5-28
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1 Principal component analysis and hierarchical clustering (A) RNA-seq data. Principal Component 1 (PC1, x-axis) represents 31.4% and PC2 (y-axis) represents 22.1% of total variation in the data. Hierarchical cluster of 9 samples with heatmap representing all 9 PCs in left-to-right order. (B). Exon array data. PC1 vs. PC2 on Exon array data together representing 33.2% and 21.8% of the data variability. Hierarchical clustering shows similarities to that in A, e.g. sample C4 departs strongly from remaining data, samples C3, C5 appear to be neighbors in both data sets.
Figure 2 Comparison of Gene Expression Measurements by Two Methods. Gene Microarray expression level (RMA) vs. RNA-seq expression level (log2 RPKM) for subject C3. Both axes are expressed in base 2 logarithmic scale. The dynamic range (ratio of largest observable value to apparent background value) of the RNA-seq data is clearly larger than that of Exon array data. Bivariate density contours indicate a strong but nonlinear correlation between the two measurements. The two methods yield nearly proportional results above the median expression levels (Blue line). Solid Black lines are detection limits for microarray (RMA = 4.5) and RNA-seq (log2RPKM =0). Refer Methods for the description of detection limits.
Figure 3 Coefficient of Variation (CV) versus expression level for microarray and RNA-seq. RNA-seq expression level is grouped inot 4 bins according to RNA-seq average number of reads per gene lesser than1, 1–25, 26–158, 159 or higher. CV is calculated as sample standard deviation of expression level within group (SCD and control), pooled and dvidied by mean expression level for RNA-seq (Red). For microarray (Blue), the expression values (RMA units) are first divided by ln (2) = 0.693 to convert them to a natural log scale. Then the CV is calculated as the pooled standard deviation of natural log of the expression levels.
Figure 4 Volcano plots for RNA-seq and Exon array. (A). p-value vs. log10 fold change (SCD vs. control) for RNA-seq data.(B). p-value vs. log10 fold change for Affymetrix human Exon array data. Points in the lower right and lower left hand corners of the plots represent transcript clusters that are significant and differentially expressed. ●Blue circles represent transcripts with a FC greater or equal to 4 in RNA-seq and Δ Red triangles represent FC greater or equal to 2 in Exon arrays. *Green asterisks represent transcripts with a FC greater than or equal to 4 on RNA-seq only.
Figure 5 Fold change for RNA-seq vs. fold change for Exon array (SCD vs Control). A total of 331 transcript clusters are highlighted in the figure. The 96 blue circles ● represent transcripts with a FC greater than or equal to 4 in RNA-seq and a FC greater than or equal to 2 in microarray. The 151 red triangles ▴ represent transcripts with a fold change greater than or equal to 2 on microarray only. The 84 green asterisks * represent transcripts with a fold change greater than or equal to 4 in RNA-seq only. Correlation coefficient, R = 0.64. Genes showing greater than 4 fold change in expression levels were selected as differentially expressed in RNA-seq and genes showing greater than 2 fold change in expression levels were selected as differentially expressed in microarrays.
Selected Differentially Expressed Genes Grouped by Pathways of Interest
| NM_005581 | BCAM | basal cell adhesion molecule | 12.27 | 1.73 |
| NM_138578 | BCL2L1 | BCL2-like 1 | 5.44 | 1.45 |
| NM_198892 | BMP2K | BMP2 inducible kinase | 4.09 | 1.93 |
| NM_004331 | BNIP3L | BCL2 | 4.98 | 1.37 |
| NM_014326 | DAPK2 | death-associated protein kinase 2 | 0.33 | 0.48 |
| NM_001923 | DDB1 | damage-specific DNA binding protein 1, 127 kDa | 2.82 | 2.23 |
| NM_001122665 | DDX3Y | DEAD (Asp-Glu-Ala-Asp) box polypeptide 3, Y | 1.29 | 3.00 |
| NM_017631 | DDX60 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 60 | 1.51 | 2.25 |
| NM_019030 | DHX29 | DEAH (Asp-Glu-Ala-His) box polypeptide 29 | 1.29 | 2.07 |
| NM_024940 | DOCK5 | dedicator of cytokinesis 5 | 0.43 | 0.50 |
| NM_004864 | GDF15 | growth differentiation factor 15 | 4.77 | 0.99 |
| NM_002094 | GSPT1 | G1 to S phase transition 1 | 4.03 | 1.34 |
| NM_017436 | A4GALT | alpha 1,4-galactosyltransferase | 4.70 | 0.98 |
| NR_024080 | ACP1 | acid phosphatase 1, soluble | 2.79 | 2.41 |
| NM_015256 | ACSL6 | acyl-CoA synthetase long-chain family member 6 | 6.96 | 2.24 |
| NM_005622 | ACSM3 | acyl-CoA synthetase medium-chain family member 3 | 3.90 | 2.72 |
| NM_001629 | ALOX5AP | arachidonate 5-lipoxygenase-activating protein | 0.39 | 0.48 |
| NM_020980 | AQP9 | aquaporin 9 | 0.48 | 0.37 |
| NM_015313 | ARHGEF12 | Rho guanine nucleotide exchange factor (GEF) 12 | 3.42 | 2.49 |
| NM_015994 | ATP6V1D | ATPase, H + transporting, V1 subunit D | 1.54 | 2.21 |
| NM_199186 | BPGM | 2,3-bisphosphoglycerate mutase | 11.03 | 2.89 |
| NM_020850 | RANBP10 | RAN binding protein 10 | 5.70 | 2.31 |
| NM_001145657 | RAP1GAP | RAP1 GTPase activating protein | 42.05 | 2.58 |
| NM_014737 | RASSF2 | Ras association (RalGDS | 0.45 | 0.47 |
| NM_002923 | RGS2 | regulator of G-protein signaling 2, 24 kDa | 0.32 | 0.44 |
| NM_000062 | SERPING1 | serpin peptidase inhibitor, clade G (C1 inhibitor) | 3.58 | 2.03 |
| NM_001122752 | SERPINI1 | serpin peptidase inhibitor, clade I (neuroserpin), 1 | 1.47 | 3.07 |
| NM_007111 | TFDP1 | transcription factor Dp-1 | 5.69 | 2.45 |
| NM_003227 | TFR2 | transferrin receptor 2 | 5.67 | 1.42 |
| NM_173804 | TMEM86B | transmembrane protein 86B | 5.22 | 1.39 |
| NM_015049 | TRAK2 | trafficking protein, kinesin binding 2 | 5.51 | 3.24 |
| NM_022066 | UBE2O | ubiquitin-conjugating enzyme E2O | 7.13 | 2.83 |
| NM_000032 | ALAS2 | aminolevulinate, delta-, synthase 2 | 4.36 | 1.66 |
| NM_001128829 | CA1 | carbonic anhydrase I | 10.68 | 2.07 |
| NM_000067 | CA2 | carbonic anhydrase II | 5.03 | 3.45 |
| NM_016417 | GLRX5 | glutaredoxin 5 | 5.24 | 1.59 |
| NM_201397 | GPX1 | glutathione peroxidase 1 | 8.02 | 2.16 |
| NM_001145260 | NCOA4 | nuclear receptor coactivator 4 | 4.77 | 2.12 |
| NM_002501 | NFIX | nuclear factor I | 5.73 | 1.73 |
| NM_000274 | OAT | ornithine aminotransferase (gyrate atrophy) | 1.30 | 2.19 |
| NM_030758 | OSBP2 | oxysterol binding protein 2 | 11.80 | 2.57 |
| NM_032523 | OSBPL6 | oxysterol binding protein-like 6 | 4.70 | 2.45 |
| NM_005809 | PRDX2 | peroxiredoxin 2 | 4.10 | 1.12 |
| NM_006745 | SC4MOL | sterol-C4-methyl oxidase-like | 1.31 | 2.47 |
| NM_138432 | SDSL | serine dehydratase-like | 6.58 | 1.21 |
| NM_003944 | SELENBP1 | selenium binding protein 1 | 5.76 | 1.92 |
| NM_175839 | SMOX | spermine oxidase | 5.09 | 2.78 |
| NM_003794 | SNX4 | sorting nexin 4 | 1.43 | 2.29 |
| NM_003105 | SORL1 | sortilin-related receptor, L(DLR class) A repeats | 0.48 | 0.41 |
| NM_003227 | TFR2 | transferrin receptor 2 | 5.67 | 1.42 |
| NM_022648 | TNS1 | tensin 1 | 4.21 | 1.05 |
| NM_005581 | BCAM | basal cell adhesion molecule (Lutheran blood group) | 12.27 | 1.73 |
| NM_003965 | CCRL2 | chemokine (C-C motif) receptor-like 2 | 2.00 | 2.28 |
| NM_012120 | CD2AP | CD2-associated protein | 2.30 | 2.20 |
| NM_001001548 | CD36 | CD36 molecule (thrombospondin receptor) | 1.68 | 2.11 |
| NM_152243 | CDC42EP1 | CDC42 effector protein (Rho GTPase binding) 1 | 8.43 | 1.04 |
| NM_004344 | CETN2 | centrin, EF-hand protein, 2 | 2.85 | 2.46 |
| NM_001008388 | CISD2 | CDGSH iron sulfur domain 2 | 7.29 | 3.91 |
| NM_033554 | HLA-DPA1 | major histocompatibility complex, DP alpha 1 | 1.32 | 2.11 |
| NM_002125 | HLA-DRB5 | major histocompatibility complex, class II, DR beta 5 | 1.96 | 2.09 |
| NM_001130080 | IFI27 | interferon, alpha-inducible protein 27 | 51.57 | 6.64 |
| NM_006417 | IFI44 | interferon-induced protein 44 | 3.14 | 3.21 |
| NM_006820 | IFI44L | interferon-induced protein 44-like | 3.86 | 5.37 |
| NM_001548 | IFIT1 | interferon-induced protein with tetratricopeptide 1 | 3.04 | 2.62 |
| NM_000576 | IL1B | interleukin 1, beta | 0.43 | 0.41 |
| NM_004633 | IL1R2 | interleukin 1 receptor, type II | 0.17 | 0.27 |
| NM_002182 | IL1RAP | interleukin 1 receptor accessory protein | 0.29 | 0.40 |
| NM_002183 | IL3RA | interleukin 3 receptor, alpha (low affinity) | 0.52 | 0.49 |
| NM_003024 | ITSN1 | intersectin 1 (SH3 domain protein) | 6.28 | 1.87 |
| NM_003189 | TAL1 | T-cell acute lymphocytic leukemia 1 | 5.37 | 1.76 |
| NM_017772 | TBC1D22B | TBC1 domain family, member 22B | 3.78 | 2.82 |
| NM_152772 | TCP11L2 | t-complex 11 (mouse)-like 2 | 3.32 | 2.64 |
| NM_020476 | ANK1 | ankyrin 1, erythrocytic | 4.04 | 1.62 |
| NM_152326 | ANKRD9 | ankyrin repeat domain 9 | 9.23 | 1.08 |
| NM_001728 | BSG | basigin (Ok blood group) | 5.01 | 1.50 |
| NM_016633 | ERAF | erythroid associated factor | 9.64 | 3.91 |
| NM_001017922 | ERMAP | erythroblast membrane-associated protein | 3.86 | 2.05 |
| NM_001012515 | FECH | ferrochelatase (protoporphyria) | 8.10 | 2.56 |
| NM_002099 | GYPA | glycophorin A (MNS blood group) | 11.49 | 4.40 |
| NM_002100 | GYPB | glycophorin B (MNS blood group) | 7.43 | 2.11 |
| NM_198682 | GYPE | glycophorin E | 6.67 | 2.70 |
| NM_000558 | HBA1 | hemoglobin, alpha 1 | 51.86 | 5.54 |
| NM_000518 | HBB | hemoglobin, beta | 28.47 | 1.12 |
| NM_000519 | HBD | hemoglobin, delta | 44.50 | 2.88 |
| NM_005330 | HBE1 | hemoglobin, epsilon 1 | 24.48 | 2.34 |
| NM_000559 | HBG1 | hemoglobin, gamma A | 43.18 | 2.17 |
| NM_001003938 | HBM | hemoglobin, mu | 3.75 | 2.04 |
| NM_005332 | HBZ | hemoglobin, zeta | 8.72 | 2.34 |
| NM_018437 | HEMGN | hemogen | 5.83 | 2.18 |
| NM_000420 | KEL | Kell blood group, metallo-endopeptidase | 4.59 | 1.67 |
| NM_006563 | KLF1 | Kruppel-like factor 1 (erythroid) | 6.76 | 1.41 |
| NM_007021 | C10orf10 | chromosome 10 open reading frame 10 | 10.26 | 1.35 |
| NM_001009894 | C12orf29 | chromosome 12 open reading frame 29 | 1.39 | 2.71 |
| NM_014059 | C13orf15 | chromosome 13 open reading frame 15 | 2.22 | 2.03 |
| NM_025057 | C14orf45 | chromosome 14 open reading frame 45 | 4.54 | 4.02 |
| AK303128 | C17orf99 | chromosome 17 open reading frame 99 | 5.43 | 1.24 |
| BC038410 | C1orf105 | chromosome 1 open reading frame 105 | 6.00 | 1.16 |
| NM_020362 | C1orf128 | chromosome 1 open reading frame 128 | 5.23 | 2.58 |
| NM_015680 | C2orf24 | chromosome 2 open reading frame 24 | 4.06 | 1.75 |
| NM_001042521 | C2orf88 | chromosome 2 open reading frame 88 | 5.41 | 2.19 |
| NM_001002029 | C4B | complement component 4B | 4.22 | 1.37 |
| NM_000715 | C4BPA | complement component 4 binding protein, alpha | 0.04 | 0.23 |
| NM_001128424 | C4orf18 | chromosome 4 open reading frame 18 | 1.35 | 2.60 |
| NM_032412 | C5orf32 | chromosome 5 open reading frame 32 | 3.04 | 2.12 |
| NM_032385 | C5orf4 | chromosome 5 open reading frame 4 | 7.20 | 2.18 |
| NM_052831 | C6orf192 | chromosome 6 open reading frame 192 | 2.08 | 2.36 |
| NR_027330 | C7orf54 | chromosome 7 open reading frame 54 | 0.67 | 0.49 |
* Differentially expressed genes were selected if they showed greater than 4-fold change (SCD v Control) in RNAseq or greater than 2-fold change in Microarray.
Figure 6 Venn diagram showing the 331 differentially expressed genes between SCD and Healthy Controls for RNA-seq and microarray. Gene selection Criteria for RNA-Seq -FC greater than or equal to 4; Exon array -FC greater than or equal to 2.
Figure 7 Validation by QPCR - Log2 expression fold change (SCD vs. Control) measured by microarray (Red) or RNA-seq (Blue) vs. qPCR on selected genes. Closed symbols represent significant changes, open signals are not significant. The green line is the line of identity. Symbols closer to the line of identity are in better agreement with QPCR. ▴Significantly differentially expressed genes by microarray, ■Significantly differentially expressed genes by RNA-seq; Δ No significance in microarray; □No significance in RNA-seq.
Highly Significant Alternatively Spliced Genes
| | | | ||
| ATF6B* | activating transcription factor 6 beta | Cell Death/Immune Response | 0.49* | 0.17 |
| BCL6* | B-cell CLL | Apoptosis/Immune Response | 0.68* | 0.07 |
| CARM* | coactivator-associated arginine methyltransferase 1 | Cell Development/Differentiation | 0.40* | 0.12 |
| CCND* | cyclin D-type binding-protein 1 | Cell Cycle | 0.63* | 0.13 |
| COX4I* | cytochrome c oxidase subunit IV iso1 | Cellular Development/Compromise | 0.89* | 0.04 |
| DCTN* | dynactin 2 (p50) | Cellular Movement | 0.47* | 0.08 |
| HPS1* | Hermansky-Pudlak syndrome 1 | Inflammatory Response; Cell Signaling | 0.39* | 0.15 |
| INPP5* | inositol polyphosphate-5-phosphatase | Cell-Cell Signaling | 0.49* | 0.07 |
| INSIG1* | insulin induced gene 1 | Lipid metabolism, Molecular Transport | 0.52* | 0.09 |
| NUDT* | nudix type motif 4 | Cell signaling; Hematopoiesis | 0.37* | 0.16 |
| NUDT* | nudix motif 4 pseudogene | Cell signaling; Hematopoiesis | 0.37* | 0.16 |
| RHCE* | Rh blood group, CcEe antigens | Agglutination of red blood cells | 0.54* | 0.18 |
| RHD* | Rh blood group, D antigen | Agglutination of red blood cells | 0.54* | 0.18 |
| TNXA* | tenascin XA (pseudogene) | Cellular Assembly | 0.49* | 0.17 |
| TNXB* | tenascin XB | Cellular Assembly | 0.49* | 0.17 |
| UNC13* | unc-13 homolog D | Cell Signaling | 0.76* | 0.10 |
* Selected genes from RNA-Seq with P < 0.00000001.
** Selected Genes from Microarray with P < 0.00000001.
Highly Significant Novel Differentially Expressed 200 bp Regions
| chr1 | 203465800 | 203465999 | 18.69 | 0.00005 |
| chr1 | 246094400 | 246094599 | 12.84 | 0.00018 |
| chr1 | 144990000 | 144990199 | 4.96 | 0.00016 |
| chr1 | 209819000 | 209819199 | 4.93 | 0.00022 |
| chr1 | 554400 | 554599 | 0.44 | 0.00050 |
| chr1 | 91770600 | 91770799 | 0.43 | 0.00030 |
| chr1 | 157083400 | 157083599 | 0.33 | 0.00034 |
| chr1 | 159838000 | 159838199 | 0.26 | 0.00025 |
| chr2 | 177785200 | 177785399 | 9.45 | 0.00049 |
| chr2 | 128992200 | 128992399 | 5.46 | 0.00038 |
| chr2 | 91411400 | 91411599 | 3.38 | 0.00037 |
| chr2 | 175293200 | 175293399 | 0.50 | 0.00037 |
| chr2 | 207652800 | 207652999 | 0.35 | 0.00012 |
| chr3 | 76567600 | 76567799 | 0.49 | 0.00029 |
| chr4 | 154075400 | 154075599 | 10.11 | 0.00029 |
| chr4 | 38368600 | 38368799 | 9.62 | 0.00004 |
| chr4 | 154075800 | 154075999 | 7.75 | 0.00005 |
| chr4 | 146763600 | 146763799 | 4.91 | 0.00035 |
| chr4 | 146765600 | 146765799 | 4.82 | 0.00018 |
| chr4 | 111338400 | 111338599 | 4.66 | 0.00028 |
| chr4 | 146516600 | 146516799 | 0.39 | 0.00014 |
| chr4 | 26043400 | 26043599 | 0.38 | 0.00045 |
| chr5 | 176439200 | 176439399 | 5.84 | 0.00048 |
| chr5 | 138854200 | 138854399 | 5.30 | 0.00014 |
| chr5 | 138856400 | 138856599 | 3.34 | 0.00006 |
| chr5 | 177142200 | 177142399 | 3.10 | 0.00046 |
| chr5 | 43623000 | 43623199 | 0.46 | 0.00018 |
| chr5 | 99410000 | 99410199 | 0.26 | 0.00010 |
| chr6 | 53038000 | 53038199 | 10.59 | 0.00015 |
| chr6 | 151297600 | 151297799 | 5.46 | 0.00021 |
| chr6 | 28212400 | 28212599 | 2.43 | 0.00004 |
| chr7 | 55681200 | 55681399 | 3.73 | 0.00047 |
| chr7 | 5567000 | 5567199 | 0.48 | 0.00041 |
| chr7 | 139365000 | 139365199 | 0.35 | 0.00034 |
| chr8 | 41761600 | 41761799 | 8.49 | 0.00001 |
| chr8 | 41762200 | 41762399 | 6.42 | 0.00046 |
| chr8 | 130922200 | 130922399 | 0.34 | 0.00033 |
| chr9 | 35101200 | 35101399 | 22.26 | 0.00009 |
| chr9 | 35101400 | 35101599 | 18.75 | 0.00019 |
| chr9 | 5101000 | 5101199 | 3.04 | 0.00020 |
| chr9 | 79521600 | 79521799 | 0.38 | 0.00020 |
| chr9 | 79523600 | 79523799 | 0.36 | 0.00014 |
| chr10 | 13766600 | 13766799 | 12.54 | 0.00002 |
| chr10 | 91112400 | 91112599 | 11.27 | 0.00042 |
| chr10 | 75248600 | 75248799 | 0.37 | 0.00000 |
| chr10 | 81901200 | 81901399 | 0.35 | 0.00036 |
| chr11 | 5225800 | 5225999 | 24.75 | 0.00001 |
| chr11 | 94542200 | 94542399 | 2.35 | 0.00002 |
| chr11 | 117571400 | 117571599 | 0.37 | 0.00034 |
| chr11 | 6193200 | 6193399 | 0.34 | 0.00012 |
| chr12 | 111304400 | 111304599 | 9.16 | 0.00022 |
| chr12 | 105247200 | 105247399 | 7.71 | 0.00033 |
| chr12 | 88443200 | 88443399 | 3.83 | 0.00014 |
| chr12 | 62331800 | 62331999 | 0.47 | 0.00032 |
| chr13 | 74302000 | 74302199 | 7.60 | 0.00014 |
| chr13 | 18137800 | 18137999 | 3.88 | 0.00013 |
| chr14 | 65417400 | 65417599 | 6.12 | 0.00029 |
| chr14 | 65417000 | 65417199 | 5.15 | 0.00033 |
| chr14 | 52176600 | 52176799 | 0.36 | 0.00046 |
| chr15 | 72689200 | 72689399 | 11.91 | 0.00011 |
| chr15 | 72678200 | 72678399 | 6.26 | 0.00018 |
| chr15 | 72688400 | 72688599 | 6.17 | 0.00014 |
| chr15 | 72687200 | 72687399 | 5.89 | 0.00028 |
| chr15 | 79393000 | 79393199 | 0.43 | 0.00002 |
| chr17 | 39736800 | 39736999 | 9.65 | 0.00025 |
| chr17 | 39632800 | 39632999 | 3.77 | 0.00047 |
| chr17 | 35692200 | 35692399 | 0.43 | 0.00047 |
| chr18 | 14383400 | 14383599 | 3.15 | 0.00016 |
| chr19 | 51317000 | 51317199 | 7.49 | 0.00019 |
| chr19 | 56938000 | 56938199 | 0.31 | 0.00035 |
| chr20 | 55405400 | 55405599 | 13.85 | 0.00012 |
| chr20 | 30909600 | 30909799 | 0.44 | 0.00014 |
| chr20 | 1403000 | 1403199 | 0.29 | 0.00036 |
| chr21 | 14058800 | 14058999 | 4.92 | 0.00006 |
| chr22 | 29607600 | 29607799 | 12.14 | 0.00048 |
| chr22 | 35263200 | 35263399 | 6.35 | 0.00047 |
| chr22 | 43511000 | 43511199 | 2.97 | 0.00018 |
| chr22 | 20569800 | 20569999 | 0.45 | 0.00029 |
| chr22 | 22898800 | 22898999 | 0.34 | 0.00010 |
| chrM | 12600 | 12799 | 0.44 | 0.00026 |
| chrX | 55067400 | 55067599 | 5.80 | 0.00033 |
| chrX | 125434800 | 125434999 | 0.50 | 0.00005 |
| chrX | 143984400 | 143984599 | 0.41 | 0.00005 |
| chrX | 1389000 | 1389199 | 0.34 | 0.00037 |
| chrY | 3611200 | 3611399 | 8.93 | 0.00011 |
| chrY | 1389000 | 1389199 | 0.34 | 0.00037 |
Figure 8 Coverage Plot of RNA-seq data for2 gene RNA-seq reads for2 gene are shown in genomic context (chrX:55,051,744-55,074,222). An apparently novel exon dubbed 4a, between exons 4 and 5 is expressed significantly more in SCD compared with controls (p = 0.0003). This exon has been previously observed as human EST BX367133, in a clone derived from T cells. The inset shows the region bounded by exons 4 and 5 with the coverage range expanded and truncated to 20 for each track.
Figure 9 Analysis of sequence variants in the expressed hemoglobin transcript in a Healthy Control – (C1), and Homozygous (S3-HbSS) and Heterozygous (S1-HbSC) Sickle Cell Patients Observed sequences of HBB (hemoglobin B) gene in the region including the known sickle cell mutation, which causes a substituion of valine (coded by CAC) for glutamic acid (coded by CTC). The box for the reads from sample C1 - control, show the observed sequences (on the coding strand, but in reversed order) and are consistently T at the mutation position. The box for sample S3 - HbSS shows the consistent substitution of A at this same position. The box for sample S1 - HbSC show approximately 50% substitution of A for T at this position, and an additional mutation at the neighboring postion C- > T. This sample was revealed to be from a compound heterozygous hemoglobin SC patient.