| Literature DB >> 18466500 |
Jeanette N McClintick1,2, Yunlong Liu3,2, Howard J Edenberg1,2.
Abstract
To explore the mapping of factors regulating gene expression, we have carried out linkage studies using expression data from individual transcripts (from Affymetrix microarrays; Genetic Analysis Workshop 15 Problem 1) and composite data on correlated groups of transcripts. Quality measures for the arrays were used to remove outliers, and arrays with sex mismatches were also removed. Data likely to represent noise were removed by setting a minimum threshold of present calls among the non-redundant set of 190 arrays. SOLAR was used for genetic analysis, with MAS5 signal as the measure of expression. Probe sets with larger CVs generated more linkages (LOD > 2.0). While trans linkages predominated, linkages with the largest LOD scores (>4) were mostly cis. Hierarchical clustering was used to generate correlated groups of genes. We tested four composite measures of expression for the clusters. The average signal, average normalized signal, and the first principal component of the data behaved similarly; in 8/19 clusters tested, the composite measures linked to a region to which some individual probe sets within the cluster also linked. The second principal component only produced one linkage with LOD > 2. One cluster based upon chromosomal location, containing histone genes, linked to two trans regions. This work demonstrates that composite measures for genes with correlated expression can be used to identify loci that affect multiple co-expressed genes.Entities:
Year: 2007 PMID: 18466500 PMCID: PMC2367593 DOI: 10.1186/1753-6561-1-s1-s155
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Distribution of fraction present for all probe sets on the arrays.
Probe sets with LOD > 3.0
| Probe seta | LOD | Chrb | cMc | Linkage | Gene location | ENTREZ gene | UniGene ID | Gene symbol | Groupd |
| 65588_at | 9.27 | 20 | 62 | chr20q11.23 | 388796 | Hs.400876 | LOC388796 | CV40 | |
| 219759_at | 7.6 | 5 | 109 | chr5q15 | 64167 | Hs.482910 | LRAP | CV70 | |
| 320_at | 5.12 | 6 | 59 | chr6p21.1 | 5190 | Hs.567243 | PEX6 | CV40 | |
| 212509_s_at | 4.49 | 17 | 113 | chr17q25.1 | 439921 | Hs.250723 | MXRA7 | CV70 | |
| 205018_s_at | 4.49 | 21 | 20 | chr13q32.1 | 10150 | Hs.125715 | MBNL2 | CV70 | |
| 204418_x_at | 4.25 | 1 | 145 | chr1p13.3 | 2946 | Hs.279837 | GSTM2 | CV40 | |
| 205027_s_at | 4.03 | 9 | 98 | chr10p11.23 | 1326 | Hs.432453 | MAP3K8 | CV70 | |
| 203868_s_at | 3.79 | 21 | 30 | chr1p32-p31 | 7412 | Hs.109225 | VCAM1 | CV70 | |
| 204073_s_at | 3.77 | 11 | 56 | chr11q12-q13.1 | 745 | Hs.473109 | C11orf9 | CV70 | |
| 208121_s_at | 3.57 | 9 | 97 | chr12p13-p12 | 5800 | Hs.160871 | PTPRO | CV70 | |
| 211317_s_at | 3.44 | 8 | 74 | chr2q33-q34 | 8837 | Hs.390736 | CFLAR | random | |
| 204015_s_at | 3.29 | 12 | 122 | chr8p12-p11 | 1846 | Hs.417962 | DUSP4 | random | |
| 204908_s_at | 3.23 | 21 | 29 | chr19q13.1-q13.2 | 602 | Hs.31210 | BCL3 | random |
aProbe sets among the 300 selected to test selection by CV. Annotations were from Netaffx, .
bChromosome of linkage peak
cCentimorgan distance of linkage peak
dGroup indicates which CV group this probe set was in (see Methods).
Figure 2Fraction of probe sets with large LOD scores for each group of selected probe sets.
Figure 3Characterization of the clusters.
Characteristics of clusters used for linkage analysis
| Probe setsa | Average correlationb | PC1 contentc | Mean CVd | PC1 LODe | Average LODf | Highest probe set LODg | Multi-linked regionsh |
| 6 | 0.83 | 0.53 | 0.56 | 2.2 | 2.5 | 2.4 | 1 |
| 42 | 0.82 | 0.29 | 0.20 | -i | - | 0.9 | 0 |
| 10 | 0.79 | 0.42 | 0.17 | - | - | 0.8 | 0 |
| 6 | 0.77 | 0.49 | 0.44 | 2.4 | 2.4 | 3.4 | 2 |
| 19 | 0.77 | 0.32 | 0.46 | 2.6 | 2.6 | 3.6 | 4 |
| 20 | 0.76 | 0.31 | 0.37 | - | - | 1.6 | 0 |
| 6 | 0.75 | 0.47 | 0.43 | 3.0 | 3.0 | 3.4 | 1 |
| 7 | 0.75 | 0.45 | 0.41 | 1.1 | 1.4 | 1.6 | 0 |
| 10 | 0.75 | 0.39 | 0.52 | 2.3 | 2.9 | 3.2 | 1 |
| 9 | 0.74 | 0.40 | 0.39 | 0.3 | 0.8 | 0.9 | 0 |
| 6 | 0.74 | 0.46 | 0.32 | 0.4 | 0.4 | 0.7 | 0 |
| 8 | 0.74 | 0.41 | 0.47 | 2.9 | 2.9 | 4.0 | 3 |
| 6 | 0.73 | 0.46 | 0.39 | 1.8 | 2.1 | 1.9 | 0 |
| 8 | 0.73 | 0.41 | 0.29 | 1.5 | 1.6 | 1.9 | 0 |
| 6 | 0.73 | 0.45 | 0.29 | 2.1 | 2.4 | 2.4 | 0 |
| 7 | 0.73 | 0.43 | 0.37 | 1.4 | 1.5 | 1.9 | 0 |
| 16 | 0.72 | 0.31 | 0.34 | 1.6 | 1.7 | 2.4 | 1 |
| 13 | 0.72 | 0.33 | 0.28 | - | 0.8 | 1.6 | 0 |
| 10 | 0.71 | 0.36 | 0.27 | - | 1.4 | 2.9 | 1 |
aNumber of probe sets within the cluster
bThe average correlation between probe sets in the cluster
cThe amount of variation captured by the first principal component
dMean CV for all probe sets in the cluster
eLOD scores for first PC
fHighest LOD score for composite generated from average signal or average normalized signal
gHighest LOD score for an individual probe set
hThe number of chromosomal regions linked to by multiple probe sets or composite measures with at least one with LOD > 2
i- indicates no linkage