Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
Download
29547 views
1
2
3
4
5
Background
6
Pathologic classification of tumors has been
7
traditionally based on microscopic appearance. Although
8
morphology often correlates with natural history of
9
disease, tumors of a given pattern may have a broad
10
prognostic range and different responses to treatment.
11
Molecular methods, such as the evaluation of hormone
12
receptors in breast carcinoma, have been effectively
13
employed to further characterize tumors [ 1 ] . Nucleic
14
acid array-based technologies extend molecular
15
characterization by providing a biochemical snapshot, or
16
profile, of cellular activity that encompasses thousands of
17
gene products [ 2 ] . Potential applications beyond
18
diagnosis and prognosis are diverse, and include treatment
19
response stratification of patients in clinical trials,
20
assessment of relevance to human safety of drug-associated
21
tumors in animal carcinogenicity studies, and the
22
development of more pertinent animal xenograft models of
23
cancer therapy. Successful application of array-based tools
24
depends on establishing robust laboratory and computational
25
methods that effectively and reliably discriminate between
26
tumor types. Recent reports have demonstrated the power of
27
such tools to distinguish between clinically meaningful
28
subsets of cancer [ 3 4 ] .
29
Renal cell carcinoma (RCCa) represents approximately 3%
30
of all human malignancies with an incidence of 7 per
31
100,000 individuals. Of these individuals about 40% present
32
with metastatic disease and a further third will develop
33
distant metastases during the postoperative course. The
34
most effective therapy for RCCa localized to the kidney is
35
surgery and a metastatic tumor is practically incurable.
36
There is a low response to biological modifiers and the
37
treatment is generally only palliative [ 5 ] .
38
We evaluated the RNA expression profiles of renal cell
39
carcinomas (RCCa) using the Affymetrix GeneChip platform,
40
comparing mRNA expression profiles from a total of 21 human
41
tissue samples and two pooled samples. The 21 samples
42
consisted of eight normal kidneys, nine clear cell
43
carcinomas (CC), two chromophobe carcinomas (Chr), one
44
urothelial carcinoma and one metanephric adenoma.
45
Expression profiles from a pilot data set of ten samples
46
were analyzed using multiple clustering algorithms. Genes
47
were then selected from the pilot data set using a
48
fold-change criteria or from all of the normal and CC
49
samples using a p-value. Genes that were identified from
50
both the pilot and the complete data set were categorized
51
according to a hierarchical classification scheme based on
52
functional attributes of encoded proteins.
53
54
55
Results
56
57
Clustering of the pilot data set
58
The gene expression data from a pilot data set that
59
consisted of ten samples (patients 1 - 4, consisting of
60
two CC, two Chr, four normals, two pooled samples) were
61
analyzed using hierarchical clustering to identify
62
structure within the data set. Pooled samples were
63
included to determine whether a combined sample yielded
64
an expression profile representative of the individual
65
samples. To determine the relatedness of the samples,
66
clustering analysis was performed. Ten different
67
clustering algorithms using four methods of
68
pre-processing the data sets were applied to identify the
69
most consistent sample-clustering pattern. The rationale
70
behind this approach was to avoid the bias inherent in
71
any single clustering method and to determine the most
72
appropriate clustering method for this data type. Genes
73
that were considered "present" above background by the
74
Affymetrix software in at least one of the samples were
75
included in the analyses. This reduced the data set to
76
4571 genes per sample. The 40 sample clusters were
77
evaluated to see if their dendrograms fitted the expected
78
sample biology (Table 2). The most consistent cluster
79
dendrogram, present in 18 of 40 analyses, did indeed
80
match the sample biology (Figure 2A) and for 16 of these
81
analyses a logarithmic transformation was used. In the
82
consensus dendrogram, the two CC, two Chr and four normal
83
samples each clustered in separate groups. Interestingly,
84
the dendrogram suggested that the expression pattern of
85
the normal samples was more similar to the CC than to the
86
Chr samples. As expected, the pooled normal sample
87
clustered with the normal kidney samples. The pooled
88
tumor sample clustered more closely to the CC than to the
89
Chr, possibly due to the greater similarity between the
90
two CC, and consequent weighting of the pooled sample
91
toward the more uniform CC expression profile.
92
93
94
Clustering of the complete data set
95
To expand our data set we obtained a further 13 human
96
tissue samples (patients 12-20), including four normal
97
kidneys and nine kidney tumors. These were profiled in
98
the same manner as the first 10 samples. From this
99
combined data set (23 samples) we selected genes
100
classified as Present (see Methods) at least once (5372
101
genes) and then clustered the log-transformed data with
102
average linkage analysis. This method had previously
103
produced a dendrogram that matched the expected sample
104
biology in the pilot data set. The sample dendrogram
105
(Figure 2B) showed that the relatedness of the samples
106
was similar to that observed with the pilot data set. As
107
seen previously in the pilot data set the normal samples
108
clustered together off a single node. However the Chr
109
also clustered off this node and now appeared more
110
similar to normal samples than CC. The two CC included in
111
the pilot data set (patients 2 and 4) now clustered
112
within a larger CC group that include the additional CC
113
samples. From this node there also appeared to be two
114
outlier samples (patients 18 and 20). Pathology reports
115
on these samples revealed that these were not RCCa
116
samples but instead patient 20 was a papillary urothelial
117
carcinoma and the sample from patient 18 was not a
118
carcinoma but a benign metanephric adenoma.
119
There were distinct patterns visible in the gene
120
cluster that were conserved in the CC samples (Figure 2B,
121
zone C), or the Chr samples (Figure 2B, zone B), or the
122
normal kidney samples (Figure 2B, zone A). These patterns
123
indicated that each subtype of tumor expressed a common
124
set of genes that could be selected and further
125
characterized. The urothelial carcinoma and metanephric
126
adenoma appeared to share few of these genes commonly
127
regulated between the other tissue types.
128
129
130
Functional taxonomy: Genes differentially regulated
131
in CC and Chr
132
To select genes that were changed between CC and Chr
133
in our pilot data set we used an arbitrary cutoff of 2
134
fold-change units in combination with the Affymetrix
135
difference call (see Methods). Genes were selected
136
according to criteria described in Table 3, which lists
137
the number of selected genes in each of eight categories.
138
A total of 456 genes were selected by these criteria.
139
In order to understand the molecular differences
140
between Chr and CC RCCa on a broader scale, we developed
141
a gene categorization system ("Functional Taxonomy," see
142
Methods) in which genes were hierarchically grouped
143
according to the cellular function of their protein
144
products. The number of genes within each primary
145
category was tallied and plotted to generate a
146
first-level "signature" (Figure 3A). The functional
147
taxonomy signature provided a graphical method to rapidly
148
visualize broad gene expression characteristics of the
149
tumors. The cellular function categories that contained
150
the largest number of the 456 genes were Signal
151
Transduction (97 genes), Cellular and Matrix Organization
152
and Adhesion (56 genes), Metabolism (54 genes), and
153
Unclassified (65 genes). The CC and Chr samples
154
demonstrated differences in the numbers or patterns of
155
increased and decreased genes in these categories and
156
their subcategories. Almost twice as many genes in Signal
157
Transduction were increased in CC compared to Chr; a
158
similar number were decreased in both types. Within
159
Cellular and Matrix Organization and Adhesion, the
160
expression of nearly four times as many genes was
161
increased in CC compared to Chr, which decreased
162
expression of about three times as many genes as did CC.
163
Subcategorization (Figure 3C) revealed a predominance of
164
genes coding for extracellular matrix proteins (16 genes)
165
and cellular adhesion molecules (10 genes) that were
166
increased in expression in CC.
167
168
169
Functional taxonomy: Genes differentially regulated
170
in CC
171
To determine the difference between normal kidney
172
samples and CC samples we used a rigorous statistical
173
approach based upon a calculated p-value (see methods)
174
and the combined data set consisting of eight normal
175
kidney samples and nine CC samples. We identified 142
176
genes that were significantly upregulated in CC compared
177
to normal kidneys and 213 that were significantly down
178
regulated. To determine the biological
179
significance/function of these genes we used Functional
180
Taxonomy to classify them into 16 cellular groupings
181
(Figure 4A). The first level signature showed similar
182
trends to the CC and Chr comparison (Figure 3A). The four
183
categories that contained the most genes were Signal
184
Transduction (78 genes), Cellular and Matrix Organization
185
and Adhesion (47 genes), Metabolism (40 genes), and
186
Unclassified (51 genes). These were the same categories
187
that showed the most gene changes in the pilot data set.
188
Subcategorization of both categories showed trends within
189
CC samples that were similar to those observed in the
190
pilot data set. Within the Signal Transduction category
191
there were many genes associated with ligands, receptors
192
and cytosolic factors (Figure 4Band Table 4). Gene
193
expression changes within the Cellular Matrix
194
Organization and Adhesion category were focused on
195
extracellular matrix genes and cellular adhesion
196
molecules (Figure 4Cand Table 5).
197
198
199
Immunohistochemistry
200
The selected gene lists were reviewed for genes that
201
corresponded to proteins that could be evaluated with
202
immunohistochemistry. The data sets revealed that mRNA
203
transcripts for CD 31 (PECAM) and the T-cell receptor
204
beta chain were increased in CC but not Chr. Since
205
antibodies to CD 31 and CD 3 (which forms a complex with
206
the T cell receptor) are reactive in fixed tissues, we
207
used them to stain the tumors. CD 31 was present in CC in
208
a prominent dense branching network of fine vessels
209
surrounding the tumor cells (Figure 1E). In contrast, Chr
210
had few CD 31-positive vessels present (Figure 1F). CD 3
211
stained numerous T lymphocytes scattered throughout the
212
CC tumors (Figure 1G). These were not initially apparent
213
in the hematoxylin and eosin sections, probably due to
214
the similarity in size and appearance of the tumor cell
215
nuclei and the lymphocytes. In contrast, there were
216
almost no T cells in the Chr tumors (Figure 1H). These
217
results indicated concordance between the mRNA expression
218
profiles and the pathobiology of the RCCa tumor
219
sub-types.
220
221
222
223
Discussion
224
Expression profiling of kidney tumors using the
225
Affymetrix GeneChip distinctly separated four different
226
tumors from each other, as well as from normal kidney
227
cortex. This finding is consistent with the morphologic,
228
karyotypic and clinical outcome differences between these
229
tumor types [ 6 7 ] . There are many sample-clustering
230
methods that may be applied to expression microarray data,
231
none of which can be conclusively called "correct", since
232
each algorithm makes different assumptions regarding the
233
nature of the data. We used ten clustering methods combined
234
with four ways of pre-processing the data sets to
235
eliminate, or at least reduce bias in a pilot data set. The
236
smaller pilot data set was used to simplify the
237
interpretation of the results. A common cluster dendrogram
238
was produced by 18 of 40 methods; 16 of these were from the
239
20 that employed logarithmic transformation of the data
240
sets. The pattern was consistent with the biology of the
241
sample with normal kidney, CC and Chr samples each grouping
242
together (Figure 2A). That a logarithmic transformation
243
gave the most meaningful cluster dendrograms is consistent
244
with the distribution of the untransformed expression data
245
being skewed to the left because the majority of genes have
246
low expression levels. Standardization of the data assigns
247
equal weight to each gene and, hence, increases the
248
contribution of unreliable low expression genes. The use of
249
logarithmic transformation, on the other hand, improves the
250
spread of the data so the distribution is close to normal.
251
It also re-adjusts the weight for each gene. For example,
252
genes with high expression levels, which might be
253
unreliable or biased due to saturation, will have lower
254
weights in distance calculation. Therefore, the logarithmic
255
transformation improves the calculation of distance for the
256
subsequential clustering algorithms and leads to uncovering
257
the biological meaningful pattern within the data.
258
A comparison of the dendrograms from the pilot data set
259
and complete data set reveals some surprising changes. In
260
general the major structure of the dendrogram remained the
261
same, CC, Chr and normal kidney all grouped separately.
262
However, in the pilot data set the CC were more similar to
263
normal kidney than Chr, while in the complete data set Chr
264
were more similar to normal kidney. It is unclear why this
265
larger data set changed the dendrogram and suggests that
266
the subtle structure in the dendrogram was not as robust as
267
it appeared. With fewer Chr compared to CC it is not
268
possible to draw any strong conclusions about relatedness
269
of the Chr samples.
270
In order to visualize the functional patterns associated
271
with a particular set of selected genes we used a simple,
272
semi-hierarchical system to categorize genes according to
273
the function of the proteins they encode, that we call
274
Functional Taxonomy. There are challenges associated with
275
the partly subjective nature of categorization of gene
276
function, such as where to place a single gene product that
277
is involved in several cellular tasks. Ideally, the
278
categorization should consider multiple attributes of a
279
protein. To this end, we propose three complementary
280
classification schemes: (1) biochemical function, which
281
categorizes according to molecular activity; (2) cellular
282
function, which categorizes according to biological role at
283
a cellular level; (3) tissue function, which categorizes
284
according to anatomic or organ system location. In this
285
paper we have visualized profiling results using the second
286
of these schemes (cellular function) at three levels:
287
primary categories, secondary categories, and individual
288
genes (see Figure 3and 4, Table 4and 5). We have found
289
Functional Taxonomy to be a useful visualization tool for
290
understanding the differences in gene expression patterns
291
between CC and Chr tumors. This system is similar in
292
concept to what is currently being developed by the Gene
293
Ontology Consortium [ 10 ] .
294
The cellular function signatures of nine CC and two Chr
295
revealed that the greatest number of gene expression
296
changes for both tumor types occurred in the categories of
297
Signal Transduction, Cellular and Matrix Organization and
298
Adhesion, and Metabolism. This is consistent with current
299
theories of neoplasia, which hypothesize that tumor cells
300
modify their signaling pathways, establish new contacts
301
with an altered extracellular matrix, and refashion their
302
metabolic machinery.
303
There exists considerable literature on the expressed
304
genes and gene products associated with RCCa. Using the
305
selected gene sets from Table 3and the p-values and
306
fold-change values calculated from the eight normal kidneys
307
and nine CC, we looked for concordance between our results
308
and published reports. The genes
309
CA 9 (
310
carbonic anhydrase IX ),
311
CCND1 (
312
cyclin D1 ),
313
CDH2 (
314
N-Cadherin ),
315
EGFR (
316
epidermal growth factor receptor )
317
and
318
TGFA (
319
tranforming growth factor alpha ) all
320
showed increases in CC expression that matched the
321
literature and had p-values ≤ 0.0061 [ 11 12 13 14 15 ] .
322
The observed decrease in
323
CDH1 (
324
E-cadherin ) in CC (p-value = 0.0045)
325
also matched previously published reports, as did the
326
decrease in
327
VIM (
328
vimentin ) expression in Chr RCCa [
329
13 16 ] .
330
VIM was also found to be increased in
331
CC with a p-value = 0.0045, which was consistent with the
332
literature. We detected a small increase in expression of
333
ICAM1 (
334
intercellular adhesion molecule 1 )
335
in CC (Fold-change = 1.9, p-value = 0.0081), which was also
336
consistent with the literature [ 17 ] .
337
The expression results for the genes
338
JUN (
339
c-jun ) and
340
VHL (
341
von Hippel-Lindau ) did not match the
342
literature [ 18 19 ] . Nor did the result for
343
KRT7 (
344
cytokeratin 7 ), which has been shown
345
to be overexpressed in Chr [ 20 ] . Instead we found
346
KRT7 to be strongly repressed in CC
347
(fold-change = -5.1, p-value = 0.0009). Yet, expression
348
profiling using nucleic acid microarrays does not
349
necessarily correlate with other forms of analysis for all
350
genes [ 21 22 ] . This may be especially true when altered
351
expression of a gene is reported to be present in a subset
352
of a population of tumors, since a small sample number (as
353
are the Chr samples in this study) may not include the
354
alteration.
355
In the case of CD 31 and the T cell receptor beta chain,
356
expression profiling results were concordant with
357
immunohistochemical analysis of the tumors. The prevalence
358
of the scattered T cells within the CC tumors was somewhat
359
surprising, but entirely consistent with the biology of
360
response to a treatment for RCCa, interleukin 2 (IL-2),
361
since IL-2 activates lymphocytes against the tumor [ 23 ]
362
.
363
During preparation of this manuscript, an expression
364
profiling study of seven renal neoplasms (four CC, 2
365
oncocytomas, and one Chr) was reported [ 24 ] . This study
366
employed a different platform (Incyte glass slide cDNA
367
microarray) and hybridization method (competitive
368
tumor/normal binding), and used related but not identical
369
gene selection criteria (two fold-change in expression
370
versus normal kidney in at least two of the seven tumors).
371
The study identified 189 genes that were differentially
372
expressed in at least two tumors, and this gene set was
373
also able to distinguish between CC and Chr tumor types. We
374
suspect that a greater number of Chr-associated genes would
375
have been selected in their study had there been at least
376
two Chr samples, since a gene altered in expression only in
377
the single Chr, but not in any of the oncocytomas or the
378
CC, would not have been identified by the selection
379
criteria.
380
381
382
Conclusion
383
The results of the present study demonstrate the power
384
of Affymetrix GeneChip expression profiling to
385
differentiate between morphologically distinct tissues that
386
are descended from a common organ. In addition, they
387
demonstrate the value of functional cataloging selected
388
genes and visualizing the result in a graphical format.
389
390
391
Methods
392
393
Sample isolation, histology and
394
immunohistochemistry
395
Renal cell carcinoma (RCCa) samples were collected
396
from patients undergoing radical nephrectomy at the
397
University of Michigan Medical Center. All samples and
398
associated clinical data were obtained with Institutional
399
Review Board approval. A total of 21 tissue samples
400
(eight normal kidneys and 13 tumors) were obtained from
401
13 patients. Patients ranged in age from 40 to 83 years
402
with four male patients and nine female patients. Nine
403
patients were diagnosed with clear cell carcinomas (CC),
404
two with chromophobe cell carcinomas (Chr), one with a
405
papillary urothelial carcinoma, and one with a
406
metanephric adenoma. For eight of the patients the
407
resection specimens included unaffected kidney (termed
408
"normal kidney") as well as tumor tissue. Tissue samples
409
were fixed in formalin, or were immediately frozen at
410
-80°C in optimal cutting temperature embedding medium
411
(OCT, Sakura). Cryostat sections were prepared to confirm
412
suitability for profiling (Figure 1Aand 1B); only
413
viable-appearing tissues were processed. Care was taken
414
to ensure that normal tissue was not contaminated with
415
tumor tissue and vice versa. Normal kidney samples were
416
uniformly taken from renal cortex. Paraffin-embedded
417
tissues were stained with hematoxylin and eosin for
418
diagnostic evaluation. Representative morphologies are
419
shown in Figure 1Cand 1D. Immunohistochemical stains were
420
performed on fixed tissues. Sections were deparaffinized,
421
rehydrated and treated with 3% hydrogen peroxide. Stains
422
were performed using an automated avidin-biotin complex
423
method according to the manufacturer's protocol (Nexes
424
IHC Staining System, Ventana Medical Systems), with
425
details as indicated in Table 1.
426
427
428
RNA isolation
429
Total RNA was prepared from each sample (eight normal
430
kidneys, 13 tumors). In addition, two pooled samples were
431
made by mixing equal quantities of RNA from the kidney
432
samples of patients 1 - 4 together, and by mixing equal
433
quantities of RNA from the four RCCa samples also from
434
patients 1 - 4. The frozen tissues were warmed briefly,
435
allowing the OCT compound to soften slightly so that it
436
could be rapidly dissected away without the tissues
437
thawing. The tissues were then pulverized using a frozen
438
steel block and hammer. RNA was extracted using Trizol
439
reagent (Life Technologies) according to the
440
manufacturer's protocol.
441
442
443
RNA labeling and GeneChip hybridization
444
Biotinylated target RNA was prepared from 15 μg of
445
total RNA using the Affymetrix protocol. Labeled cRNA was
446
hybridized on the HuGeneFL Affymetrix GeneChip
447
®containing probes for approximately 5600 mRNAs
448
corresponding to genes of known sequence. Each
449
hybridization included control RNA transcripts. The
450
hybridization reactions were processed and scanned
451
according to standard Affymetrix protocols.
452
453
454
Data Analysis
455
The Affymetrix Microarray Suite and Data Mining Tool
456
were used to calculate average difference (gene
457
expression) values, fold-change values and difference
458
calls from the GeneChip fluorescent intensity data. All
459
GeneChips were normalized to avoid differences in
460
fluorescent intensity as described in the
461
Affymetrix GeneChip Expression Analysis
462
Manual (version 3.1) . The average intensity of each
463
gene chip was scaled to 600, where average intensity was
464
calculated by averaging all the gene expression values of
465
every probe set on the array excluding the highest and
466
lowest 2% of the values. If the scaling factor did not
467
fall between 0.3 and 2, the GeneChip data was not used.
468
Fold-change values were calculated as the ratio of gene
469
expression values between each tumor-normal sample pair
470
from the same patient, with the normal sample values used
471
as the baseline. The difference call was calculated using
472
four variables: the number of probes pairs that have
473
changed in a certain direction, the ratio of increased
474
probe pairs over decreased probe pairs, the log average
475
ratio change, and the difference in the number of probe
476
pairs changed (either positive or negative). The default
477
values (explained in the
478
Affymetrix Manual ) were used in
479
the difference call decision matrix. The resulting data
480
sets were manipulated and filtered in Microsoft Access
481
according to various criteria (see results) for selection
482
of genes that showed altered expression in the
483
tumors.
484
For the pilot data set, sample clusters were prepared
485
using the software program SAS (SAS Institute, Cary, NC)
486
on the gene expression values. The following clustering
487
algorithms were used: seven hierarchical clustering
488
methods (single linkage, average linkage, complete
489
linkage, Ward's minimum variance, density linkage,
490
centroid, and flexible-beta), a k-mean method, and an
491
oblique principle component method employing either (1)
492
Pearson correlation or (2) Spearman correlation based on
493
rank. Negative values were set equal to one. Data sets
494
were processed as follows for the clustering analysis:
495
unchanged (raw data), log
496
10 transformation, standardization to
497
N(0,1), and a combination of both log
498
10 transformation and standardization
499
to N(0,1). This yielded a total of 40 sample cluster
500
dendrograms. For the complete data set (21 samples plus
501
two pooled samples) gene and sample clusters were
502
prepared using Cluster and visualized with TreeView [ 25
503
] , using log
504
10 -transformed and median-centered
505
average difference values.
506
To select genes for further analysis from the pilot
507
data set, a two fold-change threshold was used in
508
combination with the Affymetrix difference call of either
509
increase (I), marginal increase (MI), decrease (D) and
510
marginal decrease (MD). For genes selected as either
511
increased or decreased in a particular RCCa subtype both
512
samples had to show concordance in their fold-change and
513
difference call, and both samples from the other subtype
514
had to have a difference call of no change (NC) or a call
515
that was opposite to that of the first subtype.
516
To select genes that were significantly changed in CC
517
(nine samples) verses normal kidney (eight samples) a
518
non-parametric Wilcoxon test was used to calculate a
519
p-value for each gene. Genes were then selected that
520
conformed to all of the following criteria: p-value ≤
521
0.001; a mean average difference value ≥ 200; fold-change
522
≥ 1.1 for genes increased in CC relative to normal kidney
523
or fold-change ≤ -1.1 for genes decreased in CC relative
524
to normal kidney.
525
All selected genes were then annotated using online
526
resources, such as Medline, OMIM and GeneCards. The
527
information was used to build a database that included
528
HUGO (Human Genome Organization) designated names,
529
function and expression information, and any tumor
530
associations.
531
Given the absence of available tools for visualization
532
of tumor gene expression patterns on a broad scale, we
533
devised a classification system called "Functional
534
Taxonomy" that categorizes genes according to various
535
functional attributes of the proteins they encode. One
536
such functional attribute is a protein's biological role
537
at a cellular level. Using a mammalian modification of
538
the MIPS
539
Saccharomyces cerevisiae functional
540
categories [ 26 ] , (Munich Information Center for
541
Protein Sequences, www.mips.biochem.mpg.de) the genes
542
were placed into a hierarchical classification scheme
543
containing 16 primary categories of cellular function.
544
The number of genes that fell within each primary
545
category was counted and plotted graphically to generate
546
a first-level "signature" (see Figure 3Aand 4A). Each of
547
the primary categories was further divided into
548
subcategories for a more detailed second-level
549
visualization of the data (see Figures 3B, 3C, 4Band
550
4C).
551
552
553
554
Authors' Contributions
555
Author 1, MAG, carried out sample preparation, RNA
556
isolation, data analysis, functional taxonomy analysis and
557
drafted the manuscript. Author 2, TC, carried out the
558
immunohistochemistry. Author 3, MZM, performed the
559
clustering analysis. Author 4, SJM, directed the team who
560
carried out the Affymetrix GeneChip hybridizations and
561
initial data processing. Author 4 MAR provided the samples
562
and reviewed the manuscript. Author 6 EPK conceived the
563
study and participated it its design and assisted in
564
writing the manuscript.
565
566
567
568
569