Data driven derivation of cutoffs from a pool of 3,030 Affymetrix arrays to stratify distinct clinical types of breast cancer

T Karn, D Metzler, E Ruckhäberle, L Hanker… - Breast cancer research …, 2010 - Springer
T Karn, D Metzler, E Ruckhäberle, L Hanker, R Gätje, C Solbach, A Ahr, M Schmidt
Breast cancer research and treatment, 2010Springer
Pooling of microarray datasets seems to be a reasonable approach to increase sample size
when a heterogeneous disease like breast cancer is concerned. Different methods for the
adaption of datasets have been used in the literature. We have analyzed influences of these
strategies using a pool of 3,030 Affymetrix U133A microarrays from breast cancer samples.
We present data on the resulting concordance with biochemical assays of well known
parameters and highlight critical pitfalls. We further propose a method for the inference of …
Abstract
Pooling of microarray datasets seems to be a reasonable approach to increase sample size when a heterogeneous disease like breast cancer is concerned. Different methods for the adaption of datasets have been used in the literature. We have analyzed influences of these strategies using a pool of 3,030 Affymetrix U133A microarrays from breast cancer samples. We present data on the resulting concordance with biochemical assays of well known parameters and highlight critical pitfalls. We further propose a method for the inference of cutoff values directly from the data without prior knowledge of the true result. The cutoffs derived by this method displayed high specificity and sensitivity. Markers with a bimodal distribution like ER, PgR, and HER2 discriminate different biological subtypes of disease with distinct clinical courses. In contrast, markers displaying a continuous distribution like proliferation markers as Ki67 rather describe the composition of the mixture of cells in the tumor.
Springer