Kosuke Yoshihara 1 , 2 , Maria Shahmoradgoli 3 , Emmanuel Martínez 1 , 4 , Rahulsimham Vegesna 1 , Hoon Kim 1 , Wandaliz Torres-Garcia 1 , Victor Treviño 4 , Hui Shen 5 , Peter W. Laird 5 , Douglas A. Levine 6 , Scott L. Carter 7 , Gad Getz 7 , Katherine Stemke-Hale 3 , Gordon B. Mills 3 , Roel G.W. Verhaak a , 1
11 October 2013
Infiltrating stromal and immune cells form the major fraction of normal cells in tumour tissue and not only perturb the tumour signal in molecular studies but also have an important role in cancer biology. Here we describe ‘Estimation of STromal and Immune cells in MAlignant Tumours using Expression data’ (ESTIMATE)—a method that uses gene expression signatures to infer the fraction of stromal and immune cells in tumour samples. ESTIMATE scores correlate with DNA copy number-based tumour purity across samples from 11 different tumour types, profiled on Agilent, Affymetrix platforms or based on RNA sequencing and available through The Cancer Genome Atlas. The prediction accuracy is further corroborated using 3,809 transcriptional profiles available elsewhere in the public domain. The ESTIMATE method allows consideration of tumour-associated normal cells in genomic and transcriptomic studies. An R-library is available on https://sourceforge.net/projects/estimateproject/.
Tumour biopsies contain contaminating normal cells and these can influence the analysis of tumour samples. In this study, Yoshihara et al. develop an algorithm based on gene expression profiles from The Cancer Genome Atlas to estimate the number of contaminating normal cells in tumour samples.