Lung cancer is the leading cause of cancer-related death in the United States. Nearly 50% of patients with stages I and II non-small cell lung cancer (NSCLC) will die from recurrent disease despite surgical resection. No reliable clinical or molecular predictors are currently available for identifying those at high risk for developing recurrent disease. As a consequence, it is not possible to select those high-risk patients for more aggressive therapies and assign less aggressive treatments to patients at low risk for recurrence.
In this study, we applied a meta-analysis of datasets from seven different microarray studies on NSCLC for differentially expressed genes related to survival time (under 2 y and over 5 y). A consensus set of 4,905 genes from these studies was selected, and systematic bias adjustment in the datasets was performed by distance-weighted discrimination (DWD). We identified a gene expression signature consisting of 64 genes that is highly predictive of which stage I lung cancer patients may benefit from more aggressive therapy. Kaplan-Meier analysis of the overall survival of stage I NSCLC patients with the 64-gene expression signature demonstrated that the high- and low-risk groups are significantly different in their overall survival. Of the 64 genes, 11 are related to cancer metastasis (APC, CDH8, IL8RB, LY6D, PCDHGA12, DSP, NID, ENPP2, CCR2, CASP8, and CASP10) and eight are involved in apoptosis (CASP8, CASP10, PIK3R1, BCL2, SON, INHA, PSEN1, and BIK).
Meta-analysis of several lung cancer gene expression studies yields a set of 64 genes whose expression profile is useful in predicting survival of patients with early-stage lung cancer and possibly informing treatment decisions.
Lung cancer is the commonest cause of cancer-related death worldwide. Most cases are of a type called non-small cell lung cancer (NSCLC) and are mainly caused by smoking. Like other cancers, how NSCLC is treated depends on the “stage” at which it is detected. Stage IA NSCLCs are small and confined to the lung and can be removed surgically; patients with slightly larger stage IB tumors often receive chemotherapy after surgery. In stage II NSCLC, cancer cells may be present in lymph nodes near the tumor. Surgery plus chemotherapy is the usual treatment for this stage and for some stage III NSCLCs. However, in this stage, the tumor can be present throughout the chest and surgery is not always possible. For such cases and in stage IV NSCLC, where the tumor has spread throughout the body, patients are treated with chemotherapy alone. The stage at which NSCLC is detected also determines how well patients respond to treatment. Those who can be treated surgically do much better than those who can't. So, whereas only 2% of patients with stage IV lung cancer survive for 5 years after diagnosis, about 70% of patients with stage I or II lung cancer live at least this long.
Even stage I and II lung cancers often recur and there is no accurate way to identify the patients in which this will happen. If there was, these patients could be given aggressive chemotherapy, so the search is on for a “molecular signature” to help identify which NSCLCs are likely to recur. Unlike normal cells, cancer cells divide uncontrollably and can move around the body. These behavioral differences are caused by changes in their genetic material that alter their patterns of RNA transcription and protein expression. In this study, the researchers have investigated whether data from several microarray studies (a technique used to catalog the genes expressed in cells) can be pooled to construct a gene expression signature that predicts the survival of patients with stage I NSCLC.
The researchers took the data from seven independent microarray studies (including a new study of their own) that recorded gene expression profiles related to survival time (less than 2 years and greater than 5 years) for stage I NSCLC. Because these studies had been done in different places with slightly different techniques, the researchers applied a statistical tool called distance-weighted discrimination to smooth out any systematic differences among the studies before identifying 64 genes whose expression was associated with survival. Most of these genes are involved in cell adhesion, cell motility, cell proliferation, and cell death, all processes that are altered in cancer cells. The researchers then developed a statistical model that allowed them to use the gene expression and survival data to calculate risk scores for nearly 200 patients in five of the datasets. When they separated the patients into high and low risk groups on the basis of these scores, the two groups were significantly different in terms of survival time. Indeed, the gene expression signature was better at predicting outcome than routine staging. Finally, the researchers validated the gene expression signature by showing that it predicted survival with more than 85% accuracy in two independent datasets.
The 64 gene expression signature identified here could help clinicians prepare treatment plans for patients with stage I NSCLC. Because it accurately predicts survival in patients with adenocarcinoma or squamous cell cancer (the two major subtypes of NSCLC), it potentially indicates which of these patients should receive aggressive chemotherapy and which can be spared this unpleasant treatment. Previous attempts to establish gene expression signatures to predict outcome have used data from small groups of patients and have failed when tested in additional patients. In contrast, this new signature seems to be generalizable. Nevertheless, its ability to predict outcomes must be confirmed in further studies before it is routinely adopted by oncologists for treatment planning.
Please access these Web sites via the online version of this summary at http://dx.doi.org/10.1371/journal.pmed.0030467.
US National Cancer Institute information on lung cancer for patients and health professionals.
MedlinePlus encyclopedia entries on small-cell and non-small-cell lung cancer.
Cancer Research UK, information on patients about all aspects of lung cancer.
Wikipedia pages on DNA microarrays and expression profiling (note that Wikipedia is a free online encyclopedia that anyone can edit).