The cancer proteome

The cancer proteome can be defined as the entire set of proteins expressed in cancer. A large number of genes encoding proteins with diverse normal functions are involved in human cancer. More than 500 genes have been identified as strongly implicated in the process of transforming normal cells to cancer cells. The expression of these genes in normal cells contributes to normal growth, survival and function, whereas dysregulated expression, including overexpression, loss of expression or expression of a defect protein, in cancer cells contributes to ungoverned tumor growth. Altered gene expression can be caused by coarse structural and numerical chromosomal rearrangements, specific gene amplifications, silencing of transcription through methylation and mutations, e.g. point mutations with single base substitutions and small inserts or deletions, that lead to loss or gain of function of the corresponding protein.

Genes and proteins altered in cancer

Currently, more than 1% of all human genes are implicated via mutation in cancer. Cancer genes include genes encoding cell surface receptors, oncogenes, tumour suppressor genes, kinases, phosphatases, and transcription factors. Of these cancer genes, approximately 90% have somatic mutations in cancer, 20% bear germline mutations that predispose to cancer and 10% show both somatic and germline mutations. A list of genes strongly implicated in cancer, i.e.genes with a documented activity relevant to cancer, along with evidence of mutations in cancer which change the activity of the gene product in a way that promotes oncogenic transformation has been defined through the cancer Gene Census, catalogue of somatic mutations in cancer (COSMIC).

Table 1. List of different genetic alterations in 572 protein coding genes that are strongly implicated in cancer.

Protein class Number of genes
Amplifications 24
Frameshift Mutations 159
Germline Mutations 102
Large Deletions 43
Missense Mutations 256
Nonsense Mutations 157
Other Mutations 38
Somatic Mutations 532
Splicing Mutations 73
Translocations 309
Mutated cancer genes 250
Mutational cancer driver genes 283
Total number of mutated genes 572

It is clear that we are far from having a complete catalogue of cancer genes and results from efforts to identify genetic alterations implicated in cancer are continuously being published. In a study from the Broad Institute (Lawrence MS et al., 2014), a list of 260 significantly mutated cancer genes across 21 cancer types is presented, of which 250 are present in the current version of the Human Protein Atlas. In another study aiming to identify cancer driver genes (Tamborero D et al., 2013), a list of 291 high-confidence cancer driver genes across 12 cancer types is presented, of which 283 are present in the current version of the Human Protein Atlas. Moreover, the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, to create multi-omics datasets connecting genomic aberrations to cancer phenotypes. The harmonized data includes genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts (Li Y et al. (2023)).

Protein expression of cancer genes

Protein profiling using immunohistochemistry allows for visualization of the distribution and relative abundance of proteins encoded by cancer genes in various tumor tissues. Certain proteins show differential expression between different forms of cancer or individual tumors within the same form of cancer whereas other proteins show variable expression within different compartments of the tumor or in tumor cells of different states. Moreover, differences in expression can be found between tumors and corresponding normal tissue counterparts. Below are a few examples of protein expression patterns in different forms of cancer and normal tissues.


TP53 - colon

TP53 - colorectal cancer

TP53 - colorectal cancer


TP53 - lung

TP53 - lung cancer

TP53 - lung cancer


ERBB2 - breast

ERBB2 - breast cancer

ERBB2 - breast cancer


EGFR - cerebral cortex

EGFR - glioma

EGFR - glioma


BRCA1 - breast cancer

MITF - melanoma

MYC - ovarian cancer


KRAS - colorectal cancer

PIK3CA - pancreatic cancer

PTEN - prostate cancer


JUN - cervical cancer

APC - colorectal cancer

SOX2 - lung cancer

Figure 1. Immunohistochemical staining patterns in normal and cancer tissue corresponding to a few examples of genes implicated in cancer.

Cancer

Cancer can be defined as a common heterogeneous and potentially deadly disease that affects approximately a third of the human population. The common denominator for cancer is the development of an uncontrolled growth of cells that eventually leads to multiple organ failure and death. Cancer can be classified into several different diseases depending on the organ where the primary cancer develops and on the type of cancer (microscopical and molecular features of the tumor). Lung cancer, breast cancer, colorectal cancer, prostate cancer, stomach cancer and liver cancer are the most common forms of cancer worldwide (IARC/WHO). Epidemiologic research can provide cancer statistics regarding the incidence, prevalence and mortality of cancer. In 2022, there were an estimated 20 million new cancer cases and 9.7 million deaths. About 1 in 5 people develop cancer in their lifetime, and approximately 1 in 9 men and 1 in 12 women die from the disease. How advanced the cancer is at time of diagnosis, together with the origin and type of cancer are important determinants for the outcome and choice of best treatment regime for a cancer patient.




Figure 2. Examples showing the appearance of breast cancers in radiology (mammography), excised tumor specimen, and pathology (microscopy).

Cancer - clinical aspects

The symptoms for cancer are highly variable and depend on extent and localization of tumor growth. Clinical examination, radiologic investigations and laboratory tests can give clues to a cancer diagnosis, however, the firm diagnosis of cancer is ascertained by microscopical evaluation of tumor tissue. The information retrieved from microscopical examination leads to a cancer diagnose and forms the basis for classification of the tumor type. The choice of treatment for patients with cancer depends on the tumor type, malignancy grade and tumor stage at the time of diagnosis. Surgery with the ambition to remove the tumor is often the first choice of treatment when possible. Radiation and cytotoxic agents (chemotherapy) form the basis for treatment of tumors not eligible for surgery, such as leukemia and lymphoma, and can also be used to supplement surgery in patients at high risk for recurrence (adjuvant therapy) or replace surgery in cases where surgery is no longer a possibility due to the extent of tumor growth. Chemotherapy and radiation used for adjuvant treatment, aims to eradicate non-detected residual tumor cells after surgery to improve the chance of cure or prolong survival of the patient. Cancer research has led to a deeper understanding of the molecular landscape in cancer that has allowed for the development of precision medicine including targeted therapies and immunoncology approaches to treat cancer patients. Targeted cancer therapies are drugs designed to interfere with specific molecules necessary for tumor growth and progression, whereas traditional cytotoxic chemotherapies usually kill rapidly dividing cells in the body by interfering with cell division.

Pathology

Several systems for classification of cancer exist. Firstly, cancer is classified based on the site of origin and tumor type with respect to differentiation and morphological features. Most common are epithelial tumors, mainly adenocarcinoma followed by squamous cell carcinoma. Non-epithelial tumors include sarcomas (derived from mesenchymal cells), lymphoma and leukemia (derived from hematopoietic and lymphoid cells), melanoma (derived from cells of melanocytic lineage) and glioma (derived from neural glial cells). Cancer stage is important to predict the prognosis of a given cancer type. Staging of cancer is performed using the TNM-classification system where the extent of tumor growth at the time of diagnosis is assessed for the primary tumor (T), eventual metastasis in lymph nodes (N) and eventual metastasis in distant organs (M). Independent of tumor stage, the malignancy grade provides prognostic information and different cancer grading systems have been defined. Examples of such include Gleason grading of prostate cancer and Elston-Ellis grading of breast cancer. These systems estimate the degree of differentiation, proliferation and cellular atypia in a cancer based on the microscopical appearance. For several forms of cancer, entities that reflect the stage of development from precursor states to invasive cancer have been defined and can be diagnosed by microscopical examination. Examples include low grade adenoma-to-high grade adenoma-to-colon cancer, dysplastic melanocytic nevus-to-melanoma in situ-to-invasive melanoma, prostate intraepithelial neoplasia (PIN)-to-invasive prostate cancer and ductal breast cancer in situ (DCIS)-to-invasive ductal breast cancer. In addition to microscopical examination, molecular tests such as the analysis of gene mutations, gene expression and protein expression can add important diagnostic, prognostic and treatment predictive information needed for targeted therapy approaches.


Lung cancer

Breast cancer

Colorectal cancer


Gastric cancer

Liver cancer

Cervical cancer


Prostate cancer

Endometrial cancer

Ovarian cancer


Pancreatic cancer

Renal cell cancer

Urinary bladder cancer


Glioma

Lymphoma

Testis cancer


Melanoma

Skin cancer (SCC)

Skin cancer (BCC)


Head and neck cancer

Thyroid cancer

Carcinoid

Figure 3. Examples of HE-stained tissue sections representing the most common forms of human cancer.

Tumor biology

Cancer is a cellular disease that originates from one malignantly transformed normal human cell. Although cancer is common in humans, cancer is a rare cellular event with a risk of 1/10000 billion (1/1016) for a normal cell to become a cancer cell. Cancer is a genetic disease caused by mutations of genes that transform the normal growth control and behaviour of cells to abnormal cancer cells that lack growth control. Cancer is believed to develop through a multi-step process, originating from the malignant transformation of one cell leading to subsequent steps of growth advantage compared to normal cells. The impaired growth control allows for the selection of cell clones with the highest rate of cell division and survival. Eventually the primary tumor acquires the ability to grow invasively into normal surrounding tissue followed by spread of cancer cells through the lymphatics and/or blood system to draining lymph nodes and distant other organs.

In addition to cancer cells themselves, solid forms of cancer also consists of surrounding cancer stroma that supports the tumor growth. The cancer stroma includes an extracellular component (matrix proteins) and various populations of normal cells such as fibroblasts, vascular cells and inflammatory cells. The host response evoked by tumor cells includes the development of stroma but also other forms of immunologic response. Signals from normal cells in the cancer stroma, including inflammatory cells can either support or inhibit growth of cancer cells.

Certain traits define cancer cells and distinguish the difference between cancer cells and normal cells. Despite the heterogeneity of cancer, general characteristic hallmarks of cancer have been identified (Figure 4) and underlying cellular signaling pathways that are dysregulated in cancer have in part been elucidated (Figure 5).


Figure 4. Schematic displaying characteristic hallmarks of cancer. Image from Hanahan & Weinberg, 2011. Elsevier user licence.


Figure 5. Example of signaling pathways involved in colon cancer. Image from Slaby et al., 2009. Creative Commons licence.

Relevant links and publications

Uhlen M et al., A pathology atlas of the human cancer transcriptome. Science. (2017)
PubMed: 28818916 DOI: 10.1126/science.aan2507

Uhlén M et al., Tissue-based map of the human proteome. Science (2015)
PubMed: 25613900 DOI: 10.1126/science.1260419

Lawrence MS et al., Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. (2014)
PubMed: 24390350 DOI: 10.1038/nature12912

Tamborero D et al., Comprehensive identification of mutational cancer driver genes across 12 tumor types. Sci Rep. (2013)
PubMed: 24084849 DOI: 10.1038/srep02650

Li Y et al., Proteogenomic data and resources for pan-cancer analysis. Cancer Cell. (2023)
PubMed: 37582339 DOI: 10.1016/j.ccell.2023.06.009

Catalogue of somatic mutations in cancer - COSMIC (Sanger Institute)

Data Portal Cancer Proteomics Research (NIH)

Cancer genome data from TCGA - The Cancer Genome Atlas (NIH)

Genes, Cancer, and DNA mutations - Tumor Portal (Broad Institute)

Incidence, mortality, and prevalence of cancer - Cancer Today (IARC/WHO)

Microscopical images of cancer - Cancer Dictionary (Human Protein Atlas)