The multilocalizing proteome

The immunofluorescence (IF)-based approach used in the subcellular resource enables analysis of protein distribution in all organelles and subcellular structures simultaneously. This allows for the study of spatial distribution of proteins in their cellular context and identification of proteins that localize to more than one compartment, referred to as "multilocalizing proteins" (MLPs).

Figure 1 shows example images of MLPs representing common combinations of locations and gives an idea of the cellular roles of MLPs. The most common case is that MLPs are located at multiple sites at the same time, within the same cell, but there are also MLPs that are associated with cell line-specific variations. For example, ZNF554 is a nuclear protein in RT4 and SH-SY5Y cells, but becomes a MLP in U2 OS due to it's additional prominent location in the nucleoli.

IPO7 - A-431
RPL19 - A-431
CCDC51 - U2OS

KIAA1522 - HaCaT
ITM2B - RT-4
ENO1 - U2OS

Figure 1. Examples of MLPs identified in the subcellular resource. IPO7 mediates the import of proteins from the cytosol to the nucleus and can cross the nuclear membrane rapidly in both directions (detected in A-431 cells). RPL19 is a component of the ribosomal 60S subunit and was identified in nucleoli, where ribosomes are assembled, and in the cytosol and endoplasmic reticulum, where protein synthesis takes place (detected in A-431 cells). CCDC51 encodes an uncharacterized protein located in the mitochondria and nucleoplasm (detected in U2 OS). KIAA1522 encodes an uncharacterized protein identified in the plasma membrane and nucleoplasm (detected in HaCaT cells). ITM2B is a transmembrane protein processed in the Golgi apparatus and vesicles. The resulting small peptide is secreted (detected in RT4 cells). ENO1 is a well described moonlighting protein. It has several functions in different compartments including a role in glycolysis in the cytosol, and as a surface protein in the plasma membrane (detected in U2 OS cells).

MLPs in the subcellular resource

More than half of the proteins localized in the subcellular resource (61%, n=8327) are MLPs (Figure 2). Of these, around 39% (n=3268) can be found in three or more locations. The distribution of single and multilocalizing proteins for each major organelle is shown in Figure 3 and Table 1. The percentage of MLPs in the individual organelle proteomes varies, but is often more than half because of the double counting of MLPs. Organelles such as the plasma membrane, cytosol, nucleus, and nucleoli share the majority of their proteins with other subcellular structures. This may reflect a need for proteins that operate across the borders of these organelles in order to regulate metabolic reactions, control gene expression, and/or to transmit information from the surrounding environment. In contrast, the proteomes of mitochondria contain a larger fraction of single localizing proteins, suggesting that this compartment is more self-contained with regards to its biological function.

Figure 2. Bar plot showing the number of protein-coding genes for single or multilocalizing proteins.

Figure 3. Bar plot showing the distribution of proteins localized to one or multiple organelles. Note that proteins localized to different substructures of organelles (e.g. nuclear bodies and nucleoplasm) are considered multilocalizing.

Table 1. Detailed information about single and multilocalizing proteins in the proteome of organelles and substructures.

Location	Number of additional protein locations
Location	0	%	1	%	2	%	3 or more	%
Actin filaments	24	9	82	32	82	32	60	24
Aggresome	0	0	11	58	6	32	2	11
Centriolar satellite	24	11	61	27	51	22	65	29
Centrosome	51	10	110	21	157	31	158	31
Cleavage furrow	0	0	0	0	0	0	2	100
Cytokinetic bridge	2	1	36	15	49	20	106	43
Cytoplasmic bodies	14	19	34	46	18	24	7	9
Cytosol	889	17	2234	42	1362	26	679	13
Focal adhesion sites	26	18	43	29	44	30	32	22
Intermediate filaments	44	30	59	40	32	22	9	6
Microtubule ends	0	0	0	0	2	29	5	71
Microtubules	36	10	72	20	67	19	118	33
Midbody	5	9	16	29	14	25	19	34
Midbody ring	2	8	2	8	13	52	6	24
Mitochondria	532	47	376	33	161	14	59	5
Mitotic spindle	0	0	11	7	32	20	74	46
Rods & Rings	3	16	8	42	7	37	1	5
Cell Junctions	46	14	123	37	106	32	44	13
Endoplasmic reticulum	232	40	202	35	103	18	37	6
Endosomes	3	19	9	56	2	13	2	13
Golgi apparatus	270	21	488	38	351	27	139	11
Lipid droplets	11	28	21	53	5	13	3	8
Lysosomes	1	5	11	55	5	25	3	15
Peroxisomes	15	63	7	29	1	4	1	4
Plasma membrane	325	14	857	38	677	30	350	15
Vesicles	679	27	920	37	554	22	285	11
Kinetochore	0	0	1	13	3	38	3	38
Mitotic chromosome	0	0	19	24	33	42	26	33
Nuclear bodies	63	10	247	39	203	32	101	16
Nuclear membrane	32	11	116	39	91	31	55	19
Nuclear speckles	176	35	183	36	95	19	50	10
Nucleoli	105	9	457	41	360	32	173	16
Nucleoli fibrillar center	40	12	143	44	91	28	46	14
Nucleoli rim	7	5	27	18	66	43	50	33
Nucleoplasm	1567	25	2606	41	1432	23	613	10
Basal body	0	0	33	7	75	17	251	56
Primary cilium	0	0	35	7	63	13	271	58
Primary cilium tip	0	0	4	3	7	5	78	60
Primary cilium transition zone	0	0	7	7	15	16	59	61
Acrosome	10	9	23	21	13	12	43	38
Annulus	1	2	5	10	9	18	26	52
Calyx	3	4	5	7	10	14	34	49
Connecting piece	0	0	5	5	14	15	48	52
End piece	1	1	9	5	27	14	80	41
Equatorial segment	0	0	14	17	14	17	39	48
Flagellar centriole	2	2	1	1	18	16	58	50
Mid piece	18	5	28	8	57	16	171	49
Perinuclear theca	2	3	3	4	2	3	42	60
Principal piece	15	4	38	10	61	16	154	42

To get a better overview of the multilocalizing proteome, organelles can be grouped into three meta-compartments, and genes encoding MLPs can be aligned on a circular plot (Figure 4). The meta-compartments are the nucleus (nuclear and nucleolar structures shown in red), the cytoplasm (cytosol, mitochondria, and the different cytoskeletal structures shown in blue), and the secretory pathway (endoplasmic reticulum, Golgi apparatus, vesicles, plasma membrane shown in yellow). This reveals subordinate organization patterns of the MLPs. For instance, for the meta-compartments cytoplasm and nucleus, a common pattern is multilocalization between the predominant organelles cytosol and nucleoplasm, respectively. There are also many proteins that localize to more than one of the fine substructures within each of these meta- compartments. The MLPs in the secretory pathway exhibit a more sequential pattern likely reflecting the directional protein trafficking. In addition, the secretory pathway shares a strikingly high number of MLPs with the nucleus, despite that they are not in direct physical contact with each other. In agreement, cytoscape plots of each organelle (Figure 5, at the end of the page) show that dual locations to the nucleoplasm together with the Golgi apparatus or vesicles are indeed overrepresented. This suggests that the proteomes of organelles in the secretory pathway are more versatile and should not be simplified to their role in protein secretion.

Figure 4. Circular plot with the identified proteins of each compartment presented and sorted by meta-compartments (red: nucleus, blue: cytoplasm, yellow: secretory pathway). Multilocalizing proteins appearing more than once in the plot are connected by a line.

Why does the cell have MLPs?

MLPs present several advantages for the cell, some of which are crucial for cell survival. Shuttling proteins constantly switch their location in order to transport other proteins between organelles, making their multilocalization closely tied to their function. For example, members of the importin family transport proteins from the cytosol to the nucleus and hence are found in both organelles (Lange A et al. (2007), see also Figure 1). Another advantage of multilocalization is the possibility to make use of the same proteins in similar cellular processes and reactions, even if they occur in different subcellular compartments. For example, it has been shown that mitochondria and peroxisomes share some enzymes for lipid metabolism (Ashmarina LI et al. (1999)). A switch of the subcellular location can also be an important way of generating a quick cellular response to different external and/or internal cues. For example, receptors such as ERBB2 located in the plasma membrane are known to relocate to the nucleus after stimulation, where they change the expression pattern of target genes. This translocation has a profound impact on cancer initiation, progression, and prognosis of human cancers (Wang SC et al. (2009)).

Some of the MLPs are not just multilocalizing, but also multifunctional proteins. Multifunctional proteins do not fit in the paradigm of "one gene-one protein-one function", and certainly adds another dimension to cellular complexity. Multifunctionality may be the result of eg. gene fusions, expression of several splice variants, different post-translational modifications, different interaction partners and/or multilocalization of the protein. An extreme group of multifunctional proteins are the moonlighting proteins. The term "moonlighting" has been used for people who work in different jobs during daylight and moonlight, and like their human counterpart, moonlighting proteins have two or more completely different biochemical functions (Jeffery CJ. (1999)). Moonlighting proteins may provide connections and switches between different cellular reactions, pathways and processes, making it possible for cells to coordinate responses to a changing environment (Jeffery CJ. (2015)). For example, some biosynthetic enzymes moonlight as transcription factors in order to provide a tightly coupled feedback loop for transcription of genes involved in the pathway. An example of a moonlighting and multilocalizing protein is ENO1 (Figure 1), which acts as a glycolytic enzyme in the cytosol, but also as a plasminogen-receptor in the plasma membrane, and as a transcriptional repressor in the nucleus (Pancholi V. (2001)).

The Human Protein Atlas does not provide functional studies of proteins and therefore cannot determine if a MLP is multifunctional. However, the description of proteins at multiple locations is an important step in the discovery of multifunctional and moonlighting proteins and the spatial information provided in the subcellular resource could be integrated into existing prediction models (Chapple CE et al. (2015)).


Actin Filaments Centrosome Cytosol Endoplasmic Reticulum Golgi Apparatus Intermediate Filaments Microtubules Mitochondria Nuclear Membrane Nucleoli Nucleoplasm Plasma Membrane Primary Cilium Sperm Vesicles

Figure 5. Cytoscape plots showing the distribution of MLPs that are shared between the major organelle proteomes. The black middle node links to all proteins localizing to the selected major organelle proteome, while the gray nodes links to all MLPs shared with each of the other major organelle proteomes. Only gray nodes with at least 1 % of the proteins in the compartment are shown. The colored connecting nodes show the number of proteins that are exclusively shared between the compartments. The circle sizes of the connecting nodes are related to the number of proteins exclusively shared between the compartments. The cyan colored nodes show combinations that are significantly overrepresented, while magenta colored nodes show combinations that are significantly underrepresented as compared to the probability of observing that combination based on the frequency of each annotation and a hypergeometric test (p≤0.05). Each node is clickable and link to a list of the corresponding genes.

Relevant links and publications

Lange A et al., Classical nuclear localization signals: definition, function, and interaction with importin alpha. J Biol Chem. (2007)
PubMed: 17170104 DOI: 10.1074/jbc.R600026200

Ashmarina LI et al., 3-Hydroxy-3-methylglutaryl coenzyme A lyase: targeting and processing in peroxisomes and mitochondria. J Lipid Res. (1999)
PubMed: 9869651

Wang SC et al., Nuclear translocation of the epidermal growth factor receptor family membrane tyrosine kinase receptors. Clin Cancer Res. (2009)
PubMed: 19861462 DOI: 10.1158/1078-0432.CCR-08-2813

Jeffery CJ., Moonlighting proteins. Trends Biochem Sci. (1999)
PubMed: 10087914

Jeffery CJ., Why study moonlighting proteins? Front Genet. (2015)
PubMed: 26150826 DOI: 10.3389/fgene.2015.00211

Pancholi V., Multifunctional alpha-enolase: its role in diseases. Cell Mol Life Sci. (2001)
PubMed: 11497239 DOI: 10.1007/pl00000910

Chapple CE et al., Extreme multifunctional proteins identified from a human protein interaction network. Nat Commun. (2015)
PubMed: 26054620 DOI: 10.1038/ncomms8412