Brain tissue expression of THEMIS - Summary

GENERAL INFORMATIONⁱ General description of the gene and the encoded protein(s) using information from HGNC and Ensembl, as well as predictions made by the Human Protein Atlas project.
Human gene nameⁱ Official gene symbol, which is typically a short form of the gene name, according to HGNC.	THEMIS
Mouse gene name	Themis
Human gene descriptionⁱ Full gene name according to HGNC.	Thymocyte selection associated
Predicted locationⁱ All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions. Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached. Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s). The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.	Intracellular
Mouse gene	ENSMUSG00000049109 (version 109)
Pig gene	ENSSSCG00000024392 (version 109)
Antibodies in assayⁱ The internal antibody ID for the antibody/antibodies available for this specific target, click link for more antibody information.	No antibodies in assay

HUMAN PROTEIN ATLAS INFORMATION
Brain expression cluster (RNA)ⁱ The RNA data was used to cluster genes according to their expression across tissues. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.	Neurons - Mixed function (mainly)
Tissue specificityⁱ The RNA specificity category is based on mRNA expression levels in the consensus dataset which is calculated from the RNA expression levels in samples from HPA and GTEX. The categories include: tissue enriched, group enriched, tissue enhanced, low tissue specificity and not detected.	Tissue enriched (Lymphoid tissue)
	Human brain	Pig brain	Mouse brain
Regional specificityⁱ The regional specificity category is based on mRNA expression levels in the analysed brain samples, grouped into 13 main brain regions and calculated for the three different species. All brain expression profiles are based on data from HPA. The specificity categories include: regionally enriched, group enriched, regionally enhanced, low regional specificity and not detected. The classification rules are the same used for the tissue specificity category.	Low region specificity	Not detected	Not detected
Tau specificity scoreⁱ Tau specificity score is a numerical indicator of the specificity of the gene expression across cells or tissues. The value ranges from 0 and 1, where 0 indicates identical expression across all cells/tissue types, while 1 indicates expression in a single cell/tissue type.	0.59	0.77	Not detected
Regional distributionⁱ The regional distribution category is based on mRNA expression detected above cut off or not in the analysed brain samples, grouped into 13 main brain regions and calculated for the three different species. Brain expression for all species is based on data from HPA. The distribution categories include: detected in all, detected in many, detected in some, detected in single and not detected. The classification rules are the same used for the tissue distribution category.	Detected in many	Not detected	Not detected

BRAIN RNA EXPRESSIONⁱ

HUMAN

Additional Prefrontal Cortex dataset
Cerebral cortex Hippocampal formation Amygdala Basal ganglia Thalamus Hypothalamus Midbrain Cerebellum Pons Medulla oblongata Spinal cord White matter Choroid plexus Corpus callosum Expression Detection All organs	HPA Human brain datasetⁱ Normalized RNA expression levels (nTPM) shown for the 13 brain regions. Color coding is based on brain region and the bar shows the highest expression among the subregions included. To access sample data, click on region name or bar. Read more about normalized expression levels in Assays & Annotation.
CTX HPF AMY BG TH HY MB CB P M SC WM CP
Cerebral cortex
Hippocampal formation Amygdala Basal ganglia Thalamus Hypothalamus Midbrain Cerebellum Pons Medulla oblongata Spinal cord White matter Choroid plexus

STEREO-SEQⁱ

HPA HUMAN BRAIN

Pending image generation

HPA stereo-seq cerebral cortex

COMPARISON BRAIN RNA EXPRESSIONⁱ

GTEX AND FANTOM HUMAN BRAIN

PIG
GTEx Human brain RNA-Seq datasetⁱ GTEx dataset RNA-seq tissue data generated by the Genotype-Tissue Expression (GTEx) project is reported as mean nTPM, corresponding to mean values of the different individual samples for respective subregion. Highest expression among the subregions represents the brain region. To access sample data, click on region name or bar. The GTEx RNA-seq assay is described in detail in Assays & Annotation.	FANTOM5 Human brain CAGE datasetⁱ FANTOM5 dataset Tissue data for RNA expression obtained through Cap Analysis of Gene Expression (CAGE) generated by the FANTOM5 project are reported as Scaled Tags Per Million. To access sample data, click on region name or bar. The CAGE method is described in detail in Assays & Annotation.
Cerebral cortex Olfactory bulb Hippocampal formation Amygdala Basal ganglia Thalamus Hypothalamus Midbrain Cerebellum Pons Medulla oblongata Spinal cord Pituitary gland Retina White matter	HPA Pig brain RNA-Seq datasetⁱ HPA Pig dataset HPA RNA-seq tissue data is reported as mean nTPM (normalized expression) for each of the brain regions analyzed in pig. The detailed pages (reached when clicking a bar or regional name) show nTPM values at the individual sample level. To access sample data, click on region name or bar. The HPA RNA-seq assay is described in detail in Assays & Annotation. The pig brain transcriptomics project is a collaborative project between human protein atlas and the Lars Bolund institute of regenerative Medicine (Dr. Yonglun Luo), BGI-Qingdao, China.
MOUSE
Cerebral cortex Olfactory bulb Hippocampal formation Amygdala Basal ganglia Thalamus Hypothalamus Midbrain Pons and medulla Cerebellum Pituitary gland Retina White matter	HPA Mouse brain RNA-Seq datasetⁱ HPA Mouse dataset HPA RNA-seq tissue data is reported as mean nTPM (normalized expression) for each of the brain regions analyzed in mouse. The detailed pages (reached when clicking a bar or regional name) show nTPM values at the individual sample level. To access sample data, click on region name or bar. The HPA RNA-seq assay is described in detail in Assays & Annotation. Allen Mouse brain ISH datasetⁱ The Allen Mouse Brain ISH The expression values based on in situ hybridization (ISH) available through the ABA API (© 2004 Allen Institute for Brain Science, Allen Mouse Brain Atlas) is imported and show regional expression grouped in the same manner as the other datasets. To access sample data and links to the ABA, click on region name or bar.

HUMAN BRAIN PROTEIN LOCATIONⁱ The Human brain protein data is based on curated and manually selected Tissue Atlas data. The standard brain regions used in the Tissue Atlas are cerebral cortex, caudate nucleus, hippocampus and cerebellum, only selected cases include information on hypothalamus or retina. The score is based on knowledge-based annotation of the protein location in the main cell types. For genes where more than one antibody has been used, a collective score is set displaying the estimated true protein expression.
Non curated brain data available in the Tissue Atlas.

EXPRESSION CLUSTERING & CORRELATIONⁱ

THEMIS is part of cluster 53 Neurons - Mixed function with confidenceⁱ 1
527 genes in cluster

Go to interactive expression cluster page

15 nearest neighbours based on brain RNA expression

Neighbourⁱ Gene name according to HGNC.	Descriptionⁱ Gene description according to HGNC.	Correlationⁱ Correlation between the selected gene and neighboring gene. Correlation is calculated as Spearman correlation in PCA space based on the RNA-seq expression data.	Clusterⁱ ID of the expression cluster of the neighboring gene.
GTDC1	Glycosyltransferase like domain containing 1	0.8982	53
NLK	Nemo like kinase	0.8895	53
PRH1	Proline rich protein HaeIII subfamily 1	0.8895	53
NECAB1	N-terminal EF-hand calcium binding protein 1	0.8860	53
SMYD1	SET and MYND domain containing 1	0.8789	53
LRRC8B	Leucine rich repeat containing 8 VRAC subunit B	0.8772	53
C6orf58	Chromosome 6 open reading frame 58	0.8649	53
NUAK1	NUAK family kinase 1	0.8649	53
PLEKHA1	Pleckstrin homology domain containing A1	0.8596	53
LRRC4	Leucine rich repeat containing 4	0.8579	53
ZNF365	Zinc finger protein 365	0.8474	53
GOLGA8S	Golgin A8 family member S	0.8421	53
TBR1	T-box brain transcription factor 1	0.8404	53
TRAV8-3	T cell receptor alpha variable 8-3	0.8386	53
C3orf80	Chromosome 3 open reading frame 80	0.8368	53