Downloadable data
Programmatic access
If you want to programmatically access a subset of the data more information can be found on the help page
Search results
The data files represented here includes data available in the Human Protein Atlas version 24.0. A subset of this data can also be downloaded from the Search page with the genes corresponding to the current search result in the result in different formats; XML, RDF, TSV & JSON.
Single entry
Data for a single entry can be accessed in XML, RDF (trig), TSV or JSON format by adding the corresponding format extension to the Ensembl id as in the below URLs:
https://www.proteinatlas.org/ENSG00000134057.xml
https://www.proteinatlas.org/ENSG00000134057.trig
https://www.proteinatlas.org/ENSG00000134057.tsv
https://www.proteinatlas.org/ENSG00000134057.json
Archived data
As of version 16 of the Human Protein Atlas, the site can be reached using the url structure "vXX.proteinatlas.org" where XX is the version number. For example, version 16 of the Human Protein Atlas has the url https://v16.proteinatlas.org.
Tissue
Brain
Single cell type
Single nuclei brain
Immune cell
Subcellular
Cancer
Blood disease
Blood protein
Cell line
Interaction
Protein atlas data
Data from the Human Protein Atlas in tab-separated format
This file contains a subset of the data in the Human Protein Atlas version 24.0 corresponding to the data seen in the search result. This data can also be downloaded for a resulting gene set when using the search function (via the TSV link on the result page).
Data from the Human Protein Atlas in json format
This file contains the same subset of the data as the above proteinatlas.tsv but in a different format and potentially more useful for 3rd party web APIs. This data can also be downloaded for a resulting gene set using the search function (via the Download: Custom TSV/JSON link on the result page).
Data from the Human Protein Atlas in XML format
The XML file contains most of the data in the Human Protein Atlas version 24.0, including protein expression data (in normal and tumor tissues and in cell lines), antigen sequences, Western blot data for antibodies, protein array data for antibodies, RNA-seq data, external references such as UniProt identifiers, and more. The data is based on Ensembl version 109. The file structure is presented in the XSD-schema. This data can also be downloaded for a resulting gene set when using the search function (via the xml link on the result page).
The XML file presented here is compressed with gzip due to its size. It can be uncompressed with an archive program like 7‑zip.
Data from the Human Protein Atlas in RDF format
This file contains a subset of the data in the Human Protein Atlas version 24.0 corresponding to the tissue annotations on gene level. This data can also be downloaded for a resulting gene set when using the search function (via the RDF link on the result page). This RDF release is BETA and will be extended and developed in coming releases. We thank Mark Thompson, Rajaram Kaliyaperumal and Eelke van der Horst (LUMC, The Netherlands), and Christine Chichester (SIB, Switzerland) for providing templates for generating the first beta-release of HPA nanopublications. Their contribution was made possible by IMI project Open PHACTS and EU FP7 project RD-Connect. This beta was developed within an ELIXIR collaboration.
Cell graphic
Schematic cell containing all structures annotated within the Human Protein Atlas.