Protein structure - CFI - The Human Protein Atlas

We use cookies to enhance the usability of our website. If you continue, we'll assume that you are happy to receive all cookies. More information. Don't show this again.

CFI

Search result

Field

Cluster

Cell type

Category

Cluster

Cluster

Pathway

SUMMARY

TISSUE

BRAIN

SINGLE CELL

SUBCELL

CANCER

BLOOD

CELL LINE

STRUCTURE

INTERACTION

CFI

GENERAL INFORMATIONⁱ General description of the gene and the encoded protein(s) using information from HGNC and Ensembl, as well as predictions made by the Human Protein Atlas project.
Gene nameⁱ Official gene symbol, which is typically a short form of the gene name, according to HGNC.	CFI
Synonyms	C3b-INA, FI, IF, KAF
Gene descriptionⁱ Full gene name according to HGNC.	Complement factor I
Protein classⁱ Assigned HPA protein class(es) for the encoded protein(s). Read more	Disease related genes Enzymes Human disease related genes Plasma proteins Potential drug targets
Predicted locationⁱ All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions. Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached. Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s). The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.	Secreted
Protein evidence	Evidence at protein level (all genes)

HUMAN PROTEIN ATLAS INFORMATIONⁱ Summary of RNA expression analysis and annotation data generated within the Human Protein Atlas project.
Single cell type expression clusterⁱ The RNA data was used to cluster genes according to their expression across single cell types. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.	Brain endothelial cells - Blood brain barrier (mainly)
Single cell type specificityⁱ The RNA specificity category is based on mRNA expression levels in the analyzed cell types based on scRNA-seq data from normal tissues. The categories include: cell type enriched, group enriched, cell type enhanced, low cell type specificity and not detected.	Cell type enhanced (Cholangiocytes, Endometrial secretory cells, Endometrial stromal cells, Epididymal efferent duct absorptive cells, Fallopian secretory cells, Granulosa cells, Hepatocytes, Lymphatic endothelial cells, Mesothelial cells, Salivary duct cells, Vascular endothelial cells)
Tissue expression cluster (RNA)ⁱ The RNA data was used to cluster genes according to their expression across tissues. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.	Liver & Kidney - Metabolism (mainly)
Tissue specificity (RNA)ⁱ The RNA specificity category is based on mRNA expression levels in the consensus dataset which is calculated from the RNA expression levels in samples from HPA and GTEX. The categories include: tissue enriched, group enriched, tissue enhanced, low tissue specificity and not detected.	Tissue enriched (Liver)
Subcellular locationⁱ Main subcellular location based on data generated in the subcellular section of the Human Protein Atlas.	Not available
Secretome annotationⁱ All genes with at least one predicted secreted isoform have been annotated and classified with the aim to determine if the corresponding protein(s) are: secreted into blood locally secreted or actually being attached to membrane or retained in intracellular locations like mitochondria, endoplasmatic reticulum (ER), Golgi apparatus or lysosomes.	Secreted to blood

GENE INFORMATIONⁱ Gene information from Ensembl and Entrez, as well as links to available gene identifiers are displayed here. Information was retrieved from Ensembl if not indicated otherwise.
Chromosome	4
Cytoband	q25
Chromosome location (bp)	109731008 - 109802150
Number of transcriptsⁱ Number of protein-coding transcripts from the gene as defined by Ensembl.	4
Ensembl	ENSG00000205403 (version 109)
Entrez gene	3426
HGNC	HGNC:5394
UniProt	P05156
GeneCards	CFI

PROTEIN BROWSERⁱ The Structure section provides in-house generated structures, predicted using the Alphafold source code, for the majority of the proteins and their related isoforms. Displaying protein features on the AlphaFold structures Individual splice variants can be selected in the top part of the Protein Browser (see below) and different transcript-related features such as transmembrane regions, InterPro domains and antigen sequences for antibodies can be displayed in the structure by clicking on the respective features in the Protein Browser. Clinical and population-based amino acid variants based on data from the Ensembl variation database and AlphaMissense (AM) predictions can be highlighted using the sliders to the right of the structure. These can also be used to colour the entire structure by residue index or make the structure autorotate.The structures are displayed using the NGL Viewer and can also be zoomed-in and rotated manually. The Protein Browser The ProteinBrowser displays the antigen location on the target protein(s) and the features of the target protein. Transcript names and schematic transcript structures including exons, introns and UTRs for the different isoforms are shown on top, and can be used to switch between the structures for the different splice variants. At the top of the view, the position of the antigen (identified by the corresponding HPA identifier) is shown as a green bar. A yellow triangle on the bar indicates a <100% sequence identity to the protein target. Below the antigens, the maximum percent sequence identity of the protein to all other proteins from other human genes is displayed, using a sliding window of 10 aa residues (HsID 10) or 50 aa residues (HsID 50). The region with the lowest possible identity is always selected for antigen design, with a maximum identity of 60% allowed for designing a single-target antigen (read more). The curve in blue displays the predicted antigenicity i.e. the tendency for different regions of the protein to generate an immune response, with peak regions being predicted to be more antigenic.The curve shows average values based on a sliding window approach using an in-house propensity scale. (read more). Signal peptides (turquoise) and membrane regions (orange) based on predictions using the majority decision methods MDM and MDSEC are also displayed. Low complexity regions are shown in yellow and InterPro regions in green. Common (purple) and unique (grey) regions between different splice variants of the gene are also displayed (read more), and at the bottom of the protein view is the protein scale.

	CFI-201 CFI-202 CFI-204 CFI-205
	Description: Color scheme: Confidence Residue index Your selection Variants: Off Population Clinical Alphamissense variants: Off Benign Pathogenic Autorotate: Off On

PROTEIN INFORMATIONⁱ The protein information section displays alternative protein-coding transcripts (splice variants) encoded by this gene according to the Ensembl database. The Splice variant identifier links to the Ensembl website protein summary for the selected splice variant. The data in the Swissprot and TrEMBL columns links to corresponding pages in the UniProt database. The protein classes assigned to this protein are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class. The length of the protein (amino acid residues according to Ensembl), molecular mass (kDalton), predicted signal peptide and number of predicted transmembrane region(s) according to in-house majority decision methods based on sets of predictors are also reported.
Splice variant	SwissProt	TrEMBL	Protein class	Length & mass	Signal peptide (predicted)	Transmembrane regions (predicted)
CFI-201	P05156		Enzymes Predicted secreted proteins Plasma proteins Disease related genes Potential drug targets Human disease related genes Mapped to neXtProt Protein evidence (Ezkurdia et al 2014) Show all	583 aa 65.7 kDa	Yes	0
CFI-202		E7ETH0	Predicted secreted proteins Human disease related genes Protein evidence (Ezkurdia et al 2014)	591 aa 66.6 kDa	Yes	0
CFI-204		D6R9Z8	Predicted secreted proteins Human disease related genes Protein evidence (Ezkurdia et al 2014)	161 aa 18.3 kDa	Yes	0
CFI-205		G3XAM2	Predicted secreted proteins Human disease related genes Protein evidence (Ezkurdia et al 2014)	576 aa 65.1 kDa	Yes	0

Contact

The Project

The Human Protein Atlas

The Human Protein Atlas project is funded
by the Knut & Alice Wallenberg Foundation.

contact@proteinatlas.org