We use cookies to enhance the usability of our website. If you continue, we'll assume that you are happy to receive all cookies. More information. Don't show this again.
BSG
HPA
RESOURCES
  • TISSUE
  • BRAIN
  • SINGLE CELL
  • SUBCELLULAR
  • CANCER
  • BLOOD
  • CELL LINE
  • STRUCTURE & INTERACTION
ABOUT
  • INTRODUCTION
  • HISTORY
  • ORGANIZATION
  • PUBLICATIONS
  • ANTIBODY SUBMISSION
  • ANTIBODY AVAILABILITY
  • ACKNOWLEDGMENTS
  • CONTACT
NEWS
  • NEWS ARTICLES
  • PRESS ROOM
LEARN
  • DICTIONARY
  • PROTEIN CLASSES
  • PROTEIN EVIDENCE
  • METHODS
  • EDUCATIONAL VIDEOS
DATA
  • DOWNLOADABLE DATA
  • PUBLICATION DATA
  • RELEASE HISTORY
HELP
  • ANTIBODY VALIDATION
  • ASSAYS & ANNOTATION
  • DISCLAIMER
  • HELP & FAQ
  • PRIVACY STATEMENT
  • LICENCE & CITATION
Fields »
Search result

Field
Term
Gene name
Class
Subclass
Class
Keyword
Chromosome
External id
Tissue
Cell type
Expression
Antibody panel
Tissue
Main location
Patient ID
Annotation
Tissue
Category
Tau score
Cluster
Reliability
Brain region
Category
Tau score
Brain region
Category
Tau score
Brain region
Category
Tau score
Cluster
Reliability
Tissue
Cell type
Enrichment
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell lineage
Category
Tau score
Cluster
Cluster
Location
Searches
Location
Cell line
Class
Type
Phase
Reliability
Cancer
Prognosis
Cancer
Category
Cancer
Category
Tau score
Cluster
Variants
Interacting gene (ensg_id)
Type
Number of interactions
Pathway
Category
Score
Score
Score
Validation
Validation
Validation
Validation
Antibodies
Data type
Column


  • SUMMARY

  • TISSUE

  • BRAIN

  • SINGLE CELL

  • SUBCELL

  • CANCER

  • BLOOD

  • CELL LINE

  • STRUCT & INT

  • BSG
STRUCTURE & INTERACTION STRUCTURE INTERACTION
Protein structures
Protein interactions
Human metabolism
GENERAL INFORMATIONi

General description of the gene and the encoded protein(s) using information from HGNC and Ensembl, as well as predictions made by the Human Protein Atlas project.

Gene namei

Official gene symbol, which is typically a short form of the gene name, according to HGNC.

BSG
Synonyms CD147, EMMPRIN, EMPRIN, OK
Gene descriptioni

Full gene name according to HGNC.

Basigin (Ok blood group)
Protein classi

Assigned HPA protein class(es) for the encoded protein(s).

Blood group antigen proteins
CD markers
Transporters
Predicted locationi

All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions.

  • Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached.

  • Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s).

The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.

Membrane, Intracellular
Protein evidence Evidence at protein level (all genes)

HUMAN PROTEIN ATLAS INFORMATIONi

Summary of RNA expression analysis and annotation data generated within the Human Protein Atlas project.

Single cell type
expression clusteri

The RNA data was used to cluster genes according to their expression across single cell types. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.

Enterocytes - Absorption (mainly)
Single cell type specificityi

The RNA specificity category is based on mRNA expression levels in the analyzed cell types based on scRNA-seq data from normal tissues. The categories include: cell type enriched, group enriched, cell type enhanced, low cell type specificity and not detected.

Cell type enhanced (Cytotrophoblasts, Distal enterocytes, Syncytiotrophoblasts)
Tissue expression
cluster (RNA)i

The RNA data was used to cluster genes according to their expression across tissues. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.

Heart muscle - Heart development (mainly)
Tissue specificity (RNA)i

The RNA specificity category is based on mRNA expression levels in the consensus dataset which is calculated from the RNA expression levels in samples from HPA and GTEX. The categories include: tissue enriched, group enriched, tissue enhanced, low tissue specificity and not detected.

Low tissue specificity
Subcellular locationi

Main subcellular location based on data generated in the subcellular section of the Human Protein Atlas.

Localized to the Vesicles In addition localized to the Mid piece, Principal piece, End piece
Secretome annotationi

All genes with at least one predicted secreted isoform have been annotated and classified with the aim to determine if the corresponding protein(s) are:

  • secreted into blood
  • locally secreted
  • or actually being attached to membrane or retained in intracellular locations like mitochondria, endoplasmatic reticulum (ER), Golgi apparatus or lysosomes.

Not available
GENE INFORMATIONi

Gene information from Ensembl and Entrez, as well as links to available gene identifiers are displayed here. Information was retrieved from Ensembl if not indicated otherwise.

Chromosome 19
Cytoband p13.3
Chromosome location (bp) 571277 - 583494
Number of transcriptsi

Number of protein-coding transcripts from the gene as defined by Ensembl.

13
Ensembl ENSG00000172270 (version 109)
Entrez gene 682
HGNC HGNC:1116
UniProt P35613 (UniProt - Evidence at protein level)
neXtProt NX_P35613
GeneCards BSG
ASSAYSi

Links to data of the different assays available in the Structure & Interaction resource of the Human Protein Atlas. Click on the miniature images to directly get to the respective section.

Proteinstructure Interaction
PROTEIN BROWSERi

The Structure section provides in-house generated structures, predicted using the Alphafold source code, for the majority of the proteins and their related isoforms.

Displaying protein features on the AlphaFold structures

Individual splice variants can be selected in the top part of the Protein Browser (see below) and different transcript-related features such as transmembrane regions, InterPro domains and antigen sequences for antibodies can be displayed in the structure by clicking on the respective features in the Protein Browser.

Clinical and population-based amino acid variants based on data from the Ensembl variation database and AlphaMissense (AM) predictions can be highlighted using the sliders to the right of the structure. These can also be used to colour the entire structure by residue index or make the structure autorotate.The structures are displayed using the NGL Viewer and can also be zoomed-in and rotated manually.

The Protein Browser

The ProteinBrowser displays the antigen location on the target protein(s) and the features of the target protein. Transcript names and schematic transcript structures including exons, introns and UTRs for the different isoforms are shown on top, and can be used to switch between the structures for the different splice variants.

At the top of the view, the position of the antigen (identified by the corresponding HPA identifier) is shown as a green bar. A yellow triangle on the bar indicates a <100% sequence identity to the protein target.

Below the antigens, the maximum percent sequence identity of the protein to all other proteins from other human genes is displayed, using a sliding window of 10 aa residues (HsID 10) or 50 aa residues (HsID 50). The region with the lowest possible identity is always selected for antigen design, with a maximum identity of 60% allowed for designing a single-target antigen (read more).

The curve in blue displays the predicted antigenicity i.e. the tendency for different regions of the protein to generate an immune response, with peak regions being predicted to be more antigenic.The curve shows average values based on a sliding window approach using an in-house propensity scale. (read more).

Signal peptides (turquoise) and membrane regions (orange) based on predictions using the majority decision methods MDM and MDSEC are also displayed.

Low complexity regions are shown in yellow and InterPro regions in green. Common (purple) and unique (grey) regions between different splice variants of the gene are also displayed (read more), and at the bottom of the protein view is the protein scale.
«
BSG-201
BSG-202
BSG-203
BSG-204
BSG-207
BSG-208
BSG-211
BSG-214
BSG-215
BSG-217
BSG-218
BSG-219
BSG-221
»

Description:

Color scheme:
Confidence
Residue index
Your selection
Variants:
Off
Population
Clinical
Alphamissense variants:
Off
Benign
Pathogenic
Autorotate:
Off
On
PROTEIN INFORMATIONi

The protein information section displays alternative protein-coding transcripts (splice variants) encoded by this gene according to the Ensembl database.

The Splice variant identifier links to the Ensembl website protein summary for the selected splice variant. The data in the Swissprot and TrEMBL columns links to corresponding pages in the UniProt database.

The protein classes assigned to this protein are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class.

The length of the protein (amino acid residues according to Ensembl), molecular mass (kDalton), predicted signal peptide and number of predicted transmembrane region(s) according to in-house majority decision methods based on sets of predictors are also reported.
Splice variant SwissProt TrEMBL Protein class Length & mass Signal peptide
(predicted)
Transmembrane regions
(predicted)
BSG-201
P35613
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Kim et al 2014)
Protein evidence (Ezkurdia et al 2014)
Show all
385 aa
42.2 kDa
Yes 1
BSG-202
P35613
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Ezkurdia et al 2014)
Show all
176 aa
19.4 kDa
No 1
BSG-203
P35613
Show all
Q54A51
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Kim et al 2014)
Protein evidence (Ezkurdia et al 2014)
Show all
269 aa
29.2 kDa
Yes 1
BSG-204
P35613
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Kim et al 2014)
Protein evidence (Ezkurdia et al 2014)
Show all
176 aa
19.4 kDa
No 1
BSG-207
I3L192
Show all
Predicted intracellular proteins
Protein evidence (Ezkurdia et al 2014)
Show all
160 aa
17.2 kDa
Yes 0
BSG-208
P35613
Show all
R4GMX5
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Ezkurdia et al 2014)
Show all
176 aa
19.4 kDa
No 1
BSG-211
P35613
Show all
R4GN83
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Ezkurdia et al 2014)
Show all
176 aa
19.4 kDa
No 1
BSG-214
A0A087X215
Show all
Predicted intracellular proteins
Protein evidence (Ezkurdia et al 2014)
Show all
55 aa
5.5 kDa
Yes 0
BSG-215
A0A087WUV8
Show all
Predicted membrane proteins
Protein evidence (Ezkurdia et al 2014)
Show all
189 aa
20.5 kDa
Yes 1
BSG-217
A0A7P0T8A7
Show all
Predicted membrane proteins
Protein evidence (Ezkurdia et al 2014)
Show all
206 aa
22.6 kDa
No 1
BSG-218
P35613
Show all
CD markers
Blood group antigen proteins
Transporters
Predicted membrane proteins
Mapped to neXtProt
Protein evidence (Ezkurdia et al 2014)
Show all
176 aa
19.4 kDa
No 1
BSG-219
A0A7P0TA85
Show all
Predicted membrane proteins
Protein evidence (Ezkurdia et al 2014)
Show all
266 aa
29 kDa
Yes 1
BSG-221
A0A7P0TB04
Show all
Predicted membrane proteins
Protein evidence (Ezkurdia et al 2014)
Show all
266 aa
28.9 kDa
Yes 1
Show allShow less
INTERACTIONi

In the Interaction part of this page network plots showing the gene's interaction partners according to four different datasets are displayed, including a consensus network plot showing only interactions present in at least two of the datasets. In the network plots the nodes represent genes and edge color represent the number of datasets the interaction belongs to.

The highlight bar in the top of the plot can be used to color the nodes according to subcellular location (based on data in the Subcellular section), predicted location (based on signalpeptide and transmembrane region predictions), tissue specificity (based on RNA tissue expression profiles) or proteinclass. For genes categorised as single cell type or group specific the option to highlight interaction partners expressed in the same cell type will also be available in the highlight bar.

Custom highlighting of nodes is possible using the top left Filter option in which a query of choice can be built to for example label all nodes that are tissue enriched in both human and mouse brain or those that belongs to a certain tissue expression cluster. The expression cluster for the gene is stated in the box Human Protein Atlas Information above the network plot. Click on Filter in the top left in the plot to find the query builder and some example queries.

Interactions included are direct interaction and physical associations with high and medium confidence from IntAct, physical multivalidated interactions from BioGRID, interactions with>75% probability from BioPlex and significant physical interactions from OpenCell.

Filter menu »
Fields »
e.g. Brain enriched, Localized to mitochondria, Heart muscle - Heart development

Field
Term
Gene name
Class
Subclass
Class
Keyword
Chromosome
External id
Tissue
Cell type
Expression
Antibody panel
Tissue
Main location
Patient ID
Annotation
Tissue
Category
Tau score
Cluster
Reliability
Brain region
Category
Tau score
Brain region
Category
Tau score
Brain region
Category
Tau score
Cluster
Reliability
Tissue
Cell type
Enrichment
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell lineage
Category
Tau score
Cluster
Cluster
Location
Searches
Location
Cell line
Class
Type
Phase
Reliability
Cancer
Prognosis
Cancer
Category
Cancer
Category
Tau score
Cluster
Variants
Interacting gene (ensg_id)
Type
Number of interactions
Pathway
Category
Score
Score
Score
Validation
Validation
Validation
Validation
Antibodies
Data type
Column


Consensus
IntAct
BioGrid
OpenCell
BioPlex
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 4
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
ATP2B1401880
CANX126723538124
PPIA3113332
SLC16A11011110
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 3 View all protein interaction data in IntAct
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
HOOK15242018
NCSTN46610
PPIA3113332
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 76
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
ACE2101317300
AIF1L001260
ALDH7A111340
ANPEP001400
ATP2A1001600
ATP2B1401880
ATP2B413840
CANX126723538124
CDC423033157151
CRBN6103810
DDRGK1636417
DNAAF5014022
DNAJB1110753510
DPP40613600
ECPAS78371324
FAM162A12327
FBXO22222000
FBXO641385052
GARS1042900
GNAI3563119
GNPAT00303
HRAS1287135010
HSP90AA15156349230
HSP90AB1631431642219
HSPA537322602586
IPO519748370
KCNA3229800
KRAS95394160
LAMP181191471
MTX100610
MYC6588126320
NRAS3514200
OS9663020
PDHA14410021
PFKP143800
PGD00600
PGRMC1171747834
PPA1001220
PPIA3113332
PRKN1111140500
RAB11A21112034713
RAB1B95211910
RAB2A221542471
RAB355317511
RAB5C1075507
RAB6A342020
RAB7A2811124361
RAB9A628845
RHOA2121120916
RHOB3230811
RING1222360220
SAE1378113
SLC16A11011110
SLC25A13111808
SLC25A2200708
SLC3A2523750
SLC7A5111450
SPTB14400
STEAP3011230
STX7243757300
TELO23616019
TGOLN20310200
TMPRSS11B5057016
TMPRSS40105900
TOMM223530121
TUBA1A129890121
TUBA1C10333445
TUBA4A13535037
UBE2G233820
UFL111332120
UQCRFS12119034
VDAC191576222
WDR1222150
YARS1011700
YTHDF2002850
YTHDF3222540
Show allShow less
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 8
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
ATP2B1401880
CALM1442115710
CALM27021830
CALM31058351620
CANX126723538124
CAPZB9661276660
MAPRE15437931984
SLC16A11011110
BSG has no defined protein interactions in BioPlex.
METABOLIC SUMMARYi

In this summary the pathway/subsystems to which the gene belongs is shown together with the associated cellular compartment for the reactions based on metabolite information. The number of proteins and metabolites in the pathway is shown as well as the number of reactions for the selected gene. By clicking on a pathway in the summary table the associated metabolic network map is shown, together with a gene-tissue heatmap that shows the expression of genes in that pathway in different tissues.

BSG is not a metabolic protein

Contact

  • NEWS ARTICLES
  • PRESS ROOM

The Project

  • INTRODUCTION
  • ORGANIZATION
  • PUBLICATIONS

The Human Protein Atlas

  • DOWNLOADABLE DATA
  • LICENCE & CITATION
  • HELP & FAQ
The Human Protein Atlas project is funded
by the Knut & Alice Wallenberg Foundation.


contact@proteinatlas.org