We use cookies to enhance the usability of our website. If you continue, we'll assume that you are happy to receive all cookies. More information. Don't show this again.
SOX2
HPA
RESOURCES
  • TISSUE
  • BRAIN
  • SINGLE CELL
  • SUBCELLULAR
  • CANCER
  • BLOOD
  • CELL LINE
  • STRUCTURE & INTERACTION
ABOUT
  • INTRODUCTION
  • HISTORY
  • ORGANIZATION
  • PUBLICATIONS
  • ANTIBODY SUBMISSION
  • ANTIBODY AVAILABILITY
  • ACKNOWLEDGMENTS
  • CONTACT
NEWS
  • NEWS ARTICLES
  • PRESS ROOM
LEARN
  • DICTIONARY
  • PROTEIN CLASSES
  • PROTEIN EVIDENCE
  • METHODS
  • EDUCATIONAL VIDEOS
DATA
  • DOWNLOADABLE DATA
  • PUBLICATION DATA
  • RELEASE HISTORY
HELP
  • ANTIBODY VALIDATION
  • ASSAYS & ANNOTATION
  • DISCLAIMER
  • HELP & FAQ
  • PRIVACY STATEMENT
  • LICENCE & CITATION
Fields »
Search result

Field
Term
Gene name
Class
Subclass
Class
Keyword
Chromosome
External id
Tissue
Cell type
Expression
Antibody panel
Tissue
Main location
Patient ID
Annotation
Tissue
Category
Tau score
Cluster
Reliability
Brain region
Category
Tau score
Brain region
Category
Tau score
Brain region
Category
Tau score
Cluster
Reliability
Tissue
Cell type
Enrichment
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell lineage
Category
Tau score
Cluster
Cluster
Location
Searches
Location
Cell line
Class
Type
Phase
Reliability
Cancer
Prognosis
Cancer
Category
Cancer
Category
Tau score
Cluster
Variants
Interacting gene (ensg_id)
Type
Number of interactions
Pathway
Category
Score
Score
Score
Validation
Validation
Validation
Validation
Antibodies
Data type
Column


  • SUMMARY

  • TISSUE

  • BRAIN

  • SINGLE CELL

  • SUBCELL

  • CANCER

  • BLOOD

  • CELL LINE

  • STRUCT & INT

  • SOX2
STRUCTURE & INTERACTION STRUCTURE INTERACTION
Protein structures
Protein interactions
Human metabolism
GENERAL INFORMATIONi

General description of the gene and the encoded protein(s) using information from HGNC and Ensembl, as well as predictions made by the Human Protein Atlas project.

Gene namei

Official gene symbol, which is typically a short form of the gene name, according to HGNC.

SOX2
Synonyms
Gene descriptioni

Full gene name according to HGNC.

SRY-box transcription factor 2
Protein classi

Assigned HPA protein class(es) for the encoded protein(s).

Cancer-related genes
Disease related genes
Human disease related genes
Transcription factors
Predicted locationi

All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions.

  • Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached.

  • Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s).

The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.

Intracellular
Protein evidence Evidence at protein level (all genes)

HUMAN PROTEIN ATLAS INFORMATIONi

Summary of RNA expression analysis and annotation data generated within the Human Protein Atlas project.

Single cell type
expression clusteri

The RNA data was used to cluster genes according to their expression across single cell types. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.

Muller glia cells - Visual perception (mainly)
Single cell type specificityi

The RNA specificity category is based on mRNA expression levels in the analyzed cell types based on scRNA-seq data from normal tissues. The categories include: cell type enriched, group enriched, cell type enhanced, low cell type specificity and not detected.

Cell type enhanced (Basal squamous epithelial cells, Club cells, Muller glia cells, Schwann cells, Squamous epithelial cells)
Tissue expression
cluster (RNA)i

The RNA data was used to cluster genes according to their expression across tissues. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.

Brain - Neuronal signaling (mainly)
Tissue specificity (RNA)i

The RNA specificity category is based on mRNA expression levels in the consensus dataset which is calculated from the RNA expression levels in samples from HPA and GTEX. The categories include: tissue enriched, group enriched, tissue enhanced, low tissue specificity and not detected.

Tissue enhanced (Brain, Esophagus)
Subcellular locationi

Main subcellular location based on data generated in the subcellular section of the Human Protein Atlas.

Localized to the Nucleoplasm
Secretome annotationi

All genes with at least one predicted secreted isoform have been annotated and classified with the aim to determine if the corresponding protein(s) are:

  • secreted into blood
  • locally secreted
  • or actually being attached to membrane or retained in intracellular locations like mitochondria, endoplasmatic reticulum (ER), Golgi apparatus or lysosomes.

Not available
GENE INFORMATIONi

Gene information from Ensembl and Entrez, as well as links to available gene identifiers are displayed here. Information was retrieved from Ensembl if not indicated otherwise.

Chromosome 3
Cytoband q26.33
Chromosome location (bp) 181711925 - 181714436
Number of transcriptsi

Number of protein-coding transcripts from the gene as defined by Ensembl.

1
Ensembl ENSG00000181449 (version 109)
Entrez gene 6657
HGNC HGNC:11195
UniProt P48431 (UniProt - Evidence at protein level)
neXtProt NX_P48431
GeneCards SOX2
ASSAYSi

Links to data of the different assays available in the Structure & Interaction resource of the Human Protein Atlas. Click on the miniature images to directly get to the respective section.

Proteinstructure Interaction
PROTEIN BROWSERi

The Structure section provides in-house generated structures, predicted using the Alphafold source code, for the majority of the proteins and their related isoforms.

Displaying protein features on the AlphaFold structures

Individual splice variants can be selected in the top part of the Protein Browser (see below) and different transcript-related features such as transmembrane regions, InterPro domains and antigen sequences for antibodies can be displayed in the structure by clicking on the respective features in the Protein Browser.

Clinical and population-based amino acid variants based on data from the Ensembl variation database and AlphaMissense (AM) predictions can be highlighted using the sliders to the right of the structure. These can also be used to colour the entire structure by residue index or make the structure autorotate.The structures are displayed using the NGL Viewer and can also be zoomed-in and rotated manually.

The Protein Browser

The ProteinBrowser displays the antigen location on the target protein(s) and the features of the target protein. Transcript names and schematic transcript structures including exons, introns and UTRs for the different isoforms are shown on top, and can be used to switch between the structures for the different splice variants.

At the top of the view, the position of the antigen (identified by the corresponding HPA identifier) is shown as a green bar. A yellow triangle on the bar indicates a <100% sequence identity to the protein target.

Below the antigens, the maximum percent sequence identity of the protein to all other proteins from other human genes is displayed, using a sliding window of 10 aa residues (HsID 10) or 50 aa residues (HsID 50). The region with the lowest possible identity is always selected for antigen design, with a maximum identity of 60% allowed for designing a single-target antigen (read more).

The curve in blue displays the predicted antigenicity i.e. the tendency for different regions of the protein to generate an immune response, with peak regions being predicted to be more antigenic.The curve shows average values based on a sliding window approach using an in-house propensity scale. (read more).

Signal peptides (turquoise) and membrane regions (orange) based on predictions using the majority decision methods MDM and MDSEC are also displayed.

Low complexity regions are shown in yellow and InterPro regions in green. Common (purple) and unique (grey) regions between different splice variants of the gene are also displayed (read more), and at the bottom of the protein view is the protein scale.
SOX2-201

Description:

Color scheme:
Confidence
Residue index
Your selection
Variants:
Off
Population
Clinical
Alphamissense variants:
Off
Benign
Pathogenic
Autorotate:
Off
On
PROTEIN INFORMATIONi

The protein information section displays alternative protein-coding transcripts (splice variants) encoded by this gene according to the Ensembl database.

The Splice variant identifier links to the Ensembl website protein summary for the selected splice variant. The data in the Swissprot and TrEMBL columns links to corresponding pages in the UniProt database.

The protein classes assigned to this protein are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class.

The length of the protein (amino acid residues according to Ensembl), molecular mass (kDalton), predicted signal peptide and number of predicted transmembrane region(s) according to in-house majority decision methods based on sets of predictors are also reported.
Splice variant SwissProt TrEMBL Protein class Length & mass Signal peptide
(predicted)
Transmembrane regions
(predicted)
SOX2-201
P48431
Show all
A0A0U3FYV6
Show all
Predicted intracellular proteins
Transcription factors
Cancer-related genes
Disease related genes
Human disease related genes
Mapped to neXtProt
Protein evidence (Ezkurdia et al 2014)
Show all
317 aa
34.3 kDa
No 0
INTERACTIONi

In the Interaction part of this page network plots showing the gene's interaction partners according to four different datasets are displayed, including a consensus network plot showing only interactions present in at least two of the datasets. In the network plots the nodes represent genes and edge color represent the number of datasets the interaction belongs to.

The highlight bar in the top of the plot can be used to color the nodes according to subcellular location (based on data in the Subcellular section), predicted location (based on signalpeptide and transmembrane region predictions), tissue specificity (based on RNA tissue expression profiles) or proteinclass. For genes categorised as single cell type or group specific the option to highlight interaction partners expressed in the same cell type will also be available in the highlight bar.

Custom highlighting of nodes is possible using the top left Filter option in which a query of choice can be built to for example label all nodes that are tissue enriched in both human and mouse brain or those that belongs to a certain tissue expression cluster. The expression cluster for the gene is stated in the box Human Protein Atlas Information above the network plot. Click on Filter in the top left in the plot to find the query builder and some example queries.

Interactions included are direct interaction and physical associations with high and medium confidence from IntAct, physical multivalidated interactions from BioGRID, interactions with>75% probability from BioPlex and significant physical interactions from OpenCell.

Filter menu »
Fields »
e.g. Brain enriched, Localized to mitochondria, Brain - Neuronal signaling

Field
Term
Gene name
Class
Subclass
Class
Keyword
Chromosome
External id
Tissue
Cell type
Expression
Antibody panel
Tissue
Main location
Patient ID
Annotation
Tissue
Category
Tau score
Cluster
Reliability
Brain region
Category
Tau score
Brain region
Category
Tau score
Brain region
Category
Tau score
Cluster
Reliability
Tissue
Cell type
Enrichment
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell type
Category
Tau score
Cell lineage
Category
Tau score
Cluster
Cluster
Location
Searches
Location
Cell line
Class
Type
Phase
Reliability
Cancer
Prognosis
Cancer
Category
Cancer
Category
Tau score
Cluster
Variants
Interacting gene (ensg_id)
Type
Number of interactions
Pathway
Category
Score
Score
Score
Validation
Validation
Validation
Validation
Antibodies
Data type
Column


Consensus
IntAct
BioGrid
OpenCell
BioPlex
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 5
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
KDM6A242000
NFIA13322200
NFIB10251500
NFIC9102010
POU5F1341800
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 8 View all protein interaction data in IntAct
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
KDM6A242000
MBD381451110
NFIA13322200
NFIB10251500
NFIC9102010
POU5F1341800
SUMO1266155130
VRK141222117
Highlight:
Off
Subcell location
Predicted location
Cell type specificity
Protein class



Number of interactions: 198
Interaction# Consensus# IntAct# BioGrid# OpenCell# BioPlex
AHCYL14282200
AKT1406019524
ANXA1362021
ARID1A17544180
ARID1B11431100
ARID3A010800
ARID3B021120
ASH2L11184930
ATAD3A033110
BAG2171552769
BCL934700
BCOR643780
C7orf50258638
CCAR2684030
CCT33011892060
CDK1121912660
CHD322149747
CHD4133178170
CHD7221130
CIC553200
COP1775210
CTNNB147852691313
CUL4A91213700
CUX1211240
DAP313235936
DCAF1685770
DCAF5238123
DDAH215306
DDB13922169649
DDX27203030
DDX39A292380
DDX566521137
DHX309638330
DNAJA111366199
DNAJA210564928
DNAJB6131330250
DOCK7682660
EEF2234790
EGFR18527244106
EHMT29285400
EIF4A113740402
EIF6563740
EP300273436841
EP4006737100
FAM98A321190
FARSA241211
FOXC1131901
FOXK17427316
FOXK2343620
FOXP4321103
FUS142814760
FZR111710450
GATA4122200
GATAD2A7540100
GCN11124114
GNL39637440
GSE110152120
GTPBP4134472049
HDAC1102633627642
HDAC3121910370
HNRNPA39260350
HNRNPK3152111210
HNRNPL334112701
HNRNPM121681180
HNRNPU337196460
HOXA504302
HSPD120371251145
ILF2148873224
KDM1A618615290
KDM2B222200
KDM6A242000
KHDRBS1172261150
KMT2D342700
KRR116241186
L3MBTL31241530
LIN54361430
MAPK3111767212
MEF2A111700
MIDEAS411051
MIER121720
MRPL112254510
MRPL1510425035
MRPL4044410
MRPL4112322326
MRPL918444138
MRPS18B111626041
MRPS2214138816
MRPS23163341037
MRPS2634056465
MRPS27133321062
MRPS3411032855
MRPS916234851
MSI215530
MTA113236708
MTA214681179
MYEF200310
NCOA2452910
NCOA3795010
NCOA6784800
NCOR113189030
NCOR29147010
NFIA13322200
NFIB10251500
NFIC9102010
NFIX021100
NIP7141100
NIPBL531990
NOM1236336
NOP2475860
NOP589348172
NOS101700
NR2C1001700
NR2C2223200
OLA101810
OTUD7B3131500
PCGF1872504
PDCD111118335
PHF20L1224703
PIAS18386440
PITX15511201
POGZ75819120
POU2F1143200
POU5F1341800
PRR1200210
PWP1201052
RAD23B7164602
RALY7549160
RBBP44136142400
RBBP51295470
RBM7206321314
RNH1451319
RPL13A3226736117
RPL348324352
RPL7L111218740
RPS11354961215
RPS17121492312
RSL1D1126512016
SALL1111320
SAP130602228
SART32019411212
SATB1253000
SETD77172000
SIX400301
SMARCA2886220
SMARCA43214133362
SMARCA5151875277
SMARCB1213380270
SMARCC1183380201
SMARCC2191084170
SMARCD11710547160
SMARCE1235755280
SMC1A1714662311
SMC310106945
SOX100100
SOX14013100
SOX2100100
SPEN231910
SQSTM1314030520
SRP722423014915
SSB2774913210
SSBP1245906
STAU13213369390
SUPT16H175713124
SUPT5H37868910
TAF159253150
TBX206300
TCF20111300
TCF3453900
TEAD1121510
TFAM383050
TFAP41142001
TLE112163618
TLE3113800
TRIM24634980
TRIM26271800
TRPS1721070
TUBA4A13535037
TWNK01520
UBR5558624
USP3410530
USP73537205711
USP9X229200
WDR2611162360
WDR5344818585
WIZ7022114
WWP2185213911
XRCC110103183
XRCC51418131132
XRCC62122226152
ZFHX400400
ZMYM2764351
ZNF281041100
ZNF38400320
ZNF46211800
ZNF53600230
ZNF60810210
ZNF60931742
ZNF644301682
Show allShow less
SOX2 has no defined protein interactions in OpenCell.
SOX2 has no defined protein interactions in BioPlex.
METABOLIC SUMMARYi

In this summary the pathway/subsystems to which the gene belongs is shown together with the associated cellular compartment for the reactions based on metabolite information. The number of proteins and metabolites in the pathway is shown as well as the number of reactions for the selected gene. By clicking on a pathway in the summary table the associated metabolic network map is shown, together with a gene-tissue heatmap that shows the expression of genes in that pathway in different tissues.

SOX2 is not a metabolic protein

Contact

  • NEWS ARTICLES
  • PRESS ROOM

The Project

  • INTRODUCTION
  • ORGANIZATION
  • PUBLICATIONS

The Human Protein Atlas

  • DOWNLOADABLE DATA
  • LICENCE & CITATION
  • HELP & FAQ
The Human Protein Atlas project is funded
by the Knut & Alice Wallenberg Foundation.


contact@proteinatlas.org