Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: ERCC5_HUMAN (P28715)

Summary

This is the summary of UniProt entry ERCC5_HUMAN (P28715).

Description: DNA repair protein complementing XP-G cells EC=3.1.-.-
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 1186 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
Pfam A XPG_N 1 98
disorder n/a 94 97
disorder n/a 100 101
low_complexity n/a 104 115
disorder n/a 130 131
disorder n/a 145 174
low_complexity n/a 150 167
disorder n/a 176 181
disorder n/a 187 188
disorder n/a 190 193
disorder n/a 204 207
disorder n/a 215 220
disorder n/a 245 269
Pfam B Pfam-B_29308 261 401
disorder n/a 273 278
disorder n/a 301 392
low_complexity n/a 352 364
disorder n/a 394 397
disorder n/a 399 549
disorder n/a 553 557
low_complexity n/a 557 565
disorder n/a 560 595
disorder n/a 600 606
disorder n/a 608 732
low_complexity n/a 642 656
low_complexity n/a 732 757
coiled_coil n/a 733 756
disorder n/a 744 748
disorder n/a 755 757
Pfam A XPG_I 777 861
disorder n/a 903 912
Pfam B Pfam-B_659 989 1097
disorder n/a 1008 1009
low_complexity n/a 1025 1042
disorder n/a 1051 1052
disorder n/a 1056 1076
low_complexity n/a 1064 1079
low_complexity n/a 1094 1109
disorder n/a 1095 1186
low_complexity n/a 1146 1154
low_complexity n/a 1171 1185

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P28715. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGVQGLWKLL ECSGRQVSPE ALEGKILAVD ISIWLNQALK GVRDRHGNSI
50
51
ENPHLLTLFH RLCKLLFFRI RPIFVFDGDA PLLKKQTLVK RRQRKDLASS
100
101
DSRKTTEKLL KTFLKRQAIK TAFRSKRDEA LPSLTQVRRE NDLYVLPPLQ
150
151
EEEKHSSEEE DEKEWQERMN QKQALQEEFF HNPQAIDIES EDFSSLPPEV
200
201
KHEILTDMKE FTKRRRTLFE AMPEESDDFS QYQLKGLLKK NYLNQHIEHV
250
251
QKEMNQQHSG HIRRQYEDEG GFLKEVESRR VVSEDTSHYI LIKGIQAKTV
300
301
AEVDSESLPS SSKMHGMSFD VKSSPCEKLK TEKEPDATPP SPRTLLAMQA
350
351
ALLGSSSEEE LESENRRQAR GRNAPAAVDE GSISPRTLSA IKRALDDDED
400
401
VKVCAGDDVQ TGGPGAEEMR INSSTENSDE GLKVRDGKGI PFTATLASSS
450
451
VNSAEEHVAS TNEGREPTDS VPKEQMSLVH VGTEAFPISD ESMIKDRKDR
500
501
LPLESAVVRH SDAPGLPNGR ELTPASPTCT NSVSKNETHA EVLEQQNELC
550
551
PYESKFDSSL LSSDDETKCK PNSASEVIGP VSLQETSSIV SVPSEAVDNV
600
601
ENVVSFNAKE HENFLETIQE QQTTESAGQD LISIPKAVEP MEIDSEESES
650
651
DGSFIEVQSV ISDEELQAEF PETSKPPSEQ GEEELVGTRE GEAPAESESL
700
701
LRDNSERDDV DGEPQEAEKD AEDSLHEWQD INLEELETLE SNLLAQQNSL
750
751
KAQKQQQERI AATVTGQMFL ESQELLRLFG IPYIQAPMEA EAQCAILDLT
800
801
DQTSGTITDD SDIWLFGARH VYRNFFNKNK FVEYYQYVDF HNQLGLDRNK
850
851
LINLAYLLGS DYTEGIPTVG CVTAMEILNE FPGHGLEPLL KFSEWWHEAQ
900
901
KNPKIRPNPH DTKVKKKLRT LQLTPGFPNP AVAEAYLKPV VDDSKGSFLW
950
951
GKPDLDKIRE FCQRYFGWNR TKTDESLFPV LKQLDAQQTQ LRIDSFFRLA
1000
1001
QQEKEDAKRI KSQRLNRAVT CMLRKEKEAA ASEIEAVSVA MEKEFELLDK
1050
1051
AKGKTQKRGI TNTLEESSSL KRKRLSDSKG KNTCGGFLGE TCLSESSDGS
1100
1101
SSEDAESSSL MNVQRRTAAK EPKTSASDSQ NSVKEAPVKN GGATTSSSSD
1150
1151
SDDDGGKEKM VLVTARSVFG KKRRKLRRAR GRKRKT               
1186
 

Show the unformatted sequence.

Checksums:
CRC64:B0A844D617C53F2E
MD5:c3bed7beb423e4b264904487e4d21bb8

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.