Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: APC2_HUMAN (O95996)

Summary

This is the summary of UniProt entry APC2_HUMAN (O95996).

Description: Adenomatous polyposis coli protein 2
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 2303 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
coiled_coil n/a 8 28
disorder n/a 16 17
disorder n/a 19 45
disorder n/a 47 52
disorder n/a 94 119
Pfam A Suppressor_APC 123 206
coiled_coil n/a 131 151
low_complexity n/a 144 152
disorder n/a 200 202
disorder n/a 213 214
disorder n/a 227 230
disorder n/a 238 270
disorder n/a 291 299
low_complexity n/a 326 340
disorder n/a 328 343
low_complexity n/a 393 409
disorder n/a 398 402
Pfam A Arm 614 654
low_complexity n/a 627 640
disorder n/a 706 707
coiled_coil n/a 725 745
disorder n/a 736 765
Pfam B Pfam-B_12027 771 906
low_complexity n/a 779 801
disorder n/a 782 793
disorder n/a 813 839
low_complexity n/a 837 852
coiled_coil n/a 840 860
low_complexity n/a 867 878
disorder n/a 868 935
disorder n/a 947 993
disorder n/a 1001 1038
disorder n/a 1041 1045
disorder n/a 1050 1051
Pfam A APC_crr 1052 1077
disorder n/a 1053 1057
low_complexity n/a 1065 1078
disorder n/a 1070 1154
low_complexity n/a 1082 1102
Pfam A APC_crr 1144 1169
low_complexity n/a 1157 1186
disorder n/a 1173 1228
low_complexity n/a 1214 1226
Pfam A APC_crr 1257 1282
disorder n/a 1258 1262
low_complexity n/a 1275 1283
disorder n/a 1298 1301
low_complexity n/a 1304 1321
disorder n/a 1307 1339
Pfam A SAMP 1337 1357
disorder n/a 1361 1362
disorder n/a 1371 1498
Pfam A APC_crr 1385 1410
Pfam B Pfam-B_53475 1439 1478
disorder n/a 1504 1688
low_complexity n/a 1518 1526
low_complexity n/a 1548 1562
low_complexity n/a 1578 1596
low_complexity n/a 1609 1623
Pfam A SAMP 1625 1643
low_complexity n/a 1638 1650
low_complexity n/a 1659 1675
disorder n/a 1695 2036
low_complexity n/a 1703 1716
Pfam A APC_basic 1786 2123
low_complexity n/a 1819 1830
low_complexity n/a 1868 1887
low_complexity n/a 1896 1909
low_complexity n/a 1935 1947
low_complexity n/a 1961 1976
low_complexity n/a 1977 1989
low_complexity n/a 2000 2028
disorder n/a 2043 2232
low_complexity n/a 2047 2066
low_complexity n/a 2111 2131
disorder n/a 2235 2241
disorder n/a 2251 2303
low_complexity n/a 2288 2300

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O95996. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MASSVAPYEQ LVRQVEALKA ENSHLRQELR DNSSHLSKLE TETSGMKEVL
50
51
KHLQGKLEQE ARVLVSSGQT EVLEQLKALQ MDITSLYNLK FQPPTLGPEP
100
101
AARTPEGSPV HGSGPSKDSF GELSRATIRL LEELDRERCF LLNEIEKEEK
150
151
EKLWYYSQLQ GLSKRLDELP HVETQFSMQM DLIRQQLEFE AQHIRSLMEE
200
201
RFGTSDEMVQ RAQIRASRLE QIDKELLEAQ DRVQQTEPQA LLAVKSVPVD
250
251
EDPETEVPTH PEDGTPQPGN SKVEVVFWLL SMLATRDQED TARTLLAMSS
300
301
SPESCVAMRR SGCLPLLLQI LHGTEAAAGG RAGAPGAPGA KDARMRANAA
350
351
LHNIVFSQPD QGLARKEMRV LHVLEQIRAY CETCWDWLQA RDGGPEGGGA
400
401
GSAPIPIEPQ ICQATCAVMK LSFDEEYRRA MNELGGLQAV AELLQVDYEM
450
451
HKMTRDPLNL ALRRYAGMTL TNLTFGDVAN KATLCARRGC MEAIVAQLAS
500
501
DSEELHQVVS SILRNLSWRA DINSKKVLRE AGSVTALVQC VLRATKESTL
550
551
KSVLSALWNL SAHSTENKAA ICQVDGALGF LVSTLTYKCQ SNSLAIIESG
600
601
GGILRNVSSL VATREDYRQV LRDHNCLQTL LQHLTSHSLT IVSNACGTLW
650
651
NLSARSARDQ ELLWDLGAVG MLRNLVHSKH KMIAMGSAAA LRNLLAHRPA
700
701
KHQAAATAVS PGSCVPSLYV RKQRALEAEL DARHLAQALE HLEKQGPPAA
750
751
EAATKKPLPP LRHLDGLAQD YASDSGCFDD DDAPSSLAAA AATGEPASPA
800
801
ALSLFLGSPF LQGQALARTP PTRRGGKEAE KDTSGEAAVA AKAKAKLALA
850
851
VARIDQLVED ISALHTSSDD SFSLSSGDPG QEAPREGRAQ SCSPCRGPEG
900
901
GRREAGSRAH PLLRLKAAHA SLSNDSLNSG SASDGYCPRE HMLPCPLAAL
950
951
ASRREDPRCG QPRPSRLDLD LPGCQAEPPA REATSADARV RTIKLSPTYQ
1000
1001
HVPLLEGASR AGAEPLAGPG ISPGARKQAW LPADHLSKVP EKLAAAPLSV
1050
1051
ASKALQKLAA QEGPLSLSRC SSLSSLSSAG RPGPSEGGDL DDSDSSLEGL
1100
1101
EEAGPSEAEL DSTWRAPGAT SLPVAIPAPR RNRGRGLGVE DATPSSSSEN
1150
1151
YVQETPLVLS RCSSVSSLGS FESPSIASSI PSEPCSGQGS GTISPSELPD
1200
1201
SPGQTMPPSR SKTPPLAPAP QGPPEATQFS LQWESYVKRF LDIADCRERC
1250
1251
RLPSELDAGS VRFTVEKPDE NFSCASSLSA LALHEHYVQQ DVELRLLPSA
1300
1301
CPERGGGAGG AGLHFAGHRR REEGPAPTGS RPRGAADQEL ELLRECLGAA
1350
1351
VPARLRKVAS ALVPGRRALP VPVYMLVPAP APAQEDDSCT DSAEGTPVNF
1400
1401
SSAASLSDET LQGPPRDQPG GPAGRQRPTG RPTSARQAMG HRHKAGGAGR
1450
1451
SAEQSRGAGK NRAGLELPLG RPPSAPADKD GSKPGRTRGD GALQSLCLTT
1500
1501
PTEEAVYCFY GNDSDEEPPA AAPTPTHRRT SAIPRAFTRE RPQGRKEAPA
1550
1551
PSKAAPAAPP PARTQPSLIA DETPPCYSLS SSASSLSEPE PSEPPAVHPR
1600
1601
GREPAVTKDP GPGGGRDSSP SPRAAEELLQ RCISSALPRR RPPVSGLRRR
1650
1651
KPRATRLDER PAEGSRERGE EAAGSDRASD LDSVEWRAIQ EGANSIVTWL
1700
1701
HQAAAATREA SSESDSILSF VSGLSVGSTL QPPKHRKGRQ AEGEMGSARR
1750
1751
PEKRGAASVK TSGSPRSPAG PEKPRGTQKT TPGVPAVLRG RTVIYVPSPA
1800
1801
PRAQPKGTPG PRATPRKVAP PCLAQPAAPA KVPSPGQQRS RSLHRPAKTS
1850
1851
ELATLSQPPR SATPPARLAK TPSSSSSQTS PASQPLPRKR PPVTQAAGAL
1900
1901
PGPGASPVPK TPARTLLAKQ HKTQRSPVRI PFMQRPARRG PPPLARAVPE
1950
1951
PGPRGRAGTE AGPGARGGRL GLVRVASALS SGSESSDRSG FRRQLTFIKE
2000
2001
SPGLRRRRSE LSSAESAASA PQGASPRRGR PALPAVFLCS SRCEELRAAP
2050
2051
RQGPAPARQR PPAARPSPGE RPARRTTSES PSRLPVRAPA ARPETVKRYA
2100
2101
SLPHISVARR PDGAVPAAPA SADAARRSSD GEPRPLPRVA APGTTWRRIR
2150
2151
DEDVPHILRS TLPATALPLR GSTPEDAPAG PPPRKTSDAV VQTEEVAAPK
2200
2201
TNSSTSPSLE TREPPGAPAG GQLSLLGSDV DGPSLAKAPI SAPFVHEGLG
2250
2251
VAVGGFPASR HGSPSRSARV PPFNYVPSPM VVAATTDSAA EKAPATASAT
2300
2301
LLE                                                   
2303
 

Show the unformatted sequence.

Checksums:
CRC64:7BF940183ACD643D
MD5:17805df8f88f5227b2a52f48a1fb45ac

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.