Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: YH41B_YEAST (P0C2J7)

Summary

This is the summary of UniProt entry YH41B_YEAST (P0C2J7).

Description: Transposon Ty4-H Gag-Pol polyprotein Capsid protein Ty4 protease Integrase Reverse transcriptase/ribonuclease H EC=3.4.23.- EC=2.7.7.49 EC=2.7.7.7 EC=3.1.26.4
Source organism: Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (NCBI taxonomy ID 559292)
View Pfam proteome data.
Length: 1802 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
disorder n/a 1 9
disorder n/a 12 13
disorder n/a 24 39
disorder n/a 42 43
coiled_coil n/a 48 68
coiled_coil n/a 78 112
disorder n/a 379 382
disorder n/a 385 393
disorder n/a 395 399
low_complexity n/a 507 515
disorder n/a 533 536
disorder n/a 538 543
disorder n/a 551 552
disorder n/a 560 564
disorder n/a 566 573
disorder n/a 607 619
Pfam A rve 619 741
disorder n/a 692 693
disorder n/a 851 852
disorder n/a 862 878
coiled_coil n/a 873 893
disorder n/a 880 888
disorder n/a 898 907
disorder n/a 913 924
disorder n/a 926 927
disorder n/a 930 934
disorder n/a 945 954
disorder n/a 956 959
disorder n/a 962 963
disorder n/a 998 999
disorder n/a 1002 1004
disorder n/a 1027 1031
disorder n/a 1054 1213
disorder n/a 1215 1258
disorder n/a 1282 1284
disorder n/a 1286 1290
Pfam A RVT_2 1311 1547
low_complexity n/a 1489 1504
low_complexity n/a 1736 1762

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P0C2J7. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MATPVRDETR NVIDDNISAR IQSKVKTNDT VRQTPSSLRK VSIKDEQVKQ
50
51
YQRNLNRFKT ILNGLKAEEE KLSETDDIQM LAEKLLKLGE TIDKVENRIV
100
101
DLVEKIQLLE TNENNNILHE HIDATGTYYL FDTLTSTNKR FYPKDCVFDY
150
151
RTNNVENIPI LLNNFKKFIK KYQFDDVFEN DIIEIDPREN EILCKIIKEG
200
201
LGESLDIMNT NTTDIFRIID GLKNKYRSLH GRDVRIRAWE KVLVDTTCRN
250
251
SALLMNKLQK LVLMEKWIFS KCCQDCPNLK DYLQEAIMGT LHESLRNSVK
300
301
QRLYNIPHNV GINHEEFLIN TVIETVIDLS PIADDQIENS CMYCKSVFHC
350
351
SINCKKKPNR ELGLTRPISQ KPIIYKVHRD NNNLSPVQNE QKSWNKTQKK
400
401
SNKVYNSKKL VIIDTGSGVN ITNDKTLLHN YEDSNRSTRF FGIGKNSSVS
450
451
VKGYGYIKIK NGHNNTDNKC LLTYYVPEEE STIISCYDLA KKTKMVLSRK
500
501
YTRLGNKIIK IKTKIVNGVI HVKMNELIER PSDDSKINAI KPTSSPGFKL
550
551
NKRSITLEDA HKRMGHTGIQ QIENSIKHNH YEESLDLIKE PNEFWCQTCK
600
601
ISKATKRNHY TGSMNNHSTD HEPGSSWCMD IFGPVSSSNA DTKRYMLIMV
650
651
DNNTRYCMTS THFNKNAETI LAQIRKNIQY VETQFDRKVR EINSDRGTEF
700
701
TNDQIEEYFI SKGIHHILTS TQDHAANGRA ERYIRTIVTD ATTLLRQSNL
750
751
RVKFWEYAVT SATNIRNCLE HKSTGKLPLK AISRQPVTVR LMSFLPFGEK
800
801
GIIWNHNHKK LKPSGLPSII LCKDPNSYGY KFFIPSKNKI VTSDNYTIPN
850
851
YTMDGRVRNT QNIYKSHQFS SHNDNEEDQI ETVTNLCEAL ENYEDDNKPI
900
901
TRLEDLFTEE ELSQIDSNAK YPSPSNNLEG DLDYVFSDVE ESGDYDVESE
950
951
LSTTNTSIST DKNKILSNKD FNSELASTEI SISEIDKKGL INTSHIDEDK
1000
1001
YDEKVHRIPS IIQEKLVGSK NTIKINDENR ISDRIRSKNI GSILNTGLSR
1050
1051
CVDITDESIT NKDESMHNAK PELIQEQFNK TNHETSFPKE GSIGTNVKFR
1100
1101
NTDNEISLKT GDTSLPIKTL ESINNHHSND YSTNKVEKFE KENHHPPPIE
1150
1151
DIVDMSDQTD MESNCQDGNN LKELKVTDKN VPTDNGTNVS PRLEQNIEAS
1200
1201
GSPVQTVNKS AFLNKEFSSL NMKRKRKRHD KNNSLTSYEL ERDKKRSKRN
1250
1251
RVKLIPDNME TVSAQKIRAI YYNEAISKNP DLKEKHEYKQ AYHKELQNLK
1300
1301
DMKVFDVDVK YSRSEIPDNL IVPTNTIFTK KRNGIYKARI VCRGDTQSPD
1350
1351
TYSVITTESL NHNHIKIFLM IANNRNMFMK TLDINHAFLY AKLEEEIYIP
1400
1401
HPHDRRCVVK LNKALYGLKQ SPKEWNDHLR QYLNGIGLKD NSYTPGLYQT
1450
1451
EDKNLMIAVY VDDCVIAASN EQRLDEFINK LKSNFELKIT GTLIDDVLDT
1500
1501
DILGMDLVYN KRLGTIDLTL KSFINRMDKK YNEELKKIRK SSIPHMSTYK
1550
1551
IDPKKDVLQM SEEEFRQGVL KLQQLLGELN YVRHKCRYDI NFAVKKVARL
1600
1601
VNYPHERVFY MIYKIIQYLV RYKDIGIHYD RDCNKDKKVI AITDASVGSE
1650
1651
YDAQSRIGVI LWYGMNIFNV YSNKSTNRCV SSTEAELHAI YEGYADSETL
1700
1701
KVTLKELGEG DNNDIVMITD SKPAIQGLNR SYQQPKEKFT WIKTEIIKEK
1750
1751
IKEKSIKLLK ITGKGNIADL LTKPVSASDF KRFIQVLKNK ITSQDILAST
1800
1801
DY                                                    
1802
 

Show the unformatted sequence.

Checksums:
CRC64:7F6A8A3483966588
MD5:1ea7856c210c3eb8f42632b36788b08c