Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: YI31B_YEAST (Q7LHG5)

Summary

This is the summary of UniProt entry YI31B_YEAST (Q7LHG5).

Description: Transposon Ty3-I Gag-Pol polyprotein EC=3.4.23.- EC=2.7.7.49 EC=2.7.7.7 EC=3.1.26.4
Source organism: Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (NCBI taxonomy ID 559292)
View Pfam proteome data.
Length: 1498 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
Pfam A Retrotrans_gag 67 161
disorder n/a 179 180
low_complexity n/a 214 224
disorder n/a 217 220
disorder n/a 227 229
disorder n/a 231 256
low_complexity n/a 247 255
Pfam A Peptidase_A2B 286 462
disorder n/a 422 425
disorder n/a 428 450
disorder n/a 453 454
disorder n/a 460 463
disorder n/a 469 545
disorder n/a 560 574
disorder n/a 596 598
disorder n/a 601 602
disorder n/a 604 610
disorder n/a 625 629
low_complexity n/a 652 668
Pfam A RVT_1 665 823
Pfam B Pfam-B_6784 940 1001
Pfam A rve 1191 1308
disorder n/a 1283 1299
disorder n/a 1388 1395
disorder n/a 1481 1484
disorder n/a 1486 1487

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q7LHG5. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSFMDQIPGG GNYPKLPVEC LPNFPIQPSL TFRGRNDSHK LKNFISEIML
50
51
NMSMISWPND ASRIVYCRRH LLNPAAQWAN DFVQEQGILE ITFDTFIQGL
100
101
YQHFYKPPDI NKIFNAITQL SEAKLGIERL NQRFRKIWDR MPPDFMTEKA
150
151
AIMTYTRLLT KETYNIVRMH KPETLKDAME EAYQTTALTE RFFPGFELDA
200
201
DGDTIIGATT HLQEEYDSDY DSEDNLTQNR YVHTVRTRRS YNKPMSNHRN
250
251
RRNNNASREE CIKNRLCFYC KKEGHRLNEC RARKAVLTDL ELESKDQQTL
300
301
FIKTLPIVHY IAIPEMDNTA EKTIKIQNTK VKTLFDSGSP TSFIRRDIVE
350
351
LLKYEIYETP PLRFRGFVAT KSAVTSEAVT IDLKINDLQI TLAAYILDNM
400
401
DYQLLIGNPI LRRYPKILHT VLNTRESPDS LKPKTYRSET VNNVRTYSAG
450
451
NRGNPRNIKL SFAPTILEAT DPKSAGNRGN PRNTKLSLAP TILEATDPKS
500
501
AGNRGDSRTK TLSLATTTPA AIDPLTTLDN PGSTQSTFAQ FPIPEEASIL
550
551
EEDGKYSNVV STIQSVEPNA TDHSNKDTFC TLPVWLQQKY REIIRNDLPP
600
601
RPADINNIPV KHDIEIKPGA RLPRLQPYHV TEKNEQEINK IVQKLLDNKF
650
651
IVPSKSPCSS PVVLVPKKDG TFRLCVDYRT LNKATISDPF PLPRIDNLLS
700
701
RIGNAQIFTT LDLHSGYHQI PMEPKDRYKT AFVTPSGKYE YTVMPFGLVN
750
751
APSTFARYMA DTFRDLRFVN VYLDDILIFS ESPEEHWKHL DTVLERLKNE
800
801
NLIVKKKKCK FASEETEFLG YSIGIQKIAP LQHKCAAIRD FPTPKTVKQA
850
851
QRFLGMINYY RRFIPNCSKI AQPIQLFICD KSQWTEKQDK AIEKLKAALC
900
901
NSPVLVPFNN KANYRLTTDA SKDGIGAVLE EVDNKNKLVG VVGYFSKSLE
950
951
SAQKNYPAGE LELLGIIKAL HHFRYMLHGK HFTLRTDHIS LLSLQNKNEP
1000
1001
ARRVQRWLDD LATYDFTLEY LAGPKNVVAD AISRAIYTIT PETSRPIDTE
1050
1051
SWKSYYKSDP LCSAVLIHMK ELTQHNVTPE DMSAFRSYQK KLELSETFRK
1100
1101
NYSLEDEMIY YQDRLVVPIK QQNAVMRLYH DHTLFGGHFG VTVTLAKISP
1150
1151
IYYWPKLQHS IIQYIRTCVQ CQLIKSHRPR LHGLLQPLPI AEGRWLDISM
1200
1201
DFVTGLPPTS NNLNMILVVV DRFSKRAHFI ATRKTLDATQ LIDLLFRYIF
1250
1251
SYHGFPRTIT SDRDVRMTAD KYQELTKRLG IKSTMSSANH PQTDGQSERT
1300
1301
IQTLNRLLRA YVSTNIQNWH VYLPQIEFVY NSTPTRTLGK SPFEIDLGYL
1350
1351
PNTPAIKSDD EVNARSFTAV ELAKHLKALT IQTKEQLEHA QIEMETNNNQ
1400
1401
RRKPLLLNIG DHVLVHRDAY FKKGAYMKVQ QIYVGPFRVV KKINDNAYEL
1450
1451
DLNSHKKKHR VINVQFLKSL YTVQTRTQRI NQSAPLRELR EHTKLLHS  
1498
 

Show the unformatted sequence.

Checksums:
CRC64:4E51C3EFBDEFD7E4
MD5:7047a54ba281aab1c7ea6e388953b9d9