Summary
This is the summary of UniProt entry YD21B_YEAST (Q12472).
| Description: | Transposon Ty2-DR1 Gag-Pol polyprotein EC=3.4.23.- EC=2.7.7.49 EC=2.7.7.7 EC=3.1.26.4 |
| Source organism: |
Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
(NCBI taxonomy ID
559292)
View Pfam proteome data. |
| Length: | 1770 amino acids |
Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.
Pfam domains
| Source | Domain | Start | End |
|---|---|---|---|
| disorder | n/a | 1 | 103 |
| Pfam A | TYA | 17 | 114 |
| low_complexity | n/a | 68 | 81 |
| disorder | n/a | 108 | 109 |
| disorder | n/a | 113 | 118 |
| disorder | n/a | 120 | 121 |
| disorder | n/a | 125 | 129 |
| disorder | n/a | 131 | 168 |
| Pfam B | Pfam-B_2595 | 163 | 288 |
| disorder | n/a | 353 | 453 |
| low_complexity | n/a | 369 | 382 |
| disorder | n/a | 455 | 456 |
| disorder | n/a | 466 | 468 |
| disorder | n/a | 476 | 478 |
| disorder | n/a | 482 | 483 |
| Pfam A | gag_pre-integrs | 564 | 641 |
| low_complexity | n/a | 578 | 588 |
| Pfam A | rve | 656 | 778 |
| Pfam B | Pfam-B_20108 | 801 | 835 |
| disorder | n/a | 840 | 845 |
| disorder | n/a | 848 | 854 |
| disorder | n/a | 911 | 949 |
| disorder | n/a | 951 | 1225 |
| Pfam B | Pfam-B_20332 | 1004 | 1108 |
| low_complexity | n/a | 1143 | 1151 |
| disorder | n/a | 1251 | 1254 |
| Pfam A | RVT_2 | 1280 | 1517 |
| disorder | n/a | 1316 | 1317 |
| disorder | n/a | 1524 | 1527 |
| low_complexity | n/a | 1532 | 1543 |
| coiled_coil | n/a | 1534 | 1554 |
Show or hide domain scores.
Sequence annotations
This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...
Note: it can take a few seconds for this image to be generated and loaded.
Show sources update panel.
Sequence information
This is the amino acid sequence of the UniProt sequence database entry with the accession Q12472. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.
| Sequence: | 1
MESQQLSQNS PNLHGSAYAS VTSKEVPSNQ DPLAVSASNL PEFDRDSTKV
50 51
NSQQETTPGT SAVPENHHHV SPQPASVPPP QNGQYQQHGM MTPNKAMASN
100 101
WAHYQQPSMM TCSHYQTSPA YYQPDPHYPL PQYIPPLSTS SPDPIDLKNQ
150 151
HSEIPQAKTK VGNNVLPPHT LTSEENFSTW VKFYIRFLKN SNLGDIIPND
200 201
QGEIKRQMTY EEHAYIYNTF QAFAPFHLLP TWVKQILEIN YADILTVLCK
250 251
SVSKMQTNNQ ELKDWIALAN LEYDGSTSAD TFEITVSTII QRLKENNINV
300 301
SDRLACQLIL KGLSGDFKYL RNQYRTKTNM KLSQLFAEIQ LIYDENKIMN
350 351
LNKPSQYKQH SEYKNVSRTS PNTTNTKVTT RNYQRTNSSK PRAAKAHNIA
400 401
TSSKFSRVNN DHINESTVSS QYLSDDNELS LGQQQKESKP THTIDSNDEL
450 451
PDHLLIDSGA SQTLVRSAHY LHHATPNSEI NIVDAQKQDI PINAIGNLHF
500 501
NFQNGTKTSI KALHTPNIAY DLLSLSELAN QNITACFTRN TLERSDGTVL
550 551
APIVKHGDFY WLSKKYLIPS HISKLTINNV NKSKSVNKYP YPLIHRMLGH
600 601
ANFRSIQKSL KKNAVTYLKE SDIEWSNAST YQCPDCLIGK STKHRHVKGS
650 651
RLKYQESYEP FQYLHTDIFG PVHHLPKSAP SYFISFTDEK TRFQWVYPLH
700 701
DRREESILNV FTSILAFIKN QFNARVLVIQ MDRGSEYTNK TLHKFFTNRG
750 751
ITACYTTTAD SRAHGVAERL NRTLLNDCRT LLHCSGLPNH LWFSAVEFST
800 801
IIRNSLVSPK NDKSARQHAG LAGLDITTIL PFGQPVIVNN HNPDSKIHPR
850 851
GIPGYALHPS RNSYGYIIYL PSLKKTVDTT NYVILQDKQS KLDQFNYDTL
900 901
TFDDDLNRLT AHNQSFIEKN ETEQSYDQNT ESDHDYQSEI EINSDPLVND
950 951
FSSQSINPLQ LDKEPVQKVR APKEVDADIS EYNILPSTIR SRTPHIINKE
1000 1001
STEMGGTIES DTTSPRHSST FTARNQKRPG SPNDMIDLTS QDRVNYGLEN
1050 1051
IKTTRLGGTE EPYIQRNSDT NIKYRTTNST PSIDDRSSNS ESTTPIISIE
1100 1101
TKAACDNTPS IDTDPPEYRS SDHATPNIMP DKSSKNVTAD SILDDLPLPD
1150 1151
LTNKSPTDTS DVSKDIPHIH SRQTNSSLGG MDDSNVLTTT KSKKRSLEDN
1200 1201
ETEIEVSRDT WNNKNMRSLE PPRSKKRINL IAAIKGVKSI KPVRTTLRYD
1250 1251
EAITYNEDNK EKDRYIEAYH KEINQLLRMN TWDTNKYYDR NDIDPKKVIN
1300 1301
SMFIFNKKRD GTHKARFVAR GDIQHPDTYD SDMQSNTVHH YALMTSLSIA
1350 1351
LDNDYYITQL DISSAYLYAD IKEELYIRPP PHLGLNDKLL RLRKSLYGLK
1400 1401
QSGANWYETI KSYLINCCDM QEVRGWSCVF KNSQVTICLF VDDMILFSKD
1450 1451
LNANKKIITT LKKQYDTKII NLGEGDNEIQ YDILGLEIKY QRSKYMKLGM
1500 1501
EKSLTEKLPK LNVPLNPKGK KLRAPGQPGH YIDQDELEID EDEYKEKVHE
1550 1551
MQKLIGLASY VGYKFRFDLL YYINTLAQHI LFPSRQVLDM TYELIQFMWD
1600 1601
TRDKQLIWHK NKPTKPDNKL VAISDASYGN QPYYKSQIGN IFLLNGKVIG
1650 1651
GKSTKASLTC TSTTEAEIHA VSEAIPLLNN LSHLVQELNK KPIIKGLLTD
1700 1701
SRSTISIIKS TNEEKFRNRF FGTKAMRLRD EVSGNNLYVY YIETKKNIAD
1750 1751
VMTKPLPIKT FKLLTNKWIH
1770
Show the unformatted sequence. |
| Checksums: |
CRC64:A116BEF8C45917D0
MD5:2307fbe2c4e429bda828dffc86d67a64
|

