119  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: POL_HV1B1 (P03366)

Summary

POL_HV1B1

This is the summary of UniProt entry POL_HV1B1 (P03366).

Description: Gag-Pol polyprotein EC=3.4.23.16 EC=2.7.7.49 EC=2.7.7.7 EC=3.1.26.4
Source organism: Human immunodeficiency virus type 1 (isolate BH10 group M subtype B) (HIV-1) (NCBI taxonomy ID 11678)
View Pfam proteome data.
Length: 1447 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P03366. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL
50
51
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA
100
101
LDKIEEEQNK SKKKAQQAAA DTGHSSQVSQ NYPIVQNIQG QMVHQAISPR
150
151
TLNAWVKVVE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM
200
201
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM
250
251
TNNPPIPVGE IYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF
300
301
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC
350
351
QGVGGPGHKA RVLAEAMSQV TNTATIMMQR GNFRNQRKMV KCFNCGKEGH
400
401
TARNCRAPRK KGCWKCGKEG HQMKDCTERQ ANFLREDLAF LQGKAREFSS
450
451
EQTRANSPTI SSEQTRANSP TRRELQVWGR DNNSPSEAGA DRQGTVSFNF
500
501
PQITLWQRPL VTIKIGGQLK EALLDTGADD TVLEEMSLPG RWKPKMIGGI
550
551
GGFIKVRQYD QILIEICGHK AIGTVLVGPT PVNIIGRNLL TQIGCTLNFP
600
601
ISPIETVPVK LKPGMDGPKV KQWPLTEEKI KALVEICTEM EKEGKISKIG
650
651
PENPYNTPVF AIKKKDSTKW RKLVDFRELN KRTQDFWEVQ LGIPHPAGLK
700
701
KKKSVTVLDV GDAYFSVPLD EDFRKYTAFT IPSINNETPG IRYQYNVLPQ
750
751
GWKGSPAIFQ SSMTKILEPF KKQNPDIVIY QYMDDLYVGS DLEIGQHRTK
800
801
IEELRQHLLR WGLTTPDKKH QKEPPFLWMG YELHPDKWTV QPIVLPEKDS
850
851
WTVNDIQKLV GKLNWASQIY PGIKVRQLCK LLRGTKALTE VIPLTEEAEL
900
901
ELAENREILK EPVHGVYYDP SKDLIAEIQK QGQGQWTYQI YQEPFKNLKT
950
951
GKYARMRGAH TNDVKQLTEA VQKITTESIV IWGKTPKFKL PIQKETWETW
1000
1001
WTEYWQATWI PEWEFVNTPP LVKLWYQLEK EPIVGAETFY VDGAANRETK
1050
1051
LGKAGYVTNK GRQKVVPLTN TTNQKTELQA IYLALQDSGL EVNIVTDSQY
1100
1101
ALGIIQAQPD KSESELVNQI IEQLIKKEKV YLAWVPAHKG IGGNEQVDKL
1150
1151
VSAGIRKILF LDGIDKAQDE HEKYHSNWRA MASDFNLPPV VAKEIVASCD
1200
1201
KCQLKGEAMH GQVDCSPGIW QLDCTHLEGK VILVAVHVAS GYIEAEVIPA
1250
1251
ETGQETAYFL LKLAGRWPVK TIHTDNGSNF TSATVKAACW WAGIKQEFGI
1300
1301
PYNPQSQGVV ESMNKELKKI IGQVRDQAEH LKTAVQMAVF IHNFKRKGGI
1350
1351
GGYSAGERIV DIIATDIQTK ELQKQITKIQ NFRVYYRDSR NPLWKGPAKL
1400
1401
LWKGEGAVVI QDNSDIKVVP RRKAKIIRDY GKQMAGDDCV ASRQDED   
1447
 

Show the unformatted sequence.

Checksums:
CRC64:AC3EE1439592E0AD
MD5:7aa7242ed3dde05e49c7e4c740e28b97