0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: Q46034_CLODI (Q46034)

Summary

Q46034_CLODI

This is the summary of UniProt entry Q46034_CLODI (Q46034).

Description: Toxin B
Source organism: Clostridium difficile (NCBI taxonomy ID 1496)
View Pfam proteome data.
Length: 2367 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q46034. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSLVNRKQLE KMANVRFRVQ EDEYVAILDA LEEYHNMSEN TVVEKYLKLK
50
51
DINSLTDTYI DTYKKSGRNK ALKKFKEYLV IEILELKNSN LTPVEKNLHF
100
101
IWIGGQINDT AINYINQWKD VNSDYNVNVF YDSNAFLINT LKKTIIESAS
150
151
NDTLESFREN LNDPEFNHTA FFRKRMQIIY DKQQNFINYY KAQKEENPDL
200
201
IIDDIVKTYL SNEYSKDIDE LNAYIEESLN KVTENSGNDV RNFEEFKTGE
250
251
VFNLYEQESV ERWNLAGASD ILRVAILKNI GGVYLDVDML PGIHPDLFKD
300
301
INKPDSVKTA VDWEEMQLEA IMKHKEYIPE YTSKHFDTLD EEVQSSFESV
350
351
LASKSDKSEI FLPLGDIEVS PLEVKIAFAK GSIINQALIS AKDSYCSDLL
400
401
IKQIQNRYKI LNDTLGPIIS QGNDFNTTMN NFGESLGAIA NEENISFIAK
450
451
IGSYLRVGFY PEANTTITLS GPTIYAGAYK DLLTFKEMSI DTSILSSELR
500
501
NFEFPKVNIS QATEQEKNSL WQFNEERAKI QFEEYKKNYF EGALGEDDNL
550
551
DFSQNTVTDK EYLLEKISSS TKSSEGGYVH YIVQLQGDKI SYEAACNLFA
600
601
KNPYDSILFQ RNIEDSEVAY YYNPTDSEIQ EIDKYRIPDR ISDRPKIKLT
650
651
FIGHGKAEFN TDIFAGLDVD SLSSEIETAI GLAKEDISPK SIEINLLGCN
700
701
MFSYSVNVEE TYPGKLLLRV KDKVSELMPS MSQDSIIVSA NQYEVRINSE
750
751
GRRELLDHSG EWINKEESII KDISSKEYIS FNPKENKIIV KSKNLPELST
800
801
LLQEIRNNSN SSDIELEEKV MLAECEINVI SNIETQVVEE RIEEAKSLTS
850
851
DSINYIKNEF KLIESISEAL CDLKQQNELE DSHFISFEDI SETDEGFSIR
900
901
FINKETGESI FVETEKTIFS EYANHITEEI SKIKGTIFDT VNGKLVKKVN
950
951
LDTTHEVNTL NAAFFIQSLI EYNSSKESLS NLSVAMKVQV YAQLFSTGLN
1000
1001
TITDAAKVVE LVSTALDETI DLLPTLSEGL PIIATIIDGV SLGAAIKELS
1050
1051
ETSDPLLRQE IEAKIGIMAV NLTTATTAII TSSLGIASGF SILLVPLAGI
1100
1101
SAGIPSLVNN ELVLRDKATK VVDYFKHVSL VETEGVFTLL DDKVMMQQDD
1150
1151
LVISEIDFNN NSIVLGKCEI WRMEGGSGHT VTDDIDHFFS APSITYREPH
1200
1201
LSIYDVLEVQ KEELDLSKDL MVLPNAPNRV FAWETGWTPG LRSLENDGTK
1250
1251
LLDRIRDNYE GEFYWRYFAF IADALITTLK PRYEDTNIRI NLDSNTRSFI
1300
1301
VPIITTEYIR EKLSYSFYGS GGTYALPLSQ YNMGINIELS ESDVWIIDVD
1350
1351
NVVRDVTIES DKIKKGDLIE GILSTLSIEE NKIILNSHEI NFSGEVNGSN
1400
1401
GFVSLTFSIL EGINAIIEVD LLSKSYKLLI SGELKILMLN SNHIQQKIDY
1450
1451
IGFNSELQKN IPYSFVDSEG KENGFINGST KEGLFVSELP DVVLISKVYM
1500
1501
DDSKPSFGYY SNNLKDVKVI TKDNVNILTG YYLKDDIKIS LSLTLQDEKT
1550
1551
IKLNSVHLDE SGVAEILKFM NRKGSTNTSD SLMSFLESMN IKSIFVNFLQ
1600
1601
SNIKFILDAN FIISGTTSIG QFEFICDENN NIQPYFIKFN TLETNYTLYV
1650
1651
GNRQNMIVEP NYDLDDSGDI SSTVINFSQK YLYGIDSCVN KVVISPNIYT
1700
1701
DEINITPVYE TNNTYPEVIV LDANYINEKI NVNINDLSIR YVWSNDGNDF
1750
1751
ILMSTSEENK VSQVKIRFVN VFKDKTLANK LSFNFSDKQD VPVSEIILSF
1800
1801
TPSYYEDGLI GYDLGLVSLY NEKFYINNFG MMVSGLIYIN DSLYYFKPPV
1850
1851
NNLITGFVTV GDDKYYFNPI NGGAASIGET IIDDKNYYFN QSGVLQTGVF
1900
1901
STEDGFKYFA PANTLDENLE GEAIDFTGKL IIDENIYYFE DNYRGAVEWK
1950
1951
ELDGEMHYFS PETGKAFKGL NQIGDDKYYF NSDGVMQKGF VSINDNKHYF
2000
2001
DDSGVMKVGY TEIDGKHFYF AENGEMQIGV FNTEDGFKYF AHHNEDLGNE
2050
2051
EGEEISYSGI LNFNNKIYYF DDSFTAVVGW KDLEDGSKYY FDEDTAEAYI
2100
2101
GLSLINDGQY YFNDDGIMQV GFVTINDKVF YFSDSGIIES GVQNIDDNYF
2150
2151
YIDDNGIVQI GVFDTSDGYK YFAPANTVND NIYGQAVEYS GLVRVGEDVY
2200
2201
YFGETYTIET GWIYDMENES DKYYFVPETK KACKGINLID DIKYYFDEKG
2250
2251
IMRTGLISFE NNNYYFNENG EIQFGYINIE DKMFYFGEDG VMQIGVFNTP
2300
2301
DGFKYFAHQN TLDENFEGES INYTGWLGLD EKRYYFTDEY IAATGSVIID
2350
2351
GEEYYFDPDT AQLVISE                                    
2367
 

Show the unformatted sequence.

Checksums:
CRC64:EF9823DAE70427F3
MD5:e5797ae50d21fef5ef67871df4296d45