Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CATL1_MOUSE (P06797)

Summary

This is the summary of UniProt entry CATL1_MOUSE (P06797).

Description: Cathepsin L1 EC=3.4.22.15
Source organism: Mus musculus (Mouse) (NCBI taxonomy ID 10090)
View Pfam proteome data.
Length: 334 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
sig_p n/a 1 17
low_complexity n/a 3 17
Pfam A Inhibitor_I29 29 88
Pfam A Peptidase_C1 114 332

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P06797. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MNLLLLLAVL CLGTALATPK FDQTFSAEWH QWKSTHRRLY GTNEEEWRRA
50
51
IWEKNMRMIQ LHNGEYSNGQ HGFSMEMNAF GDMTNEEFRQ VVNGYRHQKH
100
101
KKGRLFQEPL MLKIPKSVDW REKGCVTPVK NQGQCGSCWA FSASGCLEGQ
150
151
MFLKTGKLIS LSEQNLVDCS HAQGNQGCNG GLMDFAFQYI KENGGLDSEE
200
201
SYPYEAKDGS CKYRAEFAVA NDTGFVDIPQ QEKALMKAVA TVGPISVAMD
250
251
ASHPSLQFYS SGIYYEPNCS SKNLDHGVLL VGYGYEGTDS NKNKYWLVKN
300
301
SWGSEWGMEG YIKIAKDRDN HCGLATAASY PVVN                 
334
 

Show the unformatted sequence.

Checksums:
CRC64:FE6747043307AD98
MD5:9e516f472a17fbee6abd80b968831b75

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.