Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: A1SHC3_NOCSJ (A1SHC3)

Summary

This is the summary of UniProt entry A1SHC3_NOCSJ (A1SHC3).

Description: Stage II sporulation E family protein
Source organism: Nocardioides sp. (strain BAA-499 / JS614) (NCBI taxonomy ID 196162)
View Pfam proteome data.
Length: 346 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
low_complexity n/a 14 30
transmembrane n/a 15 42
transmembrane n/a 54 70
low_complexity n/a 57 69
transmembrane n/a 82 102
low_complexity n/a 84 97
Pfam A SpoIIE 160 344
low_complexity n/a 181 203
low_complexity n/a 245 255
disorder n/a 261 263

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession A1SHC3. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MTSISQRWRR LDPGLVLGAA VLVVLVLVAL TTNVQLIAVF AAGPIVSSLL
50
51
TSPRRTALLG VAAVGLALLS GRWQHDVGTL DWSVRVVLCV LLSLLAVQGA
100
101
ALREHREGRL QRMTVIAETA QRAVLRSMPT AIGSIGLAAR YVSATAEALV
150
151
GGDLYEVAAT PFGVRVIVGD VRGKGLEAVQ TAAAVLGAFR AAAFTAPDVA
200
201
DLARTIDDTL ARMIGEEEFV TAIVGEFHGD RVALANCGHH PPLLVVDAAV
250
251
TTTDTGEPTL PLGLGAVPVL TEHPWPVGAR MLFYTDGLVE TRDRQGRFFP
300
301
FEDHAAELGE GTVEEALDRL VARLLAWSGQ RMADDLALVL AESEGR    
346
 

Show the unformatted sequence.

Checksums:
CRC64:31E2B6BC703078CB
MD5:9e4b5f9d6cae5f6f64cb97896da44e31