4  structures 113  species 0  interactions 511  sequences 11  architectures

Family: BSD (PF03909)

Summary

BSD domain Add an annotation

This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function.


Literature references

  1. Doerks T, Huber S, Buchner E, Bork P; , Trends Biochem Sci 2002;27:168-170.: BSD: a novel domain in transcription factors and synapse-associated proteins. PUBMED:11943536


InterPro entry IPR005607

The BSD domain is an about 60-residue long domain named after the BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins in which it is found. Additionally, it is also found in several hypothetical proteins. The BSD domain occurs in one or two copies in a variety of species ranging from primal protozoan to human. It can be found associated with other domains such as the BTB domain (see ) or the U-box in multidomain proteins. The function of the BSD domain is yet unknown PUBMED:11943536.

Secondary structure prediction indicates the presence of three predicted alpha helices, which probably form a three-helical bundle in small domains. The third predicted helix contains neighbouring phenylalanine and tryptophan residues - less common amino acids that are invariant in all the BSD domains identified and that are the most striking sequence features of the domain PUBMED:11943536.

Some proteins known to contain one or two BSD domains are listed below:
  • Mammalian TFIIH basal transcription factor complex p62 subunit (GTF2H1).
  • Yeast RNA polymerase II transcription factor B 73 kDa subunit (TFB1), the homologue of BTF2.
  • Yeast DOS2 protein. It is involved in single-copy DNA replication and ubiquitination.
  • Drosophila synapse-associated protein SAP47.
  • Mammalian SYAP1.
  • Various Arabidopsis thaliana (Mouse-ear cress) hypothetical proteins.
  • External database links

    Domain organisation

    Below is a listing of the unique domain organisations or architectures in which this domain is found. More...

    Loading domain graphics...

    Alignments

    There are various ways to view or download the sequence alignments that we store. You can use a sequence viewer to look at either the seed or full alignment for the family, or you can look at a plain text version of the sequence in a variety of different formats. More...

    View options

    Alignment:
    Viewer:  

    Formatting options

    Alignment:
    Format:
    Order:
    Sequence:
    Gaps:
    Download/view:

    Download options

    Very large alignments can often cause problems for the formatting tool above. If you find that downloading or viewing a large alignment is problematic, you can also download a gzip-compressed, Stockholm-format file containing the seed or full alignment for this family.

    You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.

    The main seed and full alignments are generated using sequences from the UniProt sequence database. However, we also generate alignments using sequences from the NCBI sequence database and the "metaseq" metagenomics dataset.

    You can view alignments from these two additional datasets using the form above, or you can download alignments of NCBI or metagenomics sequences, as gzip-compressed files.

    Pfam alignments:
    Full length sequences

    External links

    MyHits provides a collection of tools to handle multiple sequence alignments. For example, one can refine a seed alignment (sequence addition or removal, re-alignment or manual edition) and then search databases for remote homologs using HMMER2.

    Pfam alignments:

    HMM logo

    HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...

    Trees

    This page displays the phylogenetic tree for this family. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed or full alignments.

    Note: You can also download the data files for the seed, full, NCBI or metagenomics trees.

    Curation and family details

    This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.

    Curation View help on the curation process

    Seed source: [1]
    Previous IDs: none
    Type: Domain
    Author: Yeats C
    Number in seed: 83
    Number in full: 511
    Average length of the domain: 60.80 aa
    Average identity of full alignment: 21 %
    Average coverage of the sequence by the domain: 12.35 %

    HMM information View help on HMM parameters

    HMM build commands:
    build method: hmmbuild -o /dev/null HMM SEED
    search method: hmmsearch -Z 9421015 -E 1000 HMM pfamseq
    Model details:
    Parameter Sequence Domain
    Gathering cut-off 20.4 20.4
    Trusted cut-off 21.1 20.6
    Noise cut-off 20.2 19.5
    Model length: 62
    Family (HMM) version: 10
    Download: download the raw HMM for this family

    Species distribution

    Tree controls

    Hide

    The tree shows the occurrence of this domain across different species. More...

    Loading...

    Structures

    For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the MSD group, to allow us to map Pfam domains onto UniProt sequences and three-dimensional protein structures. The table below shows the structures on which the BSD domain has been found.

    Loading structure mapping...