Abstract: Structure prediction methods aim to identify the relationship
between the amino acid sequence of an unknown protein and information comprised
in databases of known protein structures. Towards this end, we created a
database by combining the amino acid sequences and the corresponding
three-dimensional atomic coordinates for all the 25% non-redundant protein
chains available in the Protein Data Bank. It contains information about the
peptide fragments that are 5 to 10 residues long. In addition, options are
provided for the users to visualize the individual motifs and the superposed
fragments in the client machine. Further, useful functionalities are provided
to look for similar sequence motifs in all the sequence databases like PDB,
90% non-redundant protein chains, Genome database, PIR and Swiss-Prot. The
database is being updated at regular intervals and the same can be accessed
over the World Wide Web interface at the following URL:
http://pranag.physics.iisc.ernet.in/sms/.
Keywords: Protein sequences, database, structural rigidity, Protein Data Bank, genome sequences, superposition, World Wide Web, non-redundant structures