ContactLib

The need to obtain structures similar to the new protein from these databases motivates the neighbor protein structure retrieval problem: given a query protein structure and a database of protein structures, retrieve all the structures in the database that are similar to the query structure. We introduce ContactLib, a complete contact group library, which can be used as fingerprints of protein structures to retrieve neighbor protein structures efficiently and effectively.


Download

  • README.txt: the readme document

  • contactlib-bin.Dec-7-2013.tgz: the ContactLib executables compiled for Linux-64bit

  • contactlib-scop40-2.03.Dec-7-2013.tgz: the pre-built ContactLib for SCOP40

  • contactlib-cullpdb40-res2.0-R0.25.Dec-8-2013.tgz: the pre-built ContactLib for CULLPDB40-RES2.0-R0.25


  • User Manual

  • To search similar protein structures from a ContactLib, call "./bin/runContactLib.sh IN_PDB_FILE IN_CONTACTLIB_DIR OUT_SCORE_FILE", where "IN_PDB_FILE" is the input PDB file containing the query protein structure, "IN_CONTACTLIB_DIR" is the input ContactLib directory containing the pre-built ContactLib, and "OUT_SCORE_FILE" is the output file containing the similarities scores. The pre-built ContactLib for SCOP40 is provided.

  • To build your own ContactLib from PDB files, call "./bin/buildContactLib.sh IN_PDB_DIR OUT_CONTACTLIB_DIR", where "IN_PDB_DIR" is the input directory containing all PDB files, and "OUT_CONTACTLIB_DIR" is the output directory containing the ContactLib.


  • Reference

  • Xuefeng Cui, Shuai Cheng Li, Lin He, Ming Li, Fingerprinting protein structures effectively and efficiently, Bioinformatics (2013)