Improving decoy databases for protein folding algorithms

Citation data:

ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, Page: 717-724

Publication Year:
Usage 3
Abstract Views 3
Captures 5
Readers 5
Citations 1
Citation Indexes 1
Repository URL:;
Lindsey, Aaron; Yeh, Hsin-Yi (Cindy); Wu, Chih-Peng; Thomas, Shawna; Amato, Nancy M.
Association for Computing Machinery (ACM)
Medicine; Computer Science; Engineering; Decoy databases; Protein folding; Sampling methods
conference paper description
Predicting protein structures and simulating protein folding are two of the most important problems in computational biology today. Simulation methods rely on a scoring function to distinguish the native structure (the most energetically stable) from non-native structures. Decoy databases are collections of non-native structures used to test and verify these functions. We present a method to evaluate and improve the quality of decoy databases by adding novel structures and removing redundant structures. We test our approach on 17 different decoy databases of varying size and type and show significant improvement across a variety of metrics. We also test our improved databases on a popular modern scoring function and show that they contain a greater number of native-like structures than the original databases, thereby producing a more rigorous database for testing scoring functions.