(485g) A Branch-and-Reduce Algorithm for the Contact Overlap Problem
AIChE Annual Meeting
2006
2006 Annual Meeting
Discovery, Development and Delivery of Medicines
Advances in Protein Structure, Function, Analysis and Stability - I
Thursday, November 16, 2006 - 10:30am to 10:50am
A fundamental problem in molecular biology is the comparison of 3-dimensional protein folds in order to develop similarity measures and exploit them for protein clustering, database searches, and drug design. Contact map overlap (CMO) is one of the most reliable and robust measures of protein structure similarity [1]. Fold comparison can be done by aligning the amino acid residues of two proteins in a way that maximizes the number of common residue contacts. CMO maximization is gaining increasing attention because it results in protein classification in good agreement with classification by experts. However, CMO maximization is an NP-hard problem and few exact algorithms exist for solving this problem to global optimality.
In this talk, we propose a branch-and-reduce exact algorithm for the CMO problem. Contrary to previous approaches, we do not transform CMO to other combinatorial optimization problems for solution. Instead, we address the problem directly in its natural form. By exploiting the problem's mathematical structure, we develop bounding and reduction procedures that lead to a very efficient algorithm. We present extensive computational results for over 36,000 test problems from the literature. These results demonstrate that our algorithm is significantly faster and solves many more challenging test sets than the best previous algorithms for CMO [2, 3]. Furthermore, the algorithm results in protein clusters that are in excellent agreement with the SCOP database.
References:
[1] A. Godzik and J. Skolnick, Flexible algorithm for direct multiple alignment of protein structures and sequences, Computer applications in biosciences: CABIOS, 10(6), 587?596, 1994
[2] A. Caprara, R. Carr, S. Istrail, G. Lancia and B. Walenz, 1001 optimal PDB structure alignments: Integer programming methods for finding the maximum contact map overlap, Journal of Computational Biology, 11(1), 27?52, 2004
[3] D. M. Strickland, E. Barnes and J. S. Sokol, Optimal protein structure alignment using maximum cliques, Operations Research, 53(3), 389?402, 2005