Abstract
Despite its apparent simplicity, the problem of quantifying the differences between two structures of the same protein or complex is nontrivial and continues evolving. In this chapter, we described several methods routinely used to compare computational models to experimental answers in several modeling assessments. The two major classes of measures, positional distance-based and contact-based, are presented, compared, and analyzed. The most popular measure of the first class, the global RMSD, is shown to be the least representative of the degree of structural similarity because it is dominated by the largest error. Several distance-dependent algorithms designed to attenuate the drawbacks of RMSD are described. Measures of the second class, contact-based, are shown to be more robust and relevant. We also illustrate the importance of using combined measures, utility-based measures, and the role of the distributions derived from the pairs of experimental structures in interpreting the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gabanyi M, Adams P, Arnold K, Bordoli L, Carter L, Flippen-Andersen J, Gifford L, Haas J, Kouranov A, McLaughlin W, et al. (2011) Journal of Structural and Functional Genomics, 1–10.
Rose PW, Beran B, Bi C, Bluhm WF, Dimitropoulos D, Goodsell DS, Prlic A, Quesada M, Quinn GB, Westbrook JD, et al. (2011) Nucleic Acids Research 39, D392–D401.
Burra PV, Zhang Y, Godzik A, & Stec B (2009) Proceedings of the National Academy of Sciences 106, 10505–10510.
Kryshtafovych A, Fidelis K, & Moult J (2009) Proteins: Structure, Function, and Bioinformatics 77, 217–228.
Cozzetto D, Kryshtafovych A, Fidelis K, Moult J, Rost B, & Tramontano A (2009) Proteins: Structure, Function, and Bioinformatics 77, 18–28.
Wodak SJ (2007) Proteins: Structure, Function, and Bioinformatics 69, 697–698.
Michino M, Abola E, participants of GPCR Dock 2008, Brooks CL, Dixon JS, Moult J, & Stevens RC (2009) Nat Rev Drug Discov 8, 455–463.
Warren G, Nevins N, & McGaughey G (2011) in 241st ACS National Meeting (Anaheim, CA).
Warren GL, Andrews CW, Capelli A-M, Clarke B, LaLonde J, Lambert MH, Lindvall M, Nevins N, Semus SF, Senger S, et al. (2005) Journal of Medicinal Chemistry 49, 5912–5931.
Kufareva I, Rueda M, Katritch V, participants of GPCR Dock 2010, Stevens RC, & Abagyan R (2011) Structure 19(8), 1108–1126.
Wu B, Chien EYT, Mol CD, Fenalti G, Liu W, Katritch V, Abagyan R, Brooun A, Wells P, Bi FC, et al. (2010) Science 330, 1066–1071.
Chien EYT, Liu W, Zhao Q, Katritch V, WonHan G, Hanson MA, Shi L, Newman AH, Javitch JA, Cherezov V, et al. (2010) Science 330, 1091–1095.
Kryshtafovych A, Venclovas, Fidelis K, & Moult J (2005) Proteins: Structure, Function, and Bioinformatics 61, 225–236.
Zemla A (2003) Nucleic Acids Research 31, 3370–3374.
Shindyalov IN & Bourne PE (1998) Protein Engineering 11, 739–747.
Holm L & Sander C (1993) Journal of Molecular Biology 233, 123–138.
Kleywegt GJ & Jones AT (1997) in Methods in Enzymology (Academic Press), pp. 525–545.
Ortiz AR, Strauss CEM, & Olmea O (2002) Protein Science 11, 2606–2621.
Levitt M & Gerstein M (1998) Proceedings of the National Academy of Sciences of the United States of America 95, 5913–5920.
Shapiro J & Brutlag D (2004) Nucleic Acids Research 32, W536-W541.
Szustakowski JD & Weng Z (2000) Proteins: Structure, Function, and Bioinformatics 38, 428–440.
Kleywegt GJ (1996) Acta Crystallogr D Biol Crystallogr 52, 842–857.
Kawabata T & Nishikawa K (2000) Proteins 41, 108–122.
Kawabata T (2003) Nucleic Acids Res 31, 3367–3369.
Yang A-S & Honig B (2000) Journal of Molecular Biology 301, 665–678.
Lackner P, Koppensteiner WA, Sippl MJ, & Domingues FS (2000) Protein Engineering 13, 745–752.
Krissinel E & Henrick K (2004) Acta Crystallographica Section D 60, 2256–2268.
Zemla A, Venclovas, Moult J, & Fidelis K (2001) Proteins Suppl 5, 13–21.
Zhang Y & Skolnick J (2004) Proteins: Structure, Function, and Bioinformatics 57, 702–710.
Abagyan R & Kufareva I (2009) Methods Mol Biol 575, 249–279.
McLachlan AD (1979) J Mol Biol 128, 49–79.
Damm KL & Carlson HA (2006) Biophysical journal 90, 4558–4573.
Phillips DC (1970) Biochem Soc Symp 30, 11–28.
Nishikawa K & Ooi T (1974) J.Theor.Biol. 43, 351–274.
Liebman MN (1980) Biophys. J. 32, 213–215.
Sippl MJ (1982) Journal of Molecular Biology 156, 359–388.
Abagyan RA & Totrov MM (1997) J Mol Biol 268, 678–685.
Marsden B & Abagyan R (2004) Bioinformatics 20, 2333–2344.
Lensink MF & Wodak SJ (2010) Proteins: Structure, Function, and Bioinformatics 78, 3085–3095.
Bottegoni G, Kufareva I, Totrov M, & Abagyan R (2009) J Med Chem 52, 397–406.
Totrov M & Abagyan R (2008) Curr Opin Struct Biol.
Coupez B & Lewis RA (2006) Curr Med Chem 13, 2995–3003.
Katritch V, Rueda M, Lam PC-H, Yeager M, & Abagyan R (2010) Proteins 78, 197–211.
Jaakola V-P, Griffith MT, Hanson MA, Cherezov V, Chien EYT, Lane JR, Ijzerman AP, & Stevens RC (2008) Science 322, 1211–1217.
Rueda M, Katritch V, Raush E, & Abagyan R (2010) Bioinformatics 26, 2784–2785.
Stroud RM & Fauman EB (1995) Protein Science 4, 2392–2404.
Eyal E, Gerzon S, Potapov V, Edelman M, & Sobolev V (2005) Journal of Molecular Biology 351, 431–442.
Golomb BA, Erickson LC, Koperski S, Sack D, Enkin M, & Howick J (2010) Annals of Internal Medicine 153, 532–535.
Palczewski K, Kumasaka T, Hori T, Behnke CA, Motoshima H, Fox BA, Trong IL, Teller DC, Okada T, Stenkamp RE, et al. (2000) Science 289, 739–745.
Scheerer P, Park JH, Hildebrand PW, Kim YJ, Krausz N, Choe H-W, Hofmann KP, & Ernst OP (2008) Nature 455, 497–502.
Park JH, Scheerer P, Hofmann KP, Choe H-W, & Ernst OP (2008) Nature 454, 183–187.
Warne T, Serrano-Vega MJ, Baker JG, Moukhametzianov R, Edwards PC, Henderson R, Leslie AGW, Tate CG, & Schertler GFX (2008) Nature 454, 486–491.
Rosenbaum DM, Cherezov V, Hanson MA, Rasmussen SGF, Thian FS, Kobilka TS, Choi H-J, Yao X-J, Weis WI, Stevens RC, et al. (2007) Science 318, 1266–1273.
Cherezov V, Rosenbaum DM, Hanson MA, Rasmussen SGF, Thian FS, Kobilka TS, Choi H-J, Kuhn P, Weis WI, Kobilka BK, et al. (2007) Science 318, 1258–1265.
Hooft RW, Vriend G, Sander C, & Abola EE (1996) Nature 381, 272–272.
Vriend G (1990) J Mol Graph 8, 52–56.
Laskowski RA, MacArthur MW, Moss DS, & Thornton JM (1993) Journal of Applied Crystallography 26, 283–291.
Chen VB, Arendall WB, III, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, Murray LW, Richardson JS, & Richardson DC (2010) Acta Crystallographica Section D 66, 12–21.
Maiorov V & Abagyan R (1998) Fold Des 3, 259–269.
Pawlowski M, Gajda MJ, Matlak R, & Bujnicki JM (2008) BMC Bioinformatics 9, 403–403.
Jain A & Nicholls A (2008) Journal of Computer-Aided Molecular Design 22, 133–139.
Clark R & Webster-Clark D (2008) Journal of Computer-Aided Molecular Design 22, 141–146.
Acknowledgments
Authors wish to thank the organizers and the participants of the GPCR Dock 2010 assessment for providing the model statistics, Max Totrov and Eugene Raush for implementing some of the core functions in ICM, Manuel Rueda for helpful discussions and Karie Wright for help with manuscript preparation. We would like to acknowledge financial support by NIH, grants # R01 GM071872, U01 GM094612, and U54 GM094618.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media,LLC
About this protocol
Cite this protocol
Kufareva, I., Abagyan, R. (2011). Methods of Protein Structure Comparison. In: Orry, A., Abagyan, R. (eds) Homology Modeling. Methods in Molecular Biology, vol 857. Humana Press. https://doi.org/10.1007/978-1-61779-588-6_10
Download citation
DOI: https://doi.org/10.1007/978-1-61779-588-6_10
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-587-9
Online ISBN: 978-1-61779-588-6
eBook Packages: Springer Protocols