A bioinformatics view of zinc enzymes
Graphical abstract
Bioinformatics tools and databases that we have developed in recent years are used to achieve a bird's eye view on the major subject matter of Bert Vallee's research, i.e., zinc enzymes.
Highlights
► Zinc enzymology has a strong basis of biochemical and biophysical knowledge. ► Many protein sequence and structural data are now available for zinc enzymes. ► Bioinformatics can offer an unprecedented overview of zinc enzymes. ► Bioinformatics is used to analyze the properties and roles of zinc in enzymes. ► Bioinformatics is used to analyze the usage of zinc enzymes in organisms.
Introduction
Zinc is an essential element for all forms of life. The first demonstration that zinc is indispensable to living organisms dates back to 1869, when Raulin showed that it is required for the growth of Aspergillus niger, a filamentous fungus [1]. However, a specific biological function for zinc was first established only in 1940, when it was shown to be a necessary catalytic cofactor of carbonic anhydrase [2]. Since then, zinc has been found to be associated with an impressive variety of proteins responsible for a wide range of physiological activities, thereby explaining its involvement in many different biological processes such as protein synthesis and degradation, DNA metabolism and repair, and neurotransmission [3], [4]. In particular, a vast wealth of information on zinc enzymes, including data on three-dimensional structures, kinetic and biochemical properties, and reaction mechanisms, has been accumulated over the years, and Bert Vallee has been an unquestioned leader in laying the foundation and building the body of knowledge of zinc enzymology [5], [6], [7], [8], [9], [10], [11], [12].
In the last 10–15 years of his life, Bert Vallee was a witness of the revolutionary changes undergone by biological sciences as a result of the advent of genome sequencing and post-genomic projects. The progress of high-throughput technologies capable of providing omics-type data (e.g., genomics, proteomics) opened the possibility of having virtually complete lists of the molecular components of cells, leading to the renewal of efforts aimed at understanding living organisms at the systems level [13], [14]. In parallel, the number of atomic resolution structures solved was boosted by the numerous structural genomics projects pursuing the determination of protein structures on a genome-wide scale [15]. A fruitful use of these unprecedented amounts of data strongly relies on the development of bioinformatics approaches, which are essential to store, analyze and interpret the experimental results [16], [17]. Examining the diverse data available through the lens of bioinformatics offers researchers a bird's eye view of the molecular mechanisms underlying biological processes, which represents a crucial change of perspective with respect to the classical reductionist approach of investigating individual molecules one by one.
In this work, we illustrate with some examples how bioinformatics resources, including tools and databases developed by ourselves, can be used to achieve a functional overview of the subject of most of Bert Vallee's research, i.e., zinc enzymes. We show that computational analysis can derive useful insights and principles from the structural and functional data on zinc enzymes collected by Bert Vallee and many other scientists, without whose contributions there would simply be nothing to make sense of.
Section snippets
Methods
All the available zinc protein structures were downloaded by querying the Protein Data Bank (PDB, http://www.pdb.org) [18] for entries containing at least one zinc atom. Zinc sites in each structure were identified by taking all the zinc atoms in the structure, and considering zinc atoms at a distance of less than 5.0 Å from one another as belonging to the same site. For each site, zinc ligands were defined as those (protein or non-protein) residues having a non-hydrogen atom at a distance of
Zinc enzymes with known structure: an overview
Until relatively recently, the amount of structural information on zinc proteins has been minimal. It is only since the 1990s, and especially since the advent of Structural Genomics in the late 1990s, that such information has considerably increased over the years, as shown in Fig. 1. At the end of 2010, the Protein Data Bank (PDB) contained 6170 structures of zinc-binding proteins. Out of these, 4882 are to be considered true zinc proteins in that they bind at least one zinc ion with a
Concluding remarks
Bioinformatics methods provide a powerful tool to obtain useful insights from the large amount of structural and functional data available for zinc enzymes, as well as to make predictions about the occurrence of zinc enzymes in living organisms. From the analysis of the PDB, PDBSprotEC and Metal-MACiE databases, it is estimated that 10% of the chemical reactions catalyzed by enzymes involve at least one catalytic mechanism that requires zinc to occur. Zinc ions playing a catalytic role are
Acknowledgements
This work was supported by Ministero Italiano dell'Università e della Ricerca (MIUR) through the FIRB project RBFR08WGXT.
References (50)
- et al.
J. Biol. Chem.
(1955) - et al.
Trends Microbiol.
(2007) - et al.
J. Mol. Biol.
(1995) - et al.
J. Mol. Biol.
(2009) - et al.
Structure
(2002) - et al.
Biochim. Biophys. Acta
(2010) - et al.
J. Mol. Biol.
(2006) - et al.
J. Mol. Biol.
(2002) - et al.
J. Mol. Biol.
(2000) - et al.
J. Mol. Biol.
(1996)