Chemical modelling: applications and theory - PDF Free Download (2024)

Specialist Periodical Reports

Chemical Modelling: Applications and Theory comprises critical literature reviews of all aspects of molecular modelling. Molecular modelling in this context refers to modelling the structure, properties and reactions of atoms, molecules and materials. Each chapter provides a selective review of recent literature, incorporating sufficient historical perspective for the non-specialist to gain an understanding. With chemical modelling covering such a wide range of subjects, this Specialist Periodical Report serves as the first port of call to any chemist, biochemist, materials scientist or molecular physicist needing to acquaint themselves with major developments in the area.

Editor A Hinchliffe

Chemical Modelling: Applications and Theory Volume 5

ISBN 978-0-85404-248-7

www.rsc.org/spr

Hinchliffe

9 780854 042487

Chemical Modelling: Applications and Theory Volume 5

Specialist Periodical Reports Specialist Periodical Reports provide systematic and detailed review coverage in major areas of chemical research. Compiled by teams of leading experts in their specialist fields, this series is designed to help the chemistry community keep current with the latest developments in their field. Each volume in the series is published either annually or biennially and is a superb reference point for researchers.

Specialist Periodical Reports

Chemical Modelling Applications and Theory

Volume 5

A Specialist Periodical Report

Chemical Modelling Applications and Theory Volume 5 A Review of the Literature Published between June 2005 and May 2007 Editor A. Hinchliffe, School of Chemistry, The University of Manchester, Manchester, UK Authors D. J. Evans, Australian National University, Canberra, Australia P. B. Karadakov, University of York, York, UK J. R. Kitchin, Carnegie Mellon University and National Energy Technology Laboratory, Pittsburgh, PA, USA R. A. Lewis, Novartis Institutes for Biomedical Research, Basel, Switzerland S. D. Miller, Carnegie Mellon University, Pittsburgh, PA, USA E. A. Moore, The Open University, Milton Keynes, UK A. J. Mulholland, University of Bristol, Bristol, UK A. H. Pakiari, Shiraz University, Shiraz, Iran D. Pugh, University of Strathclyde, Glasgow, UK D. J. Searles, Griffith University, Brisbane, Australia D. S. Sholl, Carnegie Mellon University and National Energy Technology Laboratory, Pittsburgh, PA, USA T. E. Simos, University of Peloponnese, Tripolis, Greece M. Springborg, University of Saarland, Saarbru¨cken, Germany S. Wilson, University of Oxford, Oxford, UK and Comenius University, Bratislava, Slovakia C. J. Woods, University of Bristol, Bristol, UK

If you buy this title on standing order, you will be given FREE access to the chapters online. Please contact [emailprotected] with proof of purchase to arrange access to be set up. Thankyou.

ISBN 978-0-85404-248-7 ISSN 1472-0965 A catalogue record for this book is available from the British Library r The Royal Society of Chemistry 2008 All rights reserved Apart from any fair dealing for the purpose of research or private study for non-commercial purposes, or criticism or review as permitted under the terms of the UK Copyright, Designs and Patents Act, 1988 and the Copyright and Related Rights Regulations 2003, this publication may not be reproduced, stored or transmitted, in any form or by any means, without the prior permission in writing of The Royal Society of Chemistry, or in the case of reprographic reproduction only in accordance with the terms of the licences issued by the Copyright Licensing Agency in the UK, or in accordance with the terms of the licences issued by the appropriate Reproduction Rights Organization outside the UK. Enquiries concerning reproduction outside the terms stated here should be sent to The Royal Society of Chemistry at the address printed on this page. Published by The Royal Society of Chemistry, Thomas Graham House, Science Park, Milton Road, Cambridge CB4 0WF, UK Registered Charity Number 207890 For further information see our web site at www.rsc.org Typeset by Macmillan India Ltd, Bangalore, India Printed by Henry Ling Ltd, Dorchester, Dorset, UK

Preface DOI: 10.1039/b801788n Welcome to Volume 5 of the ‘Chemical Modelling’ SPR. Naturally, I want to start by thanking my team of Reporters for the hard work they have put into making this the best and most comprehensive volume so far. We hope you will derive benefit and perhaps even pleasure from our efforts. It seems a long time since I wrote the following in my Preface to Volume 1 (1999) . . . ‘Starting a new SPR is never easy, and there was the problem of where the contributors should start their accounts; since time began? five years ago? An SPR should be the first port of call for an up-to-the-minute account of trends in a specialist subject rather than a dull collection of references. My solution was to ask contributors to include enough historical perspective to bring a non-specialist up to speed, but to include all pertinent references through May 1999. Volume 2 will cover the literature from June 1999 to May 2001 and so on. In subsequent Volumes, I shall ask those Contributors dealing with the topics from Volume 1 to start from there. New topics will be given the same generous historical perspective opportunity as Volume 1 but will have to cover the literature to 2001 + n where n = 0, 2, 4, . . . . This process will continue until equilibrium is reached.’ Equilibrium was reached a couple of Volumes ago and some mature topics don’t need cover every Volume. My final sentence for Volume 1 was ‘I am always willing to listen to convincing ideas for new topics’ as indeed I am. Alan Hinchliffe Manchester 2008 alan.hinchliff[emailprotected]

RSC Specialist Periodical Report Chemical Modelling: Applications and Theory (Volume 5) Chapter 1 2 3 4 5

Corresponding author Dr Adrian Mulholland Dr Richard Lewis Prof Michael Springborg Dr Elaine A Moore Prof David Sholl

6

Prof Debra Searles

7

Dr Steven Wilson

8

Dr David Pugh

9

Prof Ali Pakiari

10 11

Dr Peter Karadakov Prof Theodore Simos

Topic Multiscale modelling of biological systems Computer-aided drug design 2005–2007 Solvation effects The solid state DFT studies of alloys in heterogeneous catalysis Fluctional relations, free energy calculations and irreversibility Many body perturbation theory and its application to the molecular structure problem Experiment and theory in the determination of molecular hyperpolarizabilities in solution The Floating Spherical Gaussian Orbital (FSGO) method Advances in valence bond theory Numerical methods in chemistry

Chem. Modell., 2008, 5, 7–8 | 7 This journal is

c

The Royal Society of Chemistry 2008

CONTENTS Cover The icosahedral ’golden fullerene’ WAu12 reproduced by permission of Pekka Pyykko¨, Chemistry Department, University of Helsinki, Finland.

Preface Alan Hinchliffe

7

Multiscale modelling of biological systems

13

Christopher J. Woods and Adrian J. Mulholland Introduction Interfacing QM with MM models Interfacing atomistic with coarse grain models Interfacing particle with continuum models Beyond continuum models Conclusion

13 15 26 41 45 46

Computer-aided drug design 2005–2007 Richard A. Lewis Introduction QSAR and ADMET Structure-based drug design Virtual screening De novo structure generation Fragment-based screening

51 51 51 54 56 60 61

Chem. Modell., 2008, 5, 9–12 | 9 This journal is

c

The Royal Society of Chemistry 2008

Target fishing Library design Conclusions

62 63 64

Solvation effects

67

Michael Springborg Introduction Fundamental methods Recent studies Conclusions

67 68 75 114

The solid state E. A. Moore Introduction Interatomic potential methods Ab initio methods QM/MM Molecular dynamics and related methods Properties Applications Minerals Conclusions

119

Density functional theory studies of alloys in heterogeneous catalysis

150

John R. Kitchin, Spencer D. Miller and David S. Sholl Introduction Segregation Adsorption properties on alloy surfaces Reaction kinetics Miscellaneous Electrocatalysis Conclusions

150 153 158 166 170 172 176

Fluctuation relations, free energy calculations and irreversibility

182

Debra J. Searles and Denis J. Evans Introduction Fluctuation relations Free energy relations

182 184 191

10 | Chem. Modell., 2008, 5, 9–12 This journal is

c

The Royal Society of Chemistry 2008

119 121 124 130 131 133 134 141 142

Fluctuation relations and irreversibility Comment on the interpretation of the fluctuation relation Conclusions

201 201 202

Many-body perturbation theory and its application to the molecular structure problem

208

S. Wilson Introduction An overview of previous reports Applications An overview of applications of second order theory Application area 1: periodic systems Application area 2: DNA bases and amino acids Application area 3: DFT benchmarking Application area 4: basis set extrapolation and the calibration of general energy models Summary and future directions

208

Experiment and theory in the determination of molecular hyperpolarizabilities in solution; pNA and MNA in dioxane David Pugh Introduction General theory of the response to frequency-dependent electric fields The sum over states method General theory of the EFISH experiment Ab initio and DFT calculations of the pNA b tensor Gas phase measurement Solution EFISH studies of pNA and MNA in dioxane Theoretical approaches to the calculation of the EFISH nonlinearity of pNA in solution Recent work on pNA/MNA Appendix I. Conversion of units

249

The floating spherical Gaussian orbital (FSGO) method A. H. Pakiari Introduction Part I Part II, Development of original theory Part III, Application of FSGO method

279

208 209 211 230 237 238 241 242 243

249 251 253 254 257 260 261 271 275 276

279 280 282 296

Chem. Modell., 2008, 5, 9–12 | 11 This journal is

c

The Royal Society of Chemistry 2008

Part IV, Using the FSGO concept in other methods Appendix

298 307

Advances in valence bond theory

312

Peter B. Karadakov Introduction Comparison of the MO and VB approaches Developments in VB methodology Applications of VB theory Concluding remarks

312 313 316 328 346

Numerical methods in chemistry T. E. Simos Newton–Cotes formulae for the numerical integration of the Schro¨dinger equation Stabilization of a multistep exponentially-fitted methods and their application to the Schro¨dinger equation Appendix A Appendix B Appendix C Appendix D Appendix E Appendix F Appendix G Appendix H Appendix I

12 | Chem. Modell., 2008, 5, 9–12 This journal is

c

The Royal Society of Chemistry 2008

350 350 380

399 408 421 427 439 444 453 464

Multiscale modelling of biological systems Christopher J. Woods and Adrian J. Mulholland DOI: 10.1039/b608778g

1. Introduction At what point does a collection of molecules become a biomolecular system? At what length scale does biology begin, and chemistry end? Biological phenomena involve the flow of information across a range of length and timescales. For example, a cell may be placed under physical stress at the macroscopic level, which causes an increase in pressure within its protective membrane. This pressure has the effect of opening1 or closing2 mechanosensitive ion channels, thereby changing the flow of individual ions into the cell. This changes the ionic concentration within the cell, which then acts as the trigger for a signal sent via a protein signalling pathway. A chemist would look at this as a molecular system that was capable of converting mechanical forces into electrical signals. A biologist would however look at this as the mechanism a cell uses to adapt to stress, and thereby stay alive. Biology is full of such examples. Every thought we have involves the passage of signals between neurons, which itself requires the conversion of electrical signals into flows of ions. These ions trigger the release of neurotransmitter molecules, which cross the synaptic gap between neurons, and bind to individual receptor proteins at the synapse. This causes a change in protein conformation, which open nearby ion channels, causing ions to rush in or out of the neuron, thereby continuing the signal. Information is constantly flowing between the macroscopic world and the atomic, chemical world. Indeed it is this interplay between the chemical and macroscopic worlds that is a real beauty of biology, and it is the recent advances made by the science of biochemistry that has revealed the elegance of the chemicals of life to all. However, while it is possible to use a microscope to watch how an individual cell responds to external stimuli, it is not possible to ‘zoom in’ further and observe what is occurring at the chemical level. Experiments can infer what is happening, and can provide supporting evidence for a particular hypothesis, but there is no experimental technique or microscope that allows us to watch a chemical reaction within an enzyme active site. Until such techniques are developed, the most appealing route that currently exists is to use computers to create models of the biochemical world. Computational scientists can create virtual enzymes, and models of cell membranes, and then use these to provide a window through which the interactions of biomolecules can be observed. If the models are constructed on the firm foundations of physics and chemistry, and if their predictions are carefully compared and validated against experiment, then simulations using these models can provide the valuable insight necessary to link the chemical and biological worlds. Computational scientists have developed many tools for modelling molecules. Computer models are not perfect recreations of reality. Instead, approximations and assumptions have to made, and the model compromised for the sake of computational efficiency. As the size of the system gets larger, and so the size and number of molecules increases, so to does the computational expense of the calculation. This means that the larger the system, the more compromises and approximations must be made. This act of compromise has led computational scientists to develop four main levels of biomolecular modelling: 1. Quantum mechanics (QM). Quantum chemical calculations model the fine detail of the electrons in the molecule. They achieve this by modelling the electrons as a quantum mechanics wavefunction that interacts with the electrostatic potential Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, UK BS8 1TS

Chem. Modell., 2008, 5, 13–50 | 13 This journal is

c

The Royal Society of Chemistry 2008

field generated by the atomic nuclei. Quantum chemical calculations provide the most physically realistic and accurate models of molecules, but this accuracy comes at a cost. While methods have been developed that allow QM calculations on complete proteins,3 in general the high computational expense of QM methods limits their application to small molecular systems. 2. Molecular mechanics (MM). Atomistic molecular mechanics calculations apply the assumption that the fine detail information about the behaviour of the electrons can be ignored, and instead they are approximated by representing their effects using simple descriptors such as atomic partial charges or polarisabilities. By modelling the electrons implicitly, MM methods are much less expensive than QM methods, and so they are able to model significantly larger systems. By including atomic detail, MM models are still limited to the molecular level, and even today’s largest applications can only achieve the modelling of hundreds of thousands of atoms over hundreds of nanoseconds. 3. Coarse grain (CG). Coarse grain (or coarse grained) calculations apply the assumption that the fine detail information about the position of each atom in the molecule can be ignored, and instead groups of atoms are approximated by smearing them out into single ‘beads’. So, for example, rather than modelling each atom in a protein, a CG representation would portray each residue as a single bead. This approximation allows CG simulations to achieve length and timescales that are far beyond those possible using atomistic MM models. 4. Continuum. Continuum models apply the assumption that the fine detail information about the location of any particles or groups can be ignored, and instead systems are modelled as continuum regions. For example, implicit solvent models ignore the location of each individual solvent molecule, but instead represent the complete solvent as a fuzzy dielectric continuum. Equally, continuum models of a cell membrane ignore the individual locations of each lipid molecule, and instead model the membrane as a hom*ogenous elastic sheet. By ignoring particles, and instead modelling biological systems as continuous fields or hom*ogenous assemblies, continuum models are able to simulate the largest length scales and longest timescales of any of the four levels. These four levels of biomolecular modelling are each well-suited to modelling phenomena at the length and timescales for which they were designed. However, what makes biology work, and what makes it scientifically interesting, is the interplay and flow of information across the different length and timescales. It is not possible for simulations at any one of these biomolecular modelling levels to represent these complex, multiscale biological phenomena on their own, and so methods that allow the combination of different levels of biomolecular model together must therefore be sought. Multiscale modelling, in which calculations at multiple length and/or timescales are combined together into a single simulation, is now becoming popular, and its development is now the focus of significant research effort. Multiscale modelling is not new, for example combined QM/MM methods, and MM/continuum implicit solvent methods have been used for over 30 years, and multiscale methods have a rich heritage of applications in the fields of materials modelling and nanomaterials,4 and modelling fluid and gas flow.5,6 Recently, there has been a huge increase in the development and application of multiscale methods for biomolecular modelling. This review focuses on these developments, in particular the application of multiscale methods to biomolecules covering the period from 2005 to 2007. Coveney7 has produced a review of biological multiscale modelling that covers the period up to 2005. Before starting this review, there first needs to be a definition of what is meant by a multiscale method. There are several different definitions that vary depending on the type of coupling between the different modelling levels. This review will adopt perhaps the most broad definition of a multiscale method, namely that it is any method that involves a flow of information from one modelling level to another. By definition, if there is a flow of information from one level to another, then there must 14 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

be an interface between the levels through which this information will flow. Throughout this review it will become clear that there are four main classes of interface; 1. One-way, bottom-up interfaces. These involve a one-way, often one-time transfer of information from a lower level of modelling to a higher level. Examples include using a QM calculation to parameterize an MM forcefield, or using an MM simulation to parameterize a CG potential. 2. One-way, top–down interfaces. These involve a one-way transfer of information from a higher level of modelling to a lower level. Examples include using a CG model to reconstruct an atomistic model of a protein, or using a continuum model to provide the boundary conditions for an atomistic simulation. 3. Two-way parallel interfaces. These involve a two-way dynamic transfer of information between two simulations running in parallel at two different modelling levels. An example includes running both an MM and CG simulation of a system and using replica exchange8–10 moves to exchange coordinates between the two levels. 4. Two-way embedded interfaces. These involve embedding a low modelling level region within a simulation at a higher level, e.g. embedding a QM model of a substrate and active site within an MM model of the enzyme, or embedding an MM model of an ion channel within a CG model of a membrane. This review is therefore organised according to the different interfaces between levels (QM/MM, atomistic/CG, particle/continuum), and then by the different classes of interface that are used between these levels.

2. Interfacing QM with MM models The most accurate physical description of atoms and molecules is provided by quantum chemical calculations. Quantum chemical calculations are capable of correctly predicting the energetics and conformations of small molecules from first principles, using broadly applicable approximations (e.g. the Born-Oppenheimer approximation) and nothing more than fundamental physical constants as input.11 Quantum chemical calculations model electrons as a quantum mechanics (QM) wavefunction that interacts with the electrostatic potential field created by the atomic nuclei of the molecule. QM provides the most exact physical model of matter at the atomic scale, and QM calculations are capable of predicting chemical bonding and chemical reactivity. There are several recent reviews of quantum chemical methods,11–13 and QM methods may now be used across a length and timescale that ranges from modelling the femotosecond interactions of infra-red laser light with carbon monoxide,14 to modelling the sub-nanosecond dynamics of a complete protein.15 There are a range of QM methods available with varying degrees of approximation, with a range that includes fast semi-empirical Hamiltonians such as AM116 or PM3,17,18 and highly exact coupled cluster methods such as LCCSD(T).19 Because QM methods include an explicit representation of electrons, they are able to model chemical processes such as charge transfer, bond breaking and formation, and changes of molecular polarisation. However, the high computational expense of QM methods prevents their application to the large length and timescales that are required to understand complex biomolecular processes. MM methods provide a simpler representation of molecules, in which the fine detail of the electrons represented implicitly via partial charges and, is some cases, molecular polarisabilities.20,21 MM models represent molecules as a collection of atoms interacting through classical potentials. There are several MM models (or forcefields), and they differ in the functional forms of the interaction potential used between atoms, and in the means by which these interaction potentials are parameterized. Several good recent reviews of MM forcefields have been produced.22–26 Several MM forcefields have been developed for application to biomolecular systems. The most popular of these are the CHARMM,27 AMBER,28,29 Chem. Modell., 2008, 5, 13–50 | 15 This journal is

c

The Royal Society of Chemistry 2008

GROMOS30–32 and OPLS33,34 forcefields. Each of these forcefields has evolved over time, with different versions produced periodically. However, despite the proliferation and evolution of MM forcefields for biomolecular modelling, their functional forms all remain broadly similar. Each atom is modelled as a single point in space. Pairs of atoms in separate molecules interact through a pairwise non-bonded potential, Enb, which depends only on the distance between the atoms, r. The electrostatic part of this non-bonded potential, Eelec, is modelled using Coulomb’s law, assigning a fixed partial charge to each atom in the molecule. The non-bonded potential must also model the van der Waals (vdW) forces between the molecules, which result from the combination of the Pauli repulsion that results from the inability of two electrons to occupy the same space with the same set of quantum numbers, and the attractive dispersion (London) forces whose physical basis lies in the ‘instantaneous dipoles’ that result from the wavefunctions of close atoms moving in phase. These vdW forces have their origin in the behaviour of electrons, which are not explicitly modelled in MM forcefields. These forces must therefore be approximated. The most common approximation used for biomolecular applications is the Lennard-Jones (LJ) potential. This approximates the vdW interactions using a 12–6 repulsive–attractive potential, ELJ, " 6 # sij 12 sij ELJ ðrij Þ ¼ 4eij ; ð1Þ rij rij where ELJ(rij) is the Lennard-Jones energy between atom i and atom j, rij is the distance between the pair of atoms, and sij and eij are parameters that are tuned to reproduce the strength of the vdW forces between this pair of atoms, often by fitting to macroscopic properties. Note that this is a pairwise potential that acts only between pairs of atoms. This is despite the fact that unlike the permanent electrostatic forces, vdW forces are not pairwise in nature. Indeed, while permanent electrostatic forces are pairwise, MM forcefields use Coulomb’s law and fixed atomic partial charges to model both the permanent electrostatics of the molecule and, implicitly, its polarisation. Charge polarisation is also not a pairwise phenomenon. The non-bonded potential energy between two molecules is given by the sum of the Coulomb and LJ energies between all pairs of atoms in the molecules. It is therefore an effective pair potential, as the derivation of the partial charges and LJ parameters must account for the errors implicit in only using a pairwise sum over atoms, and must therefore include 3-, 4- to n-body effects implicitly. Modelling the electronic detail of a molecule, as well as providing an explicit representation of polarisation and vdW forces, is also responsible for giving a correct representation of chemical bonding. As MM forcefields do not explicitly model electrons, they must include classical interaction potentials that mimic the effects of chemical bonding. MM forcefields include classical intramolecular interaction potentials, e.g. a harmonic bond potential, Ebond that acts between bonded atoms (called 1–2 atoms), a harmonic angle potential, Eangle that acts on the angle between a series of three bonded atoms (1–3 atoms), and a torsional potential, Etorsion, that acts about the dihedral formed by four bonded atoms (1–4 atoms). Atoms that are separated by more than three bonds (1–5+ atoms) are treated as being non-bonded, and so their interaction energy is calculated using the sum of their Coulomb and LJ interaction energies. The total intramolecular energy of a molecule is then given by the sum of the bond energy between all 1–2 atoms, the sum of the angle energy between all 1–3 atoms, the sum of torsion energy between all 1–4 atoms, and the sum of the non-bonded Coulomb and LJ energies between all pairs of 1–5+ atoms. The total energy of a system of molecules can then be calculated as the sum of the intramolecular energies of all of the molecules, together with the sum of the nonbonded potential energies between all pairs of molecules. By using classical potentials, MM models allow for a very rapid evaluation of the energy and forces acting on each atom within a large biomolecular system. This 16 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

rapid evaluation allows these forces and energies to be used by statistical conformational sampling methods, such as molecular dynamics (MD)35,36 or Monte Carlo (MC),36,37 to generate large ensembles of configurations of the system, from which macroscopic (thermodynamic) properties may be evaluated. However, by not explicitly modelling electrons, MM models struggle to model many chemically important phenomena, such as chemical bond breaking and formation, electronic polarisation and charge transfer. There is therefore a strong motivation to combine MM models with quantum mechanics (QM) calculations within a multiscale modelling framework, and combined QM/MM biomolecular simulation methods have a rich history of application and evolution since their original inception in the early 1970s.38,39 Using a broad definition, QM/MM multiscale methods are those that involve a transfer of information across an interface between the QM and MM levels of modelling. There are several different types of interface in use, which fall into four categories: 1. One-way bottom-up methods. These involve a single transfer of information from a QM calculation to a classical simulation, e.g. by using QM calculations to parameterize the classical potentials used in MM force fields. 2. Two-way dynamic parameterisation methods. These involve a dynamic transfer of information between separate classical and quantum calculations, e.g. using successive QM calculations to dynamically re-parameterize the classical potentials the QM atoms of an MM forcefield during a live simulation. 3. Two-way embedded methods. These involve embedding of molecules or parts of molecules modelled using QM into a system of molecules modelled using MM, e.g. using QM to model a substrate, and MM to model the enzyme and solvent. 4. Two-way parallel methods. These involve the running in parallel of classical and quantum simulations, and dynamically sharing information between them at run time. Examples of each of these different types of interface, and recent developments in their methodology, will now be discussed in turn. 2.1 One-way bottom-up QM/MM interfacing methods As described in the last section, molecular mechanics (MM) forcefields use classical potentials to calculate the interaction energy between pairs of atoms. Several types of interaction potential are required to fully describe the MM energy of a set of molecules: 1. Non-bonded potentials. These typically take the form of a Coulomb potential between non-bonded pairs of atoms to describe polarisation and permanent electrostatics, and a Lennard-Jones (LJ) potential between non-bonded atom pairs to describe the vander Waals (vdW) interactions. 2. Bonded potentials. These typically involve harmonic terms that are applied between 1–2 bonded and 1–3 bonded pairs of atoms, and cosine terms between 1–4 bonded pairs. These potentials try to model the effect of chemical bonding. These classical interaction potentials must be parameterized, e.g. the magnitude of the partial charges on each atom in the molecule must be assigned, and the equilibrium bond length and size of the harmonic force constant must be attached to each bond. In the early biomolecular MM forcefields, these parameters were developed to produce molecular models that could reproduce known experimental properties of the bulk system. For example, several MM water models have been developed.26,40,41 One of the earliest successful models, TIP3P,42 was parameterized such that simulations of boxes of TIP3P molecules reproduced known thermodynamic properties of water, such as liquid density and heats of vaporisation. Such a parameterisation scheme is to be applauded, as it ties the molecular model closely to experiment. Indeed many of the common MM models of amino acids were developed by comparison to experiment, e.g. OPLS.33 Indeed it is such a good Chem. Modell., 2008, 5, 13–50 | 17 This journal is

c

The Royal Society of Chemistry 2008

scheme that even some modern water models, like TIP5P,43 are still parameterized in this way. However, it was quickly realised that parameterisation against experiment required large amounts of physical data that was just not available for the novel molecules being conceived during rational drug design. The developers of biomolecular forcefields therefore created recipes that allowed for the parameters of new molecules to be derived from quantum chemical calculations. One popular example of such a recipe is GAFF (generalised AMBER forcefield),44 which is an MM forcefield for small drug-like ligands that is compatible with AMBER. GAFF uses generic atom types that are assigned to each atom in the drug-like molecule, e.g. aliphatic carbon or aromatic hydrogen. These atom types are used to assign LJ, bond, angle and dihedral parameters to the molecule from a large parameter library. The partial charges for the atoms are derived by performing an AM116 semiempirical QM calculation, and calculating BCC45,46 charges. In a very broad sense, this parameterisation is a multiscale method, as information (the charge distribution) from a QM calculation is transferred to an MM simulation via the parameterisation of the atomic partial charges. This multiscale parameterisation therefore represents a one-time, one-way flow of information up from the QM level to the MM level. A similar scheme is available for the OPLS forcefield,47 which uses CM1A charges48 that are also derived from semiempirical AM1 QM calculations. Wang and Sandberg49 used a more complex multiscale parameterisation to derive intramolecular CHARMM parameters for the interaction of DNA bonded to a gold surface. The parameters were calculated by fitting to density functional theory (DFT) QM calculations. 2.2 Two-way dynamic parameterisation methods QM calculations are now used routinely as the source of parameters for MM forcefields. Indeed this application is now so routine that most workers would not consider forcefield parameterisation to be an example of a multiscale method. However, there is a drawback with using QM calculations to provide MM forcefield parameters. The problem is that the information flow is only in one direction, from the QM to the MM level. This means that the QM-derived parameters for a molecule have to be very general, and be able to represent the molecule in a variety of conformations and environments. This is an unreasonable requirement, as it is clear that the charge distribution and polarisation of the molecule depends on both its conformation and its environment, e.g. whether it is in bulk solvent or whether it is bound to the active site of a protein. Because information only flows from the QM to MM level, there is no mechanism that allows information about the environment and conformation of the molecule experienced during the MM simulation to be fed back to the QM calculation. A solution is to modify multiscale parameterisation so that there is a two-way interface between the QM and MM levels of modelling. There have been two recent applications that have made such a modification: one targeted at creating a QM/MM multiscale docking method,50 and another targeted at QM/MM multiscale free energy calculations.47,51,52 Docking is one of the primary tools used during the process of rational drug design.53 The aim of docking is to predict the binding mode of a ligand with a protein. Because docking calculations are typically used to study how libraries of thousands of ligands bind to a protein, the calculations involved must be simple and efficient. This means that the interaction potentials used in docking tend to be based on molecular mechanics forcefields.53 MM forcefields struggle to model the changes in polarisation upon protein-ligand binding, an effect that is thought to account for as much as 10–40% of the binding affinity.54 Multiscale docking methods that attempt to use QM calculations to overcome this problem have therefore been developed.50,55 Cho et al.50 have developed a QM/MM docking method that uses multiscale parameterisation dynamically throughout a docking calculation. The electrostatic interaction energy between the ligand being docked and the protein is 18 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

calculated using Coulomb’s law, with atomic partial charges placed on the protein and ligand atoms. Cho et al. first showed that the docking prediction is improved if the atomic partial charges on the ligand are derived from a QM calculation of the ligand in the bound geometry. They demonstrated this by selecting several test protein-ligand systems whose bound geometries were known via crystal structures available in the protein databank (PDB). They calculated the partial charges by performing a density functional theory (DFT) calculation of the ligand in the bound geometry. The wavefunction was polarised by embedding the partial charges from the protein within the Hamiltonian of the QM calculation. By using the bound geometry, and by embedding the partial charges of the protein, information about the environment of the ligand in the protein active site was made available to the QM calculation. The polarised wavefunction was used to obtain the molecular electrostatic potential (MEP) surface around the ligand. Partial charges were generated using an electrostatic potential fitting procedure to reproduce the QM MEP. By calculating these partial charges from a QM calculation that had information about the bound geometry, it could be argued that these atomistic partial charges were optimised for the bound geometry. Cho et al. demonstrated this50 by running redocking calculations where the ligands were docked using both optimised and generic partial charges. The results demonstrated that docking calculations using the optimised charges were significantly more likely to rediscover the known experimental binding modes. To turn this observation into an effective docking algorithm, Cho et al. created an iterative dynamic reparameterisation algorithm; 1. Dock the ligand using the default partial charges taken from the docking MM forcefield. 2. Perform a QM calculation on the predicted binding mode to obtain optimised partial charges for the ligand. 3. Dock the ligand again, this time using the optimised partial charges. 4. Keep iterating until the charges and predicted binding mode converge to within a set limit. By dynamically reparameterising the ligand throughout the docking calculation, Cho et al. allow information to flow both ways between the QM and MM levels. A similar idea has been developed by Jorgensen and co-workers to create a QM/MM multiscale method for free energy calculations.47,51,52 Jorgensen and co-workers developed a method to obtain atomic partial charges efficiently from a QM calculation that were compatible with the partial charges from the standard OPLS all-atom forcefield.47 The charges were calculated using the charge model 1 (CM1A)48 analysis of an AM1 semiempirical QM calculation. However, as CM1A charges were parameterized to reproduce gas-phase dipole moments,48 they had to be scaled by a factor of 1.2 so that an implicit account could be made for the extra polarising effects of polar solvents. Jorgensen and co-workers first used this method to perform QM/MM hydration free energy calculations. The solute molecule was modelled using QM (AM1), while the solvent molecules were modelled using MM (OPLS). The QM calculation was used to obtain the intramolecular energy of the solute. The QM calculation was also used to obtain the atomic partial charges on the solute using AM1/CM1A. These partial charges were used to calculate the electrostatic interaction energy between the solute and solvent via Coulomb’s law. The LJ equation was used to obtain the vdW interaction between the solute and solvent using pre-assigned OPLS e and s LJ parameters. The QM calculation in this method was used only to dynamically re-parameterize the MM atomic partial charges during the simulation. The only information flow from the MM to QM level was the change in conformation of the solute. The solvent environment around the solute was not passed explicitly, as it was not included within the QM calculation. The effect of the solvent was only felt implicitly in the QM calculation via the application of the scale factor. Despite the lack of explicit modelling of the solvent environment in the QM calculation, Jorgensen and co-workers have successfully used this method to study solution-phase Diels-Alder reactions,56 and to study the enzyme-catalysed Claisen Chem. Modell., 2008, 5, 13–50 | 19 This journal is

c

The Royal Society of Chemistry 2008

rearrangement reaction of chorismate to prephenate.51 Jorgensen and co-workers have since adapted this method57–59 to use the PDDG/PM3 semiempirical QM Hamiltonian,60 using the CM361 method to extract charges, which are then scaled by a factor of 1.14,62 again to provide an implicit account for the extra polarising effects of the solvent. 2.3.

Two-way embedded methods

Periodic multiscale reparameterisation, i.e. periodically during a simulation, provides a two-way conduit for information flow between the QM and MM levels of modelling. However, the coupling between levels is not particularly strong. The interchange of information between levels occurs only periodically, which can lead to the MM level falling out of step with the QM. This problem was experienced during the development of the QM/MM docking method of Cho et al.50 If the ligand was initially docked in a poor configuration, then the partial charges derived for that conformation could bias the subsequent docking runs to rediscover the poor configuration in preference to the correct binding mode. Cho et al. developed a survival of the fittest algorithm50 that ran multiple docking runs in parallel, thereby preventing one poor result from biasing the rest of the calculation. A closer coupling between the QM and MM levels can be achieved using an embedded interace method. A QM region of the biomolecular system is embedded within a larger MM simulation. One of the primary application areas for embedded QM/MM methods is computational enzymology (the computational modelling of enzyme-catalysed reaction mechanisms), where typically a QM model of the substrate and part of the enzyme active site is embedded within an MM model of the rest of the enzyme and solvent.63 Embedding a QM model within an MM simulation creates a dynamic and permanent interface between the two levels, with information flow across that interface having to be managed for each configuration of the simulation. The ONIOM method, developed by Morokuma and co-workers,64 provides such an interface via the use of multilevel corrections. The ONIOM method partitions the system into multiple layers. For example, consider a two-layer system, where a low-level QM region, A, is embedded within a high-level MM region, B. The energy of both regions, A + B, is first calculated using only the MM Hamiltonian, giving EMM(A + B). This total energy is corrected by calculating the difference in energy between the QM and MM energies of the QM region, EQM(A) EMM(A). The total ONIOM energy of the system is therefore EMM(A + B) + EQM(A) EMM(A). The generalisation of this algorithm to multiple levels is straight-forward, e.g. a system can be divided into an ab initio QM region, A, which is embedded within a semiemprical QM region, B, which is itself embedded within an MM system, C. The ONIOM energy in this case would be EMM(A + B + C) + [Esemiemprical(A + B) EMM(A + B)] + [Eab initio(A) Esemiemprical(A)]. The ONIOM method allows the facile combination of QM and MM levels of modelling. However, the electrostatic interaction between the QM and MM regions in the original ONIOM implementation is handled at the MM level only, via EMM(A + B), based on partial atomic charges derived from the QM calculation. The use of this method, called classical or mechanical embedding,65 means that information about the electrostatics of the system flows only from the QM up to the MM level. There is no conduit by which the electrostatic environment of the MM atoms is able to flow down to the QM region, to polarise the QM wavefunction.65 An alternative method of interfacing QM and MM calculations, called electronic embedding, solves this problem. In electronic embedding, the partial charges of the MM atoms are embedded within the QM Hamiltonian. This allows the QM wavefunction to be polarised by the MM atoms, thereby providing a two-way conduit for electrostatic information between the QM and MM regions. The ONIOM method has since been extended to use electronic embedding,65 thereby overcoming one of the fundamental weaknesses of the algorithm. 20 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

Electronic embedding has had a rich and long history of application in other QM/ MM schemes. The first application of electronic embedded QM/MM to a biomolecular system was in the ground-breaking work of Warshel and Levitt in 1976.39 They developed the method to study the reaction mechanism of hen egg-white lysozyme. Warshel has recently produced a detailed, clear and very interesting review63 of the details of the algorithm and the developments in QM/MM methodology since his pioneering work in the 1970s. Acceptance of this approach took a long time,63 but there is now a large body of QM/MM applications that use such methodology (see one of the very many reviews of QM/MM methods available in the literature66–69). The underlying theory of this method has been covered by many different authors68,69 so only a brief description will be given here. The biomolecular system is divided into a QM region, for example a substrate, and an MM region, e.g. an enzyme and surrounding solvent molecules. The total energy of the system is the sum of the energy of the MM region, evaluated using a standard MM forcefield, the energy of the QM region, and the QM/MM interaction energy between the two regions. The QM/MM interaction energy is split into two parts: an electrostatic part and a vdW part. The vdW part is calculated by assigning LJ parameters to all of the QM atoms and calculating the interaction between the QM and MM atoms using the LJ equation. The electrostatic part of the QM/MM interaction is calculated by bundling it together with the calculation of the QM energy of the QM region. This is achieved by embedding within the QM Hamiltonian the locations and partial charges of all of the MM atoms that are within a pre-determined cutoff distance of the QM atoms. Normally the MM atoms are represented as point charges in the QM calculation, but Gaussian charge distributions, with the width of the Gaussian that is similar to the covalent radii of the MM atoms may instead be used. 70 The MM partial charges act to polarise the QM wavefunction of the QM atoms, and therefore the evaluation of the energy of this wavefunction returns both the intramolecular energy of the QM atoms and the electrostatic interaction between the QM and MM atoms. The simple split of the QM/MM interaction into electrostatic and vdW parts is, however, only possible if the interface between regions lies between molecules, i.e. all molecules are either QM or MM, and there are no molecules that sit across the interface. It is desirable (e.g. within computational enzymology) for a single molecule to be able to bridge this interface, e.g. while the majority of an enzyme is modelled at the MM level, it is usually necessary to represent some (e.g. catalytic) active site residues at the QM level. The problem with having a single molecule straddle the interface is that the QM/MM interaction energy, as well as modelling electrostatic polarisation, must now also include terms that account for the chemical bonding between atoms modelled at the QM level and atoms modelled at the MM level. This is a particular problem with the QM side of the calculation, as when the boundary bisects a covalent bond, the electron density is terminated abruptly at the end of the QM region and electrons of the bonded atom are missing, potentially leading to unpaired electrons.66 The three most popular methods for solving this problem are the link atom method,66,71,72 the local selfconsistent field (LSCF) method73,74 and the generalized hybrid orbital (GHO) method.75,76 The ‘‘dummy junction atom’’ or link atom approach introduces so-called link atoms to satisfy the valence of the atoms on the QM side of the QM/MM interface.66,71,72 Usually this atom is a hydrogen, but other atom types have also been used, e.g. halogens such as fluorine or chlorine.77 The link atom method can be used with both the Warshel-type QM/MM methods and ONIOM methods.65,78 The link atom method has been criticised because it introduces extra unphysical atoms to the system, which come with associated extra degrees of freedom. Another problem is that a C–H bond is clearly not chemically exactly equivalent to a C–C bond. Despite these problems, the simplicity of the link atom method means that it is used widely in the QM/MM modelling of proteins and other biological molecules. 79

Chem. Modell., 2008, 5, 13–50 | 21 This journal is

c

The Royal Society of Chemistry 2008

Zhang et al.80,81 solved these problems with their pseudo bond method. If X and Y are the bonded QM and MM boundary atoms, respectively, then the link atom method effectively replaces the Y atom with a hydrogen, thereby changing X–Y into X–H. This has the problem that H is not necessarily chemically similar to Y, and also the X–H bond length may not be the same as the X–Y bond length. In the pseudobond method, Y is instead replaced by Yeff. This is a one-free valence boundary atom that has a parameterized effective core potential that mimics the strength of the X–Y bond. This method has been tested successfully80 with both Hartree Fock (HF), density functional theory (DFT) and MP282 QM/MM calculations. Antes and Thiel83 have also introduced a conceptually similar approach, which they call ‘‘adjusted connection atom’’, that works at the semiemprical QM level. Parameterisation of these effective link atoms is not straight-forward, as they must minimally perturb the electron density compared to a QM calculation of the entire molecule. Ro¨thlisberger and co-workers have developed a sophisticated scheme that derives parameters for these atoms via density functional perturbation theory.84 A second approach to define the bonding interface between the QM and MM regions is the local self-consistent field (LSCF) algorithm developed by Rivail and co-workers.73,74 In the LSCF method the bonds between the QM and MM atoms are represented by strictly localised bond orbitals, which are parameterized by separate QM calculations on small molecules.75 These localised orbitals are assumed to be transferable to the protein system, and are used, and kept constant, throughout the self-consistent field (SCF) QM calculation. An elegant feature of the LSCF method is that it does not require use of link atoms, and a comparison of the LSCF and link atom methods85 showed that both give equivalent energy results. However, the parameters for the localised bond orbitals have to be determined for each new system studied.75 Fonili et al.86 have recently shown that it is possible to use frozen core orbitals on the MM frontier atom within the LSCF scheme. This provides an explicit description of the core electrons of the atoms on the MM frontier atom, thereby improving the physical description of the interface, thus reducing the error on the calculation. While the LSCF method is elegant, and has been shown to work well, the parameters for the localised orbitals are not very portable. Gao et al. addressed this problem by developing generalised hybrid orbitals (GHOs).75,76 The QM boundary atom at the QM/MM interface has the standard valence s and p orbitals as all of the other non-hydrogen atoms in the QM region. These four sp orbitals are transformed into a set of orthogonal hybrid orbitals, which can be defined by the bound geometry of the atom.75 These hybrid orbitals are used, along with the atomic orbitals of the QM region, to determine the QM energy. However, only one hybrid orbital points along the bond between the QM and MM boundary atoms, and it is this active hybrid orbital that needs to be optimised. The complete set of this one active hybrid orbital, plus all of the atomic orbitals from the other QM atoms form the active set that is optimised during the SCF calculation. The remaining three hybrid orbitals, called auxiliary orbitals, act, together with the nucleus charge, to generate an effective core potential for the QM boundary atom. Gao et al. realised that these auxiliary orbitals may be parameterized to mimic the effective core potential for the active electrons from the MM region. Therefore rather than parameterising the charge density of the hybrid orbitals for each specific system, as is the case for the LSCF method, they instead optimise the semiemprical parameters for the boundary atom to reproduce the bonding properties of full QM systems. As a result, the parameters for this GHO method are expected to be general and transferable in the same way as all the semiemprical parameters. 75 The method has since been extended by Gao and co-workers87 to work with the selfconsistent charge density-functional tight binding (SCC-DFTB) method.88 The accuracy and efficiency of SCC-DFTB is making it a popular choice to model the QM region, and its use in computational enzymology has increased greatly in recent 22 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

years.88 Gao and co-workers76,89 have also developed a GHO method that is suitable for ab initio Hartree Fock (HF)89 and density functional theory (DFT) calculations.76 Ko¨nig et al.90 recently compared several different algorithms to model chemical bonds between QM and MM atoms, and concluded that while none of the approaches were perfect, error cancellation meant that enzyme catalysed reaction free energies using an SCC-DFTB QM region were only marginally affected by the choice of algorithm, if total charge was conserved during the reaction. The conclusion contrasts with a comparison study in which we were involved,91 in which we compared the link atom and GHO boundary methods. While we found that both methods, when properly applied, can lead to similar behaviour, the inclusion of conformational sampling amplified the effects of the differences between the methods. This led to the two methods returning different free energy reaction profiles despite being applied to the same enzyme system. It is important to make clear that while the same enzyme system was used, the QM region was not the same for the two boundary methods. This was because of the different constraints on the partitioning of the two methods (the GHO method requires that the QM atom bound to an MM atom must be an sp3 carbon91). The origin of the observed difference may well be in the different QM and MM systems, rather than in the partitioning schemes themselves. Despite these questions over the fine detail of how QM/MM methods are applied, their development and application has now matured to a point where they can provide near-quantitive results for activation enthalpies and free energies of reaction.92 It is now possible to perform electronic structure calculations on large systems approaching chemical accuracy, thus allowing quantitative studies of reaction mechanisms in enzymes.92 2.4.

Two-way parallel QM/MM methods

Quantum mechanics calculations are computationally demanding. This has caused difficulties with their application to the calculation of thermodynamic properties, such as free energies. This is because thermodynamic properties are calculated as an average over a large ensemble of conformations of a system. Sampling methods, such as molecular dynamics (MD)35,36 or Monte Carlo (MC)36,37 must be used for the rigorous generation of such ensembles. The computational expense of QM calculations means that it is only practical to generate small ensembles via standard MD or MC, e.g. current methods are limited to picoseconds of molecular dynamics. Two-way parallel QM/MM interfaces have therefore been developed in an attempt to overcome this problem. We use the term parallel methods to mean those that use both a QM or QM/MM simulation running in parallel with a standard MM simulation, using only periodic exchange of information between the two simulation levels. Warshel and co-workers developed a successful QM/MM parallel method in a set of pioneering papers in the late 1990s.93,94 The aim of this method was to calculate the free energy difference between two systems, A and B. For example, system A could be a substrate bound to an enzyme, while system B could be the transition state. The free energy difference between these two corresponds to the activation free energy of the enzyme catalysed reaction. Warshel and co-workers calculated the relative free energy of A and B by first using a molecular mechanics type potential. Because an MM potential was used, molecular dynamics sampling was efficient, and therefore a large ensemble, and well-converged relative free energy were calculated. This relative free energy, DGMM(A - B), can only be as good as the MM potential used during the calculation. Ideally, this free energy should be calculated using a QM or QM/MM representation of A and B, giving DGQM(A - B). However, the computational expense of the QM calculation prevents the efficient generation of the large ensembles necessary to evaluate converged free energies. Warshel and coworkers solved this by rather than calculating DGQM(A - B) directly, they used the Chem. Modell., 2008, 5, 13–50 | 23 This journal is

c

The Royal Society of Chemistry 2008

Fig. 1 The free energy cycle93,94 used to calculate the QM/MM free energy difference between systems A and B, DGQM/MM(A - B). The free energy difference between A and B is first estimated using an approximate potential (e.g. an MM potential), giving DGMM(A - B). This is then corrected to the QM/MM value by calculating the free energy necessary to perturb system A from MM to QM/MM (DGMM-QM/MM(A)) and the free energy to perturb system B from MM to QM/MM (DGMM-QM/MM(B)).

MM ensembles to calculate the difference in free energy between the QM and MM representations of A and B. In essence, Warshel and co-workers calculated the free energy error associated with using the MM forcefield. By calculating these errors, Warshel and co-workers were able to correct DGMM(A - B) so that it was formally equal to DGQM(A - B)93,94 (see Fig. 1). The correction free energies were calculated by generating ensembles for systems A and B using the MM model. The difference in energy between the QM and MM models was calculated for a subset of each ensemble, and the difference between these energies used as input to a single-step free energy perturbation (FEP)95,96 between the MM model (the FEP reference state) and the QM model (the FEP perturbed state). As long as the MM model is a good approximation of the QM model, i.e. the phase space overlap of the two models is good, then the average calculated via the FEP equation will converge to an accurate estimate of the correction free energy. The key advantage of this method is that all of the thermodynamic sampling is performed using only the MM model of the system. QM or QM/MM calculations are run in parallel with the MM sampling to estimate the correction free energies. The disadvantage of this method is the requirement of good overlap between the QM and MM models. Warshel and co-workers mitigate this disadvantage through the development of the empirical valence bond (EVB)93,94,97 forcefield, which has been designed to give energies that are in good agreement with experiment and QM calculations. In addition, the EVB potential has been developed so that it can be used to study chemical reactions, something that is not possible using most of the biological MM forcefields. The EVB potential and the Warshel parallel QM/MM method have been very successful, and have been used to study a variety of systems.98–102 Warshel and co-workers developed their method to avoid the problem of poor sampling of a QM or QM/MM Hamiltonian. MD methods are currently limited to picoseconds of QM/MM dynamics for typical biomolecular applications, even using relatively low levels of QM theory. Monte Carlo (MC) methods suffer from even greater problems. MC works by performing typically millions of small random moves of the biomolecular system, each of which are tested according to the change in energy associated with that move. MC sampling of a QM/MM Hamiltonian would potentially require millions of QM energy evaluations, which is impractical using current methods and computers. A second class of parallel QM/MM methods attempt to solve this problem. These methods use a novel Monte Carlo algorithm developed by Hastings in 1970.103 This is a multiscale sampling method that uses MC sampling at one modelling level to generate an ensemble which is formally correct for a different modelling level. The algorithm works by creating a new type of Monte Carlo move. The move starts at configuration i. The energy of this configuration is evaluated using both the fast, high-level model, giving Efast(i), and the slower, low-level forcefield, giving Eslow(i). A block of MC moves is then performed using the fast forcefield. This results in a new configuration, j. The energy of this configuration is evaluated using both forcefields, giving Eslow(j) and Efast(j). These 24 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

Fig. 2 Application of the Metropolis-Hastings103 algorithm to accelerate sampling of a system represented using a QM/MM Hamiltonian.104,105 The Monte Carlo move starts at configuration i. The energy of this configuration is evaluated using the target QM/MM Hamiltonian (giving EQM/MM(i)) and on an approximate (MM) Hamiltonian (giving EMM(i)). Standard Metropolis Monte Carlo moves are then attempted from configuration i using only the approximate MM Hamiltonian, until after a set number of moves, the system is in configuration j. The energy of configuration j is evaluated using both the QM/MM and MM Hamiltonians (giving EQM/MM(j) and EMM(j)). Configuration j is then accepted into the QM/ MM ensemble according to the probability min{1,exp( DDE/kBT)} where DDE = (EQM/MM(j) EMM(j)) (EQM/MM(i) EMM(i)).

energies are used to test configuration j according to a new MC acceptance test. Configuration j is accepted into the ensemble if this test is passed (see Fig. 2). Otherwise the whole block of sampling is rejected and the simulation is reset to configuration i. The form of the MC test is such that even though the trial configurations are generated using the fast forcefield, they are accepted into the ensemble of the slow forcefield with the correct Boltzmann probability. This algorithm was popularised for applications to QM and QM/MM systems by Schofield and co-workers,104,106 who coined the phrase ‘‘molecular mechanics-based importance function’’ (MMBIF). One of the problems with this algorithm is that the acceptance ratio of the MC test will be low if there is poor overlap between the fast and slow forcefields. This is similar to the problem encountered in Warshel’s correction free energy method. Effort may therefore need to be spent optimising the fast forcefield such that it is a better match to the slow forcefield. We have developed a parallel QM/MM method105 that combines the advantages of both the Warshel and MMBIF algorithms. The method works by using the MMBIF method to generate QM/MM ensembles from which the Warshel correction free energies can be calculated in full. The correction free energy is calculated using thermodynamic integration (TI)107,108 over a fictional l scaling parameter that maps from the QM model to the MM model. This l parameter allows the QM model to be changed over a series of windows into the MM model. The MMBIF algorithm can then be used to generate ensembles of conformations of the system at different values of l, so that the gradient of the free energy with respect to l can be calculated at several points between the QM and MM models. These gradients can then be integrated across l to return the correction free energy. We overcome the problems associated with potentially poor overlap between the QM and MM models by using replica exchange8,9,109 moves across the l coordinate during the simulation. Replica exchange moves are additional Monte Carlo moves that lightly couple multiple trajectories together. All of the MC simulations at different l values are run in parallel. Neighbouring pairs of simulations are tested periodically according to a replica exchange MC acceptance test. If this test is passed, then the l values of the neighbouring pairs are swapped. This has the effect of allowing each MC simulation to sample multiple l values during the simulation. This enhances convergence of the free energy averages. We have used this method105 to calculate converged relative hydration free energies of water and methane, using an MP2 ab initio QM model of water and methane, solvated by a periodic box of MM waters. While this was a nonbiological application, we are currently using this method to calculate QM/MM relative binding free energies of protein-ligand systems (using a DFT QM model of the ligand and an MM model of the protein and explicit solvent), and are planning to use it to perform some computational enzymology calculations. Chem. Modell., 2008, 5, 13–50 | 25 This journal is

c

The Royal Society of Chemistry 2008

3. Interfacing atomistic with coarse grain models In mid 2007, Leontiadou, Mark and Marrink110 produced a paper in which they used atomistic molecular dynamics simulations to model the effects of ionic concentration on the transport of ionic species across a pore in a lipid membrane. This work involved several large simulations, involving 128 dipalmitoylphosphatidylcholine (DPPC) lipids (each containing 130 atoms) and about 6000 water molecules. This work pushed the limits of what is achievable with current atomistic molecular dynamics, and, by using approximations such as modelling long-range electrostatics using a reaction field, and using bond constraints such that a 5 fs integration timestep could be used, they were able to run several simulations of between 50 ns to 100 ns in length. Despite the impressive size of these simulations, they are still limited to biologically small length and time scales. 128 lipids is merely an 8 8 membrane bilayer, which is too small to model effects such as membrane curvature or membrane waves.111,112 100 ns is also too short a time to capture events such as membrane protein aggregation or lipid raft formation within a membrane.111 Coarse grain models provide a route to longer time and length scales in biomolecular simulations. Coarse grain models are a class of mesoscale model that work by grouping several atoms together and modelling them as a single interaction site. In effect, groups of atoms are smeared together into beads. For example, a coarse grain model could be constructed that represents an amino acid residue as a single bead, and a protein as a string of beads. The use of coarse grain models reduces the computational expense of a simulation, as coarse graining reduces the number of interaction sites. In addition, CG models contain fewer degrees of freedom, and use forcefields that lead to smoother potential energy surfaces. The smoother potential energy surface reduces the problems associated with frustration or non-ergodic trapping, thereby leading to improved sampling and a lower correlation time. Also coarse graining tends to remove the stiffest degrees of freedom from the model (e.g. the C–H bond vibrational modes), thereby allowing a CG model to use a larger timestep. All of these effects mean that CG simulations provide a route to modelling length and time scales that are far beyond that which is practically achievable via atomistic molecular dynamics. Coarse grain modelling is currently undergoing a renaissance, and there is now significant international effort being spent developing and applying coarse grain methods to model biological systems. It is not the purpose of this review to cover all of these recent developments in depth, so the interested reader is directed to several excellent modern reviews of the development and application of CG methods.113–115 Coarse grain models allow simulators to routinely access length and time scales that are not practically possible using atomistic modelling methods. However, in smearing out the atomistic detail, CG models run the possibility of missing out important atomistic effects, much in the same way that molecular mechanics models, in smearing out all of the electronic detail, can fail to model important electronic effects such as polarisation. There is now significant interest in interfacing atomistic and coarse grain models within a multiscale framework, so that this problem may be overcome. Just as there is significant variation in the type and strength of interaction in the different methods developed to interface QM and MM models, so too is there significant variation in the type and strength of interaction of the different methods of interfacing atomistic and CG models. The type of interfaces broadly fall into four categories, which are similar in nature and definition to the interfaces that have been developed for QM/MM interfaces: 1. One-way, bottom-up methods. These involve a single transfer of information from atomistic simulations or calculations to the CG simulation, e.g. by using an atomistic simulation to parameterize a CG model. 2. One-way, top–down methods. These involve a transfer of information from the CG simulation to the atomistic simulation, e.g. using a CG model to enhance

26 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

sampling, and then using an atomistic model to explore interesting conformations that are discovered. 3. Two-way parallel methods. These involve the running, in parallel, of both a CG and atomistic model, and dynamically exchanging information between them at run time. 4. Two-way embedded methods. These involve the running of simulations in which an atomistic region (e.g. a membrane protein) is embedded within a CG model (e.g. a lipid membrane). Examples of each of these different types of interface will now be presented and discussed in turn. 3.1 One-way, bottom-up interfacing methods Coarse grain models can be interfaced using a one-way, bottom-up scheme. In this scheme, an atomistic model of the system (be it MM or QM) is used to parameterize a coarse grain model of the system. This idea is not new, and was the parameterisation method used by Levitt and Warshel116,117 when they first introduced coarse grain models in their pioneering 1970s papers. Levitt and Warshel’s 1975 paper introduced the first coarse grain model of a globular protein. It used two CG particles per residues: one that represented the a-carbon of an amino acid (Ca), and one that represented the side-chain atoms. A torsion potential acted about Ca particles, while a Lennard-Jones type potential acted between pairs of side chain particles. This CG model was built to represent bovine pancreatic trypsin inhibitor (BPTI), and was very successful, being able to correctly refold the protein starting from a completely denatured configuration. This work is interesting not only because it represents the first coarse grain biomolecular model, but also because it represents the first multiscale MM/CG simulation. Levitt and Warshel generated the parameters for their coarse grain model by averaging over interaction energies calculated using an atomistic potential. The parameters for the interaction potential between side chain particles were calculated by assuming that the side chains had spherical symmetry. The effective potential between identical side chain particles was calculated at various distances apart by summing the interaction energies of all of the atoms in one sphere with all of the atoms in the other sphere using a molecular mechanics style potential. This interaction potential between like-particles was then used to parameterize a Lennard-Jones type function, and the parameters between different side chain particles were then obtained using a geometric combining rule. The torsion potential between Ca atoms was also calculated by fitting to atomistic calculations, with the torsion potential between a pair of residues based on a time average of the energy calculated from atomistic simulations of a set of dipeptides. 3.1.1. Coarse grain models of lipid membranes. Levitt and Warshel’s method of obtaining CG parameters from underlying atomistic calculations can now be recognised as an example of a multiscale parameterisation scheme, where there is a one-way, bottom-up, flow of information from the atomistic to the CG calculation. Levitt and Warshel were ahead of their time in using a multiscale parameterisation scheme for their CG model. Coarse grain models only became popular for biomolecular modelling in the 1990s, which saw the beginning of the development of CG representations of lipids. Because the description of the development of coarse grain lipid molecules is outside the scope of this review (excellent reviews of this subject113,115 exist already) only a brief history of CG lipid models will be presented, in particular to highlight how CG lipid models have recently moved to using multiscale parameterisation methods. The first applications of CG lipid models looked for qualitative, rather than quantitative predictions, and therefore did not use any information from atomisticlevel calculations in their parameterisation. One of the first models produced during this time was by Smit et al.118 This model was used to investigate the phases of an oil/ Chem. Modell., 2008, 5, 13–50 | 27 This journal is

c

The Royal Society of Chemistry 2008

water/surfactant system. This was a qualitative model that used just two types of particle; an oil (o) particle and a water (w) particle. An oil molecule was represented using a single o particle, water using a single w, and a surfactant was modelled as a chain of two w particles followed by 5 o particles, all held together with harmonic springs. All particles interacted using a Lennard-Jones potential, with attractive o–o and w–w interactions, and a purely repulsive o–w interaction. This simple representation was qualitatively able to model micelle formation. Goetz et al.119 and Noguchi and Takasu120 also developed qualitative CG models of amphiphiles, and used them to investigate phenomena such as bilayer and vesicle formation. One of the problems with a CG model is that the form of the interaction potential between CG particles is not clear, and there are questions over whether simple potentials are sufficient. Noguchi and Takasu developed their qualitative model of amphiphiles using custom potentials; a pairwise exponential term to give a soft-core repulsive shape to the CG particles, and a many-body attractive potential that aimed to model the hydrophobic effect. Using these potential forms, and a small number of adjustable parameters, they were able to simulate the self-assembly of bilayer vesicles. Brannigan and Brown121 also developed a pure CG model with a complex form. They modelled lipids as soft spherocylinders that interacted via a short-range r 8 repulsive term and an isotropic attractive alignment potential, which was based on an r 2 attractive term that was mediated by an angular dependence. These terms gave the lipids shape and encouraged them to form an aligned lamellar phase. To encourage bilayer formation, they designated one end of the spherocylinder as the lipid tail, and added a r 6 Lennard-Jones type attractive term between tails. This model resulted in a very small number of parameters, which allowed its phase behaviour with respect to a parameter search to be investigated. With the right parameters, this model was observed to self-assemble into a bilayer. Michel and Cleaver122 were still investigating the correct functional form to use for coarse grain amphiphile molecules in 2007, in a paper that investigated the use of the GayBerne123 potential. The Gay-Berne potential is effectively an anisotropic version of the Lennard-Jones potential, where the r 12 repulsive and r 6 attractive terms are attenuated by the angle of interaction between two particles (using the dot product of the particles alignment vectors). This has the effect of elongating the LennardJones sphere along an alignment vector and turning it into an ellipsoidal rod. Michel and Cleaver performed a parameter space search for Gay-Berne particles, and were able to qualitatively observe several different liquid crystal phases. Despite Levitt and Warshel’s demonstration of the effectiveness of using quantitatively parameterized CG models, it took until 2001 before other workers began developing quantitative CG models for biomolecules. The Shelley model, developed by Shelley et al.124,125 was the first quantitative CG biomolecular representation that was used to model lipid membranes. Shelley et al. developed a CG model of the lipid dimyristoylphosphatidylcholine (DMPC). They represented the 46 atoms of DMPC with just 13 spherical interaction sites. Four sites were used for each hydrocarbon chain (three chain SM sites, and one terminating ST site), one site each for the esther link (E1), one site for the glycerol backbone (GL) one site for the choline (CH) and one site for the phosphate (PH) in the headgroup (see Fig. 3). In addition, they also used a spherical W particle, that represented a grouping of three water molecules. The interaction potentials between these spherical groups were based on LennardJones potentials, e.g. interactions between W particles used a LJ 6–4 potential (an r 6 repulsive term coupled with an r 4 attractive term). The parameters for these LJ potentials were obtained by reproducing thermodynamic properties, e.g. the sWW parameter was set to reproduce the experimental density of water at 303.15 K, while calculations of the vapour pressure of a box of W particles, where used to set eWW such that the experimental boiling point of water was reproduced. A Lennard-Jones 9–6 potential was used between non-bonded hydrocarbon particle sites (SM and ST), which the LJ parameters obtained by matching the densities and vapour pressures of boxes of nonane and dodecane. The Shelley CG model was mostly 28 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

Fig. 3 A comparison of different coarse grain lipid models. The Shelley model124,125 of DMPC, and Marrink126 and Essex127 models of DPPC are compared to their atomistic equivalents (for ease of comparison, hydrogen atoms of the atomistic models are not shown). Solid lines represent harmonic bonds connecting CG particles, and the CG particle types for the Shelley and Marrink models are labelled (the labels are the same as those used in the main text). The point charges (represented by + and ) and point dipoles (represented by arrows) are shown for the Essex model (the charges and dipoles are located at the centre of their associated CG particle). The Shelley and Marrink models use LJ particles (represented by spheres), while the Essex model uses a combination of LJ particles (spheres) and Gay-Berne particles (ellipsoids). Finally, the ‘blob’ model proposed by Chao et al.128 is also shown for comparison. This model represents groups of atoms as rigid non-spherical ‘blobs’ that use interaction potentials based on multipole expansions.

parameterized from experimental and thermodynamic data, similarly to how the original atomistic molecular mechanics forcefields were parameterized from experimental and thermodynamic data. However, due to the importance of correctly modelling the interactions between head group particles, Shelley et al. did use a multiscale parameterisation method to generate the parameters for the head group particles from an underlying atomistic simulation. The interactions between head group particles were based on radial distribution functions (RDFs) between head group particles calculated from atomistic simulations. These RDFs were tabulated, and then used to create a potential of mean force (PMF) which was used as the interaction potential between the head group particles. Use of this PMF led to RDFs that disagreed with the atomistic simulation, so corrections to the interaction potential were calculated by comparing the atomistic and CG RDFs, and running an iterative scheme that updated the interaction potential until the differences Chem. Modell., 2008, 5, 13–50 | 29 This journal is

c

The Royal Society of Chemistry 2008

between the atomistic and CG RDFs were minimised. The model semi-quatitatively reproduced the density profile of an aqueous DMPC bilayer, and could be used to simulate the self-assembly of the bilayer. Shelley et al. used this model to investigate diffusion of halothane through the membrane.125 Halothane was modelled using a LJ 6–4 potential, which was parameterized by matching the experimental density and boiling point of the molecule, but a lack of experimental data prevented the detailed parameterisation of the interactions between halothane and the particles in the lipids, and this led to a model which was shown to not be truly predictive. Lopez et al.129 used existing beads from the Shelley DMPC model to model a synthetic antimicrobial polymer, which was inspired by a natural antibiotic. They then qualitatively investigated the interaction between the antimicrobial model and a bilayer of Shelley DMPC molecules. Despite the quantitative foundations of the Shelley model, it has several limiting features which arise from the use of multiscale parameterization:126 the model is optimized for the bilayer phase of DMPC only (as it was an atomistic simulation of this phase that led directly to the head group parameters), the interaction potentials are more complicated, so short time steps must be used in the dynamics integrator, and it evaluates long-range interactions, so it is not possible to reduce the computational expense of the calculation by using a short-range cutoff. These disadvantages of multiscale parameterisation, the lack of portability and the complexity of the interaction potential, led Marrink and co-workers to develop a simple, experimentally parameterized CG lipid model.126 The Marrink model is semi-quantitative, and was developed to target four goals; speed, accuracy, applicability and versatility. Marrink et al. achieved speed by using only short range potentials, which were also very smooth, so that large integration timesteps may be used. Accuracy was maintained by deriving the parameters to match experimental results and by comparing results calculated using their model to results from equivalent atomistic simulations. The simplicity of the forcefield (it is composed as a set of generic particles), together with its flexibility to represent chemical groups achieved the aims of applicability and versatility. The coarse grain model is constructed as follows; on average four atoms are represented by a single interaction site. To keep the model simple, only four main types of interaction site are used; polar (P), nonpolar (N), apolar (C) and charged (Q). The nonpolar and charged groups are then futher divided into subtypes, which depend on whether or not hydrogen bonding is possible to these groups. Fig. 3 shows how a DPPC lipid is represented using this model. The coarse grain particles interact using a LennardJones function. However, only five levels of interaction are defined; attractive, where e = 5 kJ mol 1, semi-attactive, where e = 4.2 kJ mol 1, intermediate (e = 3.4 kJ mol 1), semirepulsive (e = 2.6 kJ mol 1) and repulsive (e = 1.8 kJ mol 1). The level of interaction between a particular pair of coarse grain particles is chosen from a table look-up based on the types of the interacting sites, e.g. a pair of P particles have an attractive level of interaction, while a P and a C have a repulsive level of interaction. The complete lookup table is provided in the original paper describing this model.126 These parameters were obtained by trial and error, running multiple oil and water CG simulations and modifying the small number of parameters to reproduce experimental properties (e.g. density of water and alkanes at room temperature, mutual solubility and diffusion rates). For all interaction levels, the same value of s was used (0.47 nm). To keep the interactions short-ranged, they were force shifted to zero between 0.9 nm and the user-defined parameter rcut using the standard Gromacs130,131 shift function. All coarse grain particles except for nearest neighbours interact through this Lennard-Jones potential. Nearest neighbours are connected via a weak harmonic spring, while next-nearest neighbours are connected via a harmonic angle potential. In addition to these interactions, and unusually for CG lipid models, charged (Q) particles were included, which interact via a Coulomb potential which used a relative dielectric constant of er = 20, to implicitly account for charge screening. This electrostatic interaction is also shifted using the standard 30 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

GROMACS130,131 shift function between 0 nm and rcut. The Q particles were designed to represent groups that have a full charge, and the groups full charge should be used. The exception are small hydrated ions, that use a reduced charge, to take into account the effect of an implicit hydration shell. Energy, density, length, temperature and pressure all have the same meaning for in the Marrink CG model as they do for any atomistic MM model. Time is different though, as the CG interaction potential is very smooth, and therefore conformational sampling is faster as the system is not trapped in local minima. This means that CG time is probably 3–6 times faster than atomistic MD time. To achieve parity, Marrink et al. scale time by four. Part of the reason for the success of the model is that the smooth potential surface allows large timesteps to be used with the dynamics integrator—timesteps up to 50 fs can be used, though 40 fs gives more stable results. The Marrink model is implemented in Gromacs,130,131 and the input parameters for this model can be downloaded from http://md.chem.rug.ml/marrink/ coarsegrain.html. The Marrink model has been used in several applications110,132–134 and has been compared against a united atom atomistic force field (GROMOS)135–137 (a united atom MM forcefield is one in which non-polar hydrogen atoms are not modelled explicitly, but are instead combined into the representation of their bound heavy atom, e.g. ethane is represented as a pair of CH3 particles). This comparison included an estimate of the loss of configurational entropy incurred in moving from the atomistic to coarse grain model.135,136 These comparisons showed that while the CG model compared well, it failed to fully represent some thermodynamic properties, e.g. oil/oil interactions were too weak in the CG model, and water/oil repulsion appeared to be overestimated.137 In addition, the CG model showed less enthalpy/ entropy compensation than an atomistic model for the mixing of oil and water, leading the authors of the comparison to suggest that accurate solvation thermodynamics should be employed to help in the reparameterisation of CG models. 137 Marrink et al. have now performed this reparameterisation and have created the MARTINI forcefield.138 The MARTINI forcefield was parameterized in a consistent manner to reproduce the partitioning free energies between polar and apolar media of a large number of chemical compounds. The advantage of the Marrink model is that it provides a set of basic CG bead building blocks that may be used to create different types of molecules. Bond et al.139,140 used these building blocks to construct CG models of membrane proteins, so as to investigate membrane-protein insertion. They compared their results to fully atomistic simulations, and found qualitative agreement. Marrink and co-workers have also now parameterized CG beads to represent amino acid residues in proteins,134 and have used these parameters to perform a CG study of the selfassembly of G protein-coupled receptors in membrane bilayers.

3.1.2. Multiscale CG parameterisation. Despite the problems of multiscale parameterisation, e.g. the lack of portability of the parameters and the potentially complex form of the interaction potentials, and the success of the experimentally parameterized Marrink CG model, a trend over the last few years is that CG models of lipids have begun to move towards using parameters that are derived systematically from atomistic simulations. This is because multiscale parameterisation provides a route to systematically obtain CG potentials for mixed or complex systems, or for cases where experimental data is difficult to obtain. Parameterisation methods are appearing that use the type of multiscale one-way atomistic to CG information flows that were first used by Levitt and Warshel in the 1970s. This trend parallels the historical development of atomistic MM force fields, which also moved away from complex experiment-based parameterisation schemes towards using wellvalidated recipes (e.g. GAFF44) to get MM parameters from QM calculations. CG parameterisation schemes now seem to be following this trend, and systematic Chem. Modell., 2008, 5, 13–50 | 31 This journal is

c

The Royal Society of Chemistry 2008

methods to obtain CG parameters from atomistic MM simulations are now becoming popular. Izvekov and Voth141,142 have developed a method that automates the generation of CG potentials and parameters from an atomistic MM simulation. Their method, which they term ‘‘multiscale coarse graining’’ (MS-CG), automatically generates pairwise interaction potentials for a defined CG model based on results from an atomistic simulation. They have developed a force matching method, which is an extension of the least-squares force matching approach developed by Ercolessi and Adams.143 The method is used to coarse-grain an underlying atomistic simulation trajectory. As in other coarse grain models, the atoms are combined together into beads, and the system as a whole is therefore represented as a set of interacting beads. The potential of mean force is estimated between each bead based on the average forces experienced during the atomistic trajectory. This is evaluated using the atomistic potential energy surface, and CG potentials are derived that minimise the difference between the CG force and the average atomistic force. An analytical derivation of the method has been published,144,145 and statistial mechanical justification of using pair potentials for the CG particles has been developed.146 If the CG potential depends linearly on the fitting parameters, then the fitting of the parameters can be written as a series of over-determined linear equations, and the least-squares solution to which can be found by orthogonal matrix triangulation (QR decomposition).141 The efficiency of the fitting can be enhanced by breaking the entire atomistic trajectory into small chunks and generating the CG parameters for each chunk independently. The final CG parameters are then found by averaging the parameters over each chunk. This method is very efficient, but it requires that the CG potential depends linearly on the CG parameters. This is achieved by using a series of splines, which are linear in the spline parameters, and combining this with a Coulomb potential for the electrostatics. The effect of this coarse-graining is to return a model of the forces between pairs of CG beads that accurately reflects the average forces experienced between the atoms contained within those beads during the course of the atomistic molecular dynamics trajectory. Izvekov et al. applied this method to generate a CG potential for a DMPC bilayer. They collapsed the forces over a 400 ps atomistic trajectory onto a bead representation of DMPC that was similar to the Shelley125 or Marrink126 models. These forces were input to the force matching equations to yield the CG potentials. Despite the fact that some of the CG beads were charged, they were able to omit the Coulomb term from the fitting, as it was found that due to charge screening, the effect of electrostatics could be handled sufficiently by the short range spline functions of the MS-CG forcefield. This method has since been used to derive a CG model of carbonaceous nanoparticles,147 monosaccharides in water148 and of ionic liquids.149 Because this method systematically derives CG parameters using only an atomistic simulation, it is much more straightforward to develop parameters for mixed systems, for which experimental data is either unavailable or difficult to obtain. Izvekov et al. demonstrated this advantage by deriving a CG model for a mixed DMPC/cholesterol bilayer.150 MS-CG effectively provides a CG model of a system that approximates the free energy surface. However, dynamics simulations using the MS-CG model cannot be related to the true dynamics of the atomistic model. Izvekov et al. have tried to overcome this problem by using the atomistic simulation to parameterize friction coefficients which can be used with a Brownian dynamics integrator.151 While this has only currently been applied to liquid methanol, they plan to apply this method to membrane systems. Lyubartsev152 has also developed a multiscale parameterisation method that has been used to systematically build a CG model of a DMPC bilayer. Lyubartsev uses an inverse Monte Carlo method153 to generate the CG parameters from an underlying atomistic simulation. The atomistic simulation trajectory is analysed to generate the radial distribution functions (RDFs) for the CG bead model. These RDFs can be converted into pairwise interaction potentials between the beads. The 32 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

RDFs calculated from a CG simulation using these initial interaction potentials differ from those calculated from the atomistic simulation. An inverse Monte Carlo algorithm153 is therefore used to iteratively refine these interaction potentials by correcting them by the difference between the CG and atomistic RDFs. This is essentially the same method that was used by Shelley et al.125 to derive the parameters between the CG lipid head group particles, and is also similar to the Boltzmann inversion method,154 which also uses an iterative procedure that uses RDFs measured from atomistic simulations to derive CG interaction potentials. Lyubartsev has used this method to fully parameterize his own CG model of DMPC. Arkhipov et al. have taken multiscale coarse graining to an extreme in their recent work,155 in which they use atomistic models to parameterize CG models of a complete virus capsids. Each CG particle in the simulation represented about 200 atoms. Each CG particle interacted via a Lennard-Jones potential, which was parameterized to match the size of the domain the CG particle represented, as calculated from the radius of gyration of that domain measured from an atomistic model. The resulting CG model was able to simulate complete virus capsids (of dimensions 10 nm to 100 nm) over timescales of 1 ms to 10 ms. Chao et al.128 have developed a CG model that is based on the concept of soft rigid blobs. A blob is a grouping of atoms within a molecule that interacts with other blobs using a non-spherical, soft potential. A molecule is divided into blobs, e.g. see Fig. 3 for a proposed blob model of a phospholipid.128 Each blob represents a rigid grouping of atoms, and the interaction potential between a pair of blobs is calculated based on the underlying atomistic potential energy of interaction between the atoms within those blobs. However, unlike other CG models, which try to approximate this interaction potential using high symmetry beads, the rigid blob model expands this interaction potential as a Taylor series to produce interaction moment tensors between blobs. These are akin to the multipole tensors in a standard multipole expansion. The result is that the blob potential more accurately represents the three dimensional shape of the group of atoms that it represents. A further advantage is that the quality of the model can be increased by adding higher-order moments to the interaction potentials. 3.2 One-way top–down interfacing methods The multiscale atomistic/CG methods discussed so far all involve a single, one-way transfer of information from the atomistic model to the CG model. Essentially the atomistic models are used to parameterize a CG model, so that the CG model can then be used to explore the conformational space of the system more rapidly. However, in building the CG model, the atomistic fine detail of the system has been lost, as it has been smeared out into a set of beads. Recovering the atomistic fine detail would however be very useful, as this would allow the simulator to zoom back in and observe interesting configurations revealed during the CG trajectory at an atomistic level of detail. Knecht and Marrink133 realised the advantages of this approach, and have used it together with the Marrink CG lipid model to model vesicle formation. The CG model is used to self-assemble a vesicle, and this CG structure is then used as a template for the starting structure for an atomistic simulation. The conversion from the CG model to the atomistic model is straightforward. The coordinates from the CG simulation were scaled by a factor of 1.6. Atomistic models of the lipid, taken from an atomistic simulation of the planar bilayer, were fitted on top of their CG counterparts. The CG model had to be scaled to prevent steric clashes caused by the curvature of the CG vesicle bilayer. Molecular dynamics simulations were run on the atomistic model, allowing vesicle membrane fusion to be studied at the atomistic level. It is relatively straight-forward to move from a coarse grain model of a lipid to a fully atomistic model, as the main features of the lipid, namely head group and two tails, are still present in the CG model. It is less straight-forward to reconstruct the Chem. Modell., 2008, 5, 13–50 | 33 This journal is

c

The Royal Society of Chemistry 2008

atomistic coordinates of a protein from a CG model. This is because many CG protein models represent only the backbone of the protein, and if the side chain is represented, it is done so using only one or two beads. There is a lot of atomistic structural information missing from a CG protein model, and reconstructing this fine detail is not trivial. McCammon and co-workers156 attempted in their multiscale exploration of ligand binding to HIV protease to map from a single residue per bead protein CG model back to an atomistic representation. The binding site of HIV protease is covered by two large flaps, which open and close on a timescale that is beyond that is accessible on the atomistic molecular dynamics timescale. McCammon and co-workers have, over a series of papers,157–162 developed a very successful CG model of HIV protease. This model uses a single bead per residue, and was parameterized based on a statistical analysis of the crystal structure of the protein. This CG model is capable of simulating opening and closing motion of the large flaps that gate the entrance to the active site.158,162 McCammon and co-workers used their CG model within a multiscale simulation to investigate the binding pathways of an HIV protease inhibitor. The CG simulation was used to investigate opening and closing of flaps which gate the binding site of the enzyme, to see how the presence of the inhibitor affected flap dynamics, and to test whether the inhibitor could bind with the flaps closed. Because the CG model lacks atomistic detail, McCammon and co-workers selected two configurations from the CG trajectory to act as starting points for fully atomistic, implicit solvent Brownian dynamics simulations of the protease plus inhibitor. While it was straight-forward to rebuild a fully atomistic model of the inhibitor from the CG model, it was not simple to rebuild an atomistic model of the enzyme. This is because the CG model is missing lots of important structural information, e.g. the orientation and location of the amino acid side chains. To avoid the difficulty of rebuilding the atomistic detail of the enzyme, they instead used crystal structure configurations of the protease that had backbone geometries that were similar to those observed in the two snapshots from the CG trajectory. McCammon and co-workers156 used the CG simulation to provide input to a pair of atomistic simulations, thereby performing a pair of one-way, top–down information transfers from the CG to atomistic level. They sidestepped the great challenge of top–down transfers, namely the difficulty of recovering fine atomistic detail, by only using the CG model to locate the binding geometry of the inhibitor, and as a means to select which crystal structures corresponded to that particular binding geometry. Other workers have directly tackled the problem of reconstructing atomistic detail from CG models. The RACOGS algorithm (Reconstruction Algorithm for Coarse Grain Structures)163 is an example of a recently developed algorithm that attempts to solve the reconstruction problem. The algorithm is designed to rebuild an atomistic protein model given only the position of the Ca atoms. The algorithm has several stages; 1. The backbone atoms (C, O, N) are located based on the input positions of the Ca atoms using the algorithm developed by Feig et al.,164 which is itself based on the work by Milik et al.165 The algorithm places the backbone atoms into the average positions based on a statistical analysis of the positions of backbone atoms taken from 4013 non-redundant protein structures from the PDB. 2. The next step is the positioning of side chain atoms. These are positioned using the algorithm developed by Xiang and Honig,166 which uses a rotamer library to initially place the side chain atoms. The algorithm starts by adding the side chain atoms to a residue based on selecting the rotamer for that residue that has the lowest energy of interaction with the backbone atoms of the other residues. The interaction energy is evaluated using the vdW and dihedral energy terms from the AMBER99 forcefield.167 In performing this step, the rotamer library is pruned of any rotamers that involve a steric clash with the backbone. This helps improve the efficiency of the algorithm by removing any structurally unrealistic conformations as quickly as possible. 34 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

3. Once all of the side chains have been constructed, an iterative procedure is employed whereby each residue is considered in turn, and its energy of interaction of all possible rotamers of its side chain with all of the other residues’ side chains and backbone atoms is calculated. If another rotamer in the library has a lower energy, then it is selected in place of the existing rotamer. This iteration is repeated until an entire cycle over all of the residues in the protein is completed without a single residue being replaced. 4. The next step involves a minimisation of any high-energy side chains (defined by whether the energy of interaction of that side chain with the rest of the protein is greater than a specified cutoff). Each high energy side chain is, in turn, energy minimised while holding the rest of the protein fixed. The minimisation is performed using the bond, angle, dihedral and vdW terms from the AMBER99 forcefield. 5. Finally, hydrogen atoms are added using the LEAP module of AMBER 8, 168 and the entire protein is energy minimised using the conjugate gradient method, and the full generalized born/surface area (GB/SA)169 implicit solvent AMBER99 forcefield. Heath et al.163 were able to use this algorithm to hop between CG and atomistic models of the proteins src-SH3, S6wt and S6Alz. One of the problems of hopping from a CG to an atomistic model is that, because different forcefields are used, the free energy surfaces of the two models could be very different. Thus configurations that are sampled by the CG model may not be important for the atomistic model. Heath et al.163 investigated this for the CG model (the Das et al. model170) and atomistic model (AMBER99167) that they used. They found good agreement between the free energy surfaces, and were able to successfully use the CG model to investigate the folding and misfolding of these proteins, and to use their RACOGS algorithm to zoom in to obtain the atomistic detail of the misfolded structures. Ding et al.171 have also developed a method to reconstruct the atomistic fine detail of a protein structure from a CG model. They use a CG model that uses the positions of the Ca and Cb atoms to investigate domain swapping in seven proteins. The CG model enables the running of simulations that cover the timescale on which the domain swapping is seen to occur. A fully atomistic model of the protein is then reconstructed from the configurations generated from the CG simulation. The reconstruction algorithm is composed of the following stages; 1. The positions of the Ca and Cb atoms from the CG model are used to position the N and C backbone atoms so that the correct chirality is maintained (D-amino acids). To restrict the rotational freedom, a strong constraint was used, namely that neighbouring residues must have a planar peptide bond. Harmonic potentials were added between neighbouring backbone atoms, the equilibrium distances of which were taken from average distances calculated from the PDB structures. The Ca atoms were then immobilised by setting their masses to infinity, and a short molecular dynamics simulation was then run to relax the system. 2. Backbone hydrogens were added to create non-specific backbone hydrogen bonds. A CG potential was added between the Cb atoms, and a short molecular dynamics simulation was run to optimise the backbone hydrogen bond network. 3. The side chain and backbone O atoms were added using rotamers and a Monte Carlo simulated annealing algorithm. The scoring function for the simulated annealing used the vdW interactions from the Cedar forcefield,172 the EEF1 implicit solvation model173 and a statistical potential for hydrogen bonds, as proposed by Kortemme et al.174 This reconstruction algorithm was also used by Sharma, Ding and Dokholyan175 to reconstruct the atomistic detail from CG models of nucleosomes. This algorithms is very useful, as, like the RACOGS algorithm, it allows the CG model to be used to sample conformational change on timescale beyond that which is accessible to standard atomistic methods, yet it then allows the simulator to zoom in and reconstruct the atomistic data for interesting configurations.

Chem. Modell., 2008, 5, 13–50 | 35 This journal is

c

The Royal Society of Chemistry 2008

3.3 Two-way parallel interfacing methods Multiscale modelling across the atomistic/CG boundary is not limited to single, oneway transfers of information. Two-way interfacing methods, whereby the CG and atomistic levels exchange information dynamically throughout a simulation, are in active development and use. Two main schemes for linking CG and atomistic simulations have become popular; parallel methods, which use loosely-coupled atomistic and CG simulations running in parallel, and embedded methods, whereby atomistic regions are embedded within a CG model. 3.3.1. Parallel atomistic/CG methods. The replica exchange method, discussed in section 2.4 in terms of atomistic and QM calculations, has also been applied with CG models. For example, Nanias et al.176 have used replica exchange moves over temperature to enhance sampling within a CG protein simulation. Just as there has been interest in developing replica exchange moves that map between QM and MM models,105 so to has there been recent interest in developing replica exchange methodology that can map between atomistic and CG representations. Such replica exchange moves represent a parallel, two-way interface between the atomistic and CG levels, as atomistic and CG simulations are run in parallel, and information is only periodically exchanged between them. Lyman and Zuckerman177 have developed a atomistic/CG replica exchange method that they call ‘‘resolution exchange’’ (ResEx). The aim of this method is to run both an atomistic and CG simulation of a system in parallel, and to periodically attempt replica exchange moves between the simulations. The idea is that the enhanced sampling of the CG model will be shared with the atomistic model via the replica exchange moves. For this method to work, the coarse grain model must use a subset of the coordinates of the atomistic model. In this case, Lyman and Zuckerman achieved this by using a united atom model at the CG level (OPLS united atom forcefield33), and an all-atom model for the atomistic level (OPLS all atom forcefield34). While technically being a CG model, a united atom forcefield is more normally considered to be an atomistic representation. This is because a united atom forcefield does not use beads to represent residues or parts of residues, but instead merely removes aliphatic and aromatic hydrogens from the model by collapsing them into their bonded carbon. Despite this, Lyman and Zuckerman’s method can still be considered as a multiscale method, and this application points to some of the difficulties that more ambitious applications would have to overcome. One major difficulty is that the replica exchange moves between the united atom and all atom representations had a very low acceptance ratio. This is because the CG model uses only a subset of the atomistic coordinates, so replica swap moves can lead to unphysical conformations as this subset is moved out of step with the rest of the atomistic coordinates. To overcome this problem, Lwin and Luo,178 in a similar method, minimised the energy of the atomistic model after each swap, but before the Monte Carlo test that was used to accept the configuration. While this does successfully increase the acceptance ratio of the move, it violates detailed balance, so does not sample the required ensemble correctly. Lyman and Zuckerman’s solution to the problem is to use incremental coarsening. With this method, instead of switching the whole protein between the CG and atomistic models in one go, instead individual residues in the protein are switched. This leads to the problem that CG residues then need to interact with atomistic residues, so CG/atomistic crossterms need to be included in the Hamiltonian. Fortunately the OPLS united atom and all atom forcefields are compatible, and so can be combined in this way, at least for this application. Now multiple replicas of the system may be used, with each replica representing a different percentage of the protein using the CG forcefield. Lyman and Zuckerman tested their method by application to the pentapeptide metenkephalin. They used six replicas, the first replica using a fully atomistic model, then second replica using one CG residue and four atomistic, the second using two 36 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

CG residues and three atomistic, and so on. In addition, they also increased the temperature of each replica, such that while the fully atomistic model was simulated at 298 K, the fully CG model was simulated at 700 K. However, despite the small size of the system, the small difference between the CG and atomistic models (OPLSUA vs. OPLSAA) and the use of incremental coarsening, the acceptance ratio of the swap moves was still very low, running between 2.5% and 5.8%. Applications of this method to larger systems, or using a greater difference between the atomistic and CG models therefore looks problematic. Christen and van Gunsteren10 have developed a novel multiscale method that they call ‘‘multigraining’’, which aims to use the CG model to enable both relaxation of large molecular systems and sampling of slow processes with concurrent atomic detail representation of the results. In this method, both an atomistic and a CG model of a molecule are used simultaneously. Each molecule in the simulation has both a set of atomistic (fine grain) coordinates, rFG, and a set of coarse grain coordinates, rCG. The coarse grain coordinates are constrained to map to the atomistic coordinates. The Hamiltonian of the system is composed of atomistic interaction terms, UFG(rFG), and CG interaction terms, UCG(rCG). Molecular dynamics is then performed using both the atomistic and CG potentials simultaneously. This is referred to as multigraining MD, and the algorithm is as follows; 1. First the coarse grain coordinates are determined by mapping them from the atomistic coordinates (e.g. by placing CG beads at the geometric centre of a residue). 2. Calculate the forces on the CG particles using the CG potential energy terms, and the forces on the atoms using the atomistic potential energy terms. 3. Distribute the forces from the CG particles back onto the atoms. 4. Propagate the coordinates of the atoms using the leap-frog integrator. By running both the CG and atomistic models in parallel, and by mapping the CG coordinates from the atomistic coordinates, this method avoids all of the problems associated with information loss when moving back and forth between the CG and atomistic representations. Also, the atomistic and CG potential energy terms are separate, therefore avoiding the problem of parameterising atomistic/CG crossterms. Also, by keeping the atomistic and CG potentials separate, Christen and van Gunsteren are able to use a l scaling parameter to switch between the atomistic and CG potentials, e.g. at l = 0, only the pure atomistic potential is used, at l = 1 only the pure CG potential is used, while at l = 0.5, a 50–50 mix of the CG and atomistic potentials are used. Christen and van Gunsteren then employ replica exchange moves over this l coordinate, thereby allowing several parallel trajectories to swap dynamically between CG and atomistic representations. They tested the method by using it to simulate liquid octance, but like Lyman and Zuckerman, they experienced problems with a low acceptance ratio for the replica exchange moves, and needed to use 24 replicas across the l coordinate. Using 24 replicas, they achieved an average acceptance ratio of just 24%. This figure masks the very poor acceptance ratio for replicas at l = 0, where the purely atomistic potential was used. Just 8% of the 300 replica exchange moves were accepted at l = 0, which is disappointing, given that it is the sampling at l = 0 which is important for the calculation of correct thermodynamic properties. However this method is useful as a means of performing a rapid equilibration or conformation search at the CG level, and then smoothly switching back to the atomistic model to explore interesting configurations.

3.3.2. Embedded atomistics/CG methods. Parallel atomistic/CG methods still use a simple interface between the CG and atomistic models, with information exchange occurring only during the replica exchange moves. This represents a loose coupling between the atomistic and CG models. An alternative to this loosely-coupled method is to develop a tightly coupled method whereby an atomistic model is linked directly to a CG model. This allows for an atomistic region, e.g. an atomistic model of an ion channel, to be embedded within a CG simulation, e.g. a CG model of a cell Chem. Modell., 2008, 5, 13–50 | 37 This journal is

c

The Royal Society of Chemistry 2008

membrane. This requires that the atomistic and CG models are closely coupled together, with information exchange occurring every timestep via atomistic/CG cross-terms in the Hamiltonian. This is challenging, as it is not immediately obvious how to obtain the parameters that describe the atomistic/CG interaction, nor what the form of the interaction potential should take. In section 3.1.2 the multiscale coarse graining (MS-CG) method, developed by Izvekov and Voth, was presented as a method that could be used to systematically derive the functional form and parameters for MM-CG interactions. Shi,179 working with Izvekov and Voth, has used the MS-CG method to systematically derive the functional form and parameters for the atomistic/CG interaction of an atomistic gramicidin A polypeptide ion channel immersed in a CG membrane. The DMPC membrane and solvent water were modelled using the CG model developed by Izvekov and Voth,150 which was developed using the MS-CG method. This method derives the parameters for the CG interactions from the average forces experienced by the atoms during an atomistic dynamics simulation. These forces are then coarse-grained and a CG potential is fitted. Shi ran a short all-atom simulation of the whole membrane/ion channel system and used this to develop the CG/atomistic forcefield using the MSCG method. The forcefield was obtained by treating the atomistic and CG regions equally within the force matching algorithm. However, to reduce computational cost, some simplifications and approximations were made. First, the forcefield was split into three parts; atom–atom, atom-CG and CG-CG. The atom–atom forces were assumed to be the same as those from the wholly atomistic simulation, while the CG-CG forces were taken from an MS-CG application of a pure lipid bilayer embedded in water. This meant that only the atom-CG forces had to be derived from the fully atomistic simulation of the entire system. The atom-CG forces were obtained by subtracting the atom–atom and CG-CG forces from the reference forces from the fully atomistic simulation, and then fitting the residual forces using the MS-CG method. This method is not as rigorous as applying the full MS-CG method in a single step, but it is much more efficient, and Shi et al. found that this strategy worked quite well for this system. Essex and co-workers have developed a CG model that is designed from the outset to interact with an atomistic forcefield.127,180,181 They have developed a CG lipid model of DMPC127,180 that uses six Gay-Berne123 ellipsoids to represent the hydrocarbon tails, and two Lennard-Jones (LJ) particles to represent the head group. The glycerol region was modelled by 2 further Gay-Berne units, thereby allowing the entire DMPC model to be represented by just ten interaction sites (see Fig. 3). The key advance of the Essex model is that it was designed from the outset to be compatible with atomistic forcefields. The Gay-Berne potential is essentially an anisotropic form of the Lennard-Jones potential, which is the most common potential used to model van der waals (vdW) non-bonded forces between atoms. This means that it is possible to interface the Gay-Berne and LJ potentials so that the vdW interaction energies between Gay-Berne beads and Lennard-Jones (LJ) atoms can be calculated. However, to correctly handle the electrostatic interaction between the CG and atomistic levels, the CG model has to explicitly include a representation of the charge distribution of the lipid. The Essex model achieves this by including point charges in the two head group particles, and point dipoles in the two GayBerne particles that represent the glycerol region. Water was modelled explicitly using the soft sticky dipole model182 (SSD), which despite only using a single interaction site, is capable of accurately reproducing the structural, thermodynamic, dielectric and temperature-dependent properties of water.182–185 The lipid model was parameterized by trial-and-error molecular dynamics, whereby the parameter space was searched and molecular dynamics simulation run to reproduce the experimental structure of DMPC bilayers (to test whether the desired experimental properties were obtained). The point charges and dipoles of the model were chosen to reproduce the net electrostatics (charge and dipole) of each bead from the underlying atomistic level model. To parameterize the strength of interaction between atomistic 38 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

and CG molecules, Essex and co-workers181 calculated solvation free energies of atomistic amino acid side chain analogues in boxes of CG octane, and compared this to experiment. The atomistic and CG interactions were found to be unbalanced, but Essex and co-workers found that simple scaling of the cross terms led to good agreement of comparable quality to atomistic simulations. To calibrate the scaling factor for the interaction of an atomistic inclusion with the CG water model, a similar set of calculations were made of the hydration free energy of atomistic amino acid side chain analogues in the SSD water model. The calculated hydration free energies were also compared with experiment. The calculation of these scaling factors removes the need for the atomistic forcefield to be reparameterized for use with the CG model. This is an interesting approach to the solution of the cross-term parameterisation problem of atomistic/CG multiscale modelling, and has the promise to be transferable across a range of solutes and proteins, thereby removing the need to reparameterize the model for each new system under study. The challenges in creating atomistic/CG multiscale methods parallel those of creating QM/MM methods. The definition of the QM/MM interface is relatively straightforward if the divide is between molecules. However the definition of the interface becomes more complicated if the divide is within a molecule, as then the Hamiltonian has to somehow include bonded terms between QM atoms and MM atoms. In much the same way, the definition of the atomistic/CG interface is more straightforward if it falls between molecules, rather then within a molecule, and that is why the majority of atomistic/CG multiscale simulations avoid this approach. The exceptions are the parallel method developed by Lyman and Zuckerman177 (the ResEx method discussed in the last section) and an embedded CG/atomistic method developed by Neri et al.186 Neri et al. have developed a method where a protein is modelled using a CG representation, but the active site is modelled using an atomistic representation. The aim is to allow efficient sampling of protein conformational change, due to the use of the CG model, but to still capture the fine-detail atomistic motions in the active site, which is the area of interest in protein-ligand binding or computational enzymology calculations. They tested the method by application to two proteins of active pharmaceutical interest: HIV protease, which is a target for anti-HIV medication, and human b-secretase (BASE) which plays a role in the onset of Alzheimer’s disease. The active site atoms were modelled using the GROMOS9630 atomistic forcefield, while the rest of the protein was modelled at the CG level by only considering Ca centroids. Solvent-protein interactions were modelled in terms of viscosity and the addition of random forces in the framework of a stochastic dynamics simulation. The interaction between the atomistic and CG regions was handled by the addition of an interface region. Several residues at the boundary of the atomistic and CG regions were modelled using both a CG and atomistic representation. The atomistic region residues interacted with the interface residues using the standard atomistic potential. To maintain backbone integrity between the interface and CG regions, harmonic bonds are added between the C a atoms of the interface residues with the Ca atoms of the neighbouring CG residues. In addition, an exponential-form non-bonded potential is added between CG and interface residues, and the parameters for the CG model and for the interface were chosen so as to reproduce the root mean square fluctuations (RMSF) of HIV protease as observed during fully atomistic and fully CG simulations. A key parameter is the thickness of the interface region, with the key concerns being that the interface region has to guarantee the correct geometry of the atomistic residues, it has to get the local electrostatics right, and it has to be able to transmit the modes of vibration experienced by the rest of the protein, and which are mimicked by the CG model. Neri et al. found that for both HIV protease and BASE they needed to use an interface region that included all residues that had at least one atom within 6 A˚ of any of the residues in the atomistic region. There is a clear separation in the methods presented so far between the atomistic and CG regions. One molecule, or part of one molecule (the protein), is treated at the Chem. Modell., 2008, 5, 13–50 | 39 This journal is

c

The Royal Society of Chemistry 2008

atomistic level, while the remaining molecules (lipid membrane and solvent) are modelled using CG techniques. An alternative to defining the atomistic and CG levels by molecule is instead to define the levels by region, namely to divide the simulation space into an atomistic region and a coarse grain region. This creates a more dynamic split between the two models, as individual molecules are free to diffuse from one region to another, and therefore algorithms must be developed that account for the interconversion of CG and atomistic models. Praprotnik et al. have, over a series of insightful papers,187–191 developed just such algorithms. They have created a multiscale simulation method whereby the simulation space is divided into an atomistic region and a CG region, and the molecules in the system are free to diffuse between these two regions. The methods developed allow on-the-fly interchange between the molecules’ coarse grain and atomistic representations, enabling large length and time scales to be achieved, while still retaining the atomistic fine detail of parts of the system. The key development in this work is the methodology used to allow molecules to switch dynamically between CG and atomistic representations. They call these methods AdResS (adaptive resolution scheme). The original application of the method was to the multiscale dynamics of liquid methane.187 A CG model of methane was constructed based on the radial distribution function of methane obtained from a simulation of the reference atomistic model. The simulation box was split in half, creating an atomistic region and a CG region, and an identical number of methane molecules were placed in each half of the box. To ensure that the transition between the two regions was smooth, a ‘handshaking zone’ was created between the atomistic and CG regions. To achieve a smooth transition from the atomistic to CG forcefields, the forces are scaled in the handshaking zone. As the switching of the resolution can be considered as a first order phase transition, it is necessary for this scheme to be used in combination with a thermostat. Because the latent heat is generated at the interface region,187 it is important that a local thermostat is used that couples directly to local particle motion, e.g. Langevin or dissipative particle dynamics (DPD) thermostats. The other main challenge of an adaptive resolution method is how to handle the loss and reintroduction of atomic degrees of freedom at the atomistic/CG interface. It is important that the vibrational and rotational degrees of freedom at the atomistic model, which are missing in the CG model, are slowly reintroduced as a molecule diffuses across the interface between the CG and atomistic regions. To achieve this, when a molecule moves from the CG to atomistic region it is mapped onto an atomistic representation that has the same centre of mass and linear momentum as the CG representation. In addition, each atom is given rotational and vibrational velocities that come from a random molecule already in the atomistic region. Praprotnik used AdResS successfully to simulate a multiscale box of liquid methane.187 While the application worked well, there was a small problem with a pressure imbalance at the interface zone between the CG and atomistic regions. Praprotnik et al. solved this problem in a new version of their method,188 in which they also extended the scope of application to include spherical boundaries between the atomistic and CG regions. Since then, they have used the method to run multiresolution simulations of liquid water191 (using the CG parameterisation method of Lyubartsev152 to get the CG model of water) and have investigated the thermodynamic implications of running simulations that have a dynamic, and indeed fractional number of degrees of freedom.189 Ensing et al.192 have also developed a new multiscale method that allows for an atomistic region to be embedded within a CG simulation. Like AdResS, the method uses a spherical atomistic region that is interfaced to the CG simulation via a handshaking buffer zone. To ensure a smooth transition from the CG to atomistic representations, the potential energies, rather than the forces, are smoothly scaled from the CG to atomistic Hamiltonians as the molecule crosses the handshaking zone. Additional terms are also added to the Hamiltonian that account for the kinetic energy that is lost when the number of degrees of freedom is decreased when 40 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

a molecule moves from the atomistic to the CG region. These additional terms ensure that the total energy of the system is conserved even as the number of degrees of freedom change. In addition to using a different method to switch between the CG and atomistic regions, Ensing et al. also use a different method to handle the change in dynamics between the CG and atomistic regions. Instead of destroying and recreating the atomistic degrees of freedom as the molecules move across the boundary, they instead use the RESPA193 multiple timestep molecular dynamics algorithm to freeze out the intramolecular degrees of freedom of molecules as they move into the CG region. RESPA is a molecular dynamics method whereby different timesteps can be used to sample different parts of the Hamiltonian. It has traditionally been used to separate the sampling of ‘‘fast’’ forces, such as bond potentials, from ‘‘slow’’ forces, such as those arising from the non-bonded potentials. Ensing et al. cleverly use RESPA to separate the sampling of the CG and atomistic Hamiltonians by region, and thereby effectively freeze the atomistic representation as it crosses into the CG region. This allows them to tune the molecular dynamics integrator optimally across the entire multiscale system. While this method has been applied only to a model system and liquid methane, it shows great promise as a method that could find widespread use in biomolecular modelling.

4. Interfacing particle with continuum models Continuum methods provide the largest time and space ranges considered in this review. While continuum models have been popular for the modelling of solids194 or of liquid or gas flow,195 they have only recently been applied to the study of biomolecular systems. This is because continuum methods tend to be best suited to periodic or smoothly varying systems, and biomolecular systems are distinctly inhom*ogenous, and exhibit physical behaviours that require the the modelling of fine-level atomistic detail. By far the most popular continuum models used in biomolecular modelling are those that replace atomistic waters with an implicit solvent model. Many implicit solvent models, which include the Poisson Boltzmann (PB)196,197 and Generalized Born (GB)198,199 methods, model the solvent as a dielectric continuum, and calculate the polarisation of that continuum caused by the charge distribution of the atomistic molecule model. This polarisation leads to an electrostatic reaction field with which the atomistic molecular model then interacts. Implicit solvent models have been used for many years in combination with QM,200,201 MM202,203 and CG204,205 calculations. These represent a large class of truly multiscale methods, and have been reviewed in detail many times.204,206–209 Implicit solvent models have been the dominant class of multiscale continuum methods over recent years. However exciting new classes of multiscale continuum models have recently been developed. These new methods fall into the following categories, which are of similar definition and type to the interfaces used between the QM/MM and atomistic/CG levels: 1. One-way bottom-up interfacing methods. These involve a single transfer of information from the atomistic or CG level to the continuum level, e.g. by using the atomistic level to provide parameters for the continuum model. 2. Two-way embedded interfacing methods. These involve embedding an atomistic or CG model within a continuum representation. Implicit solvent models fall into this category. New multiscale methods, which capture hydrodynamic and mechanical effects have now also been developed. 3. Two-way parallel interfacing methods. These involve the running of an atomistic or coarse grain level simulation in parallel with a continuum calculation, and interfacing the two using a lightly coupled interface, e.g. by using properties calculated from the continuum model to update the boundary conditions of the atomistic or CG simulation. Examples of each of these categories of multiscale continuum interfaces will now be discussed in turn. Chem. Modell., 2008, 5, 13–50 | 41 This journal is

c

The Royal Society of Chemistry 2008

4.1 One-way bottom-up interfacing methods Continuum models can be interfaced with atomistic models using a one-way bottomup multiscale parameterisation scheme. In this scheme, an atomistic level calculation is used to provide the input parameters for the continuum model. This scheme is very simple, as it involves a single transfer of information from the atomistic level to the continuum level. Tang et al.210 have recently employed this scheme to get the parameters for a continuum model of a mechanosensitive ion channel embedded in a lipid membrane. The membrane was modelled as a hom*ogenous elastic sheet of thickness 35 A˚, and area of 400 A˚ 400 A˚, using a finite element (FEM) model. The transmembrane helicies were represented by hom*ogenous cylindrical elastic rods of diameter 5 A˚with spherical caps at each end. The helicies were embedded within the membrane, and were also modelled using a FEM model. What makes this a multiscale simulation is that the parameters for this model were calculated using information calculated using atomistic-level models. This represents a one-way, bottom-up information flow from the atomistic level to the continuum level. Elastic parameters for the model were calculated from the mechanical properties of the membrane and helicies measured by experiment and calculated from atomistic molecular dynamics simulations. The interactions between the helicies and the membrane were modelled using a Lennard-Jones style attractive/repulsive potential. The parameters for this potential were determined based on molecular mechanics calculations run using the CHARMM forcefield.27 The channel is comprised of ten helicies. The interaction potential between helicies was parameterized by fitting to CHARMM potential energy calculations. For each pair of helicies, the interaction energy in vacuum was calculated using the CHARMM19 forcefield, with the coordinates set to either X-ray or hom*ology structures. This calculation was performed for different combinations of helix pairs, which effectively samples different orientations, and a range of centre of mass inter-helix separations that varied between 20 A˚ and 20 A˚. The interactions between each helix and the membrane were parameterized based on calculating the insertion energy profile of moving an atomistic model of the helix from an implicit solvent water model into an implicit lipid membrane model. In the implicit membrane models the membrane thickness was taken to be 23.5 A˚, which corresponds to the thickness of the hydrophobic part of the membrane. While this model was constructed as a proof of concept of FEM methods, it was able to reproduce the experimental gating tension of the channel, and the structural variations along the gating pathway as observed from biased atomistic molecular dynamics simulations. This application successfully demonstrated the potential of continuum models of biomolecular systems to reach length (sub-mm) and time (multi ms) scales that are not accessible to atomistic level calculations. 4.2 Two-way embedded interfacing methods Continuum models can be directly interfaced with atomistic or coarse grain models using a two-way embedded interface. In this scheme, the atomistic or CG model is embedded within a continuum model. Implicit solvent methods, in which an atomistic or CG model of a solute is embedded within a continuum model of the solvent, are popular and well-established examples of this type of interface. Implicit solvent models represent the solvent as a dielectric continuum, and allow the electrostatics of the atomistic or CG solute to polarise the continuum, which then results in an electrostatic reaction field that returns to interact with the solute. Implicit solvent models have been reviewed in detail many times before, 204,206–209 and enable the dynamic transfer of electrostatic information across the atomistic/ continuum or CG/continuum interfaces. Recently, new multiscale continuum methods have been developed that allow for the dynamic transfer of mechanical and hydrodynamic information across these interfaces. One example is the work by Villa 42 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

et al.,211,212 who have developed a method that allows the transfer of mechanical information across the atomistic/continuum boundary. They have developed a multiscale model to investigate protein/DNA complexes. The method uses an atomistic model of the lac repressor protein and a continuum model of a large loop of DNA that is bound to the protein. The DNA loop is represented using a continuum elastic rod model, using elasticity theory. Protein MD simulations are used to position the DNA loop end points, and these end points then serve as crucial boundary conditions to elasticity theory to determine the shape of the loop. Elasticity theory then in turn provides the forces acting at the end points, which can be fed back into the protein molecular dynamics, completing the loop. Most atomistic/continuum embedded multiscale methods split the system into continuum and atomistic regions by molecule, e.g. the solvent is defined as being in the continuum, while the solute is atomistic. While being straight-forward to implement, such a split does not allow a complete representation of the physical detail of the system. One problem with continuum solvent models is that by representing all of the solvent as a uniform dielectric, they fail to capture atomic details, such as hydrogen bonding between the solute and solvent, which can be important factors to consider in protein-ligand binding or computational enzymology applications. It is therefore preferable to be able to spit the system spatially into atomistic and continuum regions, e.g. to model the solvent in the active site of the protein using an atomistic model, with the remainder modelled as a continuum. However, such a model must account for the diffusion of solvent between the atomistic and continuum regions. Schemes must be developed that allow for the conservation of momentum, mass and energy across the interface. While we are unaware of any such methods that have been developed for application to protein systems, Coveney et al. have, over a series of papers,6,213–219 developed an ambitious and highly promising multiscale method that allows the embedding of condensed phase atomistic simulations within continuum models for the purpose of studying hydrodynamics. The Coveney method works by dividing space into two main regions; an atomistic (particle) region, called P, and a continuum region, called C, modelled using computational fluid dynamics (CFD). An interface lies between these two regions. This interface is split into two parts; a C to P part, which sits on the continuum side of the interface, and a P to C part, which sits on the particle side. In the P region, the motion of the atomistic particles is solved using molecular dynamics. In the C region the motion of the continuum solvent is solved using standard continuum fluid dynamics (CFD). The hybrid scheme applied at the interface is designed to exchange fluxes of conserved quantities, namely energy, momentum and mass. In the C to P region, the fluxes from the continuum region are imposed on the particles. In the P to C region, the atomistic fluxes are coarse grained in time and space to form boundary conditions for the continuum domain.213 The details of how this is achieved is described in detail elsewhere.213,215,220 In essence, the stress induced in the P region by flow in the C region can be calculated and converted into a local momentum flux at the C to P interface. This flux can be converted into a force which is added to all of the particles in the C to P region. This pressure force from the continuum liquid is enough to stop the escape of particles from the C to P region, and, at a basic level, is responsible for the transfer of information from the continuum to the particle region. The P to C interface, on the other hand, is used to transfer information back to the continuum model. As the continuum model is a mesoscale representation of the solvent, the particle model must be averaged in both space and time. An averaging time of Dtavg is used, and the average momentum flux is calculated. This can then be used to establish the boundary condition of the continuum domain at the C to P interface. In addition to controlling the flow of momentum across the interface, the flow of mass must also be managed to ensure that the total mass in the whole system is conserved. This is achieved by calculating the mass flux at the interface and using that to determine whether particles need to be deleted Chem. Modell., 2008, 5, 13–50 | 43 This journal is

c

The Royal Society of Chemistry 2008

(to represent flow of mass form the particle to continuum region), or whether particles need to be created (to represent flow back from the continuum region). Removing particles is simply a case of choosing those that are closest to the top of the molecular dynamics simulation and thus closest to the continuum region. Inserting particles is more difficult, as sufficient space has to be found for the new particle amongst the existing particles, and atomic fine detail (coordinates and momenta) have to be reconstructed. Coveney et al. have developed the USHER algorithm221 which efficiently finds a location and configuration of the new solvent molecule that releases an energy equal to the mean energy per molecule. The velocity of the new particle is obtained randomly from a Maxwell distribution with a zero mean velocity and temperature equal to the simulation temperature. One of the original applications of this method213 used a simple model system, with a model polymer attached to a wall, solvated by a single-particle model solvent. They were able to use this setup to study the dynamics of this polymer within a solvent which was undergoing shear flow. Coveney et al. refined their method214,219 to correctly couple hydrodynamics, using a fluctuating hydrodynamics (FH) continuum model,222 and were able to show that fluctuations, at both the atomistic and continuum level were thermodynamically consistent. They demonstrated their method by simulating sound waves passing through a continuum, then atomistic model of liquid water, and then being reflected by an atomistic model of a lipid membrane bilayer.214 The light coupling between the different types of simulation (MD and CFD or FH) makes it well-suited to application over a distributed computing cluster, and Coveney et al. have demonstrated the method’s successful deployment over computational GRIDs.6 4.3 Two-way parallel interfacing methods Creating a multiscale method that directly embeds an atomistic region within a continuum model introduces a physical interface between the atomistic and continuum models. This complicates the implementation, as techniques must be developed that allow for the transfer of information across this physical interface, e.g. the mass, momentum and energy flux across the interface must be controlled. Chang, Ayton and Voth223 have developed a multiscale atomistic/continuum method that avoids the creation of a physical interface between the two regions. They achieve this by running both a continuum and an atomistic model of the system in parallel, and only loosely coupling this pair of simulations by periodically transferring structural information between them. Chang et al. developed the method, called multiscale coupling (MSC),112,223,224 originally to study the effect of long range dynamics of a lipid bilayer membrane at the atomistic level. They loosely couple together an atomistic model of a bilayer with a continuum-based mesoscopic model. The continuum model is based on an elastic membrane (EM) implicit solvent model originally proposed by Ayton and Voth.225 This model was parameterized from mechanical properties calculated from atomistic MD simulations of a DMPC bilayer. To better capture the dynamics of a membrane, this model has been updated to include an explicit mesoscale model of water.223 The elastic membrane model was used to simulate a bilayer of dimension, 890 A˚ 890 A˚. To make this a multiscale simulation, a point on the elastic sheet representing the membrane is chosen, and an atomistic model of the membrane at that point is constructued. The atomistic model comprised 64 DMPC lipid molecules solvated by 1312 TIP3P waters. A real biological membrane is not subject to external constraints, and therefore it adopts a tensionless configuration.226 However simulations using the canonical (NVT) ensemble use a fixed number of particles and fixed membrane area. This results in a simulation of the membrane with a non-zero interfacial tension.227 This observation has resulted in the development of a new ensemble, whereby the membrane tension, g, is kept constant.226 This new ensemble, NPgT, allows the membrane tension to become part of the definition of the 44 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

thermodynamic state of the system, as g has become a thermodynamic parameter. Chang et al. use this fact to couple their continuum-based mesoscopic membrane model to the atomistic model. They run the atomistic model using the NPgT ensemble, and update the value of g every 1 ps in response to the changes in surface tension measured at the corresponding location in the continuum-based mesoscopic model. This allows a dynamic, one-way transfer of information from the continuum model to the atomistic model throughout the simulation. The advantage of this approach is that the atomistic level simulation does not directly interface with the continuum level simulation, so standard approaches can be used, e.g. periodic boundaries and the use of the particle mesh Ewald sum to model long range electrostatics. Also, while this application only allows the dynamic flow of information from the continuum to atomistic level, in theory, mechanical properties calculated from the atomistic simulation could then be calculated and used to reparameterize the continuum model, thereby completing the cycle. Ayton and Voth228 have employed this multiscale coupling (MSC) method to study the effect of long range membrane motion on the dynamics of an ion-channel. An atomistic ion channel was embedded within an atomistic simulation of the solvated bilayer, and this was coupled to the continuum-based EM model using the MSC method. Because formally an ensemble of atomistic simulations should be connected to the continuum model,228 they ran eight atomistic simulations in parallel, and connected each of these to the same point in the continuum model. They compared the dynamics of the ion channel using both the MSC boundary conditions and standard NPT boundary conditions and found that the orientation of histidine residues in the channel appeared more disordered when the atomistic level simulation was coupled to the continuum. This suggested that the fluctuations in stress caused by the long range dynamics of the membrane modelled at the continuum level, could indeed affect the dynamics of the ion channel at the atomistic level.

5. Beyond continuum models Continuum models allow biological simulators to access extremely large length and timescales, e.g. there are continuum models of a complete human heart.229 There is, however, a level of modelling that addresses even larger length and time scales. At this level, biological network models are used to represent cells, 230,231 organs232–234 or creatures.235,236 Detailed reviews230,237,238 of how to build models at this level are available. In brief, the methods work by constructing differential equations that describe the flow of material through a biological system, e.g. using a differential equation to represent an ionic current.237 Several equations can be coupled together to form a model, e.g. of the electric current flow across the human heart.232 The whole network of equations can be run together to produce an output signal that describes the biological property that the simulator is interested in modelling, e.g. modelling the oscillations in concentration of cellular metabolites.238 Multiple levels of biological network models can be combined together. The Physiome project239 is an attempt to define standard interfaces between these levels so that they can be combined together easily. To make these models fully multiscale, they could also be interfaced with atomistic, CG, or continuum representations. The conduits for information flow are the kinetic parameters that calibrate the differential equations that make up the network.238 For example, QM/MM computational enzymology calculations can be used to obtain the rate constants of enzyme catalysed reactions, or MM/CG calculations can be used to obtain the diffusion constants of ions through membrane pores. In this way, biological multiscale modelling will allow the coupling of simulations of electrons all the way up to models of a human heart. Chem. Modell., 2008, 5, 13–50 | 45 This journal is

c

The Royal Society of Chemistry 2008

6. Conclusion This review has outlined the techniques and methods involved in building computer models of biological systems that bridge between different length and time scales. Biomolecular multiscale modelling is not new: Levitt and Warshel pioneered much of the development of the foundations of the QM/MM and MM/CG interfaces in the 1970s. However, there has been an explosion of interest and activity in this area over the last few years that has been prompted both by the increases in available computer power, and the maturing of methodology at each of the different biomolecular modelling levels. As each level has matured, and become predictive, simulators are now rightly asking how can these methods be combined to answer the bigger questions of biology such as how does a cell maintain membrane cohesion, how do small changes in protein configuration initiate cell signalling pathways, and what is it about this collection of molecules that makes them a living system? While multiscale modelling methods are not yet capable of providing answers to these questions, the huge progress made over the last few years is now providing an unprecedented insight into the interplay between the biological and chemical worlds. Multiscale modelling methods will certainly be an exciting growth area of increasing importance in biology.

References 1 L. E. Bilston and K. Mylvaganam, FEBS Lett., 2002, 512(1–3), 185. 2 P. J. Booth, Curr. Opin. Struct. Biol., 2005, 15(4), 435. 3 A. Ababou, A. van der Wart, V. Gogonea and K. M. Merz, Biophys. Chem., 2007, 125(1), 221. 4 D. D. Vvedensky, J. Phys.-Condes. Matter, 2004, 16(50), R1537. 5 P. Koumoutsakos, Annu. Rev. Fluid Mech., 2005, 37, 457. 6 R. Delgado-Buscalioni, P. V. Coveney, G. D. Riley and R. W. Ford, Philos. Trans. R. Soc. A-Math. Phys. Eng. Sci., 2005, 363(1833), 1975. 7 P. V. Coveney and P. W. Fowler, J. R. Soc. Interface, 2005, 2(4), 267. 8 Y. Sugita, A. Kitao and Y. Okamoto, J. Chem. Phys., 2000, 113(15), 6042. 9 H. f*ckunishi, O. Watanabe and S. Takada, J. Chem. Phys., 2002, 116(20), 9058. 10 M. Christen and W. F. van Gunsteren, J. Chem. Phys., 2006, 124(15), 154106. 11 R. J. Bartlett and M. Musia, Rev. Mod. Phys., 2007, 79(1), 291. 12 W. A. Lester and R. Salomon-Ferrer, Theochem.-J. Mol. Struct., 2006, 771(1–3), 51. 13 J. Z. Wu and Z. D. Li, Annu. Rev. Phys. Chem., 2007, 58, 85. 14 S. Y. Zou, G. G. Balint-Kurti and F. R. Manby, J. Chem. Phys., 2007, 127(4), 044107. 15 H. Y. Liu et al., Proteins, 2001, 44(4), 484. 16 M. J. S. Dewar, E. G. Zoebisch, E. F. Healy and J. J. P. Stewart, J. Am. Chem. Soc., 1985, 107(13), 3902. 17 J. J. P. Stewart, J. Comput. Chem., 1989, 10(2), 209. 18 J. J. P. Stewart, J. Comput. Chem., 1989, 10(2), 221. 19 M. Schutz and H. J. Werner, Chem. Phys. Lett., 2000, 318(4–5), 370. 20 S. Patel and C. L. Brooks, Mol. Simul., 2006, 32(3-4), 231. 21 A. Warshel, M. Kato and A. V. Pisliakov, J. Chem. Theory Comput., 2007, 3(6), 2034. 22 D. J. Price and C. L. Brooks, J. Comput. Chem., 2002, 23(11), 1045. 23 Y. G. Mu, D. S. Kosov and G. Stock, J. Phys. Chem. B, 2003, 107(21), 5064. 24 A. D. Mackerell, J. Comput. Chem., 2004, 25(13), 1584. 25 M. Patra and M. Karttunen, J. Comput. Chem., 2004, 25(5), 678. 26 B. Hess and N. F. A. van der Vegt, J. Phys. Chem. B, 2006, 110(35), 17616. 27 A. D. MacKerell et al., J. Phys. Chem. B, 1998, 102(18), 3586. 28 D. A. Case et al., J. Comput. Chem., 2005, 26(16), 1668. 29 L. J. Yang et al., J. Phys. Chem. B, 2006, 110(26), 13166. 30 X. Daura, A. E. Mark and W. F. van Gunsteren, J. Comput. Chem., 1998, 19(5), 535. 31 W. R. P. Scott et al., J. Phys. Chem. A, 1999, 103(19), 3596. 32 L. D. Schuler, X. Daura and W. F. Van Gunsteren, J. Comput. Chem., 2001, 22(11), 1205. 33 W. L. Jorgensen, J. D. Madura and C. J. Swenson, J. Am. Chem. Soc., 1984, 106(22), 6638. 34 W. L. Jorgensen, D. S. Maxwell and J. TiradoRives, J. Am. Chem. Soc., 1996, 118(45), 11225. 35 J. A. McCammon, B. R. Gelin and M. Karplus, Nature, 1977, 267(5612), 585.

46 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

36 M. P. Allen and D. J. Tildesley, Computer Simulation of Liquids, Oxford University Press, 1987. 37 N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller and E. Teller, J. Chem. Phys., 1953, 21(6), 1087. 38 A. Warshel and M. Karplus, J. Am. Chem. Soc., 1972, 94(16), 5612. 39 A. Warshel and M. Levitt, J. Mol. Biol., 1976, 103(2), 227. 40 J. Zielkiewicz, J. Chem. Phys., 2005, 123(10), 104501. 41 Q. Zhang and Z. Z. Yang, Acta Phys.-Chim. Sin., 2007, 23(10), 1565. 42 W. L. Jorgensen, J. Chandrasekhar, J. D. Madura, R. W. Impey and M. L. Klein, J. Chem. Phys., 1983, 79(2), 926. 43 M. W. Mahoney and W. L. Jorgensen, J. Chem. Phys., 2000, 112(20), 8910. 44 J. M. Wang, R. M. Wolf, J. W. Caldwell, P. A. Kollman and D. A. Case, J. Comput. Chem., 2004, 25(9), 1157. 45 A. Jakalian, B. L. Bush, D. B. Jack and C. I. Bayly, J. Comput. Chem., 2000, 21(2), 132. 46 A. Jakalian, D. B. Jack and C. I. Bayly, J. Comput. Chem., 2002, 23(16), 1623. 47 G. A. Kaminski and W. L. Jorgensen, J. Phys. Chem. B, 1998, 102(10), 1787. 48 J. W. Storer, D. J. Giesen, C. J. Cramer and D. G. Truhlar, J. Comput.-Aided Mol. Des., 1995, 9(1), 87. 49 G. M. Wang and A. C. Sandberg, Nanotechnology, 2007, 18(13), 135702. 50 A. E. Cho, V. Guallar, B. J. Berne and R. Friesner, J. Comput. Chem., 2005, 26(9), 915. 51 C. R. W. Guimaraes, M. Udier-Blagovic, I. Tubert-Brohman and W. L. Jorgensen, J. Chem. Theory Comput., 2005, 1(4), 617. 52 C. R. W. Guimaraes, M. Udier-Blagovic and W. L. Jorgensen, J. Am. Chem. Soc., 2005, 127(10), 3577. 53 R. D. Taylor, P. J. Jewsbury and J. W. Essex, J. Comput.-Aided Mol. Des., 2002, 16(3), 151. 54 C. Hensen et al., J. Med. Chem., 2004, 47(27), 6673. 55 F. Beierlein, H. Lanig, G. Schurer, A. H. C. Horn and T. Clark, Mol. Phys., 2003, 101(15), 2469. 56 J. Chandrasekhar, S. Shariffskul and W. L. Jorgensen, J. Phys. Chem. B, 2002, 106(33), 8078. 57 O. Acevedo and W. L. Jorgensen, J. Am. Chem. Soc., 2006, 128(18), 6141. 58 O. Acevedo and W. L. Jorgensen, J. Org. Chem., 2006, 71(13), 4896. 59 O. Acevedo and W. L. Jorgensen, J. Chem. Theory Comput., 2007, 3(4), 1412. 60 M. P. Repasky, J. Chandrasekhar and W. L. Jorgensen, J. Comput. Chem., 2002, 23(16), 1601. 61 J. D. Thompson, C. J. Cramer and D. G. Truhlar, J. Comput. Chem., 2003, 24(11), 1291. 62 M. Udier-Blagovic, P. M. De Tirado, S. A. Pearlman and W. L. Jorgensen, J. Comput. Chem., 2004, 25(11), 1322. 63 A. Warshel, Annu. Rev. Biophys. Biomolec. Struct., 2003, 32, 425. 64 M. Svensson et al., J. Phys. Chem., 1996, 100(50), 19357. 65 T. Vreven et al., J. Chem. Theory Comput., 2006, 2(3), 815. 66 K. P. Eurenius, D. C. Chatfield, B. R. Brooks and M. Hodoscek, Int. J. Quantum Chem., 1996, 60(6), 1189. 67 J. L. Gao and D. G. Truhlar, Annu. Rev. Phys. Chem., 2002, 53, 467. 68 D. Riccardi et al., J. Phys. Chem. B, 2006, 110(13), 6458. 69 H. Lin and D. G. Truhlar, Theor. Chem. Acc., 2007, 117(2), 185. 70 T. Laino, F. Mohamed, A. Laio and M. Parrinello, J. Chem. Theory Comput., 2005, 1(6), 1176. 71 U. C. Singh and P. A. Kollman, J. Comput. Chem., 1986, 7(6), 718. 72 P. A. Bash, M. J. Field and M. Karplus, J. Am. Chem. Soc., 1987, 109(26), 8092. 73 V. Thery, D. Rinaldi, J. L. Rivail, B. Maigret and G. G. Ferenczy, J. Comput. Chem., 1994, 15(3), 269. 74 L. G. Gorb, J. L. Rivail, V. Thery and D. Rinaldi, Int. J. Quantum Chem., 1996, 60(7), 1525. 75 J. L. Gao, P. Amara, C. Alhambra and M. J. Field, J. Phys. Chem. A, 1998, 102(24), 4714. 76 J. Z. Pu, J. L. Gao and D. G. Truhlar, ChemPhysChem, 2005, 6(9), 1853. 77 Hypercube, Inc., Hyperchem Users Manual, Computational Chemistry, Hypercube, Inc., Waterloo, Ontario, Canada, 1994. 78 E. Derat, J. Bouquant and S. Humbel, Theochem.-J. Mol. Struct., 2003, 632, 61. 79 P. D. Lyne, M. Hodoscek and M. Karplus, J. Phys. Chem. A, 1999, 103(18), 3462. 80 Y. K. Zhang, T. S. Lee and W. T. Yang, J. Chem. Phys., 1999, 110(1), 46. 81 Y. K. Zhang, Theor. Chem. Acc., 2006, 116(1–3), 43. 82 Y. K. Zhang, J. Chem. Phys., 2005, 122(2), 024114.

Chem. Modell., 2008, 5, 13–50 | 47 This journal is

c

The Royal Society of Chemistry 2008

83 I. Antes and W. Thiel, J. Phys. Chem. A, 1999, 103(46), 9290. 84 O. A. von Lilienfeld, I. Tavernelli, U. Rothlisberger and D. Sebastiani, J. Chem. Phys., 2005, 122(1), 014113. 85 N. Reuter, A. Dejaegere, B. Maigret and M. Karplus, J. Phys. Chem. A, 2000, 104(8), 1720. 86 A. Fornili, P. F. Loos, M. Sironi and X. Assfeld, Chem. Phys. Lett., 2006, 427(1–3), 236. 87 J. Z. Pu, J. L. Gao and D. G. Truhlar, J. Phys. Chem. A, 2004, 108(25), 5454. 88 M. Elstner, Theor. Chem. Acc., 2006, 116(1–3), 316. 89 J. Z. Pu, J. L. Gao and D. G. Truhlar, J. Phys. Chem. A, 2004, 108(4), 632. 90 P. H. Konig, M. Hoffmann, T. Frauenheim and Q. Cui, J. Phys. Chem. B, 2005, 109(18), 9082. 91 A. Rodriguez, C. Oliva, M. Gonzalez, M. van der Kamp and A. J. Mulholland, J. Phys. Chem. B, 2007, 111(44), 12909. 92 F. Claeyssens et al., Angew. Chem.-Int. Edit., 2006, 45(41), 6856. 93 M. Sˇtrajbl, G. Y. Hong and A. Warshel, J. Phys. Chem. B, 2002, 106(51), 13333. 94 R. P. Muller and A. Warshel, J. Phys. Chem., 1995, 99(49), 17516. 95 R. W. Zwanzig, J. Chem. Phys., 1954, 22(8), 1420. 96 W. L. Jorgensen, J. F. Blake and J. K. Buckner, Chem. Phys., 1989, 129(2), 193. 97 A. Yadav, R. M. Jackson, J. J. Holbrook and A. Warshel, J. Am. Chem. Soc., 1991, 113(13), 4800. 98 R. H. Wood, E. M. Yezdimer, S. Sakane, J. A. Barriocanal and D. J. Doren, J. Chem. Phys., 1999, 110(3), 1329. 99 M. Sˇtrajbl, A. Shurki, M. Kato and A. Warshel, J. Am. Chem. Soc., 2003, 125(34), 10228. 100 Y. Ming, G. L. Lai, C. H. Tong, R. H. Wood and D. J. Doren, J. Chem. Phys., 2004, 121(2), 773. 101 T. H. Rod and U. Ryde, J. Chem. Theory Comput., 2005, 1(6), 1240. 102 E. Rosta, M. Klahn and A. Warshel, J. Phys. Chem. B, 2006, 110(6), 2934. 103 W. K. Hastings, Biometrika, 1970, 57(1), 97. 104 R. Iftimie, D. Salahub and J. Schofield, J. Chem. Phys., 2003, 119(21), 11285. 105 C. J. Woods, F. R. Manby and A. J. Mulholland, J. Chem. Phys., 2008, 128(1), 014109. 106 R. Iftimie, D. Salahub, D. Q. Wei and J. Schofield, J. Chem. Phys., 2000, 113(12), 4852. 107 D. A. Pearlman, J. Phys. Chem., 1994, 98(5), 1487. 108 M. Mezei, J. Chem. Phys., 1987, 86(12), 7084. 109 C. J. Woods, J. W. Essex and M. A. King, J. Phys. Chem. B, 2003, 107(49), 13703. 110 H. Leontiadou, A. E. Mark and S. J. Marrink, Biophys. J., 2007, 92(12), 4209. 111 G. S. Ayton and G. A. Voth, Biophys. J., 2004, 87(5), 3299. 112 J. W. Chu, G. S. Ayton, S. Izvekov and G. A. Voth, Mol. Phys., 2007, 105(2–3), 167. 113 S. O. Nielsen, C. F. Lopez, G. Srinivas and M. L. Klein, J. Phys.-Condes. Matter, 2004, 16(15), R481. 114 V. Tozzini, Curr. Opin. Struct. Biol., 2005, 15(2), 144. 115 M. Venturoli, M. M. Sperotto, M. Kranenburg and B. Smit, Phys. Rep.-Rev. Sec. Phys. Lett., 2006, 437(1–2), 1. 116 M. Levitt and A. Warshel, Nature, 1975, 253(5494), 694. 117 M. Levitt, J. Mol. Biol., 1976, 104(1), 59. 118 B. Smit et al., Nature, 1990, 348(6302), 624. 119 R. Goetz, G. Gompper and R. Lipowsky, Phys. Rev. Lett., 1999, 82(1), 221. 120 H. Noguchi and M. Takasu, Phys. Rev. E, 2001, 6404(4), 041913. 121 G. Brannigan and F. L. H. Brown, J. Chem. Phys., 2004, 120(2), 1059. 122 D. J. Michel and D. J. Cleaver, J. Chem. Phys., 2007, 126(3), 034506. 123 J. G. Gay and B. J. Berne, J. Chem. Phys., 1981, 74(6), 3316. 124 J. C. Shelley, M. Y. Shelley, R. C. Reeder, S. Bandyopadhyay and M. L. Klein, J. Phys. Chem. B, 2001, 105(19), 4464. 125 J. C. Shelley et al., J. Phys. Chem. B, 2001, 105(40), 9785. 126 S. J. Marrink, A. H. de Vries and A. E. Mark, J. Phys. Chem. B, 2004, 108(2), 750. 127 M. Orsi, D. Y. Haubertin, W. Sandersony and J. W. Essex, J. Phys. Chem. B, in press. 128 S. D. Chao, J. D. Kress and A. Redondo, J. Chem. Phys., 2005, 122(23), 234912. 129 C. F. Lopez, S. O. Nielsen, G. Srinivas, W. F. DeGrado and M. L. Klein, J. Chem. Theory Comput., 2006, 2(3), 649. 130 D. Van der Spoel et al., J. Comput. Chem., 2005, 26(16), 1701. 131 E. Lindahl, B. Hess and D. van der Spoel, J. Mol. Model., 2001, 7(8), 306. 132 S. J. Marrink, J. Risselada and A. E. Mark, Chem. Phys. Lipids, 2005, 135(2), 223. 133 V. Knecht and S. J. Marrink, Biophys. J., 2007, 92(12), 4254. 134 X. Periole, T. Huber, S. J. Marrink and T. P. Sakmar, J. Am. Chem. Soc., 2007, 129(33), 10126.

48 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

135 R. Baron, A. H. de Vries, P. H. Hunenberger and W. F. van Gunsteren, J. Phys. Chem. B, 2006, 110(31), 15602. 136 R. Baron, A. H. de Vries, P. H. Hunenberger and W. F. van Gunsteren, J. Phys. Chem. B, 2006, 110(16), 8464. 137 R. Baron et al., ChemPhysChem, 2007, 8(3), 452. 138 S. J. Marrink, H. J. Risselada, S. Yefimov, D. P. Tieleman and A. H. de Vries, J. Phys. Chem. B, 2007, 111(27), 7812. 139 P. J. Bond and M. S. P. Sansom, J. Am. Chem. Soc., 2006, 128(8), 2697. 140 P. J. Bond, J. Holyoake, A. Ivetac, S. Khalid and M. S. P. Sansom, J. Struct. Biol., 2007, 157(3), 593. 141 S. Izvekov and G. A. Voth, J. Phys. Chem. B, 2005, 109(7), 2469. 142 S. Izvekov and G. A. Voth, J. Chem. Phys., 2005, 123(13), 134105. 143 F. Ercolessi and J. B. Adams, Europhys. Lett., 1994, 26(8), 583. 144 S. Izvekov, M. Parrinello, C. J. Burnham and G. A. Voth, J. Chem. Phys., 2004, 120(23), 10896. 145 J. W. Chu, S. Izveko and G. A. Voth, Mol. Simul., 2006, 32(3–4), 211. 146 W. G. Noid, J. W. Chu, G. S. Ayton and G. A. Voth, J. Phys. Chem. B, 2007, 111(16), 4116. 147 S. Izvekov, A. Violi and G. A. Voth, J. Phys. Chem. B, 2005, 109(36), 17019. 148 P. Liu, S. Izvekov and G. A. Voth, J. Phys. Chem. B, 2007, 111(39), 11566. 149 Y. T. Wang, S. Izvekov, T. Y. Yan and G. A. Voth, J. Phys. Chem. B, 2006, 110(8), 3564. 150 S. Izvekov and G. A. Voth, J. Chem. Theory Comput., 2006, 2(3), 637. 151 S. Izvekov and G. A. Voth, J. Chem. Phys., 2006, 125(15), 151101. 152 A. P. Lyubartsev, Eur. Biophys. J. Biophys. Lett., 2005, 35(1), 53. 153 A. P. Lyubartsev and A. Laaksonen, Phys. Rev. E, 1995, 52(4), 3730. 154 D. Reith, M. Putz and F. Muller-Plathe, J. Comput. Chem., 2003, 24(13), 1624. 155 A. Arkhipov, P. L. Freddolino and K. Schulten, Structure, 2006, 14(12), 1767. 156 C. E. A. Chang, J. Trylska, V. Tozzini and J. A. McCammon, Chem. Biol. Drug Des., 2007, 69(1), 5. 157 J. Trylska, V. Tozzini and J. A. McCammon, Biophys. J., 2005, 89(3), 1455. 158 V. Tozzini and J. A. McCammon, Chem. Phys. Lett., 2005, 413(1–3), 123. 159 C. E. Chang, T. Shen, J. Trylska, V. Tozzini and J. A. McCammon, Biophys. J., 2006, 90(11), 3880. 160 V. Tozzini, W. Rocchia and J. A. McCammon, J. Chem. Theory Comput., 2006, 2(3), 667. 161 V. Tozzini, J. Trylska, C. E. Chang and J. A. McCammon, J. Struct. Biol., 2007, 157(3), 606. 162 J. Trylska, V. Tozzini, C. A. Chang and J. A. McCammon, Biophys. J., 2007, 92(12), 4179. 163 A. P. Heath, L. E. Kavraki and C. Clementi, Proteins, 2007, 68(3), 646. 164 M. Feig, P. Rotkiewicz, A. Kolinski, J. Skolnick and C. L. Brooks, Proteins, 2000, 41(1), 86. 165 M. Milik, A. Kolinski and J. Skolnick, J. Comput. Chem., 1997, 18(1), 80. 166 Z. X. Xiang and B. Honig, J. Mol. Biol., 2001, 311(2), 421. 167 J. M. Wang, P. Cieplak and P. A. Kollman, J. Comput. Chem., 2000, 21(12), 1049. 168 D. A. Case et al., AMBER 8, University of California, 2004. 169 A. Onufriev, D. Bashford and D. A. Case, Proteins, 2004, 55(2), 383. 170 P. Das, S. Matysiak and C. Clementi, Proc. Natl. Acad. Sci. USA, 2005, 102(29), 10141. 171 F. Ding, K. C. Prutzman, S. L. Campbell and N. V. Dokholyan, Structure, 2006, 14(1), 5. 172 J. Hermans, H. J. C. Bernendsen, W. F. van Gunsteren and J. P. M. Postma, Biopolymers, 1984, 23(8), 1513. 173 T. Lazaridis and M. Karplus, Proteins, 1999, 35(2), 133. 174 T. Kortemme, A. V. Morozov and D. Baker, J. Mol. Biol., 2003, 326(4), 1239. 175 S. Sharma, F. Ding and N. V. Dokholyan, Biophys. J., 2007, 92(5), 1457. 176 M. Nanias, C. Czaplewski and H. A. Scheraga, J. Chem. Theory Comput., 2006, 2(3), 513. 177 E. Lyman and D. M. Zuckerman, J. Chem. Theory Comput., 2006, 2(3), 656. 178 T. Z. Lwin and R. Luo, J. Chem. Phys., 2005, 123(19), 194904. 179 Q. Shi, S. Izvekov and G. A. Voth, J. Phys. Chem. B, 2006, 110(31), 15045. 180 L. Whitehead, C. M. Edge and J. W. Essex, J. Comput. Chem., 2001, 22(14), 1622. 181 J. Michel, M. Orsi and J. W. Essex, J. Phys. Chem. B, in press. 182 Y. Liu and T. Ichiye, J. Phys. Chem., 1996, 100(7), 2723. 183 A. Chandra and T. Ichiye, J. Chem. Phys., 1999, 111(6), 2701. 184 M. L. Tan, B. R. Brooks and T. Ichiye, Chem. Phys. Lett., 2006, 421(1–3), 166. 185 C. J. Fennell and J. D. Gezelter, J. Chem. Phys., 2004, 120(19), 9175. 186 M. Neri, C. Anselmi, M. Cascella, A. Maritan and P. Carloni, Phys. Rev. Lett., 2005, 95(21), 218102.

Chem. Modell., 2008, 5, 13–50 | 49 This journal is

c

The Royal Society of Chemistry 2008

187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239

M. Praprotnik, L. Delle Site and K. Kremer, J. Chem. Phys., 2005, 123(22), 224106. M. Praprotnik, L. Delle Site and K. Kremer, Phys. Rev. E, 2006, 73(6), 066701. M. Praprotnik, K. Kremer and L. Delle Site, J. Phys. A-Math. Theor., 2007, 40(15), F281. M. Praprotnik, K. Kremer and L. Delle Site, Phys. Rev. E, 2007, 75(1), 017701. M. Praprotnik, S. Matysiak, L. Delle Site, K. Kremer and C. Clementi, J. Phys.-Condes. Matter, 2007, 19(29), 292201. B. Ensing, S. O. Nielsen, P. B. Moore, M. L. Klein and M. Parrinello, J. Chem. Theory Comput., 2007, 3(3), 1100. M. Tuckerman, B. J. Berne and G. J. Marthyna, J. Chem. Phys., 1992, 97(3), 1990. R. E. Rudd and J. Q. Broughton, Phys. Rev. B, 2005, 72(14), 144104. S. A. Orszag and I. Staroselsky, Comput. Phys. Commun., 2000, 127(1), 165. N. A. Baker, J. Comput. Chem., 2005, 21, 349. N. A. Baker, Curr. Opin. Struct. Biol., 2005, 15(2), 137. J. H. Chen, W. P. Im and C. L. Brooks, J. Am. Chem. Soc., 2006, 128(11), 3728. Z. Y. Yu, M. P. Jacobson and R. A. Friesner, J. Comput. Chem., 2006, 27(1), 72. C. Park, M. J. Carlson and W. A. Goddard, J. Phys. Chem. A, 2000, 104(11), 2498. Y. H. Jang et al., J. Phys. Chem. B, 2003, 107(1), 344. N. J. English, J. Mol. Model., 2007, 13(10), 1081. S. P. Brown and S. W. Muchmore, J. Chem. Inf. Model., 2007, 47(4), 1493. G. Brannigan, L. C. L. Lin and F. L. H. Brown, Eur. Biophys. J. Biophys. Lett., 2006, 35(2), 104. I. Lotan and T. Head-Gordon, J. Chem. Theory Comput., 2006, 2(3), 541. W. Im, J. H. Chen and C. L. Brooks, Adv. Protein Chem., 2006, 72, 173. A. Warshel, P. K. Sharma, M. Kato and W. W. Parson, BBA-Proteins Proteomics, 2006, 1764(11), 1647. J. Carlsson, M. Ander, M. Nervall and J. Aqvist, J. Phys. Chem. B, 2006, 110(24), 12034. P. Koehl, Curr. Opin. Struct. Biol., 2006, 16(2), 142. Y. Y. Tang et al., Biophys. J., 2006, 91(4), 1248. E. Villa, A. Balaeff, L. Mahadevan and K. Schulten, Multiscale Model. Simul., 2004, 2(4), 527. E. Villa, A. Balaeff and K. Schulten, Proc. Natl. Acad. Sci. USA, 2005, 102(19), 6783. S. Barsky, R. Delgado-Buscalioni and P. V. Coveney, J. Chem. Phys., 2004, 121(5), 2403. G. De Fabritiis, R. Delgado-Buscalioni and P. V. Coveney, Phys. Rev. Lett., 2006, 97(13), 134501. R. Delgado-Buscalioni and P. V. Coveney, Philos. Trans. R. Soc. Lond. Ser. A-Math. Phys. Eng. Sci., 2004, 362(1821), 1639. R. Delgado-Buscalioni, E. G. Flekkoy and P. V. Coveney, Europhys. Lett., 2005, 69(6), 959. R. Delgado-Buscalioni and P. V. Coveney, Physica A, 2006, 362(1), 30. E. G. Flekkoy, R. Delgado-Buscalioni and P. V. Coveney, Phys. Rev. E, 2005, 72(2), 026703. G. Giupponi, G. De Fabritiis and P. V. Coveney, J. Chem. Phys., 2007, 126(15), 154903. R. Delgado-Buscalioni and P. V. Coveney, Phys. Rev. E, 2003, 67(4), 046704. R. Delgado-Buscalioni and P. V. Coveney, J. Chem. Phys., 2003, 119(2), 978. G. De Fabritiis, M. Serrano, R. Delgado-Buscalioni and P. V. Coveney, Phys. Rev. E, 2007, 75(2), 026307. R. Chang, G. S. Ayton and G. A. Voth, J. Chem. Phys., 2005, 122(24), 244716. G. S. Ayton, W. G. Noid and G. A. Voth, Curr. Opin. Struct. Biol., 2007, 17(2), 192. G. Ayton and G. A. Voth, Biophys. J., 2002, 83(6), 3357. M. Kranenburg, J. P. Nicolas and B. Smit, Phys. Chem. Chem. Phys., 2004, 6(16), 4142. M. Kranenburg, C. Laforge and B. Smit, Phys. Chem. Chem. Phys., 2004, 6(19), 4531. G. S. Ayton and G. A. Voth, J. Struct. Biol., 2007, 157(3), 570. G. Zini, A. Sarti and C. Lamberti, IEEE Trans. Ultrason. Ferroelectr. Freq. Control, 1997, 44(2), 297. D. Noble, Biochem. Soc. Trans., 2003, 31, 156. S. H. G. Khoo and M. Al-Rubeai, Biotechnol. Appl. Biochem., 2007, 47, 71. A. Michailova, J. Saucerman, M. E. Belik and A. D. McCulloch, Biophys. J., 2005, 88(3), 2234. D. Noble, Biochem. Soc. Trans., 2005, 33, 539. D. Noble, Biosystems, 2006, 83(2–3), 75. C. V. Forst, Drug Discov. Today, 2006, 11(5–6), 220. D. B. Kell, IUBMB Life, 2007, 59(11), 689. J. B. Bassingthwaighte, H. J. Chizeck and L. E. Atlas, Proc. IEEE, 2006, 94(4), 819. M. Stein, R. R. Gabdoulline and R. C. Wade, Curr. Opin. Struct. Biol., 2007, 17(2), 166. P. J. Hunter, W. W. Li, A. D. McCulloch and D. Noble, Computer, 2006, 39(11), 48.

50 | Chem. Modell., 2008, 5, 13–50 This journal is

c

The Royal Society of Chemistry 2008

Computer-aided drug design 2005–2007 Richard A. Lewis DOI: 10.1039/b609116b

1. Introduction At the 2005 Gordon Research Conference on computer-aided drug design, the delegates were polled for a series of big questions facing our field. These covered the fields of QSAR, structure-based design, ligand-based design, cheminformatics, library design and virtual screening. These questions are dispersed among the individual topics, but all focus on extending the scope and rigour of our methods. In the last two years, there has been a burst of activity around the fundamental science, reflecting the growing influence of modelling. It is no longer good enough to work only in the traditional areas of strength. Modelling is expected to make an impact now in all aspects of the drug discovery process.

2. QSAR and ADMET For the field of QSAR, the issues were: How global can good QSAR models be? How can we do simultaneous modelling against multiple pharmacological endpoints? How can we close the feedback loop so QSAR models could be used directly for generation of new structures? How can we make good predictive models of ADMET properties from molecular structure, everything from solubility to bioavailability to idiosyncratic toxicity? The valedictory article by Yvonne Martin 1 provides a reminder that modellers have most impact when they concentrate on the specific problems faced by medicinal chemists. Thus, providing models that we know to be flawed will have more effect than a ‘perfect’ model that is irrelevant to the daily experience of the users, or is not applicable to their compounds. The examples described are substructural alerts for reactive functional groups, which the chemists can modify and hence take ownership of, and C log P. The latter is a commonly used descriptor used as a surrogate for bioavailability despite the fact the calculation is in error by more than 1 log unit about a third of the time. Despite this, there is no pressing demand to devise a better method. 2.1 Model quality Guha et al.2 explore the relationship between global QSAR models and local models. A local model is obtained using only the compounds in the neighbourhood of the query molecule. This would be most useful when the global trend is non-linear, as one is approximating the first derivative of the non-linear curve with the local model. This has analogies to the kNN method. Of course, one must be careful how the distance metric and parameter k are chosen. In tests, the local models gave very similar global error predictions, but explained outliers in the global model much better. Where a local model falls down is when there are activity cliffs, that is, larger changes in activity for only small changes in structure, or when the neighbourhood is sparse. The key advantage is that no model needs to be built a priori, so that the method is well suited for analysing large data sets. A new method for making classification models from diverse structure, called line-walking recursive partitioning, has been published, using cyp450 data as the training set. The authors claim 85% accuracy, similar to SVM methods, and only 9 descriptors are used. The method was also able to uncover errors in the biological data, implying that it is Computer-aided Drug Discovery, Novartis Institutes for Biomedical Research, Basel CH-4002, Switzerland. E-mail: [emailprotected]

Chem. Modell., 2008, 5, 51–66 | 51 This journal is

c

The Royal Society of Chemistry 2008

robust even for noisy data sets.3 Gedeck et al.4 provide another view of QSAR, by looking at the quality of predictions as a function of the data set. Using 1000 datasets and creating QSAR models in identical fashions, they showed that the quality of the predictions correlates with the size of the data set and the dynamic range of the activity values, and that no one descriptor set is the best for all data sets. By performing randomised picking experiments, they also saw a wide variation in q2 values, indicating the instability of many of the data sets. The fraction of good models (q2 4 0.5) was low for all descriptor sets, with 2D descriptors performing better than quasi-3D descriptors such as Almond.

2.2 Matched pairs analysis Matched pair analysis (or Free-Wilson analysis) has come back into vogue with a number of papers employing the technique. As one is looking for changes in activity with small changes in structure, there is some tension with the similarity principle, but it is a valid approach. The argument is that classical QSAR models are subject to issues with descriptor validity, extrapolation and so on. By analysing a dataset of matched pairs, one can derive fragment-based rules for activity, with the hope that the rules are independent of the larger context of the rest of the molecule. Leach et al.5 have built a tool for extracting all matched pairs from a dataset, and for evaluating the statistical significance of each rule. They exemplify using solubility, plasma-protein binding and oral exposure in rats. Thus adding Br decreases solubility by 1 log unit in 98% of the cases investigated. Conversely, analysis of outliers can give hints about why the usual trend is bucked: adding a orthosubstituent may cause some change in conformational preference, i.e. the wider environment is affected and the change of fragment is not independent of the neighbourhood. Haubertin and Bruneau6 also use matched pairs in a similar fashion. Sheridan et al.7 employ the idea of a molecular transformation, that is the difference between two molecules, as a substructure difference vector. The key is to look at molecules and their activities in pairs, rather than as a whole data set. This allows one to pick up small signals that might otherwise be masked by other larger signals and errors. The substructure vector is based on the atoms left after subtraction of the maximal common substructure. By analysing all pairwise transformations, clustering them by similarity, then adding in the change in activity most strongly associated with the cluster, one can quickly see if a transformation was beneficial or deleterious, even if the original absolute activity was weak. A similar strategy can be applied to find which (beneficial) transformations derived from the training set could be applied to a query molecule. In studies using D2 agonists, DHFR inhibitors, ACE inhibitors, they showed that, in comparison with some global QSAR methods, their local method could pick up local trends that the global models missed, and vice versa. In addition, as the transformations have a well-defined neighbourhood, it is not generally possible to apply them multiple times, thus avoiding the unreasonable optimisation issues caused by applying global models to morph query structures in a de novo design paradigm. To provide balance, one can also look for matched pairs that do not show changes in activity at a position which has been shown to be important to the SAR. If this is combined by an analysis of the X-ray complexes, one can assume that there is bioisosterism. The work by Kennewell et al.8 does just that starting with an alignment of the relevant binding sites. The ligands are broken down into fragments by splitting at single bonds and the fragment pairs identified by measuring the volume overlap with a reference molecule. The analysis was performed across several targets and bioisosteric pairs are found. However the authors are keen to stress that bioisosteres are particular to the target and not necessarily universal. 52 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

2.3 Multiobjective optimisation Inverse QSAR and multiobjective optimisation have been combined into a single workflow by Brown, McKay and Gasteiger,9 using a GA-based compound generator, and two QSAR models for solubility and polarizability. The issue of model knowledge domain is handled by Hotelling’s T2 statistic, and the multiobjective optimisation by Pareto ranking. They were able to show that the evolved molecules were consistent with the training sets. 2.4 Predictive models of ADMET The focus on producing general models for ADME phenomena can produce an over-reliance on the many robust methods for deriving QSAR models, while ignoring the underlying structure of the data. Good and Hermsmeier10 have re-examined the problem of predicting druglikeness, and show that there is considerable bias in the data, which can make the models look better than they really are. They argue for training set selection based on target class and ensuring that the decoy set is matched to the true actives by molecular weight as a minimal requirement. Random selection can fall foul of analogue bias, whereby certain classes of molecule are overrepresented, leading to recognition rather than prediction. Mutagenicity is an observation that will often halt the progress of a compound through the clinic. Based on the assumption that the features that give rise to mutagenic effects are quite localised, Kazius et al.11 have created a set of toxicophores based on a dataset of 4337 structures with Ames data. 29 toxicophores could classify the actives and inactives with an error of 18%, around the interlaboratory reproducibility level for the AMES test. An interesting case study is the aromatic nitro group. While is it strongly associated with mutagenic effects, its influence can be mitigated by other substituents on the ring: this observation lead to more specific rules, all of which are published as supplementary information in SMARTS format. This could form part of any assessment for compound purchasing or library assessment. Although it is included in the ADMET section, the review by Kalgutkar et al.,12 listing the bioactivation pathways of organic functional groups, has much wider relevance. Although the authors are careful to stress that no functional group should be ‘black-listed’, this survey could be used to bias away from certain groups during compound selection or library design or de novo design. Two new models for volume of distribution in humans have been published. This involved a very careful assembly of data, coming as it did from different data sources. In the Lombardo model,13 384 compounds were used in the study, most with quite diverse structures. As expected, C log P was an important descriptor, but also terms involving the fraction of the molecule ionised. It is clear that a good predictor for pKa, especially around physiological pH, will be key to future success in the modelling of pharmaco*kinetics. A model that predicts the pKa of amines accurately is not that helpful, if all the predictions are 48.5, as the compounds will be fully ionised. A program that could distinguish between a pKa of 6 and 7 would be immensely valuable. The model for VDss is accurate to 2-fold, as accurate as equivalent models based on animal data, and the underlying, curated data has been made available. The Gleeson model14 also includes rat data and is quoted as having an error of 0.37 and a r2 of 0.53. A curated data set has been put together using drugs marketed in Sweden.15 The total set of 673 pharmaceutical agents was reduced to 24 diverse compounds using multivariate analysis. Experimental data for these compounds was measured in a consistent way, to allow appropriate use in models (pKa, log P, solubility and CACO-2 permeability). Data on VDss, half-life and clearance were obtained from the literature. As all the compounds are commercially available at reasonable prices, this data set could be expanded: even in its present form, it will be a good standard for ADME modelling. Chem. Modell., 2008, 5, 51–66 | 53 This journal is

c

The Royal Society of Chemistry 2008

Herg blockade is mesmerising the pharmaceutical industry, not just due to the implications for drug discovery, but also because the difficulty is understanding the interplay between displacement of dofetilide, patch clamp assays, dog telemetry and the occurrence of toursade des pointes in humans. Several groups have developed pharmacophore models, including one for non-basic molecules.16 However, as Pearlstein has pointed out,17 Herg does not have an endogenous ligand, so that the Herg binding site is under-evolved and promiscuous, and might be better viewed as a host–guest complex. 2.5 Metabolism There have been continued studies into the energetics of hydrogen abstraction as a key factor in both regioselectivity and relative rates of metabolism by Cytochrome P450s. Olsen et al.18 performed studies with B3LYP and AM1 Hamiltonians using 24 substrates covering most commonly found atom types, and a haem surrogate. The surrogates ranged from a full iron-porphyrine to a methoxy radical. By performing full comparisons at the different levels of theory, they showed that the error in a simple AM1 approach using PhOdwas under 4 kJ/mol, a quite acceptable margin. Bond Dissociation Energy was found to be the best correlate for activation energy. All of the results could be summarized into 5 rules based on atom type and local environment. The results were applied to two steroids, and the authors could predict, with good accuracy, the preferred sites of metabolism in both, even without taking into account the accessibility of the atoms to the iron in the cyp450 site. Sheridan et al.19 eschew the energetic approaches to a purely QSAR-based model of regioselectivity for metabolism by Cyp450’s. The best models use only three substructure descriptors and two property descriptors. In comparison with other methods, specifically Metasite,20 the simple QSARs seemed to do as well or better, although the authors acknowledge this could be an artefact, and true comparison of performance is not easy. The authors also not that the models only predict where a molecule might be oxidised, and not how fast or extensively, something the abstraction energies do address.

3. Structure-based drug design The key issues for structure-based drug design are: adequate sampling of protein conformational space; high quality but practical potentials for predicting protein structures and protein/ligand interactions; generation of hom*ology models good enough for hit identification and lead optimization, and to what extent can we refine a hom*ology model based on knowledge of the ligands known to bind? 3.1 Protein flexibility Protein plasticity is a common phenomenon, but not often explicitly addressed except via MD, and even then we cannot be sure that the key conformations have been sampled. The zero-order approximation is to assume rigidity, followed by the use of several rigid snapshots, or side chain flexibility (e.g. GOLD), or even some degree of induced fit is allowed (see below). What is lacking are methods that are fast and cover all the key conformations. To obtain this, we need to understand more about the important motions of protein structures. Kinases are a good test case, as they are plastic, but we have X-ray structures of many of the active and inactive states of the proteins, plus ligands, so we can try to assess the interplay between protein conformation and ligand structure. Subramanian et al.21 have used CDK kinases as a test case: the proteins were aligned to a standard, using all atoms. The size of the binding site was defined using 4 residues common to all the CDKs; this variable was correlated against standard descriptors generated from the ligand structures. The degree of stabilisation of active and inactive forms was found to 54 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

depend strongly on the local hydrophobic character of the ligands, in addition to the usual size and shape constraints. Using the QSAR equations, one can predict which protein conformation is best suited for docking studies with a molecule. Another way to tackle this is to look at the formation of transient pockets. Eyrisch and Helms22 have done this specifically for protein–protein interactions. Starting with a crystal structure and running 10 ns of dynamics, they created multiple snapshots from the trajectories, each of which was examined using PASS, to detect cavities. The cavities were clustered to determine the location of more substantial pockets. The validity of the pockets (and the method) was assessed by superimposing the inhibitor structure: in all cases the pockets opened up in the regions one would expect ligand atoms to bind. The pockets also seemed to be the most non-polar. In docking studies, using the transient pockets gave better results compared to using the original apo-form from the X-ray. Apart from the formation of pockets, one also should consider if they are energetically favourable for binding. Landon et al.23 using computational solvent-mapping with small organic probes to identify energetic hotspots. Using renin as an example, they identified a pocket that is used by aliskiren, but not by other peptidomimetics. Using the Fesik24 training set of druggable and non-druggable proteins, they could shown that targets with a low ranked consensus site (among the probes used) were more likely to be the nondruggable ones. The exceptions to this were small confined sites such as PTB1B. Another way of looking at localised plasticity is to consider the structure ensembles generated by NMR methods.25 X-ray structures of complexes may have higher accuracy, but the protein is collapsed around the ligand, leading to the crossdocking problem, that one will not find other inhibitors of the same target, because the binding site is too closely modelled on one ligand. Using HIV protease as the exemplar, multiple pharmacophore models were built using benzene, ethane and methanol probes within an MD simulation, followed by clustering to identify hotspots of interaction. These models were used to query a conformer database. The effect of using multiple models is to increase the uncertainty (radius) of each feature, compared to using a model from an average structure. This is important when the pharmacophore contains more elements. The confusion matrices for identifying known hits from a database for presumed inactives (renin inhibitors were also found) were very good, with 90% true positive rate, 10% false negative. The study also compared the use of NMR structures to models derived from multiple X-ray structures. The recommendation is that NMR structures give better models, as the X-ray structures contain too little variation. The same group have also used conformations from MD sampling26 with similar findings. 3.2 Prediction of affinity The prediction of free energy, or at least consistent ranking of ligands according to free energy, has attracted much attention over the last few years, partly as a result of better hardware, but mainly due to the availability of more X-ray structures. The usual questions about sampling, the effect of solvent, force field accuracy have all been examined. Michel, Verdonk and Essex27 compared implicit and explicit solvent models for a series of inhibitors of neuraminidase, COX2 and CDK2. While they found that implicit solvent models are as good as explicit models in ranking ligands, while being much quicker to run, they urge caution, as changes in the conformation of the binding site can cause significant errors, so the methods should always be allied to X-rays or reliable docking predictions. Weis et al.28 used a MM/PBSA method to examine the effect of the force field and the starting geometries affected the calculated binding energies. While the force field didn’t matter much overall, mixing the force field for geometry and energy calculations did. They concluded, in contrast with Essex, that for their system explicit water should be used. They also conclude that simulations should be at least 450 ps long and that entropy is the most poorly handled term. Wiseman et al.29 describe their grand canonical Monte Carlo Chem. Modell., 2008, 5, 51–66 | 55 This journal is

c

The Royal Society of Chemistry 2008

method for computing relative affinity. The ensemble is annealed with respect to free energy and the protein and target are treated as rigid entities, and conformational freedom is introduced by means of snapshots. This is less of an issue for fragments or small molecules. For the test case of T4-lysozyme complexes, the method was comparable with much more complex simulations. The goal of this method is to dock fragments, which can then be assembled into larger entities (see sec. 6). Finally, the empirical scoring function used in Glide has been upgraded with new terms, 30 for desolvation, for hydrophobic collapse, and for h-bonding. The force field was parameterised on 198 complexes and 15 screening sets. They claim an accuracy of 2–3 kcal/mol, depending on how well-behaved the system is (that is, can the ligand be docked into the target without substantial reorganisation). 3.3 Selectivity Methods for assessing cross-reactivity a priori are of interest for several classes of targets with similar binding sites, most notable the ATP-binding sites of kinases. Using their dehydron approach, Fernadez and Maddipati31 generate a descriptor for the ATP site that correlates very strongly with the pharmacological activity vector across a panel of 32 kinases. A dehydron is defined by the degree of non-polarity around an h-bonding group. Excessive non-polarity promoted dehydration as a means to enhance the electrostatic contribution of an h-bond to a ligand. A matrix of dehydron patterns can be built by assigning a score if two residues share a dehydron, 0 otherwise. This matrix can be compared between kinases using standard metrics. The descriptor can be sharpened by only looking at dehydrons that are known to interact with inhibitors. To extend the approach, the authors use the fact that there is a strong relation between dehydron wrapping and the disorder score,32 allowing the a priori prediction of cross-reactivity. There is a continuing debate about the ability of F bound to C as an h-bonding group, or rather the extent to which this phenomenon can be observed, and its contribution to affinity. There is also the other effect of F on pKa which can affect affinity. Razgulin and Mecozzi33 have performed a comprehensive analysis using high level ab initio methods of the energetics of C–F H interactions. They find that fluoroaromatics can h-bond to water with an energy of 2.6 kcal/mol; this is weaker than h-bonds involving O or N, but in the context of structure-based drug design, displacement on water could therefore be much more energetically favourable than was first thought.

4. Virtual screening Virtual screening covers both structure- and ligand-based design, in that the output from these fields is used to search for new ligands. The key issues are around scoring, or enriching actives over inactives in the final ranking. 4.1 Docking A significant advance in docking has been made by Sherman et al.34 by iteratively combining the usual rigid protein/flexible ligand approach (in this case GLIDE) with a program for protein structure prediction (PRIME). This allows the protein to be induced to fit a given ligand. Thus one can start with an inappropriate protein conformation (for example the apo-form) and mould the site around the ligand to produce a better model for docking. In self-docking experiments, the average RMSD could be decreased from 5.5 A˚ to under 1.8 A˚ for the 21 cases considered (chosen as they were challenges for the usual paradigm). The first docking starts with a softened steric term, and the induced fit dockings add in an extra term for the conformational strain of the protein. The induced fit method in the paper is restricted to side chain motions; there are obviously cases where there is much more extensive motion of 56 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

loops and secondary structures. This is not an insuperable difficulty, as loop prediction can be added into the protocol. 4.2 Scoring The well-known bias of scoring functions with size is investigated in detail by Jacobsson and Karlen.35 Using a training set of 945 actives and 10k decoys, with 8 different scoring functions, three methods for correcting or improving the ranking of the actives were tried. MASC36 did not generally make the enrichments better. Normalisation by the square root of molecular weight or number of heavy atoms sometimes improved the enrichment rate. As the authors point out, using structural knowledge about the target is generally far more powerful than relying on the score. The other point to note that this study is concerned with improving the enrichment rates of a docking run, rather than improving the prediction of affinity. Cho et al.37,38 have performed an important study into the effect of ligand atomic charges on the accuracy of docking runs. In most protocols, it is not feasible to recalculate charges on the fly, so that standard force field charges are set once and maintained. Apart from the questionable accuracy for some atom types, this approximation neglects the effects of polarisation. In this study, a mixed QM/MM simulation was used to recalculate the ligand charges during several cycles of docking and minimisation. In the test set of redocking, using induced QM charges to seed the next docking, did not cause any harm, and in most cases gave improvements. However, in subsequent studies, if a local minimum is encountered, using QM charges can reinforce the ‘incorrect’ binding mode. This is countered by running the docking in a stochastic parallel mode to increase the sampling. 4.3 Validation The Astex test set for docking studies was updated in 2007.39 This valuable resource has been refined to just 85 structures of high quality, representing an unbiased sampling of available structures, to compensate for the large fraction of kinase and protease structures in the PDB. The Astex set also requires deposited structure factors, so that ligand positioning can be independently regenerated and assessed. Issues arising from steric clashes or symmetry have been addressed directly. The set was also subject to extensive manual curation around tautomer and protonation states. In a experiment using GOLD, it was shown that performance was improved, not due to better docking and scoring, more that the test set was freer from experimental artefacts that would have caused docking programs to fail. Interestingly, using this high quality test set can also reveal other, more subtle, effects that can impact performance, for example the method used to generate the starting conformation. Using Corina rather than the native conformation caused a degradation of 5% in redocking experiments. Huang, Shoichet and Irwin40 have provided another benchmarking set of ligands and decoys for 40 different targets. Given the correlation of scoring functions with size of lipophilicity, the decoy set has been chosen to resemble the active ligands physically while have distinct chemotypes. Each ligand has its own set of decoys. They showed that using uncorrected decoy sets, such as the MDDR, gave enrichments 0.5 log units better, which is clear evidence of bias. One of the key tools in evaluating virtual screening performance is some measure of enrichment, whether by ROC curves, enrichment factor and so on. What one wants to know is whether a screening method identifies the actives early in the overall prioritised list of compounds. Truchon and Bayley41 have performed a theoretical analysis to show that ROC curves may not be ideal for this purpose, despite their widespread use (and advocacy). A method can get a good ROC score, yet still fail to identify any actives in the top 20%. They propose a new metric based on an enhanced form of the ROC curve called BEDROC, which, via a parameter Chem. Modell., 2008, 5, 51–66 | 57 This journal is

c

The Royal Society of Chemistry 2008

alpha, can be used to emphasis, or weight, the influence of the early part of the curve. Other effects, such as statistical error and saturation effects (the variation of a metric with the number of actives), are also discussed, leading to some practical recommendations for the conditions needed to perform a good evaluation of a new method. 4.4 Pharmacophores The old trick of using pharmacophores to pre-screen before performing expensive docking studies42 is enjoying a renaissance. Lu et al. employed this strategy to find leads against the MDM2-p53 interface.43 The pharmacophore constraints screened out all but 2599 structures from a starting set of 11k. Docking gave further enrichment of 10 confirmed hits out of 67 compounds assayed. As a pharmacophore, if properly constructed, will give very few false negatives (false negatives might arise out of different binding modes or poor feature definition), the scoring of docked hits becomes more useful. The strategy was also validated using known inhibitors, all of which passed the filters, and scored well. Baroni et al.44 combine 4-point pharmacophore fingerprints with a shape-based description of the binding site in the FLAP program. Molecular interaction fields (MIFs) are used the locate maxima and minima interaction regions within the site, which describe the pharmacophore points (which are hence independent of substructure). The same procedure can be applied in reverse to small molecules, leading to a fast fingerprint-based method for prefiltering large databases. The FLAP fingerprints can also be used for alignment-free comparison of binding sites to highlight regions of similarity and selectivity. Protein flexibility is handled at the side-chain level during the generation of the MIFs, and ligand flexibility by conformer sampling. FLAP shows good performance when evaluated against other docking programs (even though that it is not its primary purpose). A new methodology for combining pharmacophore detection and 3D QSAR, PHASE, has been described by Dixon et al.45 The workflow is similar to other procedures in the generation of conformers followed by mapping of pharmacophore features. These features can be the standard h-bond donors, acceptors, hydrophobes, charged groups plus other custom features. For hydrogen bonds, either vectors or projected binding points can be used, obviating to some extent the need to have the heavy atoms of the ligand feature superimposed, a limitation in many packages. Detection of a pharmacophore is also made more likely by reducing the restriction that all active molecules in the training set must display the same common pharmacophore. Normally several candidate pharmacophores can be proposed, so the next step is to score the models, which can be done using the fit of the actives to the model, which actives are fitted, conformational strain and volume overlap. Inactives are also used to penalise a model, but as a separate step. The selected model can, as in other cases, be used to define the alignment for a 3D QSAR model. PHASE uses an occupancy grid (that is, is the grid cube occupied by a feature from the ligand) rather than a probe energy grid as the descriptor for the QSAR; this leads to volume exclusion maps in a very facile manner. In a comparison with CATALYST (and with the caveat that this paper was written by the vendors of PHASE), PHASE seems to do better on the test set used by Patel et al.46 in their comparative study of pharmacophore generation methods. Another new tool for pharmacophore alignment, GALAHAD, has been described.47 This works by iterative construction of hypermolecules. A hypermolecule is generated by aligning successive pairs of molecules; it retains common features and individual atoms. It is therefore also dependent on the order in which the alignment pairs are processed, so the strategy is to align the most similar pairs first. The alignment is scored by the overlap of the pharmacophore features rather than just the atoms. Again, on the test set of Patel et al.,46 this method appears to find better pharmacophore models, but with the caveat that frozen conformations were used. 58 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

Rules-based conformer generators often have fairly basic torsional parameters. Sadowski and Bostrom48 have developed a method based on MIMUMBA, to generate all possible for-atom torsional fragments, to search the Cambridge Crystallographic database with these fragments and to use the results to define torsional energy parameters. These more extensive rules improve the ability of OMEGA to find conformations closer to X-ray conformations. Interestingly, if the conformers are subsequently refined using an MMFF-like force field, the improved performance is removed. Agrafiotis et al.49 have performed a comparative study of conformational coverage provided by several common packages as well as their own stochastic proximity embedding method (SPE). They conclude that SPE, along with a poling method such as catCONF, provides better sampling, whereas the others produce more extended or compact geometries. Coverage was measured using pharmacophore triplets and radius of gyration. As the authors point out, the measure affects the relative performance, so that their conclusions are not a universal endorsem*nt of a particular method. Stochastic methods are not perfect, and still produce horrors such as bonds penetrating through the centres of rings. 4.5 Scaffold hopping Scaffold hopping is the technique of identifying molecules that are structurally different but which might form similar interactions with a receptor. In its own way, scaffold hopping is a form of similarity search, but with more abstract, less connectivity-dependent descriptors. Gillet et al.50 describe the use of reduced graphs, which combine essential pharmacophore information within a graph; these graphs can then be compared using classical means. The methods seem to marginally outperform classical fingerprint similarity in enrichment experiments, as they pick up both the structurally similar and dissimilar actives. Stiefl et al.51 also use a reducedgraph formulation in their ErG program. ErG handles h-bond acceptor/donor groups as two distinct states, allowing for exact matches with structures that do not have a matching donor/acceptor, rather just, say, a donor. A new descriptor is generated for each state, leading to 2N descriptors, where N is the number of donor/ acceptor groups. The paper is also noteworthy for its analysis of why different levels of performance are seen, compared to classical 2D fingerprints, for enrichment experiments. Methods that are strongly dependent on pharmacophore features do well in cases where there is a clear pharmacophore exposed, and less well when the pattern is hidden but the class is structurally hom*ogenous. The Cresset field point method52 uses molecular field extrema to create a representation of molecules. These field patterns can be compared to produce lists of similar molecules. This algorithm is effective at finding scaffold hops, particularly for ring systems. The field points convey the hydrophobic and electrostatic features in the neighbourhood of a molecule in one conformation. The overlay is performed using a rigid-body simplex optimisation, to optimise the fit of the extrema of one molecule onto the full field of another (and vice versa). The starting point is derived by matching the extrema patterns using coloured graphs, very much as in the reduced graph approach. One of the harder aspects is when molecules could be formally charged, as this will drastically affect the fields. The power of the method is illustrated using thrombin inhibitors from different classes and trying to regenerate the X-ray overlay without using any information from the X-ray complexes, which they were able to do. Maass et al.53 offer an update on the CAVEAT methodology with the RECORE program. As before, a database of compounds is shredded to create a fragment database. Fragments are created using the notion of cut points, that is, places at which the removal of a bond splits the structure into two. The cut points are further limited to preserve conformational restraints. Fragments are filtered by size and indexed by the vectors created by two or more cuts, as well as by pharmacophore features. Scaffold hopping is therefore performed by searching the database for Chem. Modell., 2008, 5, 51–66 | 59 This journal is

c

The Royal Society of Chemistry 2008

defined vector geometries plus addition features. In three test cases, viable, unstrained replacements for query structures could be obtained, even for multiple vector queries, where one expect the density of suitable fragments to be much sparser. Bergmann et al.54 use a GRID-based method to combine the Cresset and CAVEAT strategies. The anchor points (as for CAVEAT) are provided by the user, and then a database is searched for similar scaffolds, which match the geometry of the anchor points and the molecular interaction field profile. Using protease test cases, the central core could be used as a query, and known active alternative chemotypes found. The author suggest that the other highly ranked scaffolds could well be bioisosteres. 4.6 Ligand preparation Successful virtual screening relies on an appropriate and correct description of the molecules being screened; two particular sources of error are protonation state and tautomeric form. Oellien et al.55 have developed an exhaustive tautomer enumeration approach based on a set of 21 predefined transforms. The group do not attempt to say which tautomer is favoured, only that every form is represented. Databases built using all tautomers show better performance especially when screened with pharmacophore queries, as the bias introduced by most compound registration tools as to the ‘correct’ tautomer is removed.

5. De novo structure generation Fink and Reymond56 have continued their exhaustive exploration of chemical space for molecules with up to 11 atoms, using the elements C, N, O, F. They could enumerate 26.4 million structures, in comparison with the 63k structures with similar constraints described in the literature, including 538 novel ring structures. The universe has been somewhat pruned by rules around ring strain and obvious instability. The druglikeness of the structures was assessed using Bayesian models to predict for GPCR, kinase and ion channel activity, and a surprising number of novel virtual hits were found. Fechner and Schneider57 have also published a de novo design program. It is based on the assembly of molecular building blocks derived from a dataset using a RECAP-like set of disconnections.58 The starting molecule is also disconnected using the same rules, and rebuilt. A population of structures can exchange fragments, leading to a GA implementation with cross-overs and mutations. The fitness function was defined as the similarity, using topological torsions, to a query template structure. There are also various filters, based on upper and lower bounds on defined properties or descriptors, for example, molecular weight. Some of the queries could be generated, but compounds that could not be well handled by the disconnection rules were not. Lameijer et al.59,60 used de novo methods to generate scaffolds that were subsequently hand-modified by chemists. The protocol is to generate molecules from atoms and some fragments, then quickly stop and seek input from the medicinal chemist, who selects and edits the suggestions. This cycle can be iterated. In the paper, 8 examples of evolved molecules were made, and 4 showed some activity against bioamine targets. The generation process was highly constrained to produce molecules with more leadlike properties. While the input from a chemist resolves questions about synthetic feasibility, it raises others around bias and prior knowledge of pharmacophoric requirements. However, the authors argue that it is easier to evaluate a suggestion, than to invent something new, hence the close coupling of the computer program with the human end-user. Boda and Johnson61,62 take a different tack, suggesting that complexity (as a surrogate for synthetic accessibility) can be measured by precedent. A database of fragments was created using standard procedures from the MDDR set. Query 60 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

molecules from de novo generation can be fragmented in the same way. The frequency of occurrence of the fragments (together with terms for stereocentres and rotatable bonds) can be combined as penalty scores to give an index of complexity. Unusual fragments, or unknown fragments, will incur high penalties (for example pentasubstituted phenyl rings) whereas common fragments (3-substituted indoles) will not. Naturally, this does depend on the dataset and the fragmentation rules used to define the chemical precedent. Some well known drugs do have high complexity scores, reflecting this issue.

6. Fragment-based screening Fragment-based screening and drug design is intended to bridge the gap between the size of the druggable chemical space, and the relatively small numbers of compounds we have for screening. The argument is that good ligands can be built up from relatively small numbers of fragments, so by performing more sensitive screens with very small numbers (1000’s) of fragments, we can identify the key interactions, and then assemble the fragments back into more traditional ligands. One unresolved question is how to construct an optimal screening set of fragments, looking at diversity and screening set size as the parameters. Makara63 has conducted a retrospective study of 6 datasets, comparing an HTS deck, and virtual amide library and fragment libraries to look at these questions. There are other limiting factors to screening set size, based on the ability to follow up on any hits. The theoretical conclusion was that a few thousand (20k) compounds are required to represent the chemical space of commercially available amide space, and the hit rate from this library was comparable to screening the whole HTS deck. Mauser and Stahl64 also have looked at fragment selection using a backward approach, in contrast to the forward approach of Makara. A set of fragments was derived from bioactive compounds using RECAP rules,58 and filtered using some general leadlike criteria (ring size, rotatable bonds etc.). The fragments were clustered on the basis of maximal common substructure. Using different subsets of fragments of size 2000–4000 members, the chemical space that could be covered by assembling the fragments was also assessed using the Ftrees descriptors65 in comparison to the original database of bioactives. Each subset could represent about 20% of the original database of 85k compounds. Rings have long fascinated cheminformaticians, from determining the smallest set of rings in one structure, to determining the frequency of rings used in drug discovery,66 to try to understand which rings are acceptable, which are not. Much of this argumentation is, like the rings themselves, circular; rings may be favoured not for their biological properties but instead their synthetic accessibility, both historically and today in the cost of goods for synthesis of a drug in clinical development. Ertl et al.67 analysed the World Drug Index (bioactives) and compared this to compounds for sale, and a database of natural products. The diversity of simple aromatic rings was remarkably low, with only 780 scaffolds found from 150k compounds, including 216 singletons. Using exhaustive enumeration on a small number of frameworks, 580k possible scaffolds were created, and described using simple terms like C log P and PSA, as well as parameters derived from QM calculations. Using a Kohenen network as a classifier between the bioactive scaffolds and the virtual ones, it was observed that the bioactive scaffolds did indeed cluster into discrete regions: the descriptors most important to the classifications seemed to be heteroatom count and hom*o/LUMO energies. The authors conclude that we are only scratching the surface of structural chemistry space, even though bioactivity may be limited to small regions of that space. Mapping of hot spots of interaction has been a key concept in fragment-based design. Most scoring functions (for example GRID68) are based on reproducing crystal structures. In a paper by Ciulli et al.,69 the energetic contributions of fragments are measured experimentally using isothermal calorimetry and NMR. Chem. Modell., 2008, 5, 51–66 | 61 This journal is

c

The Royal Society of Chemistry 2008

Using ketopantoate reductase/NAPHH as the model system, it was shown that the change in affinity per heavy atom added was below the figure proposed by Kuntz for drug-like molecules.70 Looking at individual components, adding ribose did not improve average ligand efficiency, but adding phosphate groups did. Thus two hot spots in the ligand-receptor complex could be identified, and targeted for a fragmentbased strategy. Hajduk71 returns to the question of lead-likeness and the minimal potency required for a fragment to be useful. By examining 18 highly optimised inhibitors, and deconstructing them stepwise back to fragments, he claims a linear relationship between weight and potency, i.e. the binding efficiency is almost constant. As a rule of thumb, one should expect 1 log unit of potency for every 64 Da added, if one is to proceed along a successful optimisation. A drop of Binding Energy Index of more than 10% is probably indicative that a plateau has been reached of the modification is not ideal. Working this through, a starting fragment of 250 Da and an affinity of 1 mM would need to grow to 750 Da to achieve a potency of 1 nM. To stay under 500 Da, the starting fragment would need to be less than 200 Da. Finally, Babaoglu and Shoichet72 have performed experiments to assess the fundamental principle of fragment-based design. If we assume that we can build a ligand from fragments, what happens if we deconstruct a ligand into fragments? Will the individual binding modes of the fragments be maintained? Unfortunately, the experiments suggests that this is not always the case, and that fragments can be too small to bind unambiguously and/or correctly. For the original binding mode to be favoured, a larger fraction of the original inhibitor needs to be in place.

7. Target fishing There is considerable interest in the new field of target fishing. This is basically the identification of the target to which a ligand binds. This can be at the level of finding an active compound in a whole cell assay, then driving down to the molecular target, or having a collection of structures, or combinatorial library, and estimating in silico which proteins are these compounds most likely to bind to, before conducting the in vitro assay. Rognan et al.73 have performed the latter, proposing five targets for a set of triazepandiones, and finding significant activity in one screen, sPLA2. The first step is to prepare an annotated library of binding sites. 5 representatives from the screening library were docked into the 2148 binding sites, and the sites that got the highest score were retained. The target proteins from which the sites were derived were assayed experimentally with members of the screening set, to give micromolar inhibitors. It can be argued that the control experiment, of assaying against all the proteins, was not done, but this was impractical. Also, the most potent compounds in the assay were not the one predicted by docking, again highlighting the errors in the virtual screen. However, it is now possible to deorphanise compounds in silico relatively quickly (docking is slow, but it can be performed in parallel), and this result will have major impact in library design and compound purchasing for enrichment of corporate compound archives. Langer et al.74 tackle the issue using pharmacophore models instead. They have built up a library of models against a variety of targets. By screening a set of compounds against these models, they try to predict the biological profile of the compounds, which they managed for 90 out of a set of 100. The models can be generated either from a set of ligands, or from the protein structure itself, as described elsewhere in this review. Using more extensive conformational analysis leads to more false positives and true positives. The quality of the model was critical (how many features, tolerances etc.), as was the stickiness of the site (how many models could be derived), as to the confidence for which a target could be linked with the query structure. Jenkins et al.75,76 have used ligand activity data from the well-curated WOMBAT database to build a series of 2D models to predict the most likely protein targets for 62 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

the compounds in the MDDR database, as well as using a 3D pharmacophore-like descriptor for the same purpose. On average, the correct association of ligand to target could be found 90% of the time by one of the 2D methods, and some of the broader activity categories (for example, ‘‘kinase inhibitor’’) in the MDDR could be disambiguated to more precise targets. The 2D models could also be used for compound selection and library design, with the caveat that, as they are based on 2D fingerprints, they might bias against novel chemotypes (where 3D methods come more into their own). Indeed, the authors ascribe the very high success rate to the congeneric nature of the data. Cleves and Jain77 have used ligand-based methods to look at the link between similarity, pharmacological profile and target similarity for about 1000 therapeutics. The goal of the study was to rationalize, or even predict, off-target effects of compounds. The first stage was to curate the data, so that pharmacological target was named consistently. Their first result was to show that the similarity principle was generally true for the dataset. When ligand-based models were built for 22 targets, recovery rates of true ligands were between 60–70% with only 1–5% of false positives. By looking at the non-cognate ligands returned by the models, one could hop to a new target family, to hint at possible off-target pharmacology; the example given being the relationship between b-lactam antibiotics and beta2-adrenoreceptors. There is crossover between this topic and library design. Increasingly, groups are trying to introduce a target-family bias into new designs, or to select between designs on this basis. The SIFt methodology of Singh et al.78 encodes the pharmacophoric information within a site as a fingerprint that can be used for fast screening of large libraries. The first step is to dock a training library, followed by R-group analysis. Using the properties of the R-group as descriptors, and a SIFt analysis of the docking modes, more refined fingerprint patterns could be defined (describing which R groups make which interactions) which are used to build predictive QSAR models. These models were shown to be more sensitive to detecting compounds that could bind favourably than just using docking score alone. A related study by f*ckunishi79 used dockings between known binders and inactive compounds, and set of pockets derived from protein active sites. Using PCA, they could show that the actives clustered together, and that using proximity to the active cluster for a given protein pocket was a good method for finding actives in a larger test set. Also they could cluster proteins with similar pockets together. This does require some fairly extensive calculations, and a large compound/target activity matrix, but only needs to be performed infrequently to generate the scoring model.

8. Library design Industry groups are now starting to publish their in-house tools for library design. Most of the innovation has occurred within Pharma research, as off-the-shelf packages often do not fit well into local workflows. Mosley et al.80 from Merck start us off with a tool for selecting reagents. The user selects the synthon, for example primary amine, and then refines the search using physicochemical properties. An important component is that the initial search is from a set of well-curated and readily available building blocks. One can also specify undesired functional groups which might interfere with the reaction scheme of require extra protection/ deprotection steps. Automatic exclusions are also triggered (i.e. choosing a secondary amine excludes the more reactive primary amine). The property refinement is performed with the true synthon, to remove effects of capping groups e.g. tosylates. The user can select the final reagents after a clustering process based on size and features. Truchon and Bayly81 propose a new algorithm, GLARE, for library subsetting, claiming very fast performance on libraries with up to 1012 members. As one of the slow steps has traditionally been the enumeration and scoring of the full product matrix, they use reagent-based approximate scores, that are 90% correlated with the Chem. Modell., 2008, 5, 51–66 | 63 This journal is

c

The Royal Society of Chemistry 2008

product scores. This is well within the accuracy of most scoring metrics anyway, so the approach seems justified. At each iteration of the subsetting process (dimension reduction), the reagent fitness score is incremented by 1 if the product is scored is as good. This is basically reagent frequency filtering, and the reagents with the lower scores are pruned out, and the cycle continues until the product subset has achieved a ‘goodness’ threshold, as defined by the user. The number of reagents to prune at each stage is the novel twist. In methods like PLUMS82 it is one by one. Here, using a formula based on the approach to the final set, many reagents can be pruned at each iteration without having false negatives. Also reagents coming from a larger pool (e.g. amines) are pruned faster than those coming from a small pool (boronic acids). However, the real speed-up comes from the sampling of the whole library space by smaller partitions. The assumption is that the fitness of the reagents should have the same distribution in the small partitions as for the whole library. The ability to handle very large library at interactive speed makes this very attractive for webbased tools intended for use by chemists. The concept of privileged substructures is frequently invoked in library design, with the assumption that addition of such a motif to a structure will confer some specific activity or selectivity. Schnur et al.83 challenge this notion. Substructural analysis was used to generate potential privileged substructures, and the relative frequency of occurrence of these fragments was examined over a number of target sets. In most cases, no selectivity is observed, leading to the conclusion that the motifs are generally conducive to binding. This may in turn lead to undesired sideeffects or even promiscuity. Promiscuity in HTS screening decks is a perceived issue, in that years of experience looking at HTS results tells us that there are some compounds that are ‘old favourites’, but it is not clear how to quantify the extent, or how to assign guilt to scaffold or fragments, except by anecdote. Pearce et al.84 have developed two methods for mapping promiscuity based on linking 180 substructural queries with HTS data. The absolute number of promiscuous compounds (defined as active in 7 or more assays) is relatively small, but their representation in hit lists is significant. Also the polyhalophenol fragment is over represented in the promiscuous compound set. In contrast, property-based filters showed no association with promiscuity, whereas several substructural queries did. The trend is that promiscuous compounds tend to be smaller and more soluble. Using two measures of promiscuity, compounds can be ranked by consensus for potential promiscuity. In practise, this can reduce the number of compounds considered for triage by 12%.

9. Conclusions The impetus to generate models in the field of ADMET has been sustained, as more high quality data becomes available. There is a price to pay, and now more care is being taken to ensure that these models, which can be widely deployed to unsuspecting users, are robust and give confidence estimates with the predictions. As methods for virtual screening are also becoming more robust, there are increasing numbers of applications for target fishing—rather than use the target to find the ligands, structures are being used to deorphanise targets; this is a field of increasing importance as we explore the effects of compounds in whole cell assays. There have been significant advances in docking, scoring, and the handling of protein flexibility. Many of the big questions posed in 2005 remain unresolved, but the good news is that the amount of new methodological research suggests that we think that one day, we will have the answers.

References 1 2 3 4

Y. C. Martin, QSAR Comb. Sci., 2006, 25, 1192. R. Guha, D. Dutta, P. C. Jurs and T. Chen, J. Chem. Inf. Model., 2006, 46, 1836. M. G. Hudelson and J. P. Jones, J. Med. Chem., 2006, 49, 4367. P. Gedeck, B. Rohde and C. Bartels, J. Chem. Inf. Model., 2006, 46, 1924.

64 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

5 A. G. Leach, H. D. Jones, D. A. Cosgrove, P. W. Kenny, L. Ruston, P. MacFaul, J. M. Wood, N. Colclough and B. Law, J. Med. Chem., 2006, 49, 6672. 6 D. Y. Haubertin and P. Bruneau, J. Chem. Inf. Model., 2007, 47, 1294. 7 R. P. Sheridan, P. Hunt and J. C. Culberson, J. Chem. Inf. Model., 2006, 46, 180. 8 E. A. Kennewell, P. Willett, P. Ducrot and C. Luttmann, J. Comput.-Aided Mol. Des., 2006, 20, 385. 9 N. Brown, B. McKay and J. Gasteiger, J. Comput.-Aided. Mol. Des., 2006, 20, 333. 10 A. C. Good and M. A. Hermsmeier, J. Chem. Inf. Model., 2007, 47, 110. 11 J. Kazius, R. McGuire and R. Bursi, J. Med. Chem., 2005, 48, 312. 12 A. S. Kalgutkar, I. Garner, R. S. Orbach, C. L. Shaffer, E. Callegari, K. R. Henne, A. E. Mutlib, D. K. Dalvie, J. S. Lee, Y. Nakai, J. P. O’Donnell, J. Boer and S. P. Harriman, Curr. Drug Metab., 2005, 6, 161. 13 F. Lombardo, R. S. Obach, F. M. DiCapua, G. A. Bakken, J. Lu, D. M. Potter, F. Gao, M. D. Miller and Y. Zhang, J. Med. Chem., 2006, 49, 2262. 14 M. P. Gleeson, N. J. Waters, S. W. Paine and A. M. Davis, J. Med. Chem., 2006, 49, 1953. 15 C. Skold, S. Winiwarter, J. Wernevik, F. Bergstrom, L. Engstrom, R. Allen, K. Box, J. Comer, J. Mole, A. Hallberg, H. Lennernas, T. Lundstedt, A. L. Ungell and A. Karlen, J. Med. Chem., 2006, 49, 6660. 16 A. M. Aronov, J. Med. Chem., 2006, 49, 6917. 17 R. Farid, T. Day, R. A. Friesner and R. A. Pearlstein, Bioorg. Med. Chem., 2006, 14, 3160. 18 L. Olsen, P. Rydberg, T. H. Rod and U. Ryde, J. Med. Chem., 2006, 49, 6489. 19 R. P. Sheridan, K. R. Korzekwa, R. A. Torres and M. J. Walker, J. Med. Chem., 2007, 50, 3173. 20 G. Cruciani, E. Carosati, B. B. De, K. Ethirajulu, C. Mackie, T. Howe and R. Vianello, J. Med. Chem., 2005, 48, 6970. 21 J. Subramanian, S. Sharma and C. Rao, J. Med. Chem., 2006, 49, 5434. 22 S. Eyrisch and V. Helms, J. Med. Chem., 2007, 50, 3457. 23 M. R. Landon, D.R. Lancia, J. Yu, S. C. Thiel and S. Vajda, J. Med. Chem., 2007, 50, 1231. 24 P. J. Hajduk, J. R. Huth and S. W. Fesik, J. Med. Chem., 2005, 48, 2518. 25 K. L. Damm and H. A. Carlson, J. Am. Chem. Soc., 2007, 129, 8225. 26 K. L. Meagher, M. G. Lerner and H. A. Carlson, J. Med. Chem., 2006, 49, 3478. 27 J. Michel, M. L. Verdonk and J. W. Essex, J. Med. Chem., 2006, 49, 7427. 28 A. Weis, K. Katebzadeh, P. Soderhjelm, I. Nilsson and U. Ryde, J. Med. Chem., 2006, 49, 6596. 29 M. Clark, F. Guarnieri, I. Shkurko and J. Wiseman, J. Chem. Inf. Model., 2006, 46, 231. 30 R. A. Friesner, R. B. Murphy, M. P. Repasky, L. L. Frye, J. R. Greenwood, T. A. Halgren, P. C. Sanschagrin and D. T. Mainz, J. Med. Chem., 2006, 49, 6177. 31 A. Fernandez and S. Maddipati, J. Med. Chem., 2006, 49, 3092. 32 C. Bracken, L. M. Iakoucheva, P. R. Rorner and A. K. Dunker, Curr. Op. Struct. Bio., 2004, 14, 570. 33 A. V. Razgulin and S. Mecozzi, J. Med. Chem., 2006, 49, 7902. 34 W. Sherman, T. Day, M. P. Jacobson, R. A. Friesner and R. Farid, J. Med. Chem., 2006, 49, 534. 35 M. Jacobsson and A. Karlen, J. Chem. Inf. Model., 2006, 46, 1334. 36 G. P. A. Vigers and J. P. Rizzi, J. Med. Chem., 2004, 47, 80. 37 A. E. Cho, V. Guallar, B. J. Berne and R. Friesner, J. Comp. Chem., 2005, 26, 915. 38 V. Guallar, B. F. Gherman, S. J. Lippard and R. A. Friesner, Curr. Op. Chem. Bio., Vol. 6, 01. 39 M. J. Hartshorn, M. L. Verdonk, G. Chessari, S. C. Brewerton, W. T. M. Mooij, P. N. Mortenson and C. W. Murray, J. Med. Chem., 2007, 50, 726. 40 N. Huang, B. K. Shoichet and J. J. Irwin, J. Med. Chem., 2006, 49, 6789. 41 J. F. Truchon and C. I. Bayly, J. Chem. Inf. Model., 2007, 47, 488. 42 P. R. Menard, J. S. Mason, M. I. and B. S., J. Chem. Inf. Comp. Sci., 1998, 38, 1204. 43 Y. P. Lu, Z. Nikolovska-Coleska, X. L. Fang, W. Gao, S. Shangary, S. Qiu, D. G. Qin and S. M. Wang, J. Med. Chem., 2006, 49, 3759. 44 M. Baroni, G. Cruciani, S. Sciabola, F. Perruccio and J. S. Mason, J. Chem. Inf. Model., 2007, 47, 279. 45 S. L. Dixon, A. M. Smondyrev, E. H. Knoll, S. N. Rao, D. E. Shaw and R. A. Friesner, J. Comput.-Aided Mol. Des., 2006, 20, 647. 46 Y. Patel, V. J. Gillet, G. Bravi and A. R. Leach, J. Comput.-Aided Mol. Des., 2002, 16, 653. 47 N. J. Richmond, C. A. Abrams, P. R. N. Wolohan, E. Abrahamian, P. Willett and R. D. Clark, J. Comput.-Aided Mol. Des., 2006, 20, 567. 48 J. Sadowski and J. Bostrom, J. Chem. Inf. Model., 2006, 46, 2305.

Chem. Modell., 2008, 5, 51–66 | 65 This journal is

c

The Royal Society of Chemistry 2008

49 D. K. Agrafiotis, A. C. Gibbs, F. Zhu, S. Izrailev and E. Martin, J. Chem. Inf. Model., 2007, 47, 1067. 50 E. J. Barker, D. Buttar, D. A. Cosgrove, E. J. Gardiner, P. Kitts, P. Willett and V. J. Gillett, J. Chem. Inf. Model., 2006, 46, 503. 51 N. Stiefl, T. A. Watson, K. Baumann and A. Zaliani, J. Chem. Inf. Model., 2006, 46, 208. 52 T. Cheeseright, M. Mackey, S. Rose and A. Vinter, J. Chem. Inf. Model., 2006, 46, 665. 53 P. Maass, T. Schulz-Gasch, M. Stahl and M. Rarey, J. Chem. Inf. Model., 2007, 47, 390. 54 R. Bergmann, A. Linusson and I. Zamora, J. Med. Chem., 2007, 50, 2708. 55 F. Oellien, J. Cramer, C. Beyer, W. D. Ihlenfeldt and P. M. Selzer, J. Chem. Inf. Model., 2006, 46, 2342. 56 T. Fink and J. L. Reymond, J. Chem. Inf. Model., 2007, 47, 342. 57 U. Fechner and G. Schneider, J. Chem. Inf. Model., 2007, 47, 656. 58 X. Q. Lewell, D. B. Judd, S. P. Watson and M. M. Hann, J. Chem. Inf. Comput. Sci., 1998, 38, 511. 59 E. W. Lameijer, J. N. Kok, T. Back and A. P. Ijzerman, J. Chem. Inf. Model., 2006, 46, 545. 60 E. W. Lameijer, R. A. Tromp, R. F. Spanjersberg, J. Brussee and A. P. Ijzerman, J. Med. Chem., 2007, 50, 1925. 61 K. Boda and A. P. Johnson, J. Med. Chem., 2006, 49, 5869. 62 K. Boda, T. Seidel and J. Gasteiger, J. Comput.-Aided Mol. Des., 2007, 21, 311. 63 G. M. Makara, J. Med. Chem., 2007, 50, 3214. 64 H. Mauser and M. Stahl, J. Chem. Inf. Model., 2007, 47, 318. 65 M. Rarey and J. S. Dixon, J. Comput.-Aided Mol. Des., 1998, 12, 471. 66 G. W. Bemis and M. A. Murko, J. Med. Chem., 1999, 42, 5095. 67 P. Ertl, S. Jelfs, J. Muhlbacher, A. Schuffenhauer and P. Selzer, J. Med. Chem., 2006, 49, 4568. 68 P. J. Goodford, J. Med. Chem., 1985, 28, 849. 69 A. Ciulli, G. Williams, A. G. Smith, T. L. Blundell and C. Abell, J. Med. Chem., 2006, 49, 4992. 70 I. D. Kuntz, K. Chen, K. A. Sharp and P. A. Kollman, Proc. Nat. Acad. Sci., 1999, 96, 9997. 71 P. J. Hajduk, J. Med. Chem., 2006, 49, 6972. 72 K. Babaoglu and B. K. Shoichet, Nature Chem. Bio., 2006, 2, 720. 73 P. Muller, G. Lena, E. Boilard, S. Bezzine, G. Lambeau, G. Guichard and D. Rognan, J. Med. Chem., 2006, 49, 6768. 74 T. M. Steindl, D. Schuster, C. Laggner and T. Langer, J. Chem. Inf. Model., 2006, 46, 2146. 75 Nidhi, M. Glick, J. W. Davies and J. L. Jenkins, J. Chem. Inf. Model., 2006, 46, 1124. 76 J. H. Nettles, J. L. Jenkins, A. Bender, Z. Deng, J. W. Davies and M. Glick, J. Med. Chem., 2006, 49, 6802. 77 A. E. Cleves and A. N. Jain, J. Med. Chem., 2006, 49, 2921. 78 Z. Deng, C. Chuaqui and J. Singh, J. Med. Chem., 2006, 49, 490. 79 Y. f*ckunishi, S. Hojo and H. Nakamura, J. Chem. Inf. Model., 2006, 46, 2610. 80 R. T. Mosley, J. C. Culberson, B. Kraker, B. P. Feuston, R. P. Sheridan, J. F. Conway, J. K. Forbes, S. J. Chakravorty and S. K. Kearsley, J. Chem. Inf. Model., 2005, 45, 1439. 81 J. F. Truchon and C. I. Bayly, J. Chem. Inf. Model., 2006, 46, 1536. 82 G. Bravi, D. V. S. Green, M. M. Hann and A. R. Leach, J. Chem. Inf. Comp. Sci.s, 2000, 40, 1441. 83 D. M. Schnur, M. A. Hermsmeier and A. J. Tebben, J. Med. Chem., 2006, 49, 2000. 84 B. C. Pearce, M. J. Sofia, A. C. Good, D. M. Drexler and D. A. Stock, J. Chem. Inf. Model., 2006, 46, 1060.

66 | Chem. Modell., 2008, 5, 51–66 This journal is

c

The Royal Society of Chemistry 2008

Solvation effects Michael Springborg* DOI: 10.1039/b608783n

I.

Introduction

The goal of electronic-structure calculations is to provide information on real materials. The field of electronic-structure calculations is therefore placed somewhere between experiment and theory. I.e., by carrying through such calculations on specific systems one obtains results that cannot be manipulated but instead have to be interpreted. This resembles the situation of experiment. On the other hand, by modifying the systems one is able to follow the changes in the properties also to idealized situations that lie outside the range of experiment, which is a typical approach within theory. However, electronic-structure calculations are only then useful, if they can provide results that are in qualitative and quantitative agreement with experimental results, whenever such ones are available. This means that not only idealized situations shall be accessible with such calculations, but also realistic situations that are as close as possible to those found in experiment. Nevertheless, the fact that electronic-structure calculations have to rely on a number of approximations (see, e.g., ref. 1) often puts serious limitations on the systems and situations that can be treated with such calculations. These limitations exist first of all concerning the size of the systems that can be treated, i.e., the number of symmetrically inequivalent atoms and/or the number of structural degrees of freedom. The reason for their occurrence is twofold: At first, the computational requirements of a typical electronic-structure calculation scale as the size of the system to some power that typically is 3 or larger. And second, if the system has no well-known symmetry, the number of independent local total-energy-minimum structures grows faster than any power in the size of the system (see, e.g., refs. 1 and 2). This means that only systems of not too large size can be treated computationally, although, due to the continuous developments in computer technology and computer programs, this limit is continuously been pushed towards larger systems. Nevertheless, there are finite limits for what is computationally possible. Therefore, finite systems without any interactions with any surroundings are the ideal systems for theoretical studies. On the other hand, experimental studies are often carried through under quite different conditions. This includes that the systems of interest may interact with a solvent or with one or more surfaces (for instance of a catalyst). As we shall see below, the presence of such extra media can modify the properties of the system of interest. Therefore, in order to obtain a proper connection to the experimental studies, it is important that the presence of these media are included in the calculations ‘somehow’. In the present contribution we shall discuss how the effects of a solvent can be included. The challenges are, however, huge. A solvent is a macroscopic system containing an enormous number of atoms and most often having essentially no symmetry. Therefore, a standard electronic-structure calculation on the complete system would require more computational resources than ever will be available. Instead, one has to apply special-purpose methods that have been developed with exactly the goal of studying the properties of a single molecule that is moving under the influence of the solvent molecules. Physical and Theoretical Chemistry, University of Saarland, D–66123, Saarbru¨cken, Germany. E-mail: [emailprotected]

Chem. Modell., 2008, 5, 67–118 | 67 This journal is

c

The Royal Society of Chemistry 2008

There are more different ways one can model this situation, depending on the precise type of interaction between the solvated molecule and the solvent. In many cases, these interactions are (partly or purely) of electrostatic nature, meaning that one may model the effects of the solvent as that of creating an external, electrostatic field in which the molecule of interest exists. But, there is also a feed-back between the solute and the solvent. I.e., the solute may in return influence the charge distribution of the solvent, thus ultimately leading to a modified effect of the influence of the solvent. Furthermore, in some cases (that are not at all atypical) hydrogen bonds between the solvent and the solute are important and should, therefore, also be included. It is the purpose of the present overview to discuss the various types of methods. In the next section we shall describe their theoretical foundations, and in the subsequent section we shall give some few selected examples of their applications. We add that the theoretical treatment of solvent effects is a currently very active research field, which can be seen both in the recent review by Tomasi et al.3 as well as from the fact that even special issues of certain journals have been devoted solely to this topic.4 Finally, we emphasize that we have not at all tried to be exhaustive in the choice of recent studies that shall be briefly mentioned below. Rather, they are meant as representing typical studies. Many others, not at all worse (or better), could have been selected.

II. Fundamental methods In this section we shall discuss the basic principles behind the different methods that have been developed for including solvation effects in electronic-structure calculations. Starting with the isolated molecules of interest, without the solvent, we shall subsequently see how the effects of the solvent can be included. We shall also briefly discuss the advantages and disadvantages of the various approaches. In the next section we shall then present different recent studies where the methods of the present section have been applied in calculating a number of different properties of different systems. A.

Without the solvent

In a typical electronic-structure calculation, the Born-Oppenheimer approximation is invoked. This means that for a system of M nuclei and N electrons one first seeks the electronic energy Ee by solving the electronic Schro¨dinger equation Hˆe Ce = EeCe

(1)

with ^e ¼ 1 H 2

N X i¼1

r2! þ ri

N N X M N X X 1X 1 Zk ! þ Vext ðr i Þ: ! ! ! 2 i6¼j¼1 j r i r j j i¼1 k¼1 j !r R j i¼1 i k

ð2Þ

h 1. Moreover, ~ ri is the Here, we have used atomic units, me = |e| = 4pe0 = ~k that of the kth nucleus. Zk is the charge of the kth position of the ith electron, and R nucleus and, finally, we have assumed that the system is under the influence of an external, static field, Vext(~ r). Once Ee has been calculated, the total energy is obtained from Etot ¼ Ee þ

M M X ! 1 X Zk Zl þ Vext ðRk Þ: ! ! 2 k6¼l¼1 j R R j k¼1 k l

ð3Þ

~k) may actually also depend on the charge or mass of the kth nucleus (e.g., Vext(R when being of electrostatic or gravitational nature), but we shall not specify that dependence explicitly. 68 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

In practical calculations, it is never possible to obtain the exact solution to eqn (1). Instead, approximate methods are used that are based on either the Hartree-Fock approximation or the density-functional formalism in the Kohn-Sham formulation. In the case of the Hartree-Fock approximation one may add correlation effects, which, however, is beyond the scope of the present discussion (for details, see, e.g., ref. 1). Then, in both cases the problem of calculating the best approximation to the ground-state electronic energy is transformed into that of solving a set of singleparticle equations, 1 ! ! ! ! ! r2 þ VC ðr Þ þ Vn ðr Þ þ V^ xc þVext ðr Þ ci ðr Þ ¼ ei ci ðr Þ: ð4Þ 2 Here, VC is the Coulomb potential from the electrons, Z ! rðr 0 Þ !0 ! dr VC ðr Þ ¼ ! ! jr 0 r j

ð5Þ

with the electron density being !

rðr Þ ¼

N X

jci ðr Þj2 : !

ð6Þ

i¼1

Moreover, Vn is the Coulomb potential from the nuclei !

Vn ðr Þ ¼

M X

Zk !

!

ð7Þ

k¼1 jr Rk j

and Vˆxc is either (in the Hartree-Fock approximation) the exchange operator that depends on all the occupied orbitals or (in the density-functional theory) the exchange-correlation potential that depends on the electron density. Independently of which approach is used, the (approximate) total energy of eqn (3) is a function of the nuclear coordinates, as is the (approximate) electronic energy, ~1, R ~2, . . . , R ~M), Etot = Etot(R

(8)

and the equilibrium structure is that for which Etot has its minimum. For isolated, finite molecules in the gas phase, the external potential Vext(~ r) = 0, and the computational requirements for solving eqn (4) scales as N to some power, typically 3, whereas the determination of the minimum of Etot as a function of the nuclear coordinates scales essentially exponentially with M. This means that exhaustive calculations are only for smaller systems possible. B. Supermolecule approaches When the molecule of interest is solvated in some solvent, interactions between the solvent and the solute have to be included. The most accurate approach would be to treat the complete system with the methods of the preceding subsection, which, however, due to the above mentioned scaling behaviours is computationally prohibitively demanding. One has, therefore, to restore to simpler approaches. In most cases, the solute consists of well-identified, neutral molecules that interact only weakly with each other. This means that reducing the solvent to consisting of just some few molecules around the solute may provide a reasonable approximation. In this case, the weak bonds (i.e., hydrogen bonds or van der Waals bonds) to more distant solvent molecules are broken, but it is expected that this will lead to only insignificant electronic redistributions. Ultimately, the finite system consisting of the solute and the smaller number of solvent molecules can be treated with the accurate methods of the preceding subsection. This is the supermolecule approach. Chem. Modell., 2008, 5, 67–118 | 69 This journal is

c

The Royal Society of Chemistry 2008

The advantage of this approach is clearly that arbitrarily accurate methods can be applied. Thereby, also the weak interactions (first of all hydrogen bonds) can be treated accurately. On the other hand, when including more than those solvent molecules that are absolutely closest to the solute, the size of the system that shall be treated easily becomes so large that finite computational resources puts serious limits on the calculations. Thus, the computational requirements represent one disadvantage of this approach. Another is that longer-range interactions at first are not considered. For instance, if the solvent molecules possess a dipole moment, the electrostatic potential from more distant solvent molecules may be felt by the solute. Notice that since the number of dipoles scales essentially as r2 (with r being the distance to the solute) whereas the dipole potential scales as r2, long-range effects may be important although screening effects and averaging over different relative orientations lead to a faster convergence with respect to r. One way of introducing the interactions with more distant solvent molecules is to use a supercell approach (see, e.g., ref. 1). Then, a finite system including the solute and a smaller number of solvent molecules is repeated periodically in all three directions. Due to the periodicity, electronic-structure calculations can be carried through, but the drawback is that also the solute is repeated periodically. This means that if the repeated unit is too small, interactions between the solute molecules become non-negligible and may affect the results. C.

QM/MM approaches

A full quantum-mechanical treatment of the solute plus the solvent molecules may appear to be a too complicated approach. It would yield detailed information about the internal properties of the solvent molecules, that is not sought, and it would do so at the cost of large computational requirements. Only the properties of the solute are of interest as well as to which extent they are influenced by the presence of the solvent. Thus, only the effects of the solvent molecules outside their inner parts are of relevance. Therefore, simplified approaches that describe only those effects may be useful. This is the basic idea behind the QM/MM (quantum-mechanical/molecularmechanical) approaches. The solute is treated at the quantum-mechanical level, but the presence of the solvent is modeled through an external potential in which the solute moves. This external potential takes the role of Vext(~ r) discussed in section IIA and, therefore, the solute molecule can be treated using the same methods as for the isolated molecule in the gas phase, except that Vext(~ r) has to be included. The success of such an approach relies on the ability to devise realistic potentials Vext(~ r). One possibility is to distribute a set of solvent molecules at random and with random orientations in a finite, sufficiently large volume under a set of constraints. I.e., the density of the solvent molecules should be realistic, no two solvent molecules should come two close to each other, and no solvent molecule should come too close to the solute. Most often it is then assumed that the external potential Vext(~ r) is of electrostatic nature and can be modeled as a superposition of those of the solvent molecules. These can, e.g., in turn be approximated as a superposition of potentials from point charges placed at the positions of the nuclei. Improvements will allow the point charges to be placed at other positions, and also higher-order multipoles can be included. These potentials are devised so that selected properties of the solvent are reproduced accurately. Since the results may be biased by the randomly chosen (fixed) positions and orientations of the solvent molecules, one may repeat the calculation for more such arrangements and ultimately average the calculated properties of the solute. In refined models there may be a feedback between solute and solvent molecules. Thus, through the charge distribution of the solute, the solvent molecules feel the presence of the former. By calculating the forces acting on the solvent molecules, these may be allowed to move, ultimately leading to a molecular-dynamics simulation for the solvent. In turn, the shifted positions of those will modify the external 70 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

potential acting on the solute, which, therefore, may respond with a changed charge distribution. Ultimately, the structure of the lowest total-energy for the complete system may be identified. In such simulations it is important that the interatomic interactions go beyond those of point charges (and, eventually, multipoles) in order to avoid that the atoms come too close. At least, short-ranged repulsions shall be included. Moreover, one may also include the internal degrees of freedom of the solvent molecules and, accordingly, not consider them solely as rigid bodies. Since the solvent molecules are included explicitly in the calculations, such approaches are called ‘explicit’. They may be quite accurate of the simple reason that, in many cases, there is a natural separation of the system into entities that interact only weakly (i.e., the solute and solvent molecules). Because the force fields that are used in the molecular-mechanics part are not unique, different models may differ in these force fields. Also differences in the quantum-mechanical approach for treating the solute may lead to different models. The advantage is that quite large systems can be treated. Disadvantages include that relatively stronger interactions between the molecules (first of all hydrogen bonds but also van der Waals interactions) are seldom described accurately. Moreover, there is no electrostatic feedback between solute and solvent molecules at this level of theory. I.e., the charge distribution of the individual solvent molecules remains largely unaffected by the solute. More recent developments aim to at improve this situation, as we shall discuss in the next subsection. D.

Polarizable-molecules approaches

Since the main parts of the solute–solvent interactions are of electrostatic nature, one may improve the models mentioned in the previous subsection by including the response of the solvent molecules to the electric field generated by the solvent and the solute. In the general case, the dipole moment of a (solvent) molecule changes under the influence of an electrostatic field, !

!

!

m ¼ m0 þa E DC þ ;

ð9Þ

~DC is the field vector, a is the where ~ m0 is the dipole moment in the field-free case, E polarizability tensor, and higher-order terms have been ignored. By knowing the polarizability (e.g., from experiment or from calculations), one can calculate the dipole moment of the solvent molecules when these are influenced by the electrostatic field generated by the other solvent molecules and by the solute. This leads to a modified dipole moment that in terms leads to a new electrostatic potential created by the solvent molecule, etc. Ultimately this leads to improved QM/MM models compared to those described in the preceding subsection. Although these approaches go back to the end of the 1970s,5–9 they are still widely used as we shall see in the next section. Finally, as was the case for the approaches of the preceding subsection, different descriptions of the interatomic interactions for the MM part lead to different models, and further differences in the results of such calculations may be due to differences in the approach used for the quantum-mechanical treatment of the solute. E. Continuum approaches The approach of the preceding section was improved over the one of section IIC by including an electrostatic response of the solvent to the presence of the solute (and of the other solvent molecules). It is thereby taken into account that the most important inter-molecular interactions in such a system most often are of electrostatic nature and an accurate description of those is essential. Since the approach considers the individual molecules, it is an explicit approach. Chem. Modell., 2008, 5, 67–118 | 71 This journal is

c

The Royal Society of Chemistry 2008

A similar rationale is behind the polarizable-continuum approaches. Here, the solvent is modeled by a hom*ogeneous continuum, i.e., the individual solvent molecules are not treated, and, therefore, this corresponds to a so called implicit approach. There are two central features of the model that can influence the outcomes of the calculations critically. At first, the solute is supposed to occupy some cavity in the solvent, whereby the precise definition of the size and shape of the cavity is not unique. Second, the continuum is able to respond to the charge distribution of the solute and, thereby, in turn by creating an electrostatic potential in which the solute exists to influence the properties of the solute. Again, the precise description of this response can differ among different approaches. We shall here briefly outline the main ideas behind this class of models. For more details, the interested reader is referred to the recent review by Tomasi et al.3 The size and shape of the cavity is one critical issue. The calculations become easier when simple shapes like spheres and ellipsoids are used, but in very many cases it is unrealistic to assume that the solute has such a simple shape. Instead, it is common practice to construct the cavity by superposing spheres placed at the positions of the atoms. Thus, any point that is inside at least one such sphere is considered a part of the cavity, and all other points are part of the polarizable continuum. Often, the radii of the spheres around each atom are chosen as van der Waals radii as defined, e.g., by Bondi10 or by Pauling,11 but other approaches exist, too (see, e.g., ref. 3). The surface of a cavity constructed as the overlap of atom-centered spheres has kinks where two spheres meet. Connolly12,13 has proposed a method that can lead to a more smooth surface and that very often is used nowadays. The idea is to let a smaller sphere ‘roll’ on the outer surface of the cavity. In most cases, this smaller sphere touches the cavity at exactly one point, but close to the regions where two (or more) spheres meet, the smaller sphere will touch all these spheres. In that case, that part of the smaller sphere that is closest to the cavity is used as a part of the cavity surface replacing the parts where the kink occurs. The presence of the continuum modifies the electrostatic potential. Formally, one may split this into one part from the solute and one part from the continuum, V(~ r) = Vm(~ r) + Vc(~ r),

(10)

but only for the total potential, one may set up the Poisson equation, r[e(~ r)rV(~ r)] = 4pr(~ r),

(11)

r(~ r) = rm(~ r) + rc(~ r),

(12)

where

is the total charge distribution from the molecule (rm) and from the continuum (rc). In the electronic-structure calculation for the solute, we replace in eqn (4) V(~ r) = VC(~ r) + Vn(~ r) + Vext(~ r).

(13)

In order to solve eqn (11) we also need the boundary conditions. First of all, very far from the system, r - N, we require that limr-N rV(~ r) and limr-N r2V(~ r) remain finite. More important is it that V(~ r) is continuous across the cavity/continuum interface, !

!

lim Vðr Þ limþ Vðr Þ ¼ 0;

!

!

!

r!rC

!

ð14Þ

r!rC

rC (~ rC+) is a point just inside (outside) the where ~ rC is a point on the interface, and ~ cavity next to ~ rC. Moreover, the derivative of the potential experiences a discontinuity !

!

@Vðr Þ @Vðr Þ e limþ ¼ 0: ! ! ! ! @n @n r!rC r!r lim

C

72 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

ð15Þ

Here, ~ n is an outward pointing normal to the interface. Moreover, we have assumed that inside the cavity the relative dielectric constant takes the value 1 (as in vacuum) and in the continuum the constant value e. The systems of eqns (11)–(15) is under-determined because we know neither the potential V, nor the charge density rc. However, ultimately we are only interested in the potential inside the cavity and can therefore choose V and rc arbitrarily as long as they satisfy all equations above. One suggestion is then to assume that rc vanishes except that we have a charge density on the cavity/continuum interface that is determined so that it satisfies the Poisson equation with boundary conditions. This is the strategy behind the polarizable continuum model that goes back to the beginning of the 1980s.14 Then e is given the value that is typical for the macroscopic solvent. A modified version, the conductor-like screening model (COSMO), was later proposed by Klamt and Schu¨u¨rmann,15 where the equations above were solved for the continuum being a conductor with e - N. Subsequently, the potential Vc was scaled in order to take the finite value of e into account. Since the shape of the cavity is that of overlapping spheres, an alternative, natural strategy could be to expand the potential V(~ r) in harmonics (i.e., potentials from monopoles, dipoles, quadropoles, . . . ). Also here various strategies have been proposed.16–19 F. Hybrid approaches None of the methods we have mentioned is new, and all of them are still the subject of improvements. Here we shall not discuss those, as we consider that beyond the scope of this presentation. However, as a single example of recent developments we mention the work of Li and Gordon.20 These authors presented a hybrid method that aims at merging the two approaches of sections IID and IIE into one hybrid method. G.

Frozen-density approaches

Alternatively, there have been other approaches earlier that, currently, seem to be less frequently used than those discussed above. Among those is the frozen-densityfunctional approach of Wesolowski and Warshel.21,22 This is an explicit approach where solvent molecules are directly included in the calculations. It is based on density-functional theory and a separation of the total electron density into that of the solute, r1, and that of the solvent, r2. According to the density-functional theory, we may write the total electronic energy as a functional of r(~ r) = r1(~ r) + r2(~ r),

(16)

Ee[r] = Ee[r1 + r2] = Ee[r1] + Ee[r2] + Ee,non-add[r1,r2],

(17)

i.e., as

where the last term is the non-additive part of the total electronic energy. Since we are mainly interested in the properties of the solute, we may choose to use less accurate approaches for describing those part of the total electronic energy that originate from the solvent. Doing so we end up with a method that scales much more favourable with system size than the supermolecule approaches of section IIB. H.

Recent developments

As may have become clear, methods for the quantum-mechanical treatment of molecules in solutions were originally developed more than 20 years ago. But as in any other active field, the methods have continuously been improved, partly in order to increase accuracy and computational speed and partly to make possible the Chem. Modell., 2008, 5, 67–118 | 73 This journal is

c

The Royal Society of Chemistry 2008

calculation of further properties, for instance such that are experimentally accessible or relevant. Many of the developments are directly tied to specific scientific questions and will in some cases be discussed further in the next section where we focus on applications, but here we shall briefly discuss some few recent works where methodological aspects have been put at the foreground. Iozzi et al.23 considered systems where the solute was experiencing surroundings that had some kind of structure. When modifying the polarizable continuum model of section IIE, this means in effect that the continuum was heterogeneous, i.e., e takes different values in different parts of space. As examples the authors mention different biological systems like water-soluble proteins where some residue interact both with the water and with the bulk proteins. Furthermore, molecules in lipid bilayers interact both with the water solvent and with the lipid bilayer. In order to treat such situations the authors suggested an extension of the continuum method where different parts of the surroundings have different relative dielectric constants. This leads to some complications that, however, could be solved. Finally, the authors demonstrated the applicability of the approach by addressing two biological issues, i.e., the protonation energies of the histidine residue, and the calculation of the nitrogen hydrogen-fine-splitting of the tempo radical in biological membranes. Here, we do not consider it relevant to discuss these issues further. In another recent development, Mennucci24 presented an extension of the polarizable continuum method that allows for studying time-dependent phenomena. Such approaches are, e.g., relevant when studying fast processes like electronic excitations or transfers. Due to the changed charge distribution of the solute following electronic excitations or transfers, the interaction with the solvent will change which in turn may change the properties of the solute, as we have discussed in this section. Since these are dynamical processes, a theoretical study of the time evolution is highly relevant. In particular when the time scale of the processes in the solute is comparable with that of the response of the solvent, resonance phenomena beyond those of a static treatment may be important. Mennucci presented an approach for such studies. Subsequently, she applied it to two processes, a charge transfer from a donor to an acceptor in a smaller molecule, as well as to the calculation of the time dependence of a Stokes shift which is used spectroscopically to measure polar solvation dynamics. An interesting approach was proposed by Yamamoto and Kato.25 They observed that calculations for a solute in some solvent bear a close resemblance to a standard electronic-structure calculations within the Born-Oppenheimer approximation. In ~1, R ~2, . . . , R ~M) R ~ [together with the the latter case, the nuclear coordinates, (R ~ define some external parameters on which nuclear charges, (Z1, Z2, . . . , ZM) Z], the solution to the (electronic) Schro¨dinger depends. When including the solvent, one in essence introduces one further dependence of the solution to the electronic Schro¨dinger equation, i.e., on the potential generated by the solvent Vext(~ r). In their approach, Yamamoto and Kato25 let Vext(~ r) be determined through a non-polarizable model for the solvent so that Vext(~ r) can be written as a superposition of Coulomb potentials from point charges representing the solvent. They showed, subsequently, how electron-transfer processes could be treated within their approach, even in the case when quantum effects of light nuclei (most notably, protons) need to be included. Finally, they presented results for a specific electrontransfer process. Tan and Luo26 have presented an improved implicit model for solvation effects. In the discussion above, in section IIE, the continuum is considered being a medium with a certain charge density. This charge density may be vanishing in large parts of space, except in those parts where the presence of the solute is being felt. The solute is in most cases treated quantum-mechanically, which for larger (e.g., biologically relevant) systems may provide a computationally too large challenge. Therefore, simpler treatments of the solute could be useful. Instead of replacing the continuum 74 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

description of the solvent with a force field approach that, then, could include the solute, too, Tan and Luo26 suggested to use a polarizable model for describing the properties of the solute. The interior of the solute is modeled through a dipolemomentum density that then replaces the quantum-mechanically treatment of the solute. A feed-back between the solvent and the solute adjusts the densities of both media. This approach is significantly simpler than that of section IIE and could, therefore, be relevant for more complex systems. Tan and Luo carried through a set of test calculations using this proposal. They found that the model was robust, i.e., model parameters could be transferred between different structures and surroundings of the solute. In some cases, the performance was even better than that of a force field, although the computational requirements of the latter are larger. Not only the development of theoretical approaches and accompanying computer programs has led to an increase in the accuracy that can be achieved and the sizes of the systems that can be treated, i.e., number of atoms in the system as well as the length of the time for which a molecular-dynamics simulation can be carried through. Also developments in the computer powers are important. This was recently demonstrated impressively by Saito and Okazaki.27 They carried through a molecular-dynamics simulation over 45 ns for hemoglobin in water using the ‘Earth Simulator’ computer that 2002–2004 was the worldwide fastest supercomputer. They considered a system consisting of the human adult hemoglobin molecule surrounded by water molecules so that the complete system occupied a sphere with a radius of 66 A˚. This system, consisting of 119 421 atoms, was treated with a force-field approach, and longer-range solvent effects were treated using a polarizable continuum approach. From the simulation the authors were able to study the dynamics of even the tertiary and quaternary structures of the hemoglobin molecule.

III.

Recent studies

In the preceding section we outlined briefly the main ideas behind the different methods/models that have been introduced for the theoretical treatment of solvated molecules. We emphasize that we concentrated on presenting the foundations and not on discussing the many, also recent, improvements that have been introduced in order to increase both the accuracy and the efficiency of the different approaches, which has led to a large variety of methods, each with its advantages and disadvantages. Thus, if the reader gets the impression that all currently applied methods were developed long time ago, this is only partly the case. However, the reader who is interested in the recent technical and methodological developments is referred to the original literature. Here, we shall instead present results of recent studies on various systems and in the discussion use the rough classification of the methods from the preceding section. We shall briefly discuss the results of various, subjectively chosen, recent studies, emphasizing that these were chosen not as an attempt of identifying the scientifically best studies, but more as representing typical studies at the time of writing. A.

Comparing the methods

None of the approaches that has been proposed is exact. Instead, each approach has its advantages and disadvantages, and each approach may be more accurate for certain systems and/or properties than for others. Thus, one of the first questions to address is, how well the various approaches perform. This has been the topic of some recent studies. Jacob et al.28 compared a frozen-density and a polarizable-molecules approach for the calculation of various molecular properties. In both approaches they used a density-functional method for the quantum-mechanical treatment of the solute. Chem. Modell., 2008, 5, 67–118 | 75 This journal is

c

The Royal Society of Chemistry 2008

Table 1 Various properties for a water molecule in the gas phase and in an water solvent. The FD results were obtained with a frozen-density approximation, and the QM with a polarizablemolecules approximation. Listed are the size of the dipole moment, the three independent components of the quadrupole moment, excitation energies, static polarizability, and static hyperpolarizability. All results are from ref. 28 Property

Quantity

Dipole moment (D) Quadrupole moment (a.u.)

m 1.80 Qxx 1.79 Qyy 1.86 Qzz 0.07 7.76 11A1 - 11B1 11A1 - 11A2 9.61 11A1 - 21A1 9.72 a 9.40 g 0.91

Excitation energy (eV)

Static polarizability (a.u.) Static hyperpolarizability (a.u.)

Gas phase

Solution (FD) Solution (QM) 2.66 2.05 2.15 0.11 8.41 10.38 10.40 9.62 0.72

2.71 2.09 2.17 0.08 8.88 10.69 11.01 8.67 0.50

Their test system was a single water molecule embedded inside a solvent consisting of 127 further water molecules. For the latter a fixed structure was used. In Table 1 we summarize their main findings. For the frozen-density calculations they considered two different approaches, one where the solvent-molecule density was kept fixed and one where it was allowed to relax. In the table we have only shown the results for the latter, which according to the authors led to an improved accuracy. The table shows that the dipole and the quadrupole moments are very similar for both approaches, which is to a lesser extent the case for the excitation energies and the static (hyper)polarizabilities. The latter were calculated using timedependent density-functional theory. In order to understand this discrepancy the authors used also a supermolecule approach with just two solvent molecules. By comparing with results from calculations with the frozen-density and the polarizable-molecule approaches on the same system they concluded that the frozen-density approach was the more accurate one in calculating the responses to electromagnetic fields. Although we in this report mainly is concerned with the behaviour of a single solute molecule in some solvent, the study of mixed systems can give some insight into the accuracy of the approaches. Therefore, we shall next discuss the work of Chevrot et al.29 who reported a study of the interface between water and the ionic liquid [BMI][PF6] (1-butyl-3-methyl-imidazolium hexafluorophosphate). They used molecular-dynamics simulation with different force fields to describe the interactions between the atoms of the system. They constructed an interface by joining a box of 2119 water molecules with a box of 181 BMI+PF6 molecules. In addition, they studied demixing of the two systems by considering a system that initially consisted of a mixture of water and the ionic liquid. In Table 2 we summarize some of their findings. The table shows the amount of water in the ionic liquid, the amount of ionic liquid in the water, and the width of the interface separating the two phases at the end of the simulation for the different force fields. It is seen that the popular TIP3P model gives quite different results from those of the other force fields and in the simulation of the mixing process there is not even a well defined interface. The other two force fields give similar results. All simulations support that the ionic liquid is essentially water-free, as should be. In order to understand the differences between the results of the different models, the authors also calculated the interaction energies between water and the ions of the ionic liquid. The results are reproduced in Table 3. The table shows at first that the variation in the interaction energies for the different computational approaches is small. This demonstrates that even small inaccuracies can have significant effects on the output of a simulation. Moreover, the table shows also that the TIP5P model 76 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 2 Molar fraction of water in the ionic liquid and of the ionic liquid in water, as well as width of the interface as found in the molecular-dynamics simulations with different force fields. q(IL) describes the effective charge of the ions in the ionic liquid used in the force field for this system, and FF(H2O) describes the force field for the water. The results for the ‘mixing’ simulations were obtained by considering a system where the ionic liquid and the water initially were separated, whereas ‘demixing’ marks results where the two systems initially formed a mixture. All results are from ref. 29 Simulation

q(IL)

FF(H2O)

xH2O

xIL

Width (A˚)

Mixing

1 1 0.9 0.9 0.9 0.9 0.9 0.9

SPC TIP5P TIP3P SPC TIP5P TIP3P SPC TIP5P

0.65 0.45 0.66 0.23 0.27 0.71 0.29 0.25

0.023 0.004 0.008 0.004 0.001 0.009 0.002 0.001

— 14.3 18.9 10.9 9.3 22.1 10.6 9.4

Demixing

with scaled effective charges of the ionic liquid is the model that at best reproduces the results of the parameter-free calculations. In section II we focused on an accurate description of electrostatic interactions between solute and solvent. Although these interactions account for the largest parts of the free energy of solvent, other interactions may contribute as well. These interactions, due to dispersion and repulsion, were studied by Curutchet et al.30 Amoville and Mennucci31,32 had earlier developed a formalism for including these interactions in the polarizable continuum model and Curutchet et al. tested it on a larger set of solvent-solute systems for which they also calculated these interactions with force-field methods. Since these interactions only rarely are discussed, we list typical results from their work in Table 4. It is seen that fairly large basis sets are needed in order to obtain results that can be considered close to converged. Moreover, the density-functional B3LYP method yields larger absolute values than does the Hartree-Fock method. Finally, the repulsion contribution amounts to only some 10–20% of the dispersion contribution. Using the B3LYP method for reference they compared subsequently the performance of four force-field methods for reproducing the B3LYP results for the dispersion and repulsive contributions to the free energy of solvation for 22 neutral molecules in four solvents. The results, Table 5, show that in most cases the force fields do not perform very well with the MST being an exception. Aidas et al.33 calculated NMR parameters for formaldehyde and acetone solvated in water using different models. Their main findings are summarized in Table 6. In Table 3 The interaction energies (in kcal/mol) between a water molecule and an ion from the ionic liquid, as obtained from calculations with Hartree-Fock (HF), density-functional (DFT), or force-fields. All results are from ref. 29 Method

q

PF6q H2O

BMI+q H2O

HF DFT TIP3P SPC TIP5P TIP3P SPC TIP5P

1 1 1 1 1 0.9 0.9 0.9

9.1 9.5 11.5 11.4 10.5 10.2 10.1 9.3

9.7 9.5 10.2 10.0 10.3 8.8 9.1 9.3

Chem. Modell., 2008, 5, 67–118 | 77 This journal is

c

The Royal Society of Chemistry 2008

Table 4 The dispersion and the repulsion contributions (in kcal/mol) to the free energy of solvation for C2H4O solvated in water or chloroform. The table shows the dependence of the results on the basis set and on the quantum-mechanical method for treating the solute. All results are from ref. 30 Solvent

Method

Basis set

DGdis

DGrep

Water

HF

6-31+G(d) cc-pVDZ aug-cc-pVDZ cc-pVTZ aug-cc-pVTZ 6-31+G(d) cc-pVDZ aug-cc-pVDZ cc-pVTZ aug-cc-pVTZ cc-pVDZ aug-cc-pVDZ cc-pVTZ aug-cc-pVTZ cc-pVDZ aug-cc-pVDZ cc-pVTZ aug-cc-pVTZ

7.4 7.3 9.2 9.1 10.2 8.4 8.5 10.9 10.3 12.0 8.6 10.8 10.6 11.9 10.0 12.8 12.0 14.0

3.1 2.6 3.1 2.9 3.1 3.4 2.7 3.5 3.2 3.4 1.7 2.0 2.0 2.0 1.8 2.4 2.1 2.3

B3LYP

Chloroform

HF

B3LYP

their calculations they compared the polarizable continuum models with QM/MM models and, moreover, they examined the effects of including two water molecules in the quantum-mechanical calculations. The results of Table 6 show that both longand short-range interactions are important. Thus, a quantum-mechanical treatment of the finite C3H6O + 2H2O system is not capable of reproducing the experimental results, showing the importance of long-range interactions. But without the two water molecules, the polarizable continuum model does not yield satisfactory results, which indicates that also short-range interactions are important. In this context it is interesting to notice that including the two water molecules in the part that is treated quantum-mechanically does not seem to be very important for the results with the QM/MM simulations. Sun et al.34 studied the properties of thirteen smaller peptides in an aqueous solution using both explicit models with different force fields for the solvent Table 5 A comparison between quantum-mechanical and force-field treatment of dispersion and repulsive contributions to the free energy of solvation. The force field is given under method, and for each solvent the first line gives the root mean square deviation (rmsd, in kcal/ mol) and the second line gives the coefficient c that gives the best fit of y = cx with y being the force-field results and x being the quantum-mechanical results. All results are from ref. 30 Method

Property

Water

Octanol

Chloroform

Carbon tetrachloride

Claverie

Rmsd c Rmsd c Rmsd c Rmsd c Rmsd c

1.9 0.87 4.0 0.71 5.7 0.59 5.6 0.59 1.2 1.01

5.4 0.68 7.5 0.56 5.4 0.68 5.6 0.67 3.1 1.17

9.8 0.48 8.8 0.54 9.2 0.51 9.1 0.52 3.6 0.82

10.3 0.47 8.9 0.54 9.6 0.50 9.6 0.50 3.8 0.81

MM3 AMBER OPLS MST

78 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 6 Isotropic shielding constant (in ppm) sX for X being oxygen or carbon and the shift dX compared to the gas-phase value. QM denotes the system that is treated quantum mechanically, and the water part describes the approach for treating the solvent. For the latter, PCM1 and PCM2 denotes polarizable continuum models and TIP3P and SPC two different force fields. marks gas-phase calculations. All results are from ref. 33 QM part

Water part

sO

dO

sC

dC

C3H6O C3H6O C3H6O C3H6O + 2H2O C3H6O C3H6O + 2H2O C3H6O + 2H2O C3H6O + 2H2O C3H6O + 2H2O Experiment

— PCM1 SPC SPC TIP3P TIP3P — PCM1 PCM2

339.2 285.6 259.3 258.7 250.6 251.0 297.3 262.7 250.1

— 53.6 79.9 80.5 88.6 88.2 41.9 76.5 89.1 75.5

21.5 33.2 37.0 37.7 38.2 38.9 31.1 36.6 38.2

— 11.7 15.5 16.2 16.7 17.4 9.6 15.1 16.7 18.9

0.7 0.8 0.6 0.6 0.5 0.5 0.5

0.7 0.8 0.6 0.6 0.5 0.5 0.5

0.1 0.1 0.1 0.1 0.1 0.1 0.1

0.1 0.1 0.1 0.1 0.1 0.1 0.1

molecules and implicit models. Their interest focused on the interactions of the peptide with a hydrophobic surface, and to this end they studied the so called potential of mean force (PMF) for the solvated peptides using the different solvation methods. They calculated the PMF by considering the interaction between a selfassembled monolayer of some molecules on some surface. The important point is that the molecules were terminated by CH3 groups, which then are the ones that the peptide will interact with. The interactions will, of course, depend on the distance of the peptide to the CH3 group, but also on the relative orientation of the two as well as on the internal properties of the peptide. Here, the latter depends on the changes induced by the solvent, i.e., on the model for the solvent. Sun et al. found significant differences between the implicit and the explicit solvent models. However, lacking further information they were not able to conclude which of the models gave more accurate results. In a recent study, Kongsted and Mennucci35 compared explicit and implicit solvation models for the calculation of various properties of aqueous solutions of pyrazine, pyrimidine, and pyridazine, i.e., three diazines that can be obtained from benzene upon replacing two CH groups by two N atoms (cf. Fig. 1). For the quantum-mechanical part they used a density-functional approach and, moreover, they also considered cases where 2 or 4 H2O molecules of the solvent were included in the QM treatment. Some of their results are summarized in Table 7. Some spread in the results depending on the model is observed, and in this case we also notice that the results that are obtained with the continuum approaches depend on the size of the spheres that are used in the construction of the cavity. Ultimately, however, the authors showed that when including 2 water molecules in the quantum-mechanical treatment of the system (thereby allowing for hydrogen bonds) and allowing the complete quantum-mechanical part of the system to relax to the structure of the lowest total energy, a good agreement with experimental values could be obtained with the unscaled polarized-continuum model (Table 8).

Fig. 1 Schematic representation of (from left to right) pyrazine, pyrimidine, and pyridazine. Open and closed circles represent CH groups and N atoms, respectively.

Chem. Modell., 2008, 5, 67–118 | 79 This journal is

c

The Royal Society of Chemistry 2008

Table 7 Lowest electronic excitation energy (in eV) and isotropic shielding constant (in ppm) sN. n gives the number of water molecules that are explicitly treated quantum theoretically. PCM denotes the standard polarized continuum method, whereas in PCM* the radii spheres surrounding each atom in the construction of the cavity have been scaled by 91.7%. Finally, MM marks results obtained with an explicit description of the solvent. All results are from ref. 35 Property

n

Model

Excitation energy (eV)

0 0 0 2 2 0 2 4 0 0 0 2 2 0 2 4

Vacuum PCM PCM* Vacuum PCM MM MM MM Vacuum PCM PCM* Vacuum PCM MM MM MM

sN (ppm)

Pyrazine 3.95 4.00 4.00 3.95 4.00 4.05 4.03 113.3 101.8 97.9 110.4 99.7 98.9 100.3

Pyrimidine

0.01 0.01 0.01 0.01

0.8 0.3 0.8 0.8

4.28 4.45 4.49 4.30 0.01 4.48 0.01 4.52 0.01 4.49 0.01 4.48 0.01 72.1 59.9 55.7 67.8 0.5 57.4 0.4 56.4 0.5 58.4 0.5 58.8 0.5

Pyridazine 3.55 3.89 3.95 3.65 3.92 3.85 4.03 210.4 176.1 168.8 198.5 171.0 178.5 179.7

0.01 0.01 0.01 0.01

1.0 0.6 1.0 1.0

As the last example we mention the study of Takemura and Kitao.36 They studied different models for molecular-mechanics simulations on water. To this end, they studied the dynamics of a ubiquitin molecule solvated in water. They performed molecular-dynamics simulations for a system with periodic boundary conditions. At first they considered pure water without the solute and studied boxes with 360, 720, 1080, and 2160 water molecules. It turned out that even for these fairly large systems, finite-size effects could be recognized. Thus, the translational diffusion constant was found to depend linearly on V1/3, where V is the volume of the repeated unit. Subsequently, they studied a periodically repeated volume containing the ubiquitin molecule as well as 4908, 9133, 13 200, or 29 182 water molecules. Again, the linear dependence of the translational diffusion constant of the protein on V1/3 was observed, which is a confirmation of well-known finite-size effects.37–39 Correcting the results for the finite-size effects the authors obtained the results of Table 9. A clear scatter in the calculated values is observed, making it non-trivial to determine the correct model for molecular-dynamics simulations for aqueous solutions. B. Aqueous solutions of atoms The transport of ions in aqueous solutions is of fundamental importance in chemistry and biology. However, despite its apparent simplicity, the transport of Table 8 Change in lowest electronic excitation energy (in eV) and in isotropic shielding constant (in ppm) sN due to solvation for the molecules of Fig. 1. PCM denotes results with the polarized-continuum model and Exp marks experimental results. Two water molecules have been included in the quantum-mechanical treatment. All results are from ref. 35 Property

Model

Pyrazine

Pyrimidine

Pyridazine

Excitation energy (eV)

PCM Exp. PCM Exp.

0.16 0.19 20.8 16.9

0.31 0.33 20.2 16.8

0.55 0.48 59.2 41.6

sN (ppm)

80 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 9 The self-diffusion constant for water Dtw and the ubiquitin molecule Dtp in aqueous solutions as calculated with different force fields for the water. In addition, t1 and t2 are rotational relaxation times for the ubiquitin molecule. Where possible, the results are compared with experimental values. All results are from ref. 36 Model

Dtw (105 cm2/s)

Dtp (105 cm2/s)

t1 (ns)

t2 (ns)

TIP3P SPC SPC/E SPC/Fw Exp.

6.35 4.96 3.06 3.28

36.9 30.6 17.1 14.2 14.3, 14.9

4.09 4.88 9.13 8.18

1.33 1.78 2.97 2.69 4.09, 4.1

ions in water is a non-trivial process, where not only the ions but also a certain cloud of water molecules surrounding the ions is being transported. The study of these water shells around the ions is, therefore, still being pursued. Here, we shall discuss two such recent studies.40,41 Using a molecular-mechanics force-field model in combination with moleculardynamics simulations, Glezakou et al.40 studied the structure of K+ ions solvated in water. The simulation considered the potassium ion as well as 550 water molecules and was carried through at different temperatures. Among the results is the (timeaveraged) pair distribution function. Due to the finite temperature, such a function does not contain sharp peaks but more broad features from which one may try to extract the number of nearest neighbours. Using two different approaches to this end, Glezakou et al.40 obtained two different values, cf. Table 10. However, it is clear that an increase in the temperature results in a slight decrease in the number of water molecules in the first shell and a small increase in the distance between the ion and the water molecules. Glezakou et al.40 also performed experiment at 300 K, resulting in the values given in the table, too. With a supermolecule approach, Bock et al.41 studied the first and second shell of water molecules surrounding various metal ions. They studied M+q(H2O)18 clusters using parameter-free density-functional calculations and trying to optimize the structures in largely unbiased calculations. Due to the size of the system (55 atoms, or 19 building blocks when assuming that the H2O units stay intact), this is a far from trivial endeavour (see, e.g., ref. 2). Their results are summarized in Table 11. For all systems, a structure with one shell containing 6 water molecules and another shell with 12 water molecules is found, but for some cases (Be, Li, and Na) another structure with three shells of water with, respectively, 4, 8, and 6 molecules is also stable. However, the fact that the metal-oxygen distances are not constant for each shell shows that the structure is one of a low symmetry. As the last example we mention the study of Blumberger and Sprik42 who were interested in the electron-transfer redox reaction Ru2+ " Ru3+ in an aqueous solution. This reaction can be studied in the framework of the Marcus theory of electron transfer43–46 which provides a simple and effective treatment of the polar solvent. Blumberger and Sprik demonstrated how the ingredients of the Marcus theory could be extracted from electronic-structure calculations on Ru2+ and Ru3+ ions in a water solvent. To this end, they carried through density-functional calculations on a periodically repeated box containing 32 H2O molecules and one metal ion using the Car-Parrinello approach (see, e.g., ref. 1). In order to keep the system neutral, a compensating constant background density was added. Their results indicated that the Ru–O distance for the nearest water shell around the metal ion decreases from 2.18 A˚ for Ru2+ to 2.10 A˚ for Ru3+. These values compare reasonable well with the experimental values of 2.11 and 2.03 A˚, respectively. Ultimately, the authors estimated that the free-energy barrier for the Ru2+ " Ru3+ reaction is 0.38 eV, whereas the experimental value is 0.55 eV. The authors Chem. Modell., 2008, 5, 67–118 | 81 This journal is

c

The Royal Society of Chemistry 2008

Table 10 The number of water molecules in the first shell around the K+ ions, n1, estimated in two different ways, as well as the average distance between the potassium ion and the oxygen atom of the water molecules, all quantities as a function of temperature. The experimental values were obtained at 300 K. All results are from ref. 40 T (K)

n1

d1 (A˚)

300 350 400 Exp.

6.6/5.7 6.6/5.6 6.5/5.3 5.7

2.770 2.777 2.782 2.732

Table 11 The number of water molecules around a metal M+q ion. n1, n2, and n3 give the number of molecules in the first, second, and third shell, respectively, and d1, d2, and d3 the distances between the metal ion and the oxygen atoms of the water molecules. All results are from ref. 41 M

q

n1

d1 (A˚)

Al Be Mg Na Ti Li Zn Be Li Na

3 2 2 1 4 1 2 2 1 1

6 6 6 6 6 6 6 4 4 4

1.922 1.858 2.099 2.415 1.993 2.191 2.120 1.638 1.956 2.279

0.001 0.010 0.001 0.16 0.28 0.10 0.006 0.018 0.05 0.06

n2

d2 (A˚)

12 12 12 12 12 12 12 8 8 8

3.988 4.103 4.130 4.184 4.092 4.075 4.123 3.672 3.955 4.086

0.010 0.4 0.017 0.19 0.4 0.13 0.07 0.15 0.18 0.28

n3

d3 (A˚)

6 6 6

4.826 0.18 4.808 0.16 4.904 0.17

suggested that the small simulation box is the reason for the discrepancy. That also long-range effects can be important was seen already in the previous subsection. An interesting finding of the authors on this system and, earlier, on Ag+ in water47 is that the volume of the water does not change upon adding the metal ion. The contraction of the liquid around the cation can be ascribed to electrostriction effects. C.

More complex systems

In this subsection we shall concentrate on the solvent-solute structures of more complex systems. We shall not discuss other properties of those systems, but solely consider those recent studies where the structural arrangement of solvent and solute has been the central issue. Other properties will be discussed later in this overview. Methanol-water mixtures were studied by Adamovic and Gordon48 who used the so called effective-fragment-potential method, an approach of the type discussed in section IID, for finite (MeOH/H2O)n clusters with n = 2,3, . . . ,8. Some of their findings were tested by using accurate, parameter-free, electronic-structure methods. The main outcome is the organization of the H2O and MeOH molecules relative to each other, which then is expected to be representative for macroscopic mixtures of water and methanol. Their results for the energetically three lowest structures are summarized in Table 12. Most useful is it to describe the structures in terms of the different types of hydrogen bonds, i.e., the donor and acceptor molecules for the hydrogen bonds. According to Table 12 the most stable structures are found for a large number of H2O–MeOH hydrogen bonds. A related study was presented by Kumar et al.49 who studied alkali-metal hydroxides (MOH, M being Na or K) hydrated by up to six water molecules, i.e., MOH(H2O)n, with n = 1,2, . . . ,6. Their calculations were performed using 82 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 12 Characteristics for the three energetically lowest isomers of (MeOH/H2O)n clusters. m labels the isomer, and DE gives the total energy relative to the most stable isomer. The donor and acceptor entries, X and Y, show the number of hydrogen bonds between the donor X and the acceptor Y. All results are from ref. 48 n Donor Acceptor

m

2

1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3

3

4

5

6

7

8

DE (kcal/mol) H2O H2O

MeOH MeOH

MeOH H2O

H2O MeOH

0 1 1 1 2 3 1 1 4 1 5 5 4 5 4 2 2 1 5 4 6

0 1 1 1 2 2 0 0 4 1 4 3 2 3 2 1 0 0 1 2 2

2 1 1 2 1 1 4 4 0 4 1 2 4 3 4 6 7 7 7 6 6

2 1 1 3 2 1 6 5 2 8 2 4 7 6 7 11 11 11 10 11 8

0.0 0.0 0.5 0.0 0.6 1.5 0.0 1.5 3.5 0.0 1.2 2.7 0.0 0.2 1.8 0.0 1.8 3.0 0.0 1.6 3.3

different parameter-free, electronic-structure methods. With successive addition of water molecules, both the number of hydrogen bonds and the coordination number of the metal atom increase. By monitoring the M–OH distance (see Table 13) the authors could also identify the change from undissociated, via partly dissociated, to dissociated metal hydroxide upon the addition of water molecules. Tongraar et al.50 studied NO3 in an aqueous solution using QM/MM molecular dynamics simulations for a system consisting of one NO3 ion and 199 water molecules. They found that the oxygen atoms of the NO3 ion participate in weak hydrogen bonds to the water molecules that easily are broken and formed. Parthasarathi et al.51 examined the surroundings of a H3O+ ion in phenol or benzene by studying a complex consisting of one H3O+ ion and three phenol or benzene molecules. They performed parameter-free, electronic-structure calculations on these systems using three different computational methods, i.e., Hartree-Fock calculations, Hartree-Fock calculations with Møller-Plesset perturbational inclusion of correlation effects, and density-functional calculations within the B3LYP framework. They identified three clusters, all shown in Fig. 2. The binding energies (i.e., the energy needed for dissociating the complex in the H3O+ ion and the three phenol or benzene molecules) are shown in Table 14, too, and it is seen that in particular the relative ordering is the same for all computational approaches. Finally, by analysing the topology of the electron density the authors were able to identify the bonding between the H3O+ ion and the surrounding molecules, i.e., for the most stable structure a hydrogen bond was found, whereas for the others they found a p complexation (Table 14). Kokubo and Pettitt52 studied mixtures of urea and water at different concentrations through molecular-dynamics simulations for seven different systems ranging from 1 urea molecule and 1305 water molecules to 530 urea molecules and 0 water Chem. Modell., 2008, 5, 67–118 | 83 This journal is

c

The Royal Society of Chemistry 2008

Table 13 Binding energy, Eb, for MOH(H2O)n clusters obtained either with Møller-Plesset calculations (MP2) or with density-functional calculations (B3LYP). NHB and NC give the number of hydrogen bonds in the cluster and the coordination number of MOH, R is the distance between the metal atom and the oxygen atom of the hydroxy group, respectively. All results are from ref. 49 M

n

Eb,MP2 (kcal/mol)

Eb,B3LYP (kcal/mol)

NHB

NC

R (A˚)

Na

1 2 3 4 5 6 1 2 3 4 5 6

21.25 39.27 55.08 66.50 76.96 88.75 21.75 40.03 55.46 66.52 78.45 87.68

22.52 41.61 56.83 69.02 79.54 90.12 21.62 39.64 54.18 65.44 75.57 85.22

1 2 3 4 5 9 1 2 3 4 6 7

2 3 4 4 4 3 2 3 4 5 5 4

2.097 2.247 2.674 2.835 2.404 4.031 2.376 2.523 2.734 2.838 2.898 3.017

K

molecules. The interatomic interactions were described using force fields, whereby they examined the performance of two different such models for urea. A main focus of their work was to study how hydrogen bonds are formed and broken, and they

Fig. 2 The four optimized structures of the H3O+ ion surrounded by three phenol or benzene molecules. Reproduced with permission of The American Chemical Society from ref. 51.

84 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 14 Binding energy, Eb, for the complexes consisting of a H3O+ ion surrounded by three phenol or benzene molecules. The results were obtained with Hartree-Fock calculations (HF), Møller-Plesset calculations (MP2), or with density-functional calculations (B3LYP). For each of the four structures (see Fig. 2), also the type of the interaction between the ion and the molecules is shown, respectively. All results are from ref. 51 Isomer

Eb,HF (kcal/mol)

Eb,MP2 (kcal/mol)

Eb,B3LYP (kcal/mol)

Type of bond

1 2 3 4

72.89 51.99 50.16 46.10

83.21 63.06 61.35 55.65

81.18 65.65 63.09 54.73

H-bond OH p OH p OH p

found that both the average number of hydrogen bonds and their average lifetimes do not change very much at different concentrations (see Table 15), but it was also clear that there was some dependence of the results on the model that was used. As a final example in this subsection we mention the recent study of Zillich and Whaley53 who examined LiH solvated in 4He clusters with up to 100 atoms. The authors used a path integral Monte Carlo simulation approach, whose details shall not be discussed further here. The LiH–He interaction potential was found to be highly anisotropic with attractions for He approaching the molecule in a direction parallel to the molecular axis, but with strong repulsions for He approaching the molecule in a direction perpendicular to the molecular bond. Despite these repulsions, the authors found that LiH prefers to occupy central regions of the LiH4Hen clusters for n larger than 10–15. D.

Solvation energies

A fundamental aspect of the theoretical study of solvation is to be able to calculate the solvation energy, i.e., the energy that is required (or released) when a molecule is brought from vacuum into the solution. Therefore, many studies have been devoted to this, but here we shall just briefly discuss three of those. Petrosyan et al.54 have presented a new method for describing solvated systems. It is based on the density-functional formalism and related to the approaches of section IIG. It combines a quantum-mechanical density-functional treatment of the solute Table 15 Various properties related to hydrogen bonds in urea-water mixtures. xu gives the mole fraction of urea, and nx gives the number of hydrogen bonds around molecules of the type x. txy gives the life time (in fs) for hydrogen bonds between molecules x and y, with x being the donor and y the acceptor. The two entries for each concentration shows the results for the two different models for urea. All results are from ref. 52 xu

nurea

nwater

0.00077

3.13 2.27 3.15 2.19 3.11 2.15 3.07 2.10 3.04 2.05 3.00 1.96 2.68 1.37

2.79 2.78 2.77 2.77 2.74 2.75 2.72 2.74 2.69 2.71 2.66 2.66

0.038 0.081 0.129 0.184 0.273 1.0

tuu

100 43 92 32 94 38 90 37 95 39 97 40

tuw

twu

tww

99 37 94 28 97 29 96 30 100 29 99 30

140 210 150 250 150 260 150 270 140 270 150 290

180 180 180 180 180 190 180 200 190 200 190 220

Chem. Modell., 2008, 5, 67–118 | 85 This journal is

c

The Royal Society of Chemistry 2008

Fig. 3 Calculated energies of solvation vs. experimental values for (from left to right) water, ethanol, methanol, and methane in water. The diagonal line gives perfect agreement, the open circles results with the joint density-functional theory for electronic structure of solvated systems, and the other symbols results from different continuum approaches. Reproduced with permission from ref. 54.

and a classical density-functional treatment of the solvent, i.e., it is based on minimizing a functional of the electron density of the solute and the averaged density of the solvent. The resulting so called joint density-functional theory for electronic structure of solvated systems was used to calculate the solvation energies for some molecules in water. The results are reproduced in Fig. 3, and it is seen that the new method gives accurate results. The calculation of solvation energies for ions is much more complicated. As discussed, e.g., by Kelly et al.,55 the fact that it is not possible to measure the electrostatic potential between two media56,57 means that the free energy of formation or chemical potential of an individual ion has no operational meaning.58 Instead, it is common practice to set, arbitrarily, the free energy of a proton equal to 0 and, then, to study well-defined free energies of solvation for neutral combinations of anions and cations, whereby those of the individual cations and anions can be estimated. Kelly et al.55 determined theoretically absolute values for the solvation energies of ions as well as of ions surrounded by a finite cluster of water molecules. Their approach was based on the approximation that the difference between the absolute solvation free energy of a positive and a negative cluster ion vanishes when the cluster becomes infinite, lim fDGsol ½ðH2 OÞn M DGsol ½ðH2 OÞn Mþ g ¼ 0:

n!1

ð18Þ

Through this approximation they were able to arrive at an absolute value for the free energy of solvation of the proton by analysing a large number of experimental results. Subsequently, they could determine the free energy of solvation for any other ion, without or with a cluster of water molecules. Some representative results are listed in Table 16, where it is seen that the aqueous solvation energy for clustered ions show a clear convergence trend when the number of water molecules in the cluster exceeds 2. Ultimately, the authors also compared calculated values with experimental values and found that the calculated values in most cases were within 3–5 kcal/mol of the experimental values. Reddy et al.59 used a special method to calculate the differences in the solvation free energies for different molecules. The method was based on continuously transferring one (solute) molecule into the other using the so called thread method.60 Thus, when comparing the A molecule with the B molecule, they introduce a parameter l so that l = 0 corresponds to the A molecule and l = 1 to the B 86 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 16 Selected absolute aqueous solvation free energies (in kcal/mol) for M and M+ ions clustered by n water molecules. All results are from ref. 55 M +

H Li+ Ag+ H3O+ F I OH HO2

n=0

n=1

n=2

n=3

n=4

n=5

n=6

265.9 128.4 118.7 110.3 104.4 59.9 104.7 97.3

105.6 98.3 90.2 87.9 59.0 89.3 84.7

91.1 83.9 81.7 78.8 59.4 82.3

82.3 79.8 76.7 75.2 60.8 78.3

79.2 78.1 75.4 73.9 63.0 77.1

79.1 77.8 75.4 74.0 65.8 77.3

81.0 78.5 76.9 74.9 77.4

molecule. This means that when passing from l = 0 to l = 1, some atoms will disappear and others will show up, bonds will be broken and created, and atoms may move. For the simulations, they used a QM/MM approach. Intermediate values of l were treated in the MM part by scaling the force-field parameters according to l. On the other hand, the quantum-mechanical part is more tricky. Reddy et al. calculated the total energies and forces for the two molecules A and B separately, and combined afterwards these results using a linear interpolation with l being the interpolation parameter. They tested the approach on various systems for which the A - B transition led to a number of different changes in bonding and structural properties. Moreover, they used different approaches for the QM part, but as Table 17 shows, the method is reliable and leads to accurate results. In order to obtain more information on the solvation process Yang and Cui61 performed a so called natural energy decomposition analysis (NEDA)62–64 on monomethyl phosphate ester (MMP) solvated in water. They used a supermolecular approach where the solute plus a number of water molecules (up to 34) were treated quantum-mechanically. A further set of water molecules was treated with a forcefield model. Their results indicate that there is a substantial charge transfer between the solute and the nearest solvent molecules. The interaction energy due to this transfer was found to amount to some 70–80% of that of the electric interactions. Since MMP forms hydrogen bonds with the water molecules, all results together suggests that for such a system it is important to include the nearest solvent molecules in the quantum-mechanical treatment, whereas a continuum approximation or a force field may not be sufficiently accurate.

Table 17 The difference in free solvation energies for the A and B molecules in aqueous solution, DGsol(B) DGsol(A), in kJ/mol. HF, AM1, FF, and exp. marks results from parameter-free Hartree-Fock calculations, semiempirical AM1 calculations, force-field calculations, and experiment, respectively. All results are from ref. 59 A

B

HF

AM1

CH3OH AcCH3 C6H6 CH3CCl3 C6H6 Serine Phenylalanine Cytosine Adenine

(CH3)2 AcNH2 C6H5OH (CH3)2 C5H5N Cysteine Isoleucine Thymine Guanine

29.5 25.9 22.6 9.0 13.7 17.7 11.2 24.8 37.3

1.8 2.1 1.7 2.1 1.6 1.7 3.1 2.1 2.5

29.7 27.9 22.2 9.1 13.2 18.8 10.7 25.6 38.1

FF

1.8 2.3 1.7 2.1 1.5 1.8 3.2 2.3 2.6

31.5 28.8 21.6 9.3 12.6 18.5 10.5 24.1 36.3

Exp.

1.9 2.2 1.7 2.2 1.5 1.8 3.1 2.2 2.6

29.0 24.8 23.5 8.4 14.7 16.0 12.1

Chem. Modell., 2008, 5, 67–118 | 87 This journal is

c

The Royal Society of Chemistry 2008

E. Solvation-induced structural changes As should be evident from the discussion in section II, a solvent leads to an extra external potential in which the solute moves. This extra potential is not spatially constant and may, therefore, influence the different parts of the solute differently. Ultimately this means that a solute may change its properties due to the presence of the solvent. In this and the next subsections we shall discuss some recent studies where this issue has been addressed. At first, we shall in this subsection discuss structural changes due to the solvent. Balakina and Nefediev65 studied four different conjugated organic molecules, related to that of Fig. 4. These so called push-pull systems (i.e., conjugated molecules with an electron donor at one end and an electron acceptor at the other) had been studied earlier experimentally by Flipse et al.66 for their non-linear optical responses. Since the experiments were carried through in solutions, the best theoretical approach was to perform calculations for solvated molecules. The three other molecules that were considered by Balakina and Nefediev were obtained by replacing the NH2 group by a NH(CH3)2 group, as well as subsequently replacing the NO2 group by a conjugated (CH)Q(CH)–NO2 group or by a conjugated (CH)Q(C(CN)2) group. Balakina and Nefediev compared the geometries in the gas phase with those found in chloroform or acetone using the polarizable continuum method. It turned out that the structures of the molecules were very robust against solvation. The bond lengths changed less than 0.01 A˚ and the bond angles less than 21. Cucinotta et al.67 studied the equilibrium between the keto and enol tautomers of acetone, cf. Fig. 5, both in vacuum and in an aqueous solution. They used parameter-free, electronic-structure calculations for a periodically repeated supercell (see section IIB) with the Car-Parrinello approach (see, e.g., ref. 1). In static calculations for solely the acetone molecule they found a total-energy difference between the enol and keto forms of 11.8 kcal/mol. The barrier between the two forms was found to be around 58 kcal/mol (Fig. 5). Adding a single water molecule per supercell, the barrier was reduced by somewhat more than 10 kcal/mol. The simulations showed clearly that the water molecule was active in the proton-transfer process that is needed to change the keto form to the enol form. Thus, the solvent does not only provide an external potential in which the molecule is moving, but takes active part in the transition. Having not only one water molecule per supercell but 28, the barrier was reduced by another 8 kcal/mol. In this case, the hydrogen transfer involves several water molecules and can be described as a Grotthuss like mechanism. A related study was undertaken by Zeng and Ren68 who examined the tautomerization of 2-aminothiazole in the gas phase and in solution. They considered more different scenarios, including the isolated molecules, the isolated molecules interacting with 1, 2, or 3 water molecules, and two interacting molecules. In all cases, they considered a gas phase tautomerization as well as reactions occurring in water or

Fig. 4 Schematic representation of one of the donor–acceptor substituted conjugated chromophores studied by Balakina and Nefediev.65 Open circles, crosses, closed circles, and closed triangles mark carbon, hydrogen, nitrogen, and oxygen atoms, respectively.

88 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Fig. 5 Schematic representation of the keto and enol tautomers of acetone. Open circles, closed circles, and closed triangles mark hydrogen, carbon, and oxygen atoms, respectively.

CCl4. In the latter case, the solvents were treated with a polarizable continuum method (Fig. 6). Similar to the study above, the presence of other molecules (e.g., water or 2aminothiazole) leads to a significant lowering of the reaction barrier, as shown in Table 18. On the other hand, the presence and precise nature of a solvent are both less important, at least as long as it is treated within the continuum approach. The marginal differences when including more water molecules (see Table 18) suggest that in this case a Grotthuss mechanism is not important. As Balakina and Nefediev, also Kar et al.69 studied the properties of donor– acceptor substituted conjugated molecules, this time sesquifulvalene of Fig. 7. They used a polarizable continuum approach in taking into account the effects of a solvent and varied the dielectric constant of the solvent from 1 (i.e., vacuum) to 78.4 (i.e., water). In one case (when the triangle in Fig. 7 represents NH2 and the square represents NO2) they observed a reduction in the bond-length alternation due to solvent effects (i.e., the ‘double’ bonds became longer and the ‘single’ bonds became shorter), which was not the case when the NO2 and NH2 groups were interchanged. Also Perpe`te and Jacquemin70 studied the properties of push-pull systems, in this case NH2(C2H2)nNO2 oligomers for n = 2, . . . ,16. An example of the structure is shown in Fig. 8. The effects of a solvent (water) were included using a polarizable continuum model. Moreover, the authors carried through parameter-free HartreeFock calculations without and with the inclusion of correlation effects. As above, a fundamental quantity describing the structure is the bond-length alternation parameter, Dr, which the authors defined as the difference in the bond lengths of the central units. The evolution of this parameter with the size of the system is shown in Table 19. It is remarkable that the solvent has the most pronounced effects for the shortest chains, whereas for larger chains Dr is almost unaffected by the presence of the water. Meng et al.71 studied another class of conjugated molecules, i.e., the oligothiophenes H(C4H2S)nH for which we in Fig. 9 show the case of n = 2. They also considered cases where some of the hydrogen atoms were substituted by OCH3 groups. Using molecular dynamics simulations for the oligothiophenes in, e.g., water they found that the chains did not stay linear even for quite small values of n. A similar behaviour was found in other solvents like n-hexane, 1,4-dioxane, and chloroform. Furthermore, from parameter-free density-functional calculations with a polarizable continuum approach the authors found that the structures of these

Fig. 6 Schematic representation of the two tautomers of 2-aminothiazole. Open circles, closed circles, closed triangles, and crosses mark carbon, sulphur, nitrogen, and hydrogen, atoms, respectively. The left part shows the amino form and the right part the imino form.

Chem. Modell., 2008, 5, 67–118 | 89 This journal is

c

The Royal Society of Chemistry 2008

Table 18 The relative Gibbs free energies (in kcal/mol) of (A) the amino and (B) the imino forms and (TS) the transition state in the tautomerization of isolated, mono-, di-, and trihydrated as well as dimeric species of 2-aminothiazole (cf. Fig. 6). All results are from ref. 68 System

Gas

Water

CCl4

A TS B A + H2O TS + H2O B + H2O A + 2H2O TS + 2H2O B + 2H2O A + 3H2O TS + 3H2O B + 3H2O 2A 2TS 2B

0 50.2 7.5 0 18.3 6.2 0 15.0 5.5 0 17.9 5.2 0 12.6 10.2

0 54.5 7.3 0 19.1 5.7 0 15.0 4.4 0 19.2 5.3 0 13.3 8.3

0 51.3 7.3

0 12.1 9.9

Fig. 7 Schematic representation of sesquifulvalene. Open circles and closed circles mark hydrogen and carbon atoms, respectively, and for the two donor–acceptor substituted structures considered by Kar et al., one of the symbols marked by the closed triangle and square represents a NH2 group and the other a NO2 group.

Fig. 8 Schematic representation of one of the push-pull oligomers that were studied by Perpe`te and Jacquemin. In this case, n = 2. Open circles, closed circles, closed triangles, and closed squares mark hydrogen, carbon, nitrogen, and oxygen atoms, respectively.

systems for n = 2 and 3 are very insensitive to the presence of a solvent, similar to what we have reported above for other conjugated systems. Li et al.72 studied the monochalcogenocarboxylic acids CH3COXH with X being S, Se, or Te (Fig. 10) both in vacuum and in a THF (tetrahydrofuran) solution. They used the polarizable continuum model and calculated the electronic properties of the solute by using Hartree-Fock calculations with a second order Møller-Plesset treatment of correlation effects. They compared the enol and the keto forms and calculated also the transition state between the two. Finally, they considered both 90 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 19 The bond-length alternation parameter (in A˚) for the push-pull oligomers of Fig. 8. The phase shows whether the calculations were performed in vacuum or in the water phase, and HF and MP2 mark Hartree-Fock calculations without (HF) or with (MP2) the inclusion of correlation effects via Møller-Plesset perturbation theory. All results are from ref. 70 n

Phase

HF

MP2

2 4 6 8 10 12 14 16 2 4 6 8 10

Vacuum Vacuum Vacuum Vacuum Vacuum Vacuum Vacuum Vacuum Water Water Water Water Water

0.1089 0.1116 0.1166 0.1193 0.1206 0.1213 0.1216 0.1218 0.0556 0.0873 0.1106 0.1181 0.1205

0.0854 0.0720 0.0682 0.0658 0.0645 0.0638 0.0634 0.0632 0.0542 0.0578 0.0668 0.0673 0.0661

Fig. 9 Schematic representation of one of the oligomers of thiophene that were studied by Meng et al. In this case, n = 2. Open circles, closed circles, and closed squares mark hydrogen, carbon, and sulphur atoms, respectively.

the isolated molecules and ones that were allowed to interact with a dimethyl ether, CH3OCH3, molecule. This molecule was considered as a simpler form of the THF solvent, having, however, similar properties as THF. Table 20 summarizes their findings. It is seen that both in the gas phase and in the THF phase, the keto form is found to be the most stable form. However, when extra molecules are explicitly considered (i.e., the dimethyl ether), solvent effects can change the relative stability of the two forms. In a combined experimental and theoretical study, Borovkov et al.73 examined the properties of n-alkane radical cations in solution. Here, we shall just focus on one single aspect of their work, i.e., the conformers of one of the alkanes, n-nonane, C9H20. In its simplest form of the highest symmetry, its structure can be described as a zigzag chain formed by the 9 carbon atoms. To each of the carbon atoms, 2 hydrogen atoms

Fig. 10 Schematic representation of (left part) the enol and (right part) the keto forms of monochalcogenocarboxylic acids CH3COXH with X being S, Se, or Te. Open circles, closed circles, closed triangles, and closed squares mark hydrogen, carbon, oxygen, and the chalcogen atom, respectively.

Chem. Modell., 2008, 5, 67–118 | 91 This journal is

c

The Royal Society of Chemistry 2008

Table 20 The relative total energies in kJ/mol for systems containing n CH3OCH3 molecules plus either the keto or the enol form of the monochalcogenocarboxylic acids of Fig. 10. TS shows the results for the transition state between the two forms. The phase marks whether gas phase or a THF (tetrahydrofuran) solution is considered. All results are from ref. 72 X

n

Phase

Keto

TS

Enol

S S Se Se Te Te S S Se Se Te Te

0 0 0 0 0 0 1 1 1 1 1 1

Gas THF Gas THF Gas THF Gas THF Gas THF Gas THF

0.0 0.0 0.0 0.0 0.0 0.0 18.4 9.1 14.9 7.7 11.3 1.4

117.0 118.8 117.6 118.8 120.4 121.7 82.2 75.0 77.2 63.3 79.0 64.6

15.4 14.8 18.4 14.5 23.3 20.2 16.7 13.8 15.5 15.7 11.2 11.0

are bonded, and one further hydrogen atom is bonded to each of the terminating carbon atoms. More complex structures can be obtained by rotating 1201 about one or more C–C bonds. Using the B3LYP density-functional method for different such structures and the polarizable continuum method for the treatment of a n-heptane solvent, Borovkov et al.73 studied the relative stability of different structures both in the gas phase and in the solution. The results are reproduced in Table 21 and here it is seen that the presence of the solvent leads to an enhanced stabilization of the highsymmetry conformation. In addition, in gas phase the total energy change due to a rotation about one bond depends only very weakly on the bond about which is rotated, but much stronger when the molecule exists in the solution. Finally, the solvent may also be indirectly responsible for structural changes. By solvating a molecule and subsequently applying pressure on the complete system, the solvent molecules transfer the externally applied forces to the solute that, therefore, may change its structure. This was studied by Floriano et al.74 who performed molecular-dynamics simulations on some large biological molecules in aqueous solution for different external pressures. Due to the external pressures, the volume of the system changed and, subsequently, the structure of the solute changed, too. They used different measures in quantifying the structural changes of the solute molecules, but we shall here not discuss this work further.

F. Chemical reactions involving the solvent Already in the previous subsection we saw examples where the solvent molecules take active part in chemical reaction. This was, e.g., the case for proton transfer Table 21 The relative total energies in kcal/mol for different conformations of n-nonane either in the gas phase or in a solution of n-heptane. The energies are given relative to that of the undistorted system (conformation I), and for the other conformations, the bonds about which a 1201 otation has been performed are given through the numbers of the two carbon atoms forming the bond (numbering the atoms from one end). All results are from ref. 73 Conformation

Bonds

DE (vac)

DE (sol)

I II III IV

— 2–3 4–5 3–4, 6–7

0.00 0.83 0.85 1.8

0.00 1.02 4.26 6.56

92 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Fig. 11 Schematic representation of the pyridine containing molecule studied by De Angelis et al.75 Open circles, closed circles, closed triangles, and closed squares mark hydrogen, carbon, nitrogen, and the halogen atom, respectively.

reactions where one tautomer was changed into another. Here, the proton transfer could take place via water molecules of an aqueous solvent, ultimately resulting in a significant reduction of the energy barrier of the transition state between the two tautomers. In this subsection we shall discuss some further recent theoretical work where an active role of the solvent has been studied. Recently, De Angelis et al.75 studied the b elimination of the compound of Fig. 11. Upon reaction with an OH group, a water molecule and the negatively charged halogen ion will be eliminated. In an earlier work,76 the same group had considered this reaction by modeling the solvent through a polarizable continuum model, thereby obtaining the reaction path shown in Fig. 12. In the more recent work, De Angelis et al.75 performed parameter-free density-functional calculations on a periodically repeated cell containing one solute molecule and 56 water molecules using the Car-Parrinello approach. After an initial molecular-dynamics calculation,

Fig. 12 The variation of the Gibbs free energy (in kcal/mol) for the b-elimination reaction for the molecule of Fig. 11, as well as selected structures along the reaction. Compared to the system of Fig. 11, the NH group has been replaced by a NCH3 group. Reproduced with permission of The American Chemical Society from ref. 76.

Chem. Modell., 2008, 5, 67–118 | 93 This journal is

c

The Royal Society of Chemistry 2008

Table 22 The energy of the transition state compared to the initial energy for the hydrolysis of cisplatin in various solvents. All results are from ref. 77 Solvent

HF

CHF3

CH2Cl2

NH3

H2O

CH4

Vacuum

Barrier (eV)

0.54

0.97

1.11

1.11

1.15

1.36

1.59

they selected a single water molecule close to the reagent and replaced that by an OH group, and the simulation was continued. Ultimately, the calculations led to essentially the same results concerning the reaction path, supporting the applicability of both methods. Song and Hu77 studied the hydrolysis of cisplatin, i.e., the reaction of PtCl2(NH3)2 with H2O, in solution. They carried through a supercell calculation in which the two reacting molecules plus a number of solvent molecules were repeated periodically. In the case that the solvent is water, the reacting H2O molecule is part of the solvent, but they considered actually also other solvents. They focused on the reaction energy barrier, i.e., the energy of the transition state relative to that of the initial state, with the ultimate goal of understanding how a solvent can modify this. At first, they examined how many water molecules was needed in the supercell in order to obtain converged results with respect to this parameter. They found that for around 7 molecules this was the case, and this number they used for all their subsequent calculations. Table 22 summarizes their findings. It is clear that a properly chosen solvent can modify the activation energy significantly. A couple of years ago, Kelly et al.78 developed an implicit solvation model for aqueous solutions, related to those we discussed above in section IIE. In a more recent work,79 they tested the model for the calculation of aqueous acid dissociation constants, i.e., the free energy change associated with the reaction AH(aq) - A(aq) + H+(aq)

(19)

with AH and A being an acid and its conjugate base. In aqueous solutions the proton H+ may interact with the water molecules forming, e.g., hydronium H3O+. Therefore, an implicit model where the solvent molecules are not included directly may not be sufficiently accurate. Kelly et al.79 suggested to include one or more water molecules in the quantum-mechanical part of the calculation. They found that in many cases just the addition of a single water molecule can improve the accuracy of the calculations significantly, which could be seen for instance through their calculated pKa values. It may not surprise that the importance of the inclusion of explicit water molecules increases when strong solute–solvent interactions exist. In some cases, with the CO32 ion being special, the inclusion of more explicit water molecules could be important. The case of CO32 was studied in some details by Kelly et al.79 who considered the three arrangements of Fig. 13. In this case, the accuracy depended not only on the number of explicit water molecules but also on the precise form of the solvent model.

Fig. 13 Schematic representation of the CO32 surrounded by 1, 2, or 3 water molecules. Closed circles, open circles, and closed triangles represent oxygen, hydrogen, and carbon atoms, respectively.

94 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Also Gutowski and Dixon80 studied the reaction of eqn (19) and compared it with the same reaction in the gas phase. Using a polarizable continuum model to describe the effects of solvating the acids and the bases, and setting the free energy of solvation for the proton at 298 K equal to 264.3 kcal/mol, Gutowski and Dixon could calculate pKa values as follows. They first calculate the free energy of reaction for the reaction of eqn (19) in the gas phase, DG298. To this they add the changes in the free energies due to solvation, DGsolv. The sum gives the reaction free energy in the solution, DGsolution = DG298 + DGsolv

(20)

and ultimately pKa is determined from pKa ¼

DGsolution 2:303RT

ð21Þ

with R being the gas constant and T = 298 K being the temperature. The results are listed in Table 23 where they are compared with experimental values whenever possible. It is seen that the two contributions to DGsolution in eqn (20) almost cancel, whereby the main contribution to DGsolv comes from the free energy of solvation of the proton. As we have discussed above, this number is related with some uncertainty. Taking all these issues into accounts, the agreement in the table is nice. In addition, the authors considered also the proton exchange reactions HA + CH3CO2 - A + CH3CO2H and HA + NO3 - A + HNO3. Also from those, values of pKa could be estimated. These are listed in the table, too, as pK 0 a and pK00 a for the first and second reaction, respectively. The fact that these reactions lead to other values of pKa is an indication of the uncertainty of the approach. On the other hand, one may speculate that the inclusion of one or more water molecules from the solvent in the quantum-mechanical treatment would lead to even more Table 23 The free energy of the reaction of eqn (19) in the gas phase, the free energy of solvation, and the free energy of the reaction in an aqueous solution, together with the thereby obtained pKa value in comparison with experimental values (denoted Exp) for the acids given by HA. pKa 0 and pKa00 were calculated by considering other proton exchange reactions. All energies are in kcal/mol and the results are from ref. 80 HA

DG298

DGsolv

DGsolution

pKa

CF3COCH3 (CN)2CH2 (CH3CO)3CH (CF3)2NH (CF3)3COH CH3CO2H CF3SO2NH2 (FSO2)2CH2 (CF3CO)2NH (CF3CO)2CH2 CF3CO2H (CF3SO2)2CH2 CF3COSH HNO3 (CF3CO)3CH CH3SO3H H2SO4 (CF3SO2)2NH FSO3H CH3SO3H (CF3SO2)3CH

343.3 327.9 322.9 323.2 324.0 340.3 320.8 306.0 308.5 309.8 316.9 297.4 312.7 317.5 295.1 312.2 303.8 286.0 294.7 292.4 274.0

319.7 307.8 307.7 310.7 313.3 329.9 311.6 299.3 303.4 305.1 316.9 299.2 313.8 321.8 301.1 320.7 315.8 302.6 312.5 311.7 299.6

23.6 20.1 15.2 12.5 10.7 10.4 9.2 6.7 5.1 4.7 0.0 1.8 1.1 4.3 6.0 8.5 12.0 16.6 17.8 19.3 25.6

17.3 14.7 11.2 9.2 7.8 7.6 6.8 4.9 3.8 3.4 0.0 1.3 0.8 3.2 4.4 6.2 8.8 12.2 13.0 14.2 18.8

pKa (Exp.) 11.2 5.9 5.1 4.76 6.3

0.6, 0.3, 0.5

pKa 0

pKa00

14.4 11.8 7.4 6.4 5.2 4.0 2.3 1.4 0.0 2.8

1.2 1.8 1.0

1.4 2.6, 1.9 3, 10 6.4, r12 5.9, 14

3.8 4.6 7.0 10.6 11.4 12.5 17.4

Chem. Modell., 2008, 5, 67–118 | 95 This journal is

c

The Royal Society of Chemistry 2008

Fig. 14 Schematic representation of the different conformers of the fluoroscein anion. Open and closed circles represent carbon and oxygen atoms, respectively. Hydrogen atoms are not shown. For the three isomers the positions 1, 2, and 3 represent –OH, QO, –O (conformation 1), –O, QO, –OH (conformation 2), and –O, –OH, QO (conformation 3), respectively.

accurate results. However, this possibility was not considered by Gutowski and Dixon. As the last example we discuss the study of Friedman et al.,81 who considered proton transfer within a single molecule, fluoroscein, exposed to water. The single anionic fluoroscein molecule can exist in three different conformations, differing in the oxygen atom to which a proton is attached, cf. Fig. 14. According to Friedman et al., the total-energy differences between the different conformations are in the gas phase less than 8 kcal/mol, when using parameter-free electronic-structure calculations. Friedmann et al. carried through QM/MM molecular-dynamics simulations in order to identify possible proton-transfer paths. Subsequently, supermolecule calculations with the fluoroscein molecule plus some extra water molecules were carried through. Here, the water molecules were placed so that a proton transfer could take place through these molecules. According to the authors, two or three water molecules would be involved in the proton-transfer process. The energy barrier they found was some 10–22 kcal/mol, depending on the details of the calculations, which is in good agreement with the experimental value of 11 kcal/mol. Once such a chain of water molecules is involved in a proton-transfer process, it may be speculated whether the process is concerted, semiconcerted, or stepwise. In the first case, the protons along the hydrogen bonds in the water bride move simultaneously, whereas in the last case, they move one by one (i.e., as according to the Grotthuss mechanism). In the case of a semiconcerted process, some but not all protons move simultaneously. The calculations, however, indicated that the process was concerted. G.

Chemical reactions in solution

In this section we shall discuss reactions where the solvent does not take active part but merely provides a medium for the reactions. Of course, in reality the distinction between this type of reactions and the ones we have discussed in the preceding subsection is not sharp. Nevertheless, here we hope that it provides a help for the reader to organize the very many studies that have been devoted to chemical reactions in solution. Improta et al.82 studied large molecules with a well-separated electron donor and electron acceptor pair. In this case, the electron-transfer process may lead to a dissociation of the molecule. Improta et al. applied the Marcus theory43–45 to obtain an expression for the electron transfer rate constant. Subsequently, the values of the parameters entering this expression were calculated using a density-functional approach for the solute together with the polarizable continuum model for the solvent. They applied their approach for a specific system (for the present purpose, the details are less important) and found the results of Table 24. Here, three different 96 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 24 The calculated rate constants for the electron-transfer reaction for three different systems in comparison with experimental values. The two different calculated values correspond to two different basis sets. All energies are from ref. 82 System

log kDET (calc.)

log kDET (exp.)

1 2 3

4.20/5.61 2.86/3.56 0.61/0.39

4.70 0.18

modifications of the system were considered and the results compared to experiment. The agreement is good, but it is also seen that the results depend somewhat on details of the calculations like size of the basis set. A related issue was addressed by Scholes et al.83 who studied how energy is transferred over longer distances in a larger donor–acceptor molecule after this has been photoexcited. The presence of a solvent can influence the rate of this energy transfer, as originally described by Fo¨rster.84 According to the proposition of Fo¨rster, the presence of the solvent will reduce the electronic energy transfer rate by roughly a factor of 4, independent of the relative arrangement of the donor and acceptor. As pointed out by Scholes et al., it may be more correct to expect a distance dependent screening effect. Therefore, they applied a recently developed approach,85,86 that is based on the polarizable continuum model, to study the screening effects due to a solvent. They studied several different donor–acceptor pairs and found an overall exponential behaviour of the screening factor s = A exp(bR) + s0

(22)

(R being the donor–acceptor distance) for which a fit gave A = 2.68, b = 0.27 A˚1, and s0 = 0.54, although the results possessed some scatter around this fitted curve. Kong et al.87 studied the photodissociation products of carbon tetrahalides, CX4, with X being Cl, Br, or I, in different solvents. They used the B3LYP densityfunctional method for the quantum-mechanical treatment of the solute and a polarizable continuum approach for the inclusion of solvation effects. Ultimately, the final outcome of the photodissociation and subsequent reactions is C2X4 and X2. In order to identify possible reaction paths the authors calculated, therefore, the total energies of various species. Some of their results are reproduced in Table 25 for X = Br. It is remarkable that the total energies show only a weak dependence on the presence of the solvent, with the energy of the transition state being the main exception. Ramos et al.88 studied the Hoffmann elimination of (N-Cl),N-methylethanolamine in the gas phase and in aqueous solution, i.e., the reaction CH2OH–CH2–NCl–CH3 Table 25 The calculated relative total energies (in kcal/mol) for various dissociation and recombination products of CBr4. TS marks the energy of the transition state between CBr4 and CBr3 + Br. All energies are from ref. 87 Species

Vacuum

Methanol

Cyclohexane

CBr4 CBr3 + Br CBr2 + Br2 CBr2 + 2Br Br2CBr–Br (C2Br6 + Br2)/2 (C2Br5 + Br + Br2)/2 (C2Br4 + 2Br + Br2)/2 (C2Br4 + 2Br2)/2 TS

0 44.6 50.8 96.4 32.2 1.6 19.2 19.9 2.9 44.6

0 45.6 49.4 97.5 25.9 1.8 19.4 20.1 4.0 36.7

0 44.5 50.5 95.9 30.1 1.6 19.1 19.7 3.0 41.6

Chem. Modell., 2008, 5, 67–118 | 97 This journal is

c

The Royal Society of Chemistry 2008

Table 26 The calculated relative total free energies for the Hoffmann elimination reaction. The energies are given in kcal/mol relative to that of the isolated reactants and for the molecules interacting via hydrogen bonds. R, TS, and P denote reactants, transition state, and products, respectively, and n gives the number of water molecules used in the quantum-mechanical treatment. Results for the reaction in the gas phase (gas) and in the solution (sol) are shown. All energies are from ref. 88 n

DG (R, gas)

DG (R, sol)

DG (TS, gas)

DG (TS, sol)

DG (P, gas)

DG (P, sol)

0 1 2

13.89 7.37 9.91

24.20 23.44 22.84

10.14 2.98 2.52

29.02 34.39 23.62

97.20 76.62 79.57

39.95 42.54 43.16

+ OH - CH2OH–CH2–NQCH2 + H2O + Cl. They calculated the total free energies using density-functional methods and treated the solvent within the polarizable continuum method. Moreover, they included n = 0, 1, or 2 water molecules in the density-functional calculations in order to allow for hydrogen bonding. In Table 26 we list the calculated total free energies for the reactants, the transition state, and the products for the structures where hydrogen bonds between the different molecules exist, i.e., the results are not for the well separated molecules. The table shows clearly that the presence of water is very important for the reaction, both for the short-range and for the long-range interactions. A similar conclusion was drawn by Tian and Fang.89 These authors studied the photodissociation of formic acid, i.e., the reaction HCOOH - HCO + OH, H + COOH, or HCOO + H. They compared gas-phase results with those found in aqueous solution when including water molecules both explicitly (i.e., in that part that is treated quantum-mechanically) and via the polarizable continuum model. Hydrogen bonding would modify the properties of the molecules, including explaining a blue shift of an excitation energy. Moreover, long-range effects due to the solvent influence the structures of the molecules.

H.

Electronic properties in solution

After having discussed energetic and structural properties for solvated molecules we shall now turn our attention towards other properties of such systems. These include responses to external perturbations like electromagnetic fields and the topic of the present subsection, the distribution of the electrons. Within several of the models we have discussed in section II, an important ingredient is the ability of the solvent, whether described as a continuum or as a discrete set of atoms or molecules, to provide an external electrostatic potential in which the solute exists. Thus, first of all the charge distribution of the solute will be affected by the presence of the solvent. This may in turn modify other properties, like the structure, of the solute, but changes in the charge distribution of the solute may be considered the immediate response to the presence of the solvent. One way of quantifying this is obtained through the dipole moment. This was considered, e.g., by Balakina and Nefediev65 in the study we already discussed in section IIIE. They studied four different conjugated molecules related to that of Fig. 4. As we have discussed above, the structural changes upon solvation were minor. On the other hand, the dipole moment changes significantly, as shown in Table 27. In this study the solvent was treated as a polarizable continuum. The presence of a solvent may also lead to a reduction in the dipole moment. For the Hoffmann elimination, discussed above, that was studied by Ramos et al.,88 the authors considered the dipole moment for the initial complex consisting of the (NCl),N-methylethanolamine, the OH ion, and 0, 1, or 2 water molecules as well as a polarizable continuum describing the rest of the aqueous solvent. They found that it changed from 6.44, via 5.82, to 6.88 D when including 0, 1, or 2 water molecules. For 98 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 27 Various properties for the molecule of Fig. 4 as well as four modifications of it. m, a, and b denote the dipole moment, the average polarizability, and a certain averaged first hyperpolarizability. The three other headings show the phase. All results are from ref. 65 Molecule

Property

Vacuum

Chloroform

Acetone

PNA

m(D) a (1024 esu) bz (1030 esu) m(D) a (1024 esu) bz (1030 esu) m(D) a (1024 esu) bz (1030 esu) m(D) a (1024 esu) bz (1030 esu)

7.4 13.89 6.67 7.94 17.68 9.75 9.39 23.71 27.87 11.94 33.87 58.69

9.42 17.40 20.3 10.3 22.40 33.14 11.95 30.07 82.16 15.32 42.87 145.26

10.08 18.78 28.15 11.14 24.19 46.17 12.80 32.63 113.61 16.38 46.19 184.57

DMA

DNAP

DACP

the transition state, it changed from 6.31, via 4.26, to 4.08 D in the same three cases. In this case, no clear trend can be identified. These authors studied also the Mulliken atomic charges. Equivalent to the findings above, the inclusion of explicit water molecules in the calculations leads in some cases to an increase, in other cases to a decrease in the atomic charges. In their study of the donor-accepted substituted sesquifulvalene (see Fig. 7), Kar et al.69 found a systematic increase in the dipole moment as a function of the relative dielectric constant of the solvent, which was treated within the polarizable continuum approach. These authors also calculated the hardness, Z = 12(eLUMO ehom*o),

(23)

(eLUMO and ehom*o is the energy of the lowest unoccupied orbital, LUMO, and of the highest occupied orbital, hom*o) which to some extent describes the ability of the system to participate in chemical reactions. Kar et al. observed an only weak dependence of Z on the dielectric constant of the solvent. In the study by Li et al.72 on the tautomerism of monochalcogenocarboxylic acids CH3C(QO)XH, with X being S, Se, or Te, that we discussed above (see, e.g., Table 20 and Fig. 10), the authors calculated also the atomic charges for the different structures in the gas phase. The results are reproduced in Table 28, where it is seen that the presence of a single solvent molecule, i.e., the CH3OCH3 molecule, leads to some modifications. The fact that this molecule takes active part in the proton transfer process explains why the largest effects due to its presence are found for the hydrogen atom at the transition-state structure. Of the atoms listed in the table, the C atom is the one most far away from the CH3OCH3 molecule and, therefore, the one that is the least affected by its presence. Table 28 shows also the total dipole moment for the different systems and structures. Thus, when a CH3OCH3 molecule is included, the dipole moment contains also contributions from this molecule. This explains that in this case the dipole moment shows much larger variations than when the CH3OCH3 molecule is not included. In particular at the transition state some charge transfer between the two molecules is thereby indirectly seen. The electronic chemical hardness, defined in eqn (23), gives a measure of the chemical reactivity of a given compound. Also the chemical potential, m = 12(eLUMO + ehom*o),

(24)

90

is a useful property in this concept. Meneses et al. have therefore studied this for a set of smaller molecules and, in particular, how it changes when the molecule is solvated. They used a supermolecule approach, where the solute was surrounded by Chem. Modell., 2008, 5, 67–118 | 99 This journal is

c

The Royal Society of Chemistry 2008

Table 28 Results from a natural-population analysis for systems containing n CH3OCH3 molecules plus either the keto or the enol form of the monochalcogenocarboxylic acids of Fig. 10. TS shows the results for the transition state between the two forms. q shows the effective atomic charge and C, O, X, and H represent the carbon atom of the CQO or C–O group, the oxygen atom of that group, the chalcogen atom, and the hydrogen atom that is been transferred, respectively. m is the total dipole moment for the different structures. All results are from ref. 72 X

n

Form

q(C)

q(O)

q(X)

q(H)

m(D)

S

Keto TS Enol Keto TS Enol Keto TS Enol Keto TS Enol Keto TS Enol Keto TS Enol

0.543 0.478 0.355 0.539 0.486 0.391 0.502 0.423 0.288 0.495 0.446 0.333 0.448 0.371 0.225 0.441 0.405 0.281

0.648 0.758 0.735 0.680 0.824 0.773 0.645 0.755 0.739 0.673 0.810 0.776 0.643 0.761 0.748 0.669 0.805 0.783

0.025 0.159 0.149 0.054 0.493 0.225 0.106 0.057 0.077 0.074 0.472 0.165 0.296 0.081 0.013 0.276 0.423 0.092

0.121 0.391 0.494 0.168 0.596 0.538 0.033 0.351 0.499 0.082 0.603 0.542 0.093 0.293 0.499 0.051 0.600 0.544

2.23 2.23 2.03 0.80 6.31 1.78 2.30 2.35 2.32 1.16 6.71 2.19 2.59 2.32 2.36 1.76 7.29 2.68

1

Se

1

Te

1

eight water molecules, and in order to be able to use the definitions of eqns (23) and (24) they identified those orbitals closest to the hom*o and LUMO orbitals of the complete supermolecule that were localized to the solute and for those used the definitions of the chemical potential and chemical hardness. In their calculations they used the B3LYP density-functional method. Table 29 summarizes their findings. It is clear that in all cases the chemical potential gets closer to zero and the hardness becomes smaller when the molecules are solvated. Using the definitions of eqns (23) and (24) this means that both the hom*o and the LUMO moves upward in energy. As a simple approximation (see, e.g., ref. 1), the orbital energy can be interpreted as the energy that is required to move an electron infinitely far away from the molecule (hom*o) or adding an electron from infinitely far away to the molecule (LUMO), assuming that relaxation effects can be ignored. Thus, the energies for these electron-transfer processes are reduced, i.e., the solvent screens the electrostatic interactions. The last two examples of Table 29 are exceptions to these rules, which, however, may be related to the very delocalized nature of the hom*o and/or LUMO orbitals, whereby these orbitals have positive energies both in the gas phase and in the solution. In their study on solvent effects on the properties of oligothiophenes, Fig. 9, Meng et al.71 calculated also the dipole moment for chains consisting of 2–6 thiophene units, either in the gas phase or in either n-hexane or water. The calculations were carried through using the B3LYP density-functional method for the solute and the polarizable continuum method for the solvent. As Table 30 shows, the solvent leads to an increase of the dipole moment, but in all cases a clear even–odd oscillatory pattern is identified. The latter can be related to the zigzag-like structure of the systems, cf. Fig. 9. In their study of the properties of CX4 (with X being a halogen) and their photodissociation products, Kong et al.87 also reported the atomic populations obtained from a natural population analysis for the Br2CBr–Br system in various 100 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 29 The chemical potential (m) and the chemical hardness (Z) for various small molecules in the gas phase (gas) as well as in an aqueous solution (sol). All quantities are given in eV and are from ref. 90 Molecule

m (gas)

m (sol)

Z (gas)

Z (sol)

CH3NH3+ NH4+ H3O+ CH3NH2 NH3 H2O NH2 OH

11.13 14.11 13.40 1.96 2.36 3.11 7.38 7.81

5.94 7.31 5.41 0.08 1.13 2.18 4.92 4.54

11.98 16.75 12.33 8.52 9.01 9.63 5.87 6.87

4.54 6.62 2.40 1.44 1.61 3.70 0.17 1.49

Table 30 The dipole moment, m (in D), for oligothiophenes (cf. Fig. 9) in various solvents as well as in the gas phase. xT represents the oligothiophene containing x units. All results are from ref. 71 Solvent

m(2T)

m(3T)

m(4T)

m(5T)

m(6T)

Gas n-Hexane Water

0.27 0.30 0.32

0.73 0.83 1.05

0.46 0.49 0.54

0.83 0.94 1.09

0.61 0.61 0.60

solvents. The results are reproduced in Table 31. The calculations were performed with a polarizable continuum method for the treatment of the solvent. Thus, the table shows that even this approach leads to significant redistributions of the electronic charges when the system is inserted into the solvent. What may not surprise is that the solvent with the largest relative dielectric constant (water) leads to the largest changes in the atomic populations.

I.

Excitation energies in solution

The determination of responses to external perturbations is the field where experiment and theory meet. Therefore, very much effort has been invested in determining such responses, also for solvated molecules. In this and the following two subsections we shall discuss some of the results that have been obtained when calculating the responses to electromagnetic fields. In this subsection we focus on the (optical) excitation energies. In a work towards the understanding of the optical properties of peptides, Sˇebek et al.91 studied those of an amide-group containing molecule (NMA), N-methylacetamide, CH3–NH–CO–CH3. Using time-dependent density-functional theory Table 31 The atomic net populations on various atoms of Br2CBr–Br in different solvents. The atom is marked in bold face. er gives the relative dielectric constant for the solvent. All results are from ref. 87 Solvent Vacuum Cyclohexane Dodecane 2-Propanol Acetonitrile Propylene carbonate Water

er 1.000 2.023 20.18 36.14 66.14 78.39

Br2CBr–Br

Br2CBr–Br

Br2CBr–Br

Br2CBr–Br

0.244 0.274 0.273 0.355 0.365 0.369 0.370

0.587 0.567 0.568 0.504 0.496 0.493 0.492

0.522 0.515 0.516 0.455 0.446 0.442 0.442

0.422 0.495 0.495 0.662 0.679 0.688 0.689

Chem. Modell., 2008, 5, 67–118 | 101 This journal is

c

The Royal Society of Chemistry 2008

Table 32 Transition wavelengths and dipole strengths for N-methylacetamide in the gas phase and in an aqueous solution. Both the COSMO model and a supermolecule approach (denoted clusters) have been applied for treating the solvent. In the latter case, averages over 90 configurations have been calculated, resulting in the error bars for those calculations. All results are from ref. 91 State

l (nm) Vacuum

l (nm) COSMO

l (nm) Clusters

D (debye2) Vacuum

D (debye2) COSMO

D (debye2) Clusters

1A00 2A00 3A 0 4A 0 5A00 6A 0 7A00 8A 0 9A00 10A 0

217 216 214 191 190 183 182 176 175 175

208 202 193 175 183 169 175 180 164 169

208 12 244 23 229 19 183 8 187 9 184 8 189 8 182 9 188 17 182 13

0.03 0.00 0.55 0.57 0.38 0.56 0.28 3.35 0.00 2.68

0.03 0.07 0.90 1.31 0.30 0.33 0.43 9.10 0.01 1.82

0.09 0.05 0.11 1.37 0.86 0.78 0.87 2.29 0.27 1.96

0.16 0.04 0.09 1.61 1.07 0.91 0.90 0.87 0.24 1.84

(TD-DFT) they calculated the transition energies and dipole moments for NMA both in vacuum and in an aqueous solution. Moreover, in the treatment of the solvent they compared two different approaches, i.e., a polarizable-continuum method (COSMO) and a supermolecule approach. For the latter, the authors performed molecular-dynamics calculations using a force-field model and, subsequently, extracted a cluster containing the solute and 3–4 water molecules that form hydrogen bonds to the solute. Averages over 90 such configurations were ultimately determined. Some of their results are reproduced in Table 32. At best, the supermolecule and continuum model give similar trends, but the large scatter in the results for the latter indicates a strong dependence of those on the precise geometry of the supermolecule that is being studied. By calculating the complete absorption spectra, Sˇebek et al.91 confirmed the sensitivity of the results on all kinds of details of the theoretical approach. Thus, these results suggest that the present models for the theoretical treatment of solvation effects are not yet able to provide precise information on such spectra. On the other hand, it has to be mentioned that the calculation of excitation energies (in particular with time-dependent density-functional theory that not yet has reached a mature level) is connected with some uncertainty that has nothing to do with the presence of a solvent. In their detailed study of the photodissociation of CX4 molecules (with X being a halogen), Kong et al.87 reported also various spectroscopic properties. The calculations were performed using a polarizable-continuum model for the solvent and B3LYP density-functional calculations for the solute. In Tables 33–35 we show their results for vibrational properties (Table 33), absorption energies (Table 34), and singlet-triplet excitation energies (Table 35). The tables shall give an impression of what currently is possible, but the tables show also that where a comparison with experiment is possible, there is still some discrepancy, although trends are well reproduced by the calculations. In their study of the photodissociation of formic acid, Tian and Fang89 compared the gas-phase reaction with that of an aqueous solution. For the solvent they used a combined model with a supermolecule treatment of the solute plus the nearest water molecules together with a polarizable continuum model for the long-range solvent effects. They could demonstrate that the explicit treatment of water molecules closest to the solute was important for the theoretical study of the photoabsorption process. The study of Zeng and Ren68 on the tautomerization of 2-aminothiazole that we have discussed above, included also an examination of the excitation properties of the systems in the gas phase and in solutions. Zeng and Ren treated the solvent via a 102 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 33 Calculated and experimental vibrational energies (in cm1) for Br2CBr–Br in different solvents. All results are from ref. 87

Vibration

Acetonitrile with Cyclohexane Cyclohexane Water Acetonitrile traces of water calculation experiment calculation calculation experiment

C–Br str. Br–C–Br wag. Br2–C–Br wag. Br2–C–Br sym. str. Br2–C–Br bend BrCBr bend + Br–Br str. Br–Br str. + BrCBr bend Torsion Br2CBr–Br bend

818 762 344 283 182 175 147 50 36

828

797 775 411 279 185 170 108 48 19

179 155

796 777 408 279 185 170 109 48 20

828

280

polarizable-continuum approach, but included 0–3 water molecules in the quantummechanical part for the aqueous solutions. Their results are summarized in Table 36. For the gas phase, there is a clear dependence on the number of extra water molecules that is included, but in the aqueous solutions the results are quite insensitive to this aspect. This suggests that for optical properties the explicit treatment of (parts of) the solvent may not be needed, although, as we have seen above, the structure may depend on this, and, thereby indirectly, also the optical properties. This finding is partly in contrast to some of those we have found above. Kar et al.69 examined the properties of the sesquifulvalene molecule of Fig. 7. An interesting aspect is whether the molecule can change structure so that the two rings no longer are parallel but perpendicular. This phenomenon is called sudden polarization. One possibility could be to electronically excite the system. Therefore, Kar et al. calculated the relative total energy between the parallel and the perpendicular conformation in different excited states for the two different molecules of their study, both in the gas phase and in a solvent. We show their results in Table 37 for the two different donor–acceptor substituted molecules that they studied. It is interesting that the results depend very sensitive on the substitution. Table 34 Calculated absorption energies (in nm) for Br2CBr–Br in different solvents. For the 6th excitation, the second entry gives experimental values. For the solvents, also the relative dielectric constant is given. All results are from ref. 87 State

Vacuum Cyclohexane er = 1.000 er = 2.023

Dodecane 2-Propanol er = 20.18

Acetonitrile Propylene carbonate er = 36.64 er = 66.14

2 3 4 5 6

883 799 719 651 459

7 8 9 10 11 12 13 14 15 16

417 374 360 357 354 349 342 324 315 310

895 821 749 684 480 480 406 387 365 354 354 346 342 333 326 310

997 955 914 845 650 635 434 404 381 373 364 360 354 349 347 339

897 823 751 685 482 480 406 388 365 354 354 346 342 333 326 310

985 936 893 824 622 540 429 340 382 368 361 358 352 348 347 336

1000 962 922 853 663 635 436 406 381 375 366 360 355 350 347 340

Chem. Modell., 2008, 5, 67–118 | 103 This journal is

c

The Royal Society of Chemistry 2008

Table 35 Calculated singlet–triplet energies (in kcal/mol) for three different CX2 molecules in different solvents. All results are from ref. 87 Molecule

Vacuum

Cyclohexane

Methanol

CCl2 CBr2 CI2

17.4 14.3 7.0

17.6 14.4 7.2

18.0 14.7 7.4

Moreover, in one case, the S3 state possesses a sudden polarization when being solvated in a solvent with a not too large dielectric constant. In a study on coumarin derivatives, Nguyen et al.92 compared the performance of different theoretical approaches. They considered different derivatives and different solvents and compared two different polarizable-continuum models for the solvents and four different density functionals for the quantum-mechanical treatment of the solute. In addition, the dependence on the size of the basis set was explored. We show some of their findings in Table 38. We shall not discuss the performance of the different theoretical approaches in detail, but rather emphasize that the spread in the calculated values can be taken as an estimate of the inaccuracies such calculations will have. For more details, the reader is referred to the original study,92 where many more results are reported, too. In their study on oligothiophenes, Meng et al.71 also calculated the lowest excitation energies for the molecules as a function of length, solvent, and sidegroups. They used the time-dependent density-functional approach combined with a polarized-continuum method for the inclusion of solvent effects. Their results are summarized in Table 39, where they are compared with available experimental information. The decrease in the excitation energy upon increased chain length is readily observed. Moreover, the shifts due to the solvent are not as large as are those due to substitution. Finally, even the experimental values show a significant scatter, which most likely is due to the fact that many of the experimental studies were performed on substituted oligothiophenes. Barone et al.93 studied the optical properties of solvated p-benzoquinone, i.e., a benzene molecule in which two opposite C atoms have been replaced by oxygen (cf. Fig. 15). They used time-dependent density-functional calculations in the determination of the optical properties of the solute and a polarizable-continuum approach for the solvent. By using a recent development that allows for optimizing the structure of the excited molecule in solution, they could determine both the ground-state structure and that of the excited state. Then, they could calculate both vertical

Table 36 The calculated vertical excitation energies DE (in eV) and the corresponding oscillator strengths of (A) the amino and (B) the imino forms of isolated, mono-, di-, and trihydrated as well as dimeric species of 2-aminothiazole (cf. Fig. 6). All results are from ref. 68 System

Gas DE

Water DE

CCl4 DE

Gas f

Water f

CCl4 f

A B A + H2O B + H2O A + 2H2O B + 2H2O A + 3H2O B + 3H2O 2A 2B

5.14 5.20 5.07 4.95 5.02 4.94 5.03 4.96 4.85 4.79

5.07 4.98 5.05 4.98 5.02 4.96 5.03 4.96 4.86 4.81

5.09 5.29

0.12 0.002 0.16 0.13 0.16 0.14 0.16 0.13 0.03 0.004

0.15 0.13 0.16 0.15 0.16 0.16 0.16 0.16 0.05 0.05

0.11 0.002

4.86 4.81

104 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

0.04 0.02

Table 37 The relative total energy (in kcal/mol) of the parallel and the perpendicular conformation of the two donor–acceptor substituted sesquifulvalene molecules of Fig. 7. The system denotes the left and right group in the figure, and results are given for the ground and different excited singlet states as a function of the relative dielectric constant of the solvent. (cf. Fig. 7). All results are from ref. 69 System

State

er = 1.0

er = 4.9

er = 20.7

er = 78.4

NH2/NO2

S0 S1 S2 S3 S4 S0 S1 S2 S3 S4

23.7 6.6 18.0 14.1 14.0 38.4 20.5 25.7 15.6 31.9

11.2 28.1 28.8 3.0

8.8 32.7

6.0 25.8

0.16

6.6

38.8 21.7 26.7 15.8 32.7

38.9 22.2 26.9 15.9 33.0

39.1 22.6 27.6 16.1 33.5

NO2/NH2

Table 38 Calculated excitation energies (in eV), oscillator strength, and transition dipole moment (in D) for the coumarin derivative C1 in different media. Two different polarizablecontinuum models (SCRF-S and PCM) and four different density functionals (CAMB3LYP, mCAMB3LYP, PBE0, and B3LYP) have been used. In all cases, the same size of the basis set has been used, except for the B3LYP calculations in vacuum, for which the second and third entry list results for a smaller and a larger basis set, respectively. All results are from ref. 92 Medium

Solvent model

DFT

DE

f

m01

Vacuum

CAMB3LYP PBE0 B3LYP

Ethanol

SCRF-S

Water

PCM SCRF-S PCM Experiment SCRF-S

CAMB3LYP mCAMB3LYP PBE0 B3LYP B3LYP

Cyclohexane

PCM Experiment SCRF-S

4.24 4.04 3.94 4.00 3.93 4.00 3.83 3.74 3.69 3.64 3.51 3.99 3.81 3.68 3.63 3.65 4.13 3.97 3.84 3.87 3.77 3.76

0.368 0.363 0.344 0.329 0.338 0.497 0.451 0.477 0.406 0.445 0.318 0.495 0.447 0.401 0.439 0.325 0.477 0.446 0.415 0.495 0.468

4.79 4.87 4.80 4.66 4.76 5.72 5.57 5.80 5.39 5.68 5.54 5.72 5.56 5.36 5.65 4.85 5.51 5.44 5.33 5.81 5.72

CAMB3LYP mCAMB3LYP B3LYP B3LYP CAMB3LYP mCAMB3LYP B3LYP PBE0 B3LYP

PCM Experiment

and non-vertical excitation and emission energies, which are reported in Table 40. Structural relaxation clearly has a significant effects on the optical properties. In a mainly experimental work, Fujihara et al.94 performed photoelectron spectroscopy experiments on Na2 in small water clusters, i.e., Na2(H2O)n with n r 6. In addition, they performed electronic-structure calculations with Hartree-Fock plus 2nd order Møller-Plesset perturbation treatment of correlation effects. Further Chem. Modell., 2008, 5, 67–118 | 105 This journal is

c

The Royal Society of Chemistry 2008

Table 39 Calculated excitation energies (in eV) for the nT oligothiophenes of Fig. 9. Except for the n = 2 case, hydrogen atoms are attached everywhere to the carbon atoms. For n = 2, two different structures with two OCH3 groups attached have been considered, too. In all cases, the second entry lists experimental results. Also the relative dielectric constants of the solvents are listed. All results are from ref. 71 n

Sidegroups Gas er = 1.000

Acetonitrile n-Hexane er = 36.64 er = 1.89

1,4-Dioxane Chloroform er = 2.21 er = 4.90

Water er = 78.39

2

4.00

4.02 4.05, 4.11

4.02 3.96, 4.11

4.00

3.25 3.50–3.51 2.81 3.17 2.53 2.98–3.00 2.34 2.85

3.25 3.50, 3.61 2.80 3.04–3.34 2.53 2.98, 3.16 2.34 2.71–3.06

3.25

4.12

3.84 3.90–3.91 3.70 3.87–3.88

OCH3 OCH3 3

3.36

4

2.92

5

2.64

6

2.45

4.03 4.11–4.14 3.86 3.93 3.70 3.89 3.25 3.51–3.55 2.81 3.21–3.22 2.55 3.03 2.35 2.85

2.80 2.51 2.32

calculations on the optimized structures with more advanced methods were carried through in order to calculate the experimental spectra. They were able to identify several different structures for each value of n. For the structures of the lowest total energy for n r 3 the water molecules are bound preferentially to one of the Na atoms by O atoms, so that Na2 becomes asymmetrically solvated. For these structures, the Na–Na+(H2O)n structure grows with n resulting in one Na+ ion and a solvated Na ion. Nikitina et al.95 studied the Stokes shift of some organic chromophores due to the ~0 and R ~1 being the coordinates for the molecule of interest in solvent effects. Letting R ~ and E1(R) ~ its ground state and its first excited state, respectively, and letting E0(R) being the total energy of the ground state and of the first excited state, the Stokes shift is defined as ~0) E0(R ~0)] [E1(R ~1) E0(R ~1)] DESS = [E1(R ~0) E1(R ~1)] + [E0(R ~1) E0(R ~0)] = [E1(R l1 + l0 = (l1,s + l1,v) + (l0,s + l0,v).

(25)

In the first identity, we have written the Stokes shift as a sum of two parts, one from the first excited state and one from the ground state. These two are called the

Fig. 15 Structure of p-benzoquinone. Closed circles, open circles, and closed triangles represent carbon, hydrogen, and oxygen atoms, respectively.

106 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 40 Calculated energies (in eV) for the p-benzoquinone in various media. DEv gives the vertical excitation energy, DEabs is the non-vertical excitation energy, and DEemis is the vertical emission energy. TD-DFT, CASPT2, and SAC-CI represent results obtained with timedependent density-functional calculations as well as with two post-Hartree-Fock wavefunction-based approaches. Properties for the three excitations with the largest oscillator strengths are shown. All results are from ref. 93 Medium Gas

CCl4

DMSO

DEv TD-DFT

DEv CASPT2

DEv SAC-CI

DEabs TD-DFT

DEemis TD-DFT

3.09 3.05 4.11 3.06 3.13 4.09 3.08 3.28 4.08

2.80 2.82 3.56

2.71 3.50 3.74

2.88 2.79

2.67 2.28

2.80 2.87

2.57 2.36

2.67 2.28

2.18 2.48

reorganization energies, l, and they are in the last identity split into one contribution from a solvent, li,s, and another part from the internal structure of the molecule, li,v. In their work, Nikitina et al.95 focused on the solvent contribution, ls = 12(l1,s + l0,s).

(26)

To calculate this they applied a methodology developed by their group earlier.96 Ultimately, they applied it on various larger, organic chromophores and demonstrated a fair agreement with available experimental information. We shall not discuss their results further here. Optical rotation spectroscopy (ORD) can by used in characterizing chiral molecules, like (R)-3-methylcyclopentanone of Fig. 16. This molecule can exist in two conformations, as shown in the figure. Al-Basheer et al.97 studied this molecule experimentally and theoretically in various solvents, including cyclohexane. They calculated the spectra of the two isomers, separately, as shown in Fig. 17. Experimental information on the mole fractions of the two conformers was subsequently used in calculating the spectrum that is shown in Fig. 18, where it is also compared with the experimental results. The agreement between experiment and theory is excellent. A related study was presented by Kundrat and Autschbach.98 They calculated the optical rotation for aromatic amino acids in solution. For each molecule, three primary rotamers, g, h, and t (for ‘gauche’, ‘hindered’, and ‘trans’), may exist. Fig. 19 shows one example from the study of Kundrat and Autschbach.

Fig. 16 Structure of the two different conformations of (R)-3-methylcyclopentanone. Reproduced with permission of The American Chemical Society from ref. 97.

Chem. Modell., 2008, 5, 67–118 | 107 This journal is

c

The Royal Society of Chemistry 2008

Fig. 17 Calculated optical rotation spectra for the two isomers of (R)-3-methylcyclopentanone in cyclohexane. Reproduced with permission of The American Chemical Society from ref. 97.

Fig. 18 Calculated and experimental optical rotation spectra for (R)-3-methylcyclopentanone in cyclohexane. Reproduced with permission of The American Chemical Society from ref. 97.

Fig. 19 Optimized rotamers of the phenylalanine zwitterion. Reproduced with permission of The American Chemical Society from ref. 98.

Kundrat and Autschbach calculated the relative total energies of these in the solvent. They used density-functional calculations for the solute and a polarizable continuum method for the solvent. Moreover, they compared the results obtained with two different basis sets. From the relative total energies, Kundrat and Autschbach could calculate the populations for a temperature of 293 K using a Boltzmann distribution, and compare those with available experimental information, see Table 41. Subsequently, by calculating the optical rotation for each rotamer and a wavelength of 589.3 nm (that of the sodium D line) and weighting the results with the Boltzmann factors, they obtained the values for the optical rotations of the 108 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 41 Different properties from the study of the optical rotation (ORD) in solution. DE gives the relative total energy in kJ/mol, x1 and x2 the Boltzmann populations from the calculations with two different basis sets, and xexp that from experiment. The calculated optical rotations [in (deg cm3)/(g dm)] with the two different basis sets and from experiment is listed, too. Two different molecules are considered and for each three different rotamers. DE has been calculated with the first basis set, and the corresponding Boltzmann populations have been used. The entry ‘mix’ gives the Boltzmann averaged results. All results are from ref. 98 Molecule

Structure DE

Phenylalanine zwitterion

g

7.6

0.03 0.03

0.24

h t mix g

3.0 0.0

0.21 0.19 0.76 0.78

0.27, 0.28 0.48, 0.50

1.8

0.26 0.24

150.9 6.7 22.3 0.24, 0.28, 0.32, 0.36 221.8

h t mix

2.5 0.0

0.19 0.20 0.55 0.57

0.26, 0.27 0.37, 0.42, 0.46, 0.50

Phenylalanine cation

x1

x2

ORD1

xexp

162.4

69.1 25.8 59.0

ORD2 ORDexp 142.8 120.6 18.2 7.7 35.1 210.4 66.7 18.0 52.2

7.4

mixture that are given in Table 41, too. Also for those, a comparison with experimental values is presented. Table 41 contains the results for only two molecules. The original study considered several further molecules, and in some cases a better agreement with experiment was found than what appears to be the case for the results of Table 41. As seen in the table, the ORD is very sensitive to the structure. Therefore, if small inaccuracies occur in the relative total energies, the populations may be quite inaccurate and, consequently, wrong results for the total ORD signal are found. Such inaccuracies could, e.g., have their origin in the lack of explicit solvent molecules in the near neighbourhood of the solute. Mukhopadhyay et al.99 calculated the optical rotation of methyloxirane in water using different approaches and compared the results with experimental information. They considered both explicit and implicit models for the solvent, i.e., they treated the solute as well as some parts of the solvent quantum-mechanically, whereas the long-range effects of the solvent were treated within a polarizable-continuum approximation. They found that without water molecules in the quantum-mechanical treatment, the agreement between experiment and theory was not good. Thus, in this case it is important to include short-range solute–solvent interactions explicitly. In that case, a good agreement was found. J. Linear and nonlinear optical properties in solution The fact that materials respond to electromagnetic fields can be used to manipulate light. In particular the higher-order responses are then of interest. Therefore, many (experimental and theoretical) studies have been devoted to understand, exploit, and optimize these responses. One way of quantifying the responses is to consider the dipole moment of the molecule of interest. In the presence of an external electric field, the ith component of the dipole moment can be expanded as ð0Þ

mi ¼ mi þ

X

aij Ej þ

j

1X 1X b Ej Ek þ g Ej Ek El þ ; 2 jk ijk 6 jkl ijkl

ð27Þ

is the dipole moment in the absence of the external field, aij is the where m(0) i polarizability, and bijk, gijkl, . . . is the first, second, . . . hyperpolarizability. Em is the mth component of the external field. If the external field is frequency dependent, Chem. Modell., 2008, 5, 67–118 | 109 This journal is

c

The Royal Society of Chemistry 2008

so will the response be. Through the hyperpolarizablities it becomes thereby possible to obtain responses with frequencies that are different from those of the incoming electric field. The conjugated, organic molecules that we have discussed a couple of times during this presentation have turned out to be materials with particularly large values for the hyperpolarizabilities. Therefore, many studies have been devoted to those materials. Among those is the work of Balakina and Nefediev,65 who studied systems like the one of Fig. 4 in the presence of static fields. Assuming that the molecule is lying in the (x, z) plane with the z axis being the main axis of the molecule, they calculated an average polarizability, a = 13(axx + ayy + azz),

(28)

and an average first hyperpolarizability along the molecular axis, bz = bzxx + bzyy + bzzz.

(29)

These two quantities are listed in Table 27 for the four different molecules of their study both in the gas phase and in solution. It is interesting to see that the responses can be increased significantly upon solvation. The authors found that the structure is hardly affected by the solvation, so the effect is mainly a purely electronic one. This was confirmed by calculating the responses in the solution but for the structures of the gas phase. Also this led to significant enhancements of the values of a and gz compared to the gas-phase values, although slightly smaller. Also Perpe`te and Jacquemin70 studied such systems for static fields. They calculated the first hyperpolarizability using a continuum-model for the solvent and Hartree-Fock + Møller-Plesset approaches for the solute. As solute they considered the push-pull systems of Fig. 8. They optimized the structure both in the gas phase and in the solvent. Subsequently, they calculated the properties for the molecules considering both geometries in both phases, whereby they could distinguish between structure and media effects. The results are shown in Table 42 where it is confirmed that the main effects of the solvent are purely electronic, whereas structural changes are less important. It is also interesting to see that bzzz is significantly enhanced in the solution and that bzzz/n goes through a maximum as a function of n. Li et al.100 examined the properties of two organometallic tungsten–carbon complexes, tungsten pentacarbonyl pyridine (TPCP) and tungsten pentacarbonyl trans-1,2-bis(4-pyridyl)-thylene (TPCB), that also had been studied experimentally. They considered the isolated monomers as well as dimers and studied the systems in solutions. They used density-functional methods in order to calculate the linear and nonlinear responses to electric fields, and the solvents were treated with a Table 42 The first hyperpolarizability bzzz (in a.u.) for the push-pull oligomers of Fig. 8 as a function of n. The geometry was optimized in either gas or water phases, and the calculations were carried through for both structures in both media. All results are from ref. 70 n Geometry Medium

bzzz/n Gas Gas

bzzz/n Gas Water

bzzz/n Water Gas

bzzz/n Water Water

2 4 6 8 10 12 14 16

1571 4635 8628 12 301 14 903 16 330 16 824 16 696

7199 29 743 48 727 52 273 48 380 42 864 37 731 33 407

1691 5284 9323 12 851 15 378

6772 39 309 58 041 55 410 50 634

110 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 43 The static dipole moment, the average polarizability, and the average first hyperpolarizability for different tungsten–carbon complexes. The second entry gives the conditions, and the third gives the method. All results are from ref. 100 System

m(0) (D) a (1024 esu) a (1024 esu) b (1030 esu) b (1030 esu) b (1030 esu) static l = 1907 nm static l = 1907 nm l = 1907 nm Calc. Calc. Calc. Calc. Calc. Exp.

TPCP monomer TPCP dimer TPCB monomer TPCB dimer

7.5 5.6 6.1 4.2

28.9 56.5 52.2 102.2

29.2 56.9 53.3 104.2

11.0 5.0 66.0 11.1

15.0 6.5 109.3 21.5

4.4 4.4 7.0 7.0

polarizable-continuum model. In order to compare with experiment, they considered toluene and chloroform for TPCP and TPCB, respectively, as the solvent. Moreover, the experiments were performed for electric fields with a wavelength of l = 1907 nm, and, therefore, both that wavelength and an electrostatic field were considered. Table 43 summarizes some of their findings. It is seen that in particular the hyperpolarizability responds very sensitively to the formation of dimers, although the interaction between the monomers is an only weak p–p stacking effect. The fact that such weak interactions are important suggests that also solvent molecules should be treated explicitly. On the other hand, Table 43 shows that, in the present case, the theoretical results for the dimers are able to reproduce the experimental results, whereas those of the monomers are not. K.

Magnetic properties in solution

Not only the responses of materials to electric fields can be used in characterizing materials as well as be exploited in applications, but also those to magnetic fields. Therefore, theoretical studies of the magnetic responses are as relevant as are those to electric fields. Here, we shall briefly discuss a few recent such ones. Jansik et al.101 calculated the magnetochiral axial birefringence. This effect appears as a change in the optical index of chiral systems when being exposed to a static magnetic field parallel to the direction of the propagation of the light. For a chiral molecule, the axial birefringence due to the presence of a magnetic field can be defined as the difference between the refractive indices for unpolarized light propagating parallel and antiparallel relative to the direction of the externally applied magnetic field, Dn = nmm nmk.

(30)

Here, the first upper index represents the direction of propagation of the light and the second represents the direction of the magnetic field. In an earlier work, the same group had used Hartree-Fock calculations in calculating Dn for the most stable isomer of some chiral organic molecules in vacuum.102 In the more recent work,101 they extended the study in several directions. At first, they applied the B3LYP density-functional method, whereby also correlation effects were included. Second, they compared gas-phase results with those obtained for a solution, whereby they applied the polarizable-continuum method for the treatment of the solvent. And third, for some of the larger molecules they included the effects of having a mixture of more different stable structures in an approach very similar to that we discussed above in section III I for the calculation of the optical rotation. Their results are summarized in Table 44. The table shows that the changes when passing from the Hartree-Fock approximation to the B3LYP approach are large, which the authors interpret as indicating the importance of correlation effects. Moreover, when considering several different isomers with closeby energies, the Chem. Modell., 2008, 5, 67–118 | 111 This journal is

c

The Royal Society of Chemistry 2008

Table 44 The magnetochiral birefringence, Dn [in 1012 cm3/(T g)], per unit magnetic field and density for various chiral molecules. The theoretical method is either the Hartree-Fock or the B3LYP density-functional method, and the calculations were performed for either the gas phase or a solution. For the last two molecules, not only the most stable isomer was considered but also mixtures of six stable structures with populations according to a Boltzmann distribution. [a] is the specific optical rotation [in deg/(cm3 dm g)]. All results are from ref. 101 Molecule

Method

No. of isomers

Phase

Dn

(R)-C3H6O

HF B3LYP HF B3LYP HF B3LYP HF B3LYP B3LYP B3LYP B3LYP HF B3LYP B3LYP B3LYP B3LYP

1 1 1 1 1 1 1 1 1 6 6 1 1 1 6 6

Gas Gas Gas Gas Gas Gas Gas Gas Solution Gas Solution Gas Gas Solution Gas Solution

7.53 18.50 0.74 3.64 23.36 0.24 0.60 53.91 28.43 1.23 4.61 16.97 33.09 35.96 36.24 33.56

(S)-C3H3OF (R)-C4H6O (S)-C10H14O

(R)-C10H16

[a]

205.1 238.7 41.6 40.5 18.2 18.6 87.5 86.0

results may change significantly. That solvation has a strong effect should be visible in the table, too. NMR spectroscopy is a very useful tool for determining the local chemical surroundings of various atoms. Komin et al.103 studied theoretically this for the adenine molecule of Fig. 20 both in vacuum and in an aqueous solution using different computational approaches. In all cases, density-functional calculations were used for the adenine molecule, but as basis functions they used either a set of localized functions (marked loc in Table 45) or plane waves (marked pw). Furthermore, in order to include the effects of the solvent they used either the polarizable continuum approach (marked PCM) or an explicit QM/MM model with a force field for the solvent and a molecular-dynamics approach for optimizing the structure (marked MD). In that case, the chemical shifts were calculated as averages over 40 snapshots from the molecular-dynamics simulations. Finally, in one case, an extra external potential from the solvent acting on the solute was included, too, marked by the asterisk in the table. The table shows a strong dependence of the nitrogen chemical shifts on the basis set. A similar dependence was not observed for 13C chemical shifts on the same

Fig. 20 Schematic representation of the structure of the adenine molecule. Open circles, closed circles, and closed triangles represent carbon, nitrogen, and hydrogen atoms, respectively. The atom numbering is also shown.

112 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

Table 45 Calculated and experimental values of the 1H and 15N chemical shifts of adenine of Fig. 20 where also the atom numbering is given. For further details, see the text. All results are from ref. 103 Method

Phase

H2

H1

N1

N3

N6

N7

N9

loc loc/PCM pw pw/MD pw/MD pw/MD* Exp.

Gas Sol Gas Gas Sol Sol Sol

8.61 8.63 8.07 8.21 7.99 7.98 8.07

7.79 8.35 7.65 7.74 8.22 8.17 8.11

112.3 124.6 151.8 144.8 162.3 158.5 157.3

121.0 132.8 159.1 154.3 166.7 162.6 158.2

295.3 290.1 271.1 264.8 268.8 273.9 304.0

103.6 119.9 160.9 154.1 180.6 174.3 166.8

212.7 203.4 213.0 213.9 209.0 211.1 209.9

molecule (not shown). Adding solvent effects with the polarizable continuum method brings the chemical shifts in general closer to the experimental values (obtained in solution). The same is true when adding the solvent effects using a QM/MM approach. However, the overall agreement between theory and experiment as seen in Table 45 is not perfect, which actually may suggest that such studies are very sensitive to the accuracy of the theoretical approach and, accordingly, may provide an excellent field for testing new developments. As the final example we mention the recent study of Pollet and Marx.104 They studied the properties of a gadolinium-containing complex in aqueous solution. This complex is used as a contrast agent to enhance the image contrast in magnetic resonance imaging that in turn is used for diagnostic purposes in medicine. In order to be able to use theoretical calculations as a support for the experimental studies, it is mandatory that the theory is able to provide accurate information. Therefore, Pollet and Marx compared different theoretical approaches for studying the complex in water, including the polarizable continuum approach, a QM/MM approach, and a Car-Parrinello supercell approach where also the solvent is treated fully quantummechanically. They found that only the last approach was able to give structural information that was in good agreement with experiment. The authors did, however, not extend the study to the calculation of magnetic responses.

L. Including surfaces Computationally, solvated molecules adsorbed on surfaces represent an enormous challenge. The surface is usually extended in two dimensions, but the presence of the adsorbed molecules breaks the symmetry. Moreover, the material is extended in the third dimension, although the presence of the surface leads to a broken symmetry there, too. Finally, also the presence of the solvent leads to a significant reduction of symmetry. Therefore, theoretical studies of such systems are much less common than those of finite, solvated but otherwise isolated molecules. However, some studies exist, and here we shall briefly discuss two that illustrate the possibilities and approaches. Pool et al.105 studied the adsorption of alkyl thiols on gold surfaces, both without and with a solvent. As solvent they considered n-hexane. For the simulations they used a force field with a so called united-atom model. This means that the thiol heads, the CH2 and CH3 groups, and the gold atoms are treated as individual united ‘atoms’. The interactions between these united pseudoatoms are described in terms of the force fields. Subsequently, molecular dynamics simulations were carried through for the different systems. At first, the authors analysed the adsorption behaviour for a clean perfect Au(111) surface in vacuum. For alkyl thiols with three segments in the alkyl tail they found the structures of Fig. 21. At a temperature of T = 300 K, they found two different structures, depending on whether the simulation was started from a fully loaded Chem. Modell., 2008, 5, 67–118 | 113 This journal is

c

The Royal Society of Chemistry 2008

Fig. 21 Structure of alkyl thiols adsorbed on a gold surface. Light spheres, dark spheres, and thin lines represent gold atoms, thiol heads, and carbon chains, respectively. The left part shows the two-dimensional, ordered, ‘crystalline’ phase, and the right part the two-dimensional, disordered, ‘liquid’ phase. Top (top) and side (bottom) views are shown in each case. Reproduced with permission of The American Chemical Society from ref. 105.

surface or from a completely empty surface. The former case leads to an ordered, ‘crystalline’ phase, as shown in the left part of the figure, whereas the latter leads to a disordered, ‘liquid’ phase that is shown in the right part of Fig. 21. Similar simulations for finite gold nanocrystals with 561 atoms led to a similar hysteresis between adsorption and desorption. Moreover, the adsorption starts at a lower thiol concentration for the nanocrystals than for the macroscopic crystal surface. Adding the solvent led to significant changes. There results a competitive adsorption between the thiols and the solvent and the interactions between the tails are modified. Thus, as the authors conclude, phenomena that are observed in vacuum may be different from those observed in a solution.

Fig. 22 The model system for the electrochemical double layer above a Pt(111) electrode. The large dark spheres represent the Pt atoms, whereas the smallest spheres represent hydrogen and the intermediate-sized ones oxygen. Hydronium ions, H3O+, and transition states are explicitly marked. The top shows a top view and the bottom a side view. Reproduced from ref. 106.

114 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

As the last example we discuss the recent study of Sku´lason et al.106 who examined the hydrogen evolution at a platinum electrode. They studied the model system of Fig. 22 that contains a slab of Pt atoms, representing the Pt(111) surface. On top of that, two layers of water molecules plus an excess of H atoms are placed. Thereby, protons become solvated in the water, leading to the formation of H3O+ ions, whereas electrons are transferred to the metal. In their simulations, they use a supercell approach with the cell of Fig. 22 repeated periodically in all three dimensions (with, however, some further layers of Pt atoms). All atoms were treated quantum-mechanically within the density-functional formalism. At the surface, the (fast) so called Volmer reaction H+ + e - Had takes place. Subsequently, either the hom*olytic Tafel reaction, 2Had - H2, or the heterolytic Heyrovsky reaction, Had + H+ + e - H2, may take place. Using the nudgetelastic-band approach (see, e.g., ref. 2) the authors studied the three reactions on the surfaces. They found that the Heyrovsky reaction may dominate, but the similarity of the calculated activation energies for the Heyrovsky and the Tafel reactions suggests that both reactions may take place in parallel.

IV. Conclusions The theoretical treatment of molecules in solution represents on the one hand a methodological and computational challenge and on the other a possibility for bringing experimental and theoretical studies in closer contact. Since many experiments are performed in solution it is important for theoretical studies to be able to treat this situation. But the presence of a solvent means that the solvated molecules may interact more or less strongly with those of the solvent. Ultimately, the solvent is structured, extended, and able to respond to the presence of the solute. Thus, in the perfect case a full quantum-mechanical treatment of the complete system, solute and solvent, is needed. Such a treatment is not possible and one has to restore to approximations. We discussed some of those in section II. They could, as a first approximation, be split into three different classes. In one set of approximations the solvent was treated within a simplified, parameterized approach. Thereby, the structure of the solvent was retained. As an alternative, the solvent could be approximated as a polarizable continuum that would respond to the presence of the solute but otherwise was without any internal structure. These two types of approaches are coined explicit and implicit methods, respectively. Finally, a full quantum-mechanical treatment of either a smaller, finite supermolecule or an infinite, periodic supercell could be carried through. The first approach suffers from the dependence of the parameterization of the simple model. We saw examples where quite different results could be obtained for different ‘realistic’ models. On the other hand, it has as an advantage that also large, complex systems can be treated, and chemical interactions like the formation of hydrogen bonds can be included. The second approach is computationally simple and efficient, which is a significant advantage. On the other hand, the lack of true atoms of the solvent and of possible chemical or hydrogen bonds between the solvent and the solute may in some cases be serious drawbacks. Also the dependence of the results on the shape of the cavity occupied by the solute may be a problem. The third approach is, in principle, the most accurate one that should be able to give arbitrarily accurate results. However, the computational demands of this put serious limitations on the system sizes that can be treated with this. On the other hand, only this approach allows directly for a proper description of possible charge transfer processes between solvent and solute. Independent of the approach used, we saw several examples of the influences of a solvent on the properties of a solute. Structure, relative stability, optical, and Chem. Modell., 2008, 5, 67–118 | 115 This journal is

c

The Royal Society of Chemistry 2008

magnetic properties could all be altered due to the presence of a solvent. We also saw many examples where an impressive agreement between experiment and theory could be obtained. Finally, this presentation has hopefully also indicated where the developments in the theoretical treatment of solvated systems will bring us in the not too distant future. This includes for instance that even more complex systems will be treated, including heterogeneous catalysis in solution. Moreover, additional, experimentally accessible quantities will be accessible with calculations, too. And, as our examples have shown, both long-range and short-range interactions are important, which ultimately may lead to the consensus that a combined explicit/implicit treatment of a solvent is most useful. This means that an implicit treatment of the long-range effects of the solvent as a polarizable continuum is combined with an explicit treatment (for instance a full quantum-mechanical one) of the closest solvent molecules to the solute. An increasing number of studies is based on such approaches that appears to lead to some of the currently most accurate results. One may hope that through such multi-scale approaches an optimal accuracy can be obtained and that the arbitrariness of the results due to the parameterization may be reduced as much as possible.

References 1 M. Springborg, Methods of Electronic-Structure Calculations, Wiley, Chichester, UK, 2000. 2 M. Springborg, in Specialist Periodical Reports: Chemical Modeling, Applications and Theory, ed. A. Hinchliffe, Royal Society of Chemistry, Publisher, 2006, vol. 4, p. 249. 3 J. Tomasi, B. Mennucci and R. Cammi, Chem. Rev., 2005, 105, 2999. 4 E. S. Marcos, Theor. Chem. Acc., 2006, 115, 75. 5 F. J. Vesely, J. Comput. Phys., 1977, 24, 361. 6 M. Neumann, F. J. Vesely, O. Steinhauser and P. Schuster, Mol. Phys., 1978, 35, 841. 7 F. H. Stillinger and C. W. David, J. Chem. Phys., 1978, 69, 1473. 8 P. Barnes, J. L. Finney, J. D. Nicholas and J. E. Quinn, Nature, 1979, 282, 459. 9 A. Warshel, J. Phys. Chem., 1979, 83, 1640. 10 A. Bondi, J. Phys. Chem., 1964, 68, 441. 11 L. Pauling, The Nature of the Chemical Bond, Cornell University Press, Ithaca, New York, 1960. 12 M. L. Connolly, J. Appl. Crystallogr., 1983, 16, 548. 13 M. L. Connolly, J. Mol. Graph., 1993, 11, 139. 14 S. Miertusˇ , E. Scrocco and J. Tomasi, J. Chem. Phys., 1981, 55, 117. 15 A. Klamt and G. Schu¨u¨rmann, J. Chem. Soc., Perkin Trans., 1993, 2, 799. 16 K. V. Mikkelsen, H. A˚gren, H. J. A. Jensen and T. Helgaker, J. Chem. Phys., 1988, 89, 3086. 17 D. Rinaldi and J. L. Rivail, Theor. Chim. Acta, 1973, 32, 57. 18 J. L. Rivail and D. Rinaldi, Chem. Phys., 1976, 18, 233. 19 C. J. Cramer and D. G. Truhlar, Chem. Rev., 1999, 99, 2161. 20 H. Li and M. S. Gordon, J. Chem. Phys., 2007, 126, 124112. 21 T. A. Wesolowski and A. Warshel, J. Phys. Chem., 1993, 97, 8050. 22 T. Wesolowski and A. Warshel, J. Phys. Chem., 1994, 98, 5183. 23 M. F. Iozzi, M. Cossi, R. Improta, N. Rega and V. Barone, J. Chem. Phys., 2006, 124, 184103. 24 B. Mennucci, Theor. Chem. Acc., 2006, 116, 31. 25 T. Yamamoto and S. Kato, J. Chem. Phys., 2007, 126, 224514. 26 Y.-H. Tan and R. Luo, J. Chem. Phys., 2007, 126, 094103. 27 M. Saito and I. Okazaki, J. Comp. Chem., 2007, 28, 1129. 28 C. R. Jacob, J. Neugebauer, L. Jensen and L. Visscher, Phys. Chem. Chem. Phys., 2006, 8, 2349. 29 G. Chevrot, R. Schurhammer and G. Wipff, Phys. Chem. Chem. Phys., 2006, 8, 4166. 30 C. Curutchet, M. Orozco, F. J. Lique, B. Mennucci and J. Tomasi, J. Comp. Chem., 2006, 27, 1769. 31 C. Amovilli, Chem. Phys. Lett., 1994, 229, 244. 32 C. Amovilli and B. Mennucci, J. Phys. Chem. B, 1997, 101, 1051. 33 K. Aidas, A. Møgelhøj, H. Kjær, C. B. Nielsen, K. V. Mikkelsen, K. Ruud, O. Christiansen and J. Kongsted, J. Phys. Chem. A, 2007, 111, 4199. 34 Y. Sun, B. N. Dominy and R. A. Latour, J. Comput. Chem., 2007, 28, 1883.

116 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86

J. Kongsted and B. Mennucci, J. Phys. Chem. A, 2007, 111, 9890. K. Takemura and A. Kitao, J. Phys. Chem. B, 2007, 111, 11870. B. Du¨nweg, J. Chem. Phys., 1993, 99, 6977. I.-C. Yeh and G. Hummer, Biophys. J., 2004, 86, 681. I.-C. Yeh and G. Hummer, J. Phys. Chem. B, 2004, 108, 681. V.-A. Glezakou, Y. Chen, J. L. Fulton, G. K. Schenter and L. X. Dang, Theor. Chem. Acc., 2006, 115, 86. C. W. Bock, G. D. Markham, A. K. Katz and J. P. Glusker, Theor. Chem. Acc., 2006, 115, 100. J. Blumberger and M. Sprik, Theor. Chem. Acc., 2006, 115, 113. R. A. Marcus, J. Chem. Phys., 1956, 24, 966. R. A. Marcus, Discuss. Faraday Soc., 1960, 29, 21. R. A. Marcus, J. Chem. Phys., 1965, 43, 679. R. A. Marcus, Rev. Mod. Phys., 1993, 65, 599. J. Blumberger, L. Bernasconi, I. Tavernelli, R. Vuilleumier and M. Sprik, J. Am. Chem. Soc., 2004, 126, 3928. I. Adamovic and M. S. Gordon, J. Phys. Chem. A, 2006, 110, 10267. A. Kumar, M. Park, J. Y. Huh, H. M. Lee and K. S. Kim, J. Phys. Chem. A, 2006, 110, 12484. A. Tongraar, P. Tangkawanwanit and B. M. Rode, J. Phys. Chem. A, 2006, 110, 12918. R. Parthasarathi, V. Subramanian, N. Sathyamurthy and J. Leszczynski, J. Phys. Chem. A, 2007, 111, 2. H. Kokubo and S. M. Pettitt, J. Phys. Chem. B, 2007, 111, 5233. R. E. Zillich and K. B. Whaley, J. Phys. Chem. A, 2007, 111, 7489. S. Petrosyan, J.-F. Briere, D. Roundy and T. A. Arias, arXiv:cond-mat/0606817. C. P. Kelly, C. J. Cramer and D. G. Truhlar, J. Phys. Chem. B, 2006, 110, 16066. E. A. Guggenheim, J. Phys. Chem., 1929, 33, 842. E. A. Guggenheim, J. Phys. Chem., 1930, 34, 1540. I. M. Klotz and R. M. Rosenberg, Chemical Thermodynamics, Wiley, New York, USA, 1994. M. R. Reddy, U. C. Singh and M. D. Erion, J. Comput. Chem., 2007, 28, 491. U. C. Singh and S. J. Benkovic, Proc. Natl. Acad. Sci. USA, 1988, 85, 9518. Y. Yang and Q. Cui, J. Phys. Chem. B, 2007, 111, 3999. E. D. Glendening and A. Streitwieser, J. Chem. Phys., 1994, 100, 2900. E. D. Glendening, J. Am. Chem. Soc., 1996, 118, 2473. G. K. Schenter and E. D. Glendening, J. Phys. Chem., 1996, 100, 17152. M. Yu. Balakina and S. E. Nefediev, Int. J. Quant. Chem., 2006, 106, 2245. M. C. Flipse, R. de Jonge, R. H. Woudenberg, A. W. Marsman, C. A. van Walree and L. W. Jenneskens, Chem. Phys. Lett., 1995, 245, 297. C. S. Cucinotta, A. Ruini, A. Catellani and A. Stirling, Chem. Phys. Chem., 2006, 7, 1229. Y. Zeng and Y. Ren, Int. J. Quant. Chem., 2006, 107, 247. T. Kar, N. Panja and P. K. Nandi, J. Phys. Chem. A, 2006, 110, 12684. E. A. Perpe`te and D. Jacquemin, Int. J. Quant. Chem., 2007, 107, 2066. S. Meng, J. Ma and Y. Jiang, J. Phys. Chem. B, 2007, 111, 4128. Q.-G. Li, G.-M. Liang, X. Wang, S.-Y. Chu and Y. Ren, Int. J. Quant. Chem., 2007, 107, 921. V. I. Borovkov, N. P. Gritsan, I. V. Yeletskikh, V. A. Bagryansky and Y. N. Molin, J. Phys. Chem. A, 2006, 110, 12752. W. B. Floriano, G. B. Domont and M. A. C. Nascimento, J. Phys. Chem. B, 2007, 111, 1893. F. De Angelis, F. Tarantelli and S. Alunni, J. Phys. Chem. A, 2006, 110, 11014. S. Alunni, F. De Angelis, L. Ottavi, M. Papavasileiou and F. Tarantelli, J. Am. Chem. Soc., 2005, 127, 15151. T. Song and P. Hu, J. Chem. Phys., 2006, 125, 091101. C. P. Kelly, C. J. Cramer and D. G. Truhlar, J. Chem. Theory Comput., 2005, 1, 1133. C. P. Kelly, C. J. Cramer and D. G. Truhlar, J. Phys. Chem. A, 2006, 110, 2493. K. E. Gutowski and D. A. Dixon, J. Phys. Chem. A, 2006, 110, 12044. R. Friedman, S. Fischer, E. Nachliel, S. Scheiner and M. Gutman, J. Phys. Chem. B, 2007, 111, 6059. R. Improta, V. Barone and M. D. Newton, Chem. Phys. Chem., 2006, 7, 1211. G. D. Scholes, C. Curutchet, B. Mennucci, R. Cammi and J. Tomasi, J. Phys. Chem. B, 2007, 111, 6978. T. Fo¨rster, Ann. Phys., 1948, 2, 55. M. F. Iozzi, B. Mennucci, J. Tomasi and R. Cammi, J. Chem. Phys., 2004, 120, 7029. C. Curutchet and B. Mennucci, J. Am. Chem. Soc., 2005, 127, 16733.

Chem. Modell., 2008, 5, 67–118 | 117 This journal is

c

The Royal Society of Chemistry 2008

87 Q. Kong, M. Wulff, S. Bratos, R. Vuilleumier, J. Kim and H. Ihee, J. Phys. Chem., 2006, 110, 11178. 88 D. R. Ramos, R. Castillo, M. L. Canle, M. V. Garcı´ a, J. Andre´s and J. A. Santaballa, Chem. Phys. Lett., 2006, 429, 425. 89 Y.-C. Tian and W.-H. Fang, J. Phys. Chem. A, 2006, 110, 11704. 90 L. Meneses, P. Fuentealba and R. Contreras, Chem.Phys. Lett., 2006, 433, 54. 91 J. Sˇebek, Z. Kejı´ k and P. Bourˇ , J. Phys. Chem. A, 2006, 110, 4702. 92 K. A. Nguyen, P. N. Day and R. Pachter, J. Chem. Phys., 2007, 126, 094303. 93 V. Barone, R. Improta, G. Morelli and F. Santoro, Theor. Chem. Acc., 2007, 118, 143. 94 A. Fujihara, C. Miyata, A. Maekawa, K. f*cke, K. Daigoku, N. Murata and K. Hashimoto, J. Phys. Chem. A, 2007, 111, 7364. 95 E. A. Nikitina, A. V. Odinokov, F. V. Grigoriev, M. V. Basilevsky, A. A. Khlebunov, V. A. Sazhnikov and M. V. Alfimov, J. Phys. Chem. B, 2007, 111, 3953. 96 M. V. Verner, A. V. Tovmash, I. V. Rostov and M. V. Basilevsky, J. Phys. Chem. B, 2006, 110, 14950. 97 W. Al-Basheer, R. M. Pagni and R. N. Compton, J. Phys. Chem. A, 2007, 111, 2293. 98 M. D. Kundrat and J. Autschbach, J. Phys. Chem. A, 2006, 110, 12908. 99 P. Mukhopadhyay, G. Zuber, M.-R. Goldsmith, P. Wipf and D. N. Beratan, Chem. Phys. Chem., 2006, 7, 2483. 100 Q. Li, R. Sa, C. Liu and K. Wu, J. Phys. Chem. A, 2007, 111, 7925. 101 B. Jansı´ k, A. Rizzo, L. Frediani, K. Ruud and S. Coriani, J. Chem. Phys., 2006, 125, 234105. 102 S. Coriani, M. Pecul, A. Rizzo, P. Jørgensen and M. Jaszun´ski, J. Chem. Phys., 2002, 117, 6417. 103 S. Komin, C. Gossens, I. Tavernelli, U. Rothlisberger and D. Sebastiani, J. Phys. Chem. B, 2007, 111, 5225. 104 R. Pollet and D. Marx, J. Chem. Phys., 2007, 126, 181102. 105 R. Pool, P. Schapotschnikow and T. J. H. Vlugt, J. Phys. Chem. C, 2007, 111, 10201. 106 E. Sku´lason, G. S. Karlberg, J. Rossmeisl, T. Bligaard, J. Greely, H. Jo´nsson and J. K. Nørskov, Phys. Chem. Chem. Phys., 2007, 9, 3241.

118 | Chem. Modell., 2008, 5, 67–118 This journal is

c

The Royal Society of Chemistry 2008

The solid state E. A. Moore DOI: 10.1039/b801632c

1. Introduction Solid state modelling has long been the poor relation of organic modelling as far as chemists are concerned. There are reasons for this. Firstly it was easier to develop relatively small sets of parameters that would describe a wide range of organic molecules including biologically important molecules such as amino acids and DNA bases, whereas the solid state modeller requires parameters for the entire periodic table. Secondly the initial molecular modellers could work with small molecules and hence make progress with the computer resources available in the 1960s. Solid state modelling requires methods of dealing with infinite arrays. Finally many solids of interest contain open shell atoms which are harder to deal with theoretically and require more computer resource than, the usually closed shell, organic molecules. Ideas for the simulation of solids have been around for a long time—for example the lattice energy calculations such as those of Kapustinskii1 and the electron gas theory,2 but it required the advent of computers for simulation of solid state materials to become widespread. The developments that have led to the recent surge of interest in solid state modelling and the adoption of modelling by bench top chemists have been the vast increases in readily-available computer memory, processor speed and storage space, the development of efficient methods of calculation and the development of graphical interfaces linked to modelling software. It is now possible to run many jobs on a PC, using a graphical interface to construct a crystal and set up a run in a way that our organic chemist colleagues have been able to do for over a decade. As with organic modelling, some knowledge of the programs behind the interface leads to more appropriate use. Solid state modelling is now well-established and this review is long overdue. There have been several issues of journals devoted to the topic, more recent ones include the Journal of Materials Chemistry3 and the Journal of Solid State Chemistry4 and there has been a review on this topic in Annual Reports of Progress in Chemistry.5 The introductory article in the Journal of Solid State Chemistry issue gives a very good summary of the history of quantum mechanics applied to the solid state. In this review I shall look at the different approaches to modelling solids and at some examples of the use of such techniques. The field is now so wide that I am unable to cover all aspects and so I have put more emphasis on areas that interest me while trying to at least mention other aspects that are currently attracting modellers. As in other branches of chemistry there are both ab initio methods based on solving Schro¨dinger’s equation and methods based on classical physics. The latter methods are based around lattice energy calculations and I shall refer to them as interatomic potential methods. In the first section of this review, I shall discuss the development and current position of interatomic potential calculations. The first atomistic simulation programs for solids using interatomic potentials were developed in the 1970s.6 Zeolites and isolated point defects in solids were applications that illustrated the use of such methods early on. Work continues in these areas and other applications have been developed. One such is the use of these methods in molecular dynamics calculations as their efficiency makes possible calculations on large supercells or for relatively long lifetimes. Department of Chemistry and Analytical Sciences, The Open University

Chem. Modell., 2008, 5, 119–149 | 119 This journal is

c

The Royal Society of Chemistry 2008

The second section deals with ab initio methods. One way to approach solids is to start from the solid as a structureless box. The solutions to the Schro¨dinger equation for this are of course the well-known ‘particle-in-a-box’ wavefunctions eik r. Physicists have long used such functions to represent the valence electrons of solids, particularly for metals and semiconductors, and consider one of these methods (linear augmented plane waves, described in section 3.1) as the standard against which other methods should be tested. Such methods have not been popular with chemists until this century. A striking current trend is the adoption of plane wave methods by chemists. At the same time physicists are increasingly studying solids of chemical interest. In particular both chemists and physicists are interested in modelling solids with novel magnetic properties such as the manganates which exhibit colossal magnetoresistance. I have therefore spent some time discussing the use of these methods. An alternative approach is to describe the valence electrons by localised functions. In the linear combination of crystalline orbitals (LCCO) which is similar to the LCAO approach to molecules, the valence electrons are represented by sums of atomic orbitals. As well as being in some respects more familiar, this method is capable of all-electron calculations and in principle should be better for properties which need a good description of core electrons. For some of these properties however, extensions to the plane wave method are available which can cope with inner electrons around nuclei of interest. At present, the performance of both approaches is comparable. For some applications, it is useful to treat one part of the system rigorously and employ a more approximate technique for the remainder. This is a technique wellknown for studying active sites of enzymes. Here I consider its application to solid state catalysts where the more rigorous method can be applied to molecules on surfaces and in zeolite cages. In organic/biological chemistry the technique is usually referred to as QM/MM (quantum mechanics/molecular mechanics) and I retain this label with QM referring to ab initio methods and MM to interatomic potential methods. A technique that is increasingly popular is molecular dynamics. This enables the study of free energies and of the effects of changing temperature and pressure. This technique is notoriously computer resource-hungry but increases in storage capacity, memory and processor speed have made it more feasible and it is now possible to combine ab initio and molecular dynamics calculations. The next section is devoted to this and related topics. The now wide-spread use of solid state NMR techniques such as magic angle spinning (MAS) led to a need for calculations of NMR parameters such as chemical shifts and quadrupole coupling constants. A notable achievement this century has been the development of a method to calculate NMR chemical shielding in periodic solids. This will be described in the following section along with developments in the calculation of other properties. Finally I shall present some interesting applications of solid state modelling. Zeolites were one of the early success stories and there is still considerable interest in modelling these and related compounds such as ALPOs. Calculations in this area can predict new structures, determine the distribution of different cations over the available sites and throw light on the use of zeolites as catalysts by investigating the interaction of molecules with the framework. Another area that has been of interest since the development of this field is that of metal oxides. Simple metal oxides such as MgO provide a test bed for new methods. More complex oxides have attracted much interest for their commercially important properties—solid electrolytes, ferroelectrics, catalysts, semiconductors, superconductors, multiferroics. Relatively simple calculations can, for example, track the path of ions through ionic conductors and suggest alternative solids for fuel cells or batteries. Solids with interesting electrical and magnetic properties such as high TC superconductors and solids showing colossal magnetoresistance (CMR) have been 120 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

the focus of much work in recent years and I shall consider examples of studies in this area in a separate subsection. I shall however take my examples from solids of chemical interest and will not therefore cover metals, alloys and III–V semiconductors. I shall omit some areas which are currently flourishing and of interest to solid state chemists. These are surfaces, polymers, biological materials such as proteins and DNA, metal organic frameworks and organic crystals. These will perhaps be reviewed later. To keep this review to a reasonable length I have regretfully left out references to calculations on nanomaterials, although there is much current work in using solid state methods in this area, particularly on nanotubes. The last application I shall consider is the modelling of minerals. These are often complex solids formed under high pressures or temperatures or in the presence of water and are not easy to model. Nevertheless there is considerable work on minerals both those found on Earth and those on other planets and advances in molecular dynamics and related methods should boost this area. I have been selective in my choice of examples of application and apologise to those whose work I may have omitted either because it did not fall into the categories of compounds I have chosen or because it slipped by me in the huge number of papers on this topic being published. I have tried to include at least some examples of work from most of the large groups in this field. I hope that some of the omissions will be remedied in the next review in this series.

2. Interatomic potential methods Ten years ago when I attended a Faraday Discussion on Solid State Chemistry: New Opportunities from Computer Simulations,7 interatomic potential methods were well developed and the use of ab initio methods starting to become widespread. In his Introductory Lecture Prof. C. R. A. Catlow asked ‘With the continuing growth of the applicability of electronic structure techniques, can we see them as replacing interatomic potential based methods?’ His reply then was ‘there will be a continuing role for interatomic based potential based methods as the field moves to more complex systems.’ Over the last decade, ab initio electronic structure methods have progressed rapidly and for many applications plane-wave ab initio methods are now the first choice for calculations. Nevertheless that reply still holds true. This section will only give a brief overview of interatomic potential methods. The article by Gale and Rohl8 on the widely-used program GULP (General Utility Lattice Program) provides an extensive discussion of the methods used in this and similar programs with a comprehensive list of references. The basis of interatomic potential methods when applied to solids, particularly ionic solids, is the calculation of lattice energies. The starting point is to place each atom on a lattice site and assign it a charge. For simple ionic compounds, the formal charge is used. Perhaps surprisingly the formal charge has also proved successful in calculations on covalently-bonded networks such as zeolites and ALPOs, where the apparently unrealistic charges of +4 on Si and +5 on P are used. However in a study aimed at modelling the effect of radiation damage in solids, Thomas et al.9 found Mulliken charges from ab initio calculations gave a better description of TiO2. The use of formal charges has advantages when studying defects as the defect must necessarily have an integral charge. For covalently-bonded entities within the crystal such as ammonium, nitrate and sulfate ions and water molecules, the atoms within the molecule are assigned partial charges as in organic modelling with the overall charge on the molecule fixed at the ionic charge or at zero. The calculation of the energy due to the electrostatic interaction of all the ions is generally done using the Ewald method.10 Because the electrostatic interaction converges only slowly with distance, the Ewald method splits the summation into two parts. The short range part is evaluated in real space but the long range part is evaluated in reciprocal space. Modifications that accelerate the evaluation of the Ewald summation include the Chem. Modell., 2008, 5, 119–149 | 121 This journal is

c

The Royal Society of Chemistry 2008

particle-mesh approach11 and the fast multipole method.12,13 Wolf et al.14 have put forward an alternative real space-only summation. In most simulation calculations, the charges on the atoms are fixed. However geometry-dependent charges are used in some cases.15–17 Most such approaches have been based on the concept of electronegativity equalisation.18 The next set of terms usually added allow for van der Waals-type interactions between the ions. The most popular form of the potential in solid state modelling is the Buckingham potential A exp(r/r)C/r6 but there are other forms in use such as the Lennard-Jones rA12 rC6 , the general form A expðr=rÞ rCn and potentials derived rm from splined fits. For molecules and ions containing hydrogen, it is usual to include a Morse potential De((1 ea(rre))2 1) to describe the covalent bond between H and a heavier atom. Such potentials have also been used for molecular anions, for example the carbonate anion in calcite,19 perchlorate and perbromate20 and sulfate.21 Water and OH ions are important in many technological and mineralogical applications but it is difficult to find satisfactory potentials for these species. Chroneos et al.22 have put forward two transferrable new models for hydroxide ions, one using partial charges and one using formal charges. In place of the Morse potential, they employ a screened Coulomb potential for the O–H interaction, qO qH rOH V¼ ð1Þ 1 exp 4pe0 rOH r An idea of the range of available two-body potentials can be obtained from the list of those included in GULP.8 For solids containing covalent bonds, including salts of molecular ions and alumino-silicate framework and similar structures, potential terms akin to those used in molecular mechanics calculations on organic molecules are used. These are bond, angle and torsion terms. A simple harmonic model is often used, V = 12kb(r r0)2 +

1 2

k(y y0)2 + kt(1 S cos nf)

(2)

but there are other options. Many body potentials e.g. Sutton-Chen,23 Tersoff,24,25 Brenner26,27 can be used to describe metals and other continuous solids such as silicon and carbon. The Brenner potential has been particularly successful with fullerenes, carbon nanotubes and diamond. Erhart and Albe28 have derived an analytical potential based on Brenner’s work for carbon, silicon and silicon carbide. The Brenner and Tersoff potentials are examples of bond order potentials. These express the local binding energy between any pair of atoms/ions as the sum of a repulsive term and an attractive term that depends on the bond order between the two atoms. Because the bond order depends on the other neighbours of the two atoms, this apparently two-body potential is in fact many-body. An introduction and history of such potentials has recently been given by Finnis29 in an issue of Progress in Materials Science dedicated to David Pettifor. For a study of solid and liquid MgO Tangney and Scandolo30 derived a many body potential for ionic systems. Polarisability of ions is usually modelled via the shell model of Dick and Overhauser.31 The ion is divided into a point core and a massless spherical shell which interact with each other by a spring force. Two body interatomic potentials such as the Buckingham potential act on the shells. Both core and shell are charged, the sum of their charges being the formal charge of the ion. The distribution of charge between core and shell is purely empirical and can be quite counter-intuitive; it is not intended to represent an inner core and a shell of valence electrons. For example a +3 ion may have a shell charge of 4.97 and a core charge of 1.97. This model makes some allowance for the finite size of ions and can also model distortion of the electron cloud to a limited extent by allowing the shell centre to be non-coincident with the core position. It has been found also that this model can be used to 122 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

represent a degree of covalency.8 The shell model only deals with dipole polarisability. An alternative approach is to use a point ion dipolar polarisability; this has the advantage that it can easily be extended to higher order polarisabilities. Such an approach has been used for ionic materials by Wilson and Madden.32 An interesting recent development has been a method based on the angular overlap model for the inclusion of ligand field effects.33 The parameters in all these potentials are obtained in one of two ways. The first is by calculation. The earliest method of calculation was the electron gas model popularised by Gordon and Kim.34,35 This model divides the repulsion energy between two closed shell atoms into three terms—electrostatic energy, kinetic energy and exchange energy. Methods of obtaining these terms were reviewed by Wood and Pyper.36 This method has now been superseded by ab initio methods in which the geometry of pairs, clusters or periodic arrays of atoms is varied systematically. The variation of energy with distance is then used to fit the parameters in the chosen interatomic potential model. The second method of obtaining potentials is the empirical method in which the parameters are varied until a good fit to experimental cell constants and, wherever possible, other experimental data is obtained. Typical experimental data used for fitting are elastic constants and dielectric constants. Empirical potentials are often only a realistic approximation to the true potential over a limited range of interatomic distances. Fig. 1 compares energy curves for a typical empirical potential and potentials derived from ab initio calculations on the rock salt structure. The potentials have similar values at typical interionic distances found in crystals but care should be taken in using empirical potentials if the problem involves the ions moving to distances that are much smaller or larger.

Fig. 1 Calculated and empirical potentials for the O2–O2 interaction. From J. Harding, Computer Simulation of Defects in Solids, in Reports on Progress in Physics, 1990, 53, 1403. Reproduced with permission of the publisher, IOP Publishing, Bristol.

Chem. Modell., 2008, 5, 119–149 | 123 This journal is

c

The Royal Society of Chemistry 2008

However empirical potentials have proved remarkably successful and a large number of studies of oxides have used empirical potentials based on the O–O potential of Lewis and Catlow38 derived from Hartree–Fock calculations on two O2 ions. Parker et al.39 have shown that quantum mechanical studies can be used to screen possible empirical potential sets and suggest improvements. The potentials obtained by this procedure can then be used with some confidence in problems where ab initio calculations would be prohibitively expensive, for example molecular dynamics runs. An interesting development is the use of a neural network to fit empirical potentials from a large amount of data with a genetic algorithm.40 A review by Harding37 contains a discussion of the advantages and disadvantages of empirical and ab initio fitting and although it was published in 1990 I would still recommend it. While for many applications, ab initio methods are now preferred as they are more accurate and there is no longer such a large penalty in computer resource, one area in which interatomic potentials still have an advantage is in studying the effect of isolated defects. Defect calculations with programs using interatomic potentials generally use the Mott-Littleton approach.41 In this the solid is divided into two spherical regions. In the region immediately surrounding the defect, the defect interacts with the surrounding ions explicitly. The interaction of the ions in region 2 with the defect is approximated. Positions of ions in the bulk solid are allowed to relax to accommodate the defect which can be intrinsic or extrinsic. The concentration of intrinsic defects in crystals is often very small and such calculations are well-suited to determining the energetically most favourable intrinsic point defect. For a Schottky defect for example, this method is used to calculate the energies when one cation is removed from the lattice and when one anion is removed separately from the lattice. Subtracting the lattice energy gives the energy of a Schottky defect for widely-separated cations and anions, but it is of course possible that the vacancies are clustered and this is also easily studied using this method. Early examples of such studies include work by Lewis and Catlow42 on intrinsic defects in alkali halides and a study by Mackrodt and Stewart43 of defects in alkaline earth (Group 2) oxides. Because these calculations are relatively cheap, they are also a good way to produce a first guess for the mode of incorporation of extrinsic ions. The defect structure found this way can then be used as a starting point for Rietveld analysis of powder XRD patterns or to predict the pattern of substitution of ions with very similar XRD response e.g. F and O2. Islam44–46 has for example investigated the position of fluorine in a number of oxyhalides and we have used it in our own work to locate dopant ions in iron oxides.47–50 The Mott-Littleton approach is incorporated into several programs including METADISE,51 GULP.8 Details of the implementation of this scheme in the widely-used program GULP are discussed by Gale.8

3. Ab initio methods The most striking advance in solid state modelling in chemistry in the last decade is the rapid increase in the use of plane wave methods, mostly because there are several readily-available programs and graphical interfaces have made them more userfriendly. I shall discuss both the basis and developments of these methods and of the alternative localised function approach, but first I shall mention some concepts that are common to both approaches. Both approaches treat the solid as an infinite three-dimensional array of unit cells. This enables Bloch’s theorem to be applied so that the electronic wave function of the solid can be written Cnk(r) = unk(r)eik r

(3)

where Cnk(r) is the nth wavefunction for wavenumber k. unk(r) has the periodicity of the unit cell. In the plane wave approach, unk(r) is chosen to be a sum of plane 124 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

waves eiG r where the wavevectors G are chosen to be commensurate with the cell. Thus the electronic wavefunction in this approach becomes X Cnk ðrÞ ¼ unk ðGÞeiGr eikr ð4Þ G

In the localised function approach, unk(r) is chosen to be a linear combination of functions localised on the atoms in the reference cell X X Cnk ðrÞ ¼ ain ðkÞ ji ðr Ai gÞeikg ð5Þ i

g

where ji(r Ai g) is a local function (such as an atomic orbital) centred on the nucleus whose coordinates are given by Ai in the zero reference cell. I shall introduce here a concept that is common to all methods. The Brillouin zone (BZ) is described in most solid state physics textbooks. It is defined as a reciprocal lattice cell bounded by the planes that are perpendicular bisectors of the vectors from the origin to the reciprocal lattice points. Fig. 2 illustrates the first BZ cell for a hexagonal lattice. To evaluate energies and electron densities in ab initio methods, it is necessary to integrate over the Brillouin zone. The integrations needed are performed over a set of discrete points within the BZ. For insulators, it is only necessary to use a few special points thus cutting down on computational time. The most common method for choosing these points, used in many of the packages described below, is the Monkhorst-Pack method.52 The set of points generated by this approach for a threedimensional system is given by 3 X 2ni Ni 1 Gi 2Ni i

where Gi represents the primitive vectors of the reciprocal lattice, ni = 1,2,3, . . . ,Ni and Ni is selected by the user. For many systems, small values of Ni can produce reasonable results. Another reason for introducing the Brillouin zone is that it is common in physics papers to include diagrams of the change in energy as a function of k along a path from one point in the Brillouin zone to another, for example from G to K. Modern diagrams tend to show several paths and a large number of energy levels giving them the appearance of spaghetti. However gaps between the valence

Fig. 2 Brillouin zone for a hexagonal unit cell showing some of the special points. G is the centre point.

Chem. Modell., 2008, 5, 119–149 | 125 This journal is

c

The Royal Society of Chemistry 2008

levels and the conduction levels can be distinguished and give the variation of band gap with k, and partially filled bands can be seen crossing the Fermi level. A recent trend has been to use Wannier functions53 to calculate properties. Wannier functions are orthonormal localized functions spanning the same space as the eigenstates of a particular band or group of bands and are Fourier transforms of the Bloch eigenstates. For one band, i, Wannier functions, wi, are given by Z Vcell dkeikTm cki ðrÞ ð6Þ wi ðr Tm Þ ¼ ð2pÞ3 BZ where Tm is a lattice point in the unit cell associated with wi, Vcell is the volume and BZ refers to integration over the Brillouin zone. Highly localised Wannier functions play the role in periodic solids of localised orbitals, such as Boy’s orbitals, in molecular orbital theory. One widely-used method of obtaining maximally-localised Wannier functions is to minimize the mean square spread, O, where O¼

N X

½hr2 ii hri2i :

ð7Þ

i¼1

N is the number of bands considered in forming the Wannier function. 3.1 Plane wave methods Plane wave methods have long been the preferred approach to solid state calculations for physicists and much of the literature in this field is written for physicists. However it is now becoming popular among the chemical community with plane wave methods included for example in the graphics-led solid state modelling package Materials Studios and so I have included brief descriptions of the various approaches here. A clear and comprehensive textbook in this area is that by Martin.54 There are two main approaches to plane wave methods, both using density functional theory (DFT)—pseudopotential methods and linear methods. Pseudopotential methods are implemented in the program CASTEP55 and in the GNU-licenced ABINIT program.56,57 A good introduction to the background theory and philosophy of implementation is given in the paper describing the previous version of CASTEP.58 An impressive and unusual feature of the 2002 version of CASTEP is that the code was completely redesigned. Most large computational chemistry packages have evolved from codes written in the 1970s or 1980s in Fortran77 (or earlier Fortran versions). As they evolved, more efficient algorithms for parts of the calculation were included, extra properties and the ability to optimise structures added and the code modularised. Added functionality is still dealt with today by adding modules to the existing code. The CASTEP programmers however decided to redesign the code from scratch taking advantage of object oriented design to produce a program that would be easier to debug and update. As in all plane wave methods, the set of plane waves eiG r form the basis set for the valence electrons. Note that they do not contain any reference to the particular elements present in the solid. This has both advantages and disadvantages. One advantage is that there is no need to develop basis sets tailored to each element. The basis set is chosen for any system simply by choosing the number of plane waves to be used. In practice, this number is determined by selecting a cut-off energy EC where EC is the energy of a free electron whose wavefunction has the same energy as the largest wave vector in the basis set, EC ¼

h2 ðGC þ kÞ2 : 2m

126 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

ð8Þ

Sufficient plane waves have to be chosen to cover the whole unit cell including empty volumes. A disadvantage stemming from this is that many plane waves are needed to fill the empty volume for structures with cages, tunnels or irregular shaped ions or molecules, making calculations on such structures expensive. Core electrons are represented by pseudopotentials. These are determined by applying the restraints that (1) the eigenvalues of the valence orbitals in the presence of the pseudopotential must be the same as those obtained in the all-electron case. (2) the continuity of the wavefunctions and their first derivatives must be continuous across the core boundary. In norm-conserving pseudopotentials a third restraint—that integrating the charge in the core region must give the same answer as for the all-electron case— is applied. This ensures that scattering properties remain correct to linear order.59 Pseudopotentials may be derived separately for wavefunctions of different angular momentum quantum number, l. Such sets are known as non-local. In recent years ultrasoft potentials, introduced by Vanderbilt,60,61 have become popular. These are generated by relaxing the norm-conservation restraint and to compensate for the loss of the guarantee that the scattering properties remain correct to linear order, several reference energies are used with a set of projectors for each energy. Such potentials were introduced because of the difficulty of representing transition metals and second row (Li–Ne) elements by norm-conserving pseudopotentials for a plane wave basis. Computational advances which have led to pseudopotential methods becoming efficient are Fast Fourier Transfoms and the approach of Car and Parrinello62 in which wavefunction coefficients are treated as dynamical variables. Parts of the calculations on periodic systems are easier to calculate in real space and others in reciprocal space. Fourier transforms are needed to convert between the two spaces. The introduction of Fast Fourier Transforms substantially reduced the computational costs of this conversion. The key feature of the Car-Parrinello approach is the use of minimisation rather than diagonalisation of the Kohn–Sham equations to reach the ground state. Linear methods originated in the augmented plane wave (APW) method of Slater63 and were developed initially by Andersen.64 Like the pseudopotential method, the linear augmented plane wave (LAPW) approach uses density functional theory, but the ionic cores are not represented by pseudopotentials. Instead each core is modelled by a sphere inside which the wave function has the form of a linear combination of radial functions times spherical harmonics. X jkn ¼ ðAlm ðkn Þul ðr; El Þ þ Blm ðkn Þ½dul ðr; El Þ=dEEl ÞYlm ðrÞ ð9Þ lm

where Alm and Blm are obtained by matching basis functions of the same kn inside and outside the atomic spheres. ul(r,El) is the value at the origin of the solution of the radial Schro¨dinger equation for energy El and the spherical part of the potential inside the atomic sphere. In the full potential linear augmented plane wave method (FLAPW)65 the electron density inside the sphere is replaced by a smoothed density with the exact multipole moments. PAW (projector augmented-wave)66 is a method designed to have the flexibility of FLAPW methods with the simplicity of pseudopotential methods. In the PAW method, all-electron one electron wave functions, Cn, are derived from pseudo-one electron wave ~ n, where n refers to the nth band by means of a linear transformation functions, C X ~ ih~ ~ ni þ ~ jCn i ¼ jC ðjfi i jf ð10Þ i pi jjCn iÞ i

The all-electron partial waves, fi, are chosen to be solutions of the scalar relativistic Schro¨dinger equation for a non-spin-polarised atom with a specific angular Chem. Modell., 2008, 5, 119–149 | 127 This journal is

c

The Royal Society of Chemistry 2008

~ n are the variational momentum and at a specific energy in the valence region. C quantities and are usually expanded in plane waves. Although described as an allelectron method, the core wavefunctions are usually frozen. A recent paper67 suggests a method of relaxing the frozen core restraint without increasing substantially the computational time. This solves for a self-consistent core charge density while re-establishing the PAW potential parameters at each minimisation step in such a way that the plane wave valence functions remain orthogonal to the core states. This obviates the need to increase the number of electrons in the plane wave part of the Hamiltonian. Mortensen et al.68 have developed a grid-based implementation of PAW. For DFT methods, it is necessary to calculate the electron density and because this is p|C2|, in plane wave methods it involves N2G operations where NG is the number of G vectors used. This can be reduced by obtaining the Bloch states on a grid of NB points in real space when the density can then be found in NB operations. Usually the wave function c(G) is converted via a Fast Fourier Transform to a real space wave function. This is then squared and the density summed over all electrons and all wave vectors, k. A second Fast Fourier Transform converts the electron density back to reciprocal space. Mortenson realised that if the wavefunction is described by a grid in real space then the Fast Fourier Transform can be avoided. By carrying out all calculations in real space and using grids Mortensen et al. were able to put forward a method that could make use of massively-parallel computers and reduce the time for calculations on large systems.

3.2 Methods using localised functions The augmented spherical wave method (ASW) describes the valence electrons by a set of spherical waves centred round ionic cores described by atomic orbital-like functions.69 The linear combination of crystalline orbitals method, as implemented in CRYSTAL06,70 for example, is based on the use of atomic orbitals. In contrast to the methods described in the previous section, both Hartree–Fock (HF) and DFT calculations are possible. The Gaussian basis sets are similar to molecular basis sets but there are crucial differences, one of the most important being that diffuse functions should generally be avoided. Such functions are known to improve molecular calculations especially of negative ions but in a periodic solid they are very time-consuming and can lead to unphysical states.70 The Hartree–Fock (HF) method is not much used these days but HF calculations often converge more easily than DFT calculations and give the correct energy order of spin states so that they can be used for a cheaper preliminary survey of spin states before using a more accurate method to investigate band structure or density of states. BAND71–73 from the Amsterdam Density Functional (ADF) group uses atomic orbital basis sets but these are composed of either Slater or numerical functions rather than Gaussians. The program includes relativistic effects at the zero order relativistic approximation (ZORA) level and it is possible to include spin–orbit coupling. LSDA and GGA functionals are available. This method ought to deal well with strongly-correlated systems but there does not seem to be much work in this area using BAND, possibly because earlier versions lacked the functionality of other solid state programs. The updated BAND2007 has more functionality, for example geometry optimisation, but is too recent for papers to have been published using the results of this program. SIESTA74 is an order-N method that uses norm-conserving pseudopotentials (see above) with strictly localised basis sets. The basis orbitals are products of a numerical radial function and a spherical harmonic and are zero beyond a certain radius from the atomic centre. The radial function is defined by a cubic spline interpolation. The cut-off radius can vary for different radial functions. A more recent reference75 gives a fuller description of the program and some applications. 128 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

3.3 Density functionals and strongly-correlated solids As in molecular calculations, DFT solid state calculations can use both local density approximation (LDA) and generalised gradient approximation (GGA) density functionals. However in general these methods underestimate the band gap and often do not describe solids where there is strong correlation very well. Familiar examples of this are the monoxides MnO and NiO whose properties are more easily explained by assuming the 3d electrons are localised on the transition metal centres rather than occupying delocalised bands. This was attributed by Liechstenstein et al.76 to the existence of orbital polarisation driven by screened on-site Coulomb interactions (repulsion between d electrons on the same ion) resulting in the survival of atomic characteristics of the electronic states in the region of the transition metal ions. A 2003 review by Matar77 discusses strongly correlated oxides with particular reference to the ASW method. There has been much recent work on ways to overcome this problem. A development in pseudopotential programs has been the inclusion of non-local exchange–correlation functionals. Examples of such functionals are the screened exchange (sX),78,79 exact exchange (EEX or EXX),80–82 weight density approximation (WDA),83–85 metaGGA86–88 and Hartree–Fock.89 A good discussion of these functionals is given in the thesis of M. C. Gibson.90 The method that has been widely adopted to solve this problem for linear methods is the addition of a term describing the Coulomb interaction between d electrons on the same ion.91–94 Such calculations are designated +U. U is a spherically averaged Hubbard parameter describing the energy required for adding an extra d electron to an atom. Filled d orbitals localised on one particular site are moved to lower energies by (U J)/2 and empty orbitals raised by (U J)/2 where J is a parameter describing the screened exchange interaction. The +U method can be used with both LDA and GGA calculations. The optimum value of U varies from system to system and is basis set dependent so that each researcher must determine the best value for their problem. Rather than looking for a universal value of U, it is possible to take the view that the value of U needed provides information on the degree of localisation in the solid. Mosey and Carter95 suggest calculating U and J from HF orbitals and show that this technique gives values that describe Cr2O3 better than conventional DFT. An alternative to GGA (or LDA) +U is the use of hybrid density functionals such as B3LYP.96–98 These seem better at describing band gaps and other properties of highly correlated solids than pure DFT. Franchini et al.99 obtained very good results for the structural, electronic and magnetic properties of MnO using the hybrid functional PBE0100,101 in the PAW-based program VASP.102 PBE0 is derived from the Perdew, Burke and Ernzerhof (GGA) functional103,104 (PBE) which is composed of the Perdew-Wang correlation functional and an exchange functional given by ExPBE ¼

bx2 1 þ ax2

ð11Þ

where a and b are fixed and x ¼ rrr 4=3 with r the electron density. The 0 family of functionals have no adjustable parameters in the expression for the mixing of functionals to form the hybrid functional, that is GGA E0XC = EGGA + 14(EHF ) XC X EX

(12)

This functional which is effectively the PBE functional with 25% of exact Hartree– Fock exchange included, performed better than GGA + U. Alkauskas and Pasquarello105 found that PBE0 increased the calculated band gap of a-quartz to 8.3 eV close to the experimental value of B9 eV and a great improvement on the value of 5.8 eV obtained with PBE. Blaha’s group106 showed that the hybrid functionals PBE0 and B3PW91 gave comparable or superior results to LDA + U for the transition metal monoxides MnO, FeO and CoO. Chem. Modell., 2008, 5, 119–149 | 129 This journal is

c

The Royal Society of Chemistry 2008

It would be interesting to see how PBE0 compares with GGA + U for other systems and with B3LYP. Hybrid functionals have also proved useful in localised orbital approaches. The B3LYP functional used in CRYSTAL0670 can be written as LSDA + x(EHF ) + yDEGGA Eex = ELSDA ex ex Eex ex

(13)

with x = 0.2 and y = 0.9. Feng and Harrison107 have suggested that a value of x = 0.35 is better for reproducing magnetic coupling constants and work by Catlow’s group5,108 found that x = 0.6 was needed to reproduce the structural and polarisation properties of BaTiO3. In our own work109 we found x = 0.1 reproduced the band gaps of the strongly-correlated oxides a-Fe2O3 and a-Cr2O3. However Montanari et al.110 found that the standard B3LYP functional produced results very close to experiment for the vibrational frequencies of a-Al2O3 and Wilson et al.111 considered this functional good for describing the structure, magnetic moment and band gap of ilmenite (FeTiO3). Hybrid functionals other than B3LYP have also proved successful in localised orbital methods. Yang and Dolg112 found that the hybrid functional B3PW including spin–orbit coupling reproduced the band gap of BiB3O6 well, while Prodan et al.113 found that the HSE (Heyd, Scuseria and Emzerhof)114 screened coulomb hybrid potential described the oxides of uranium and plutonium well. The performance of hybrid functionals is discussed by Cora et al.115 Just as there is no universal value of U for the +U methods, there is not yet an optimised hybrid functional for all solid state properties. In general, though, the use of hybrid functionals does seem to be a way forward for calculating solid state properties of correlated solids. A third approach to the problem of strongly correlated solids is Novak’s Orbital Polarisation Method.116 Finally a method which shows promise for the future is dynamic mean field theory. Dynamical mean field theory uses an approximation to the local spectral density functional (rather than energy density functional) and a set of correlated local orbitals. For solids this local description is combined with a periodic description such as DFT using LDA to provide a method of dealing with both localised and delocalised electrons.117–119 Anisimov et al.117 applied this method to the photoemission spectrum of La1xSrxTiO3. Kotliar120 has recently reviewed the use of dynamical mean-field theory for strongly correlated systems and shown that it models a variety of systems well including metallic Pu, NiO, manganates and fullerenes. Kuchinskii et al.121 have extended DMFT for correlated systems by including a self energy term S(k). This method shows promise for describing cuprate superconductors.

4. QM/MM Combined quantum mechanical/molecular mechanics calculations have become popular over the last decade or so in solid state modelling. Typically a molecule attaching to a surface or the side of a cage structure or an isolated defect is treated quantum mechanically as a cluster embedded in a bulk solid described by an interatomic potential method. One of the earliest examples of this approach was Vail’s work on F-centres which began in 1983122,123 and which treated a cluster around the F-centre quantum-mechanically and the rest of the solid via an interatomic potential method. The quantum mechanical and molecular mechanics methods used are those we have discussed earlier; molecular mechanics in the case of ionic solids and zeolites generally meaning the use of interatomic potential methods. The art of QM/MM is how to deal with the boundary of the two regions. One method used for ionic materials is to have a boundary region which features in both the quantum mechanical and molecular mechanics calculations. The addition boundary scheme 130 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

which has been adapted widely for ionic materials, gives the total energy as a sum of three terms inner,boundary Etot + Eouter,boundary + Einner,outer,boundary QM/MM = EQM MM QM/MM

(14)

where the first term represents the energy due to interacting quantum particles, the second that due to particles interacting through molecular mechanics and the third the interaction of quantum particles with particles described by molecular mechanics. The third term generally considers the effect of the charge and sometimes the polarisability of one set of particles on the other. It is important to match the charges derived from quantum mechanics with those derived from molecular mechanics. Since programs such as GULP generally use formal charges, this means they work best with highly ionic materials. An alternative to the addition method is the subtraction method. In this the entire system is treated using molecular mechanics and then quantum mechanics calculations and molecular mechanics calculations performed on the inner region or cluster. Usually there is no boundary region but link atoms are added to the inner region or cluster to make a molecule-like entity. The energy is then given by inner,outer Etot + Einner,link Einner,link QM/MM = EMM QM MM

(15)

Link schemes are often used in modelling biological materials such as proteins but have been used for zeolites by, for example, Sauer’s group.124–126 Zeolites are suited to the link method because of their covalent framework. Problems such as reactions of molecules on a catalytic surface are better handled using boundary methods. For all these methods, there are a number of ways of treating the QM/MM interaction. The least sophisticated is to simply regard the QM region or cluster as a free ‘molecule’. Calculations on the MM region are then modified by including point charges derived from QM. More realistic is to solve for the QM region in the presence of a set of point charges representing the MM region. These charges can be partial charges used in the MM scheme employed but better results may require rederivation of the charges. Sauer’s group, for example, in work on zeolites127,128 rederived the charges from a standard force field by fitting to electrostatic potentials. Other groups have used charge equilibration schemes.129,130 Beyond this there are calculations that allow for the polarisation of the MM area due to the QM charge distribution. Finally the QM and MM calculations can be made self-consistent either through iterative solution of both calculations or by matrix inversion (direct reaction field method131,132). Sherwood133 has written a useful guide to QM/MM methods for both solid state and biological systems. The papers134,135 describing the ChemShell program also contain a useful brief description of the QM/MM approach.

5. Molecular dynamics and related methods Static minimization methods, whether ab initio or interatomic potential, strictly describe solids at 0 K. However it is often important to model phase changes and the behaviour of solids under high temperatures and pressures especially in geochemistry. Such modelling requires methods that sample more than one configuration and/or are time-dependent. The least expensive approach is quasiharmonic lattice dynamics. The quasiharmonic approximation136 assumes that the free energy of a crystal can be written as the sum of a static contribution and a vibrational contribution. For periodic structures, this vibrational contribution is given by X 1 ð16Þ Fvib ¼ 2hn j ðkÞ þ kB T lnð1 expðhn j ðkÞ=kB TÞÞ k;j

where nj(k) is the frequency of a vibrational mode with wavevector, k. Chem. Modell., 2008, 5, 119–149 | 131 This journal is

c

The Royal Society of Chemistry 2008

The equilibrium structure is obtained by minimising the total free energy with respect to a set of internal strain variables. The static potential energy and vibrational frequencies can be calculated using either interatomic potential or ab initio methods. Despite the increasing popularity of molecular dynamics methods, lattice dynamics is still used to investigate solids at high temperatures or undergoing phase transitions. For example, studies in the period 2005-7 included the phase transition of MgSiO3 from perovskite,137 thermal expansion of CaMoO4138 and the origin of negative thermal expansion.139 Monte Carlo methods generate a large number of structures and calculate properties over a weighted average of these. Starting from a reasonable structure, atoms are displaced/removed/added and the energy of the new structure calculated. The most common method for choosing which structures to include in the calculation of properties is Metropolis sampling.140 In this the probability that a structure is selected or rejected depends on a Boltzmann-weighted probability, exp(E/kBT), where E is the energy relative to the initial configuration. Selection of a change from one structure to the next that is neither too small nor too large is crucial to efficient calculation. Molecular dynamics, which is used extensively in modelling biological systems, solves Newton’s equations of motion with the system subject to forces derived from the potentials between atoms. The system is followed as a function of time and as well as time-averaged properties, time-dependent properties such as diffusion can be studied. The coordinates and energy of the system are followed by solving the equations of motion numerically using discrete time steps. The time step is typically of the order of femtoseconds. Using a suitable algorithm such as that due to Verlet,141 simulations remain stable for long runs; these days typically ten or so nanoseconds. The calculations can be run for different temperatures and under constant pressure or constant volume restraints. It is thus possible to obtain free energies. Details of Molecular Dynamics can be found in the classic text book of Allen and Tildesley.142 Molecular dynamics calculations are computationally expensive and are usually performed using interatomic potential methods rather than ab initio methods. For example Martin et al.143 undertook studies of inorganic materials, particularly surfaces, using the DL_POLY144 code. One of their examples is a very nice study of the migration of oxide ions in ceria both in the bulk and in the 111 surface. By setting an oxide ion off towards a vacancy at constant velocity, they obtained free energies of activation and pathways for several mechanisms. The activation energies were considerably modified by the presence of a Ce3+ ion adjacent to the vacancy thus underlining the importance of considering defect clusters as well as isolated defects in these calculations. A study by Kerisit and Rosso145 illustrates how the solid state modeller is not confined to one of the available techniques but can choose different techniques to tackle different aspects of a problem. They used a combination of molecular dynamics with interatomic potentials, periodic Hartree–Fock calculations and ab initio cluster calculations to study charge transfer in FeO. They used molecular dynamics to calculate the reorganisation energy and free energy of reaction and to provide transition state configurations. Ab initio methods were used to calculate electronic coupling. Using molecular dynamics to study reaction rates involves choosing a suitable reaction coordinate. This is not always easy and Dellago et al.146–149 proposed a method based on Monte Carlo sampling of the Molecular Dynamics pathway. van Erp et al.150–152 have more recently introduced an algorithm that makes transition path sampling more efficient, the Transition Interface Sampling technique. Mohn and Stølen153 have suggested a route using a genetic algorithm154 and energy minimizations for dealing with highly disordered systems and show that it leads to accurate calculations of thermodynamic properties when only a small fraction of energy minima are thermally accessible. The hybrid Monte Carlo method of Allan155 combined Monte Carlo and molecular dynamics steps within the same simulation. Allan et al.156 have discussed 132 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

the relevant merits and limitations of lattice dynamics, Monte Carlo and molecular dynamics, illustrated by examples.

6. Properties Calculated properties of solids include structural, optical, electronic, magnetic and spectroscopic properties. An important aspect of solids is the bulk magnetisation. Many solid transition metal compounds can be considered as having localised net spins on the metal centre. These spins couple ferro- or antiferromagnetically with spins on adjacent metal centres. Spin-polarised ab initio calculations permit the setting up of ferro-, ferri- or antiferromagnetic states by setting the sign of the spin on magnetic centres in the unit cell. In cases where the crystallographic unit cell contains only one atom of a particular type, it becomes necessary to use supercells. The classic example of this is NiO where in order to reproduce the observed ground state antiferromagnetism it is necessary to use the magnetic supercell found from neutron diffraction studies. Electronic structure calculations strictly deal with the ground state at 0 K so that such calculations generally give an ordered magnetic state even though the solid may be above its Curie or Neel temperature at room temperature and hence paramagnetic. The energy difference between ferro- and antiferro-magnetic states can however give information on the temperature at which the onset of paramagnetism occurs. In contrast to this picture of isolated spins interacting via spin–spin coupling (Ising model), is the magnetism of metals in which the magnetic susceptibility arises from alignment of spins in delocalised bands. Magnetic moments can be extracted from population analysis. Hartree–Fock magnetic moments are often closer to experimental values than DFT values are. The nature of the electronic conductivity (insulator, semiconductor, metallic conductor) and band gap values are obtained from the band structure of the solid. For example, a recent paper by Erhart et al.157 used DFT +U calculations to investigate the band gap of indium oxide. Optical measurements suggested an indirect band gap around 1 eV less than the direct band gap at G, however they concluded that this observation could not be explained on the basis of the band structure of the defect free solid. More difficult to calculate are the properties which depend on the response of the solid to an outside influence (stress, electric field, magnetic field, radiation). Elastic constants are obtained by considering the response of the crystal to deformation. Interatomic potential methods often provide good values for these and indeed experimental elastic constants are often used in fitting the potential parameters. Force constants for lattice vibrations (phonons) can be calculated from the energy as a function of atomic coordinates. In the frozen phonon approach, the energy is obtained explicitly as a function of the atom coordinates. Alternatively the deriva2E tive, @R@i @R can be calculated at the equilibrium geometry. j Properties depending on the response to electric and/or magnetic fields such as optical properties and NMR shielding constants have only been implemented relatively recently. A way to calculate electrical polarization in periodic solids, for example, was only found in the 1990s. Spontaneous polarisation, that is polarisation in the absence of an electric field, has been calculated using both a Wannier function approach and a Berry’s phase158 approach. Berry’s phase involves an adiabatic change around a closed loop which results in a change of phase without change in energy. A recent paper by Ferretti et al.159 used the PAW method with ultrasoft pseudopotentials and Wannier functions to calculate the spontaneous polarisation of AlN in its wurtzite phase. Gajdosˇ et al.160 have put forward a method of calculating dielectric tensors using density functional perturbation theory which they extended for the PAW method. Calculations of NMR shielding is difficult for molecules and more so for periodic solids. With the advent of modern solid state NMR techniques such as MAS (magic angle spinning), many experimental data have become available and computational Chem. Modell., 2008, 5, 119–149 | 133 This journal is

c

The Royal Society of Chemistry 2008

chemists naturally wanted to complement these results. Until a decade ago, however, the only available approach to calculating NMR shielding in solids was to use clusters. The results of course depended strongly on the cluster chosen. In 1996, Mauri et al.161 put forward a method for calculating shielding in periodic solids. Their innovation was to modulate the uniform external field with a finite wave vector, q, giving B(r) - B(r)eiq r and then take the limit of the perturbation energy as q - 0. This group went on to use PAW in conjunction with an approach similar to the gauge invariant (or including) atomic orbital (GIAO) approach used for molecules to calculate NMR chemical shifts in solids.162,163 This approach is now available in NMRCASTEP part of the Materials Studios Package. It should be noted that the approach in NMRCASTEP uses pseudopotentials and therefore does not calculate the contribution to the shielding from core electrons. Sebastiani and Parrinello164 proposed a different approach. They used the R = r variant of the CSGT (continuous set of gauge transformations) method and locally maximised Wannier orbitals. For insulators, the Wannier function decays exponentially and it is therefore possible to choose virtual unit cells whose walls lie in regions where the density for a particular orbital is close to zero. Sebastiani and Parrinello define the position operator as running from l/2 to +l/2 across the virtual cell, then dropping back to –l/2 at the cell wall. Their method is claimed to be more efficient than that of Mauri for systems with large unit cells but again uses pseudopotentials. It has been implemented in the CPMD (Car-Parrinello Molecular Dynamics) package.165 Mauri’s group have published a number of papers on calculated 17O NMR parameters.166–168 Ref. 167 is particularly interesting as it considers vibrational effects on the NMR shielding by averaging the chemical shift over fluctuations of the nuclear positions. They show that vibrational corrections are crucial to reproducing the temperature dependence of the chemical shift. Truflandier et al.169 have published the first periodic calculations of NMR shielding for a transition metal nucleus, albeit on a d0 system (VO43). A number of studies of optical properties of chalcogenides have appeared recently. Weng et al.170 studied magneto-optical Kerr effects in transition metal chalcogenides, Singh et al.171 investigated optical and magneto-optical properties of europium chalcogenides, Reshak et al.172 studied linear and non-linear optical properties of zinc chalcogenides. These studies employed FLAPW calculations and were motivated by the use or potential use of these semiconductors in electronic devices such as switches, light beam addressable memory systems or high density storage. Building on the use of Wannier functions to calculate properties, a very recent paper173 uses both electron Wannier functions and phonon Wannier functions (that is localised orbitals and vibrations) to calculate electron–phonon interaction, a property thought to be important in high TC superconductors.

7. Applications 7.1 Zeolites, ALPOs and other microporous/nanoporous materials Zeolites are now defined as solids that possess a framework of tetrahedra which are all corner-sharing and include a degree of openness such as channels or cavities. Each framework type is issued a unique three letter code by the Structure Commission of the International Zeolite Association.174 At the end of February 2007, there were 176 framework types. This definition does not specify atom types. The original zeolites were alumino-silicate minerals in which the frame work was composed of SiO2 with some Si replaced by Al and the charge balance restored by the presence of other cations within the channels or cavities of the structures. These 134 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

proved an ideal problem for interatomic potential methods but difficult for ab initio methods. The unit cell is often large thus making calculations computationally expensive and random substitution of Si by Al is easy to represent by interatomic potential methods but difficult for ab initio methods. There have thus been many interatomic potential studies on these solids, particularly by Catlow and his group, and there have been several reviews and books175,176 on this subject. An extensive discussion of the work of Catlow’s group in modelling and predicting zeolite structures is given in a recent Annual Reports of Progress in Chemistry.5 Nonethelesss there have been ab initio studies of these solids and these are likely to increase in future due to the greater computing power available. Early work by the CRYSTAL group employed Hartree–Fock calculations to study zeolites, for example silico-chabazite.177 There have also been a number of DFT studies. Corminboeuf178 has discussed the application of DFT to zeolites and concluded that the way forward was ab initio molecular dynamics. Nachtigall179 reviewed the application of quantum mechanical methods to zeolites. Poulet et al.180 discussed the relative merits of linear (VASP) and localised orbital (SIESTA) methods as applied to the related aluminium phosphate structures (ALPOs). QM/MM methods are useful for considering site occupancy of atoms substituting for Si and positions of molecules and counterions in the cages of these materials. As noted in section 4, QM/MM has been used extensively for zeolites by Sauer’s group. Modelling of zeolites and other microporous materials is mainly focussed on three aspects—structural studies, studies of the interaction of molecules or ions with the materials (particularly in relation to their use as catalysts) and templating. Structural studies can include topological studies, crystallisation and the position of substituted atoms in the framework. One basic question is the number of possible zeolitic frameworks and topological techniques have been used to tackle this, the zeolite framework being viewed as a 4-connected net. A recent paper by O’Keeffe et al.181 provides a clear discussion of the use of tiling in designing materials. Catlow5 in his Annual Report gives references to reviews of earlier work and summarises work in this field from 1990–2005. Foster et al.182 identified possible binodal (two types of tetrahedral sites) structures using tiling theory. These mathematically possible structures were then investigated using interatomic potential methods resulting in the identification of 98 previously unknown toplogies. Some of these were not feasible zeolite structures but a number of very open structures were noted which the authors suggest may be feasible structures for elements other than those normally found in zeolites. An example of these is the D3R (double 3 ring) family containing trigonal prisms (Fig. 3). Double 4 rings (D4R) have also attracted attention. Sastre183 has suggested, based on interatomic potential calculations with the Vessal-Leslie-Catlow184 force field, that D4Rs in some zeolite frameworks are less strained than previously thought. Kamakoti and Barckholtz185 used LDA DFT calculations to study the role of germanium in stabilising D4Rs. Auerbach186 has published a useful, short review of the insights gained from molecular modelling of zeolite formation. Mora-Fonz187 has investigated the role of solvation and pH in the nucleation of silicaceous zeolites. An article by Agger et al.188 in an issue of Journal of Crystal Growth dedicated to new advances in crystal growth and nucleation, uses interatomic potential methods to study the crystal growth of analcime by considering low energy hydrated surfaces. van Erp et al.189 propose the use of transition interface sampling (see section 5) for theoretical studies of the synthesis of zeolites as the efficiency of this method is less sensitive to the reaction coordinates chosen than conventional free energy techniques. This could be an advantage in the study of zeolite synthesis because the reaction pathway is usually difficult to find. Modellers often use silica zeolites but most zeolites contain other cations as well as silicon in the framework. Recent work by Gale190 uses ab initio methods and molecular dynamics to study the location of titanium within TS-1. To et al.191 Chem. Modell., 2008, 5, 119–149 | 135 This journal is

c

The Royal Society of Chemistry 2008

Fig. 3 The most feasible structure of the D3R family [adapted from ref. 182. Reproduced with permission of the International Union of Crystallography (http://journals.iucr.org/)].

employed QM/MM methods to investigate the active site in this zeolite. They concluded that a tetrapodal site (Ti bonded to four O–Si bonds) would be the substitutional site under anhydrous conditions, but that on hydration tripodal structures would prevail and act as the active site for catalysis. Bulanek et al.192 used DFT to investigate the position of Cu+ in Cu-FER. Because zeolites are much used as catalysts, there have been many studies on the interaction of molecules with the framework. Recent studies (2005–2007) include the effect of incorporation of impurity anions into the zeolite DSP,193 adsorption of NO in iron-exchanged ferrierite,194 adsorption of CO in Li+-FER,195 adsorption of CO on Cu+ in Cu-FER,192 adsorption of methane in zeolites A and Y,196 methane permeation through zeolite membranes,197 reactions of 1,3-dimethylbenzene in zeolites198 and the separation of CO2 and CH4 in silicalite.199 Thompson et al.200 used Monte Carlo methods to study the absorption of HCN and methyl ethyl ketone in silicalite, mordenite and zeolite beta. Their motivation was the desire to explore the use of zeolites in removing methyl ethyl ketone (an irritant found in glues and varnishes) from industrial gases. The absorption was found to depend heavily on pore size and cation loading. In an extension of the QM/MM approach, Tuma and Sauer201 treat the protonation of isobutene in ferrierite by using MP2 calculations on the reaction site and hybrid DFT for the zeolite. Sholl202 has written a review of molecular dynamics applied to diffusion of molecules in nanoporous materials including zeolites and carbon nanotubes. Bougeard and Smirnov203 have reviewed the important, but difficult to model, area of water in nanoporous materials. They discuss both quantum mechanical and molecular dynamics studies of water in clays and zeolites. Ruiz-Salvador et al.204 used interatomic potential methods to study the effect of water, framework atoms and extra-framework cations on the structure of the zeolite Goosecreekite (GOO). A modification of the potential for water developed by de Leeuw205,206 was found to be suitable. This zeolite (GOO) contains an ordered arrangement of Al3+ on the framework sites and Ca2+ in the cavities. All water in the zeolite is coordinated to Ca2+ making modelling more tractable. Their model confirmed that the extra-framework cations were the structural driving force for collapse on dehydration. Organic molecules are traditionally used as structure-directing agents for zeolites and work in this area continues. Huang et al.207 studied organic cations in a 136 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

borophosphate network. Gomez-Hortiguela et al.208 studied fluorinated dibenzyldimethylammonium as a structure-directing agent for ALPOs. Sastre209 has used interatomic potential calculations to compare the roles of framework atom substitution, extra-framework anions and organic molecules as structure directing agents. 7.2 Metal oxides Simple metal oxides were the subject of much early work and provide test systems for new methods. There is still work to be done though as evidenced by recent papers on simple solids such as MgO,5,210,211 and ZnO.210,212–215 MgO remains popular as the test crystal for new methods; ZnO has received renewed attention due to increasing exploitation of its semiconducting properties in applications. Transition metal oxides remain problematic. These oxides are strongly correlated and as explained above these systems are not well described by pure DFT calculations; indeed in some cases such calculations predict a metallic state when the true ground state is an insulator. As an example of an apparently simple oxide that is difficult to model, Table 1 gives some results of ab initio calculations on the oxide a-Fe2O3. Both plane wave and LCCO methods can now reproduce the band gap and nature of the valence and conduction bands but only if recent improvements are used. Neither have yet managed to correctly predict both the band gap and the equilibrium structure. A further point to note is that the calculations in Table 1 refer to a-Fe2O3 at low temperatures. At the Morin transition the alignment of the spins with respect to the crystal axes changes. This effect can only be modelled by including spin–orbit coupling. Table 1 Calculated band gaps for a-Fe2O3 Method

a/A˚

Experimental

5.035 13.747 4.9 B2 5.4241 (55117 0 ) 5.112 13.820 4.74 11

HFa LSDA/fully relativistic ASWb LSDAc LSDA + Uc U = 2 eV LSDA + Ud U = 5 eV GGA (PAW)e GGA + Ue (PAW) U = 4 eV B3LYPf B3LYPf 10%

c/A˚ (a)

m/mB Eg/eV Nature of valence band

3.72 0.75 5.0347 13.7473 3.43 0.51 (fixed) (fixed) 3.84 1.42 5.345 (55113 0 ) 4.1

1.88

5.007 13.829

3.45 B0.3

5.067 13.882

4.11 B2

5.057 13.883

4.21 3.31

5.055 13.939

4.04 1.93

Fe3d/O2p upper edge dominated by O2p states Weak Fe3d/O2p mixing. Upper edge dominated by O2p Strong mixing of Fe3d/O2p. Fe3d concentrated at top of band Strong mixing of Fe3d/O2p. Increase in O2p states close to band edge. Mixing of Fe3d/O2p. Fe3d shifted strongly to bottom of valence band. Strong mixing of Fe3d/O2p. Fe3d and O2p at upper edge in comparable amounts. Weaker Fe3d/O2p mixing. O2p states at band edge. Fe3d/O2p mixing. Fe3d and O2p at band edge O2p dominant Fe3d/O2p mixing. Fe3d and O2p at band edge O2p dominant

Rhombohedral cell. a M. Catti, G. Valerio and R. Dovesi, Phys. Rev. B: Condens. Matter, 1995, 51, 7441. b L. M. Sandratskii, M. Uhl and J. Ku¨bler, J. Phys. Condens. Matter, 1996, 8, 983. c M. P. J. Punkkinen, K. Kokko, W. Hergert and I. J. Va¨yrynen, J. Phys. Condens. Matter 1999, 11, 23421. d A. Bandyopadhyay, J. Velev, W. H. Butler, S. K. Sarker and O. Bengone, Phys. Rev. B: Condens. Matter, 2004, 69, 174429. e G. Rollman, A. Rohrbach, P. Entel and J. Hafner, Phys. Rev. B: Condens. Matter, 2004, 69, 165107. f E. A. Moore, Phys. Rev. B: Condens. Matter, 2007, 76, 195107.

Chem. Modell., 2008, 5, 119–149 | 137 This journal is

c

The Royal Society of Chemistry 2008

Oxides are important industrially and rather than list all the work on oxides, I shall consider three classes of compounds—perovskites, the layered componds AMO2 where M is a transition metal and A an alkali metal, and solid ionic conductors. Perovskites have a wide range of uses because of their electric (e.g. ferroelectrics such as barium titanate), optical (e.g. lithium niobate) and magnetic properties. Studies of perovskites with interesting magnetic or electrical properties such as superconductivity and colossal magnetoresistance are dealt with in a later subsection. Interatomic potential methods continue to play a useful role in structural studies and there have been a number of studies of defects in perovskites using this approach. Jackson216,217 has studied the ferroelectric LiNbO3 (used, for example, in wave guides where carefully-controlled doping is crucial) , Thomas218,219 has studied defects in SrTiO3 (used in capacitors) , Diniz and Paschoal220 have studied the compounds Ln(TaTi)O6 where Ln is a lanthanide. Fisher et al.221 used molecular dynamics to study oxide ion diffusion in Ba1xSrxCo1yFeyO2.5 and conclude that the conductivity is stongly dependent on the values of x and y. Other groups have used ab initio methods to study MgTiO3,222 PbTiO3,223 Ba0.5Sr0.5TiO3,224 BaCd1/3Ta2/3O3 and BaZn1/3Ta2/3O3,225 BiAlO3,226,227 BiGaO3,226 CaGeO3,228 KNbO3,229 KTaO3,230 and Bi2FeCrO6, Bi2MnNiO6 and Bi2CuCrO6.231 The layered compounds AMO2 where M is a transition metal and A an alkali metal, Cu or Ag have attracted interest because of their electrical properties. LiCoO2 is used in rechargeable lithium ion batteries and NaxCoO2 is metallic except at x = 0.5 and for values of x of about 0.35 becomes superconducting in the presence of intercalated water. Li et al.232 and Xu and Zeng233 have studied Na0.5CoO2. A FLAPW GGA study of the structure and optical properties of AgFeO2 and CuFeO2 has been carried out by Blaha’s group.234 Ogata235 has reviewed band models of NaxCoO2. This solid has also been the subject of a study by the recently-developed dynamic mean field theory (DMFT).236 This study used VASP with an LDA functional, a value of U = 3 eV for Co and an on-site potential for Na. It was found that the Na potential was the key to explaining the stronger correlation for x = 0.7 than for x = 0.3. Idemoto et al.237,238 have used FLAPW to obtain calculated electron density in the solids LiMn1xMxO2, where M = Mn, Co, Ni, Zn, which they were investigating as cathode material for Li ion batteries. This is primarily an experimental paper and is an example of an increasing trend whereby experimentalists are using modelling to complement their practical results. Park et al.239 have used VASP with a GGA density functional to calculate the structure, density of states and ionic charge of Li(M1/3Mn1/3Ni1/3)O2 with M = Al, Ti, Cr, Fe and Mo. Using LDA + U calculations, Mazin240 has shown that in LiCrO2 the antiferromagnetism arises from Cr–Cr overlap rather than superexchange. Ion migration is important for solids with applications in solid oxide fuel cells, gas sensors and batteries. Interatomic potential methods are particularly useful in this type of work as the migrating ion can be moved along a suggested path and its local environment minimised at each step thus producing an energy profile for the migration. Islam’s group has been particularly productive in this area. An account of much of the group’s work up to 2000 is given in an article in Journal of Materials Chemistry241 and a 2002 paper242 presents work on ion transport in perovskite oxides. Initially this work calculated the potential energy surface for the migration of oxygen vacancies. Molecular dynamics was then used to calculate the rate of diffusion. Finally dopant ions were introduced and the effect of these on the migration calculated. More recent work by this group includes oxide ion migration in apatites,243–247 ionic conduction in Sc2(WO4)3,248 lithium ion transport in LiFePO4249 and oxide ion transport in CaTiO3,250 and in Sr2Fe2O5 and Sr4Fe6O13.251 A number of other groups have recently used modelling to study ion migration in ceria, CeO2. The reaction pathway for migration of oxygen vacancies in ceria has been studied using the PAW method with GGA + U.252 The results confirm the 138 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

pathway found by interatomic potential methods.253 This ab initio study was used to test the potentials in the molecular dynamics study on CeO2143 described in section 5. Gotte et al.254 used large scale molecular dynamics to study reduced CeO2 at high temperatures and find that only a h100i oxygen vacancy diffusion occurs. Stølen et al.255 used GGA calculations in VASP on doped Ba2In2O5 to support their hypothesis that a high density of low energy local structures was a prerequisite for high ionic conductivity. They predicted that the Sr-doped solid would be a better ionic conductor than the un-doped solid in contrast to the effect of doping with Ga and in agreement with experimental results. 7.3 Solids exhibiting superconductivity or novel magnetism The discovery of the high TC superconductors naturally sparked much interest among solid state modellers. More recently interest has been aroused by the discovery of solids displaying colossal magnetoresistance. Perhaps surprisingly, since these properties are electronic properties, interatomic potential methods have proved of some use. Such modelling can investigate ordering of atoms, charges or vacancies and the ordering may provide clues as to the conditions necessary for solids to possess interesting electronic and magnetic properties. Allan and Mackrodt,256 for example, argued that since the nature and distribution of defects apparently played a crucial role in determining TC, studies of defects might indicate why some cuprates were high temperature superconductors and others not and studied a series of compounds M2CuO4 plus La2NiO4. They concluded that La2CuO4 had a unique combination of lattice and defect properties. Islam257 studied HgBa2Ca2Cu3O8+d and found interstitial oxygen defects to be the most favourable hole-doping mechanism. Two groups have used interatomic potential methods to study ordering in the colossal magnetoresistance material La1xCaxMnO3.258,259 Ab initio methods as well as looking at the structure can of course consider band structure. In the first rush of excitement following the discovery of high TC superconductors, a number of studies on the band structure of cuprates were undertaken. The results of these up to 1989 are reviewed in detail by Pickett.260 These calculations established that the bands that crossed the Fermi level were antibonding Cu 3d /O 2p bands. Most of them used an LDA functional but these compounds are expected to be strongly correlated and there is experimental evidence for local Coulomb repulsion on Cu(II) sites. Larbaoui et al.261 found that using a GGA functional led to considerable improvement for YBa2Cu3O7 but such systems really need methods such as +U or the use of hybrid functionals. Blaha et al.262 showed that the use of LDA +U with a value of 6eV for U on Cu(II) for YBa2Cu3O7 improved the calculated value of the electric field gradient at Cu. Electron–phonon interactions are important in conventional superconductors and there have been suggestions that this is true for high TC superconductors as well. Zhang et al.263 found that the use of LDA + U enhances electron–phonon coupling of a Cu–O stretching mode. However Ohkawa264 in a study of Bi2Sr2CaCu2O8d concludes that electron– phonon interaction plays little or no role in cuprate d wave superconductivity and that Cu–Cu superexchange is responsible. Kuchinskii et al.265 have employed DMFT with an LDA + U functional and an additional self energy term s(k) to study Bi2Sr2CaCu2O8d and find good semiquantitative agreement with angular resolved photoemission (ARPES) results. The discovery of colossal magnetoresistance (CMR) in alkaline-earth-doped rare earth manganates, Ln1xAxMnO3, naturally inspired a rush to model these solids. The earliest explanation was in terms of double exchange, although there are now known examples of solids exhibiting colossal magnetoresistance in which double exchange is not thought to be present. In the undoped solids manganese is present as Mn3+, high spin d4, with three electrons occupying t2g bands and one occupying eg bands. The eg bands are close to the Fermi level. On doping, a fraction of the Chem. Modell., 2008, 5, 119–149 | 139 This journal is

c

The Royal Society of Chemistry 2008

Fig. 4 Unit cell of double perovskite. Small open spheres M/M 0 , large open spheres Sr, shaded spheres O.

managanese ions are oxidised to Mn4+ leaving these ions without any eg electrons. The eg electrons migrate through the solid and because on-site repulsion causes them to line up with the t2g electrons with Mn3+ couples ferromagnetically to Mn4+. Ramakrishnan266 has reviewed the problems of modelling the manganates and discusses the interactions involved. Structurally these solids are often found to form several phases close in energy and it has been suggested that regions (on the nanometre to micron scale) of different ‘phase’ exist in a single sample. The dopant atoms may also be ordered or disordered. Other problems are that there is evidence of strong on-site correlation and the behaviour of interest is not that at 0 K. However examining the strength of the on-site interaction, double exchange coupling and Jahn–Teller effects is important in understanding these solids. Ramakrishnan himself reports results of DMFT calculations with large values of U. The interatomic potential calculations mentioned above explore cation ordering. Akhtar et al.259 find, using a Monte Carlo method that at low temperatures for x = 0.5 there is ordering of the La3+ and Ca2+ ions in Ln1xCaxMnO3such that along the b axis there are alternate planes of La3+ and Ca2+. However at temperatures typically used to prepare these solids, they find that random occupation is favoured. Tang and Zhang267 also favour a layer ordering. Birsan268 has used a Monte Carlo method to explore the influence of superexchange in CMR manganates. Experimentally there has been much recent interest in solids based on the perovskite SrFeO3. SrFeO3 itself is metallic and shows no Jahn–Teller distortion although it formally contains the d4 ion Fe4+. SrFeO3 is rarely synthesised as stoichiometric; its formula being better described as SrFeO3d. Shein et al. have calculated the magnetic moments and band structure for both stoichiometric SrFeO3269,270 and SrFeO2.875.271 The calculations for the stoichiometric solid using VASP with an LSDA + U functional give a ferromagnetic ground state. Substitution of Fe by other metals gives a series of double perovskites, Sr2MM 0 O6, Fig. 4. The properties of these solids depend on the nature of M and M 0 . Shein’s group has considered the mixed double perovskites ASn1xMxO3272 where A = Ca, Sr, Ba and M = Fe, Mn, Co. They find that Sr2SnMnO6 is a ferromagnetic semiconductor, Sr2SnCoO6 a ferromagnetic metal and Sr2SnFeO6 a ferromagnetic semimetal. However although they use the accurate FLAPW method, the density functional employed is LSDA. A very recent intriguing result from this group273 is the prediction that substitution of B, C or N for O in the perovskites SrMO3 (M = Ti, Zn, Sn) yields a spin-polarised impurity band. FLAPW + lo was used for this study. Stoeffler and Colis274 studied the effect of oxygen vacancies in Sr2FeMoO6, a half-metallic ferromagnet that they suggest may be a good candidate for use in 140 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

magnetic tunnel junctions. Again they use FLAPW but employ a GGA density functional. It might well be worth seeing if +U, hybrid functional or other methods aimed at strongly-correlated systems give the same results. An interesting approach to spin ordering and spin glasses is the Monte Carlo analysis used by Gibb.275 This simply considers the spin interactions, Js1 s2. Applied to Sr2FeRuO6,276 it shows that the next nearest neighbour interaction, although small, can lead to spin frustration.

8. Minerals Minerals are particularly challenging for modellers as they are often formed under conditions of high temperature and/or pressure and may contain several randomlydistributed cations and water. Nevertheless modelling solids at the temperatures and pressures found in the Earth’s core and mantle and in the core of other planets is possible whereas it is not always possible to reproduce these conditions experimentally. Molecular dynamics is particularly suited to disordered systems and high temperature simulations and the advances in computer hardware offer an opportunity for an expansion in modelling of minerals. However it is sometimes possible to obtain useful insights from static interatomic potential calculations. Parker et al.277 discuss examples of applications of interatomic potentials to minerals using HADES, METADISE and PARADISE. Other studies using interatomic potentials include the diffusion of sodium and potassium ions in nepheline,278 the interaction of heavy metals with dolomite surfaces,279 the dissolution of jarosite [KFe3(SO4)2(OH)6],280 properties of lizardite.281 de Leeuw’s group have conducted a number of simulations on apatites.282–284 An interesting recent set of studies, which show that simulations can be of use to the mining industry are those by Ngoepe285,286 on platinum and palladium sulfide minerals. Vinograd et al.287 use lattice energy calculations to study carbonate solid solutions. Breu has made a number of studies on clays. Natural clays are inhom*ogeneous and severely disordered but by considering ordered structures he was able to gain insights into the factors determining long range order of pillars.288 This led to the preparation of artificial ordered clays which may prove better in applications than natural clays. Many minerals occur as solid solutions and to understand these, it is necessary to consider the enthalpy and, preferably, free energy of mixing of the components. Allan et al.289,290 describe how Monte Carlo and lattice dynamics methods can be combined to study solid solutions and highly disordered systems and apply this to garnets289,290 and spinels and carbonates.291 Reich and Becker292 combined Monte Carlo and ab initio methods to study the solution of arsenic in pyrites and concluded that for a wide range of geologically relevant temperatures, a mixture of two phases (FeS2 and FeAsS) is more stable than a solid solution. Ab initio methods have been used to calculate vibrational frequencies of kaolinite293 and katoite.294 It is not only minerals in the Earth’s crust that have been studied. There have been a series of studies of planetary interiors. Gillan et al.295 review work on the interiors of Earth, Jupiter, Saturn, Uranus and Neptune. The materials of interest are magnesium silicate in the Earth’s mantle, hydrogen in the interior of Jupiter and Saturn and methane, ammonia and water in the interiors of Uranus and Neptune. Calculations on the post-perovskite phase of (Mg, Fe)SiO3 explained aspects of the behaviour of the D layer, the lowermost part of the Earth’s mantle.296 Most of these studies exploited the increase in computer power to use molecular dynamics or lattice dynamics in conjunction with ab initio calculations. Study of the Earth’s mantle has continued. Carrez et al.297 studied dislocation structures in the D layer starting from generalised stacking faults which they generated using the ab initio package VASP. Blanchard et al.298,299 have derived new interatomic potentials to Chem. Modell., 2008, 5, 119–149 | 141 This journal is

c

The Royal Society of Chemistry 2008

study magnesium silicate. Jung and Oganov300 used PAW + GGA and fitting of the calculated energy–volume curve to study CaSiO3 at high pressure. The eMinerals project was set up to exploit grid computing in the simulation of minerals. This project links computers in London, Bath and Cambridge enabling workers in the consortium to undertake demanding simulations, using both interatomic potential and ab initio methods. An issue of Molecular Simulation301 contains a number of papers resulting from this project.

9. Conclusions Solid state modelling is currently a flourishing and exciting area. Simple lattice energy-based calculations based on interatomic potentials have been used to model solids for the past three decades and continue to provide useful insights into the structures of materials often of industrial (e.g. ionic conductors, catalysts) or geological importance. A marked trend in recent years is the use of plane wave ab initio methods for solids of chemical interest. The use of these methods by chemical modellers and even bench top chemists has been encouraged by graphics-led packages. Until recently, most calculations were of static solids at 0 K but increasing computer performance now means that molecular dynamics is an increasingly popular tool and it is even feasible to combine this technique with ab initio calculations. This enables modellers to tackle problems such as highly-disordered solids and phase diagrams. The properties that can be modelled have increased in recent years, a notable addition being NMR chemical shifts. The use of localised orbitals (Wannier functions) has helped in this area. Zeolites were one of the earliest types of inorganic solids to be modelled and are still a popular area for modelling with the approach moving from simple interatomic methods to QM/MM and ab initio and the systems studied expanding into related areas such as ALPOs. Strongly-correlated transition metal oxides have been of much interest experimentally due to their novel electronic and magnetic properties. These pose a particular problem to modellers in that it is necessary to allow for localisation of spins on the transition metal ions. Currently the two most popular approaches to this problem are the use of hybrid DFT functionals and the addition of an on-site Coulomb term +U. Minerals, being often disordered and formed under high pressures and temperatures are ideal candidates for molecular dynamics or similar approaches. There has been some exciting work on the minerals found in the Earth’s mantle and this is an area which should expand.

References 1 A. F. Kapustinskii, Q. Rev. Chem. Soc., 1956, 10, 283. 2 A. Sommerfeld and H. Bethe, ‘Elektronentheorie der Metalle’, Springer Verlag, Heidelberg, 1933. 3 Issue 20, J. Mater. Chem., 2006, 16, pp. 1873–2012. 4 Issue 2, J. Solid State Chem., 2003, 176. 5 R. Catlow, R. Bell, F. Cors, B. Slater, S. A. French and A. Sokol, Ann. Rep. Prog. Chem. Sect. A, 2005, 101, 1. 6 M. J. Norgett, A User’s Guide to HADES. Technical Report AERE-R7015, AERE Harwell Laboratory, 1972. 7 Faraday Discuss., 1997, 106. 8 J. D. Gale and A. L. Rohl, Mol. Simul., 2003, 29, 291. 9 B. S. Thomas, N. A. Marks and B. D. Begg, Nucl. Instrum. Methods Phys. Res., Sect. B, 2005, 228, 288. 10 P. P. Ewald, Ann. Phys., 1921, 369, 253. 11 U. Essman, L. Perera, M. L. Berkowitz, T. Darden, H. Lee and L. G. Pedersen, J. Chem. Phys., 1995, 103, 8577. 12 L. Greengard and V. Rokhlin, J. Comput. Phys., 1987, 73, 325. 13 H. G. Petersen, D. Soelvason, J. W. Perram and E. R. Smith, J. Chem. Phys., 1994, 101, 8870. 14 D. Wolf, P. Keblinski, S. R. Philpot and J. Eggebrecht, J. Chem. Phys., 1999, 110, 8254.

142 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57

58

K. A. van Genechten, W. J. Mortier and P. Geerlings, J. Chem. Phys., 1987, 86, 5063. A. K. Rappe and W. A. Goddard III, J. Phys. Chem., 1991, 95, 3358. S. I. Njo, J. F. Fan and B. van de Graaf, J. Mol. Catal. A, 1998, 134, 79. R. T. Sanderson, Science, 1951, 114, 670. A. Pavesi, M. Catti, S. C. Parker and A. Wall, Phys. Chem. Miner., 1996, 23, 89. G. B. Telfer, J. D. Gale, K. J. Roberts, R. A. Jackson, P. J. Wilde and P. Meenan, Acta Crystallogr., Sect. A: Found. Crystallogr., 1997, 53, 415. N. L. Allan, A. L. Rohl, D. H. Gay, C. R. A. Catlow, R. J. Davey and W. C. Mackrodt, Faraday Discuss., 1993, 95, 273. A. Chroneos, K. Desai, S. E. Redfern, M. O. Zacate and R. W. Grimes, J. Mater. Sci., 2006, 41, 675. A. P. Sutton and J. Chen, Philos. Mag. Lett., 1990, 61, 139. J. Tersoff, Phys. Rev. Lett., 1986, 56, 632. J. Tersoff, Phys. Rev. B: Condens. Matter, 1988, 37, 6991. D. W. Brenner, Phys. Rev. B: Condens. Matter, 1990, 42, 9458. D. W. Brenner, O. A. Shenderova, J. A. Harrison, S. J. Stuart, B. Ni and S. B. Sinnott, J. Phys.: Condens. Matter, 2002, 14, 783. P. Erhart and K. Albe, Phys. Rev. B: Condens. Matter, 2005, 71, 035211. M. W. Finnis, Prog. Mater. Sci., 2007, 52, 133. P. Tangney and S. Scandolo, J. Chem. Phys., 2003, 119, 9673. B. G. Dick and A. W. Overhauser, Phys. Rev., 1958, 112, 90. P. A. Madden and M. Wilson, Chem. Soc. Rev., 1996, 339. S. M. Woodley, P. D. Battle, C. R. A. Catlow and J. D. Gale, J. Phys. Chem. B, 2001, 105, 6824. R. G. Gordon and Y. S. Kim, J. Chem. Phys., 1972, 56, 3122. Y. S. Kim and R. G. Gordon, J. Chem. Phys., 1974, 60, 1842. C. P. Wood and N. C. Pyper, Mol. Phys., 1981, 43, 1371. J. H. Harding, Rep. Prog. Phys., 1990, 53, 1149. G. V. Lewis and C. R. A. Catlow, J. Phys. C: Solid State Phys., 1985, 8, L413. S. C. Parker, S. Kerisit, A. Marmier, S. Grigoleit and G. W. Watson, Faraday Discuss, 2003, 124, 155. S. Bukkapatnam, M. Malshe, P. M. Agrawal, L. M. Raff and R. Komanduri, Phys. Rev. B: Condens. Matter, 2006, 74, 224102. N. F. Mott and M. J. Littleton, Trans. Faraday Soc., 1938, 34, 485. C. R. A. Catlow, N. K. M. Diller and M. J. Norgett, J. Phys. C: Solid State Phys., 1997, 10, 1395. W. C. Mackrodt and R. F. Stewart, J. Phys. C: Solid State Phys., 1979, 12, 5015. S. D’Arco and M. S. Islam, Phys. Rev. B: Condens. Matter, 1997, 55, 3141. T. Baikie, M. S. Islam and M. G. Francesconi, J. Mater. Chem., 2005, 15, 119. M. S. Islam and R. C. Baetzold, J. Phys. Chem. Solids, 1992, 53, 1105. F. J. Berry, A. Bohorquez, C. Greaves, J. McManus, E. A. Moore and M. Mortimer, J. Solid State Chem., 1998, 140, 428. E. A. Moore, A. Bohorquez, F. J. Berry, O¨. Helgason and J. F. Marco, J. Phys. Chem. Solids, 2001, 62, 1277. I. Ayub, F. J. Berry, R. L. Bilsborrow, O¨. Helgason, R. C. Mercader, E. A. Moore, S. J. Stewart and P. G. Wynn, J. Solid State Chem., 2001, 156, 408. E. A. Moore, H. M. Widatallah and F. J. Berry, J. Phys. Chem. Solids, 2002, 63, 519. G. W. Watson, E. T. Kelsey, N. H. de Leeuw, D. J. Harris and S. C. Parker, J. Chem. Soc., Faraday Trans., 1996, 92, 433. H. J. Monkhorst and J. D. Pack, Phys. Rev. B: Condens. Matter, 1976, 13, 5188. G. Wannier, Rev. Mod. Phys., 1962, 34, 645. R. M. Martin, ‘Electronic Structure: Basic Theory and Practical Methods’, Cambridge University Press, Cambridge, UK, 2004. S. J. Clark, M. D. Segall, C. J. Pickard, P. J. Hasnip, M. J. Probert, K. Refson and M. C. Payne, Z. Kristallogr., 2005, 220, 567. X. Gonze, J.-M. Beuken, R. Caracas, F. Detraux, M. Fuchs, G.-M. Rignanese, L. Sindic, M. Verstraete, G. Zerah, F. Jollet, M. Torrent, A. Roy, M. Mikami, Ph. Ghosez, J.-Y. Raty and D. C. Allan, Comput. Mater. Sci., 2002, 25, 478. X. Gonze, G.-M. Rignanese, M. Verstraete, J.-M. Beuken, Y. Pouillon, R. Caracas, F. Jollet, M. Torrent, G. Zerah, M. Mikami, Ph. Ghosez, M. Veithen, J.-Y. Raty, V. Olevano, F. Bruneval, L. Reining, R. Godby, G. Onida, D. R. Hamann and D. C. Allan, Z. Kristallogr., 2005, 220, 558. M. D. Segall, P. J. D. Lindan, M. J. Probert, C. J. Pickard, P. J. Hasnip, S. J. Clark and M. C. Payne, J. Phys.: Condens. Matter, 2002, 14, 2717.

Chem. Modell., 2008, 5, 119–149 | 143 This journal is

c

The Royal Society of Chemistry 2008

59 M. J. Gillan, in ‘Computer Simulation in Materials Science’, eds. M. Meter and V. Pontikis, Dordrecht, Kluwer, 1991, p. 257. 60 D. Vanderbilt, Phys. Rev. B: Condens. Matter, 1990, 41, 7892. 61 C. Y. Lee, D. Vanderbilt, K. Laasonen, R. Car and M. Parrinello, Phys. Rev. B: Condens. Matter, 1993, 47, 4863. 62 R. Car and M. Parrinello, Phys. Rev. Lett., 1985, 55, 2471. 63 J. C. Slater, Phys. Rev., 1937, 51, 846. 64 O. K. Andersen, Phys. Rev., B: Condens. Matter, 1975, 12, 3060. 65 M. Weinert, J. Math. Phys., 1981, 22, 2433. 66 P. E. Blo¨chl, Phys. Rev. B: Condens. Matter, 1994, 50, 17953. 67 M. Marsman and G. Kresse, J. Chem. Phys., 2006, 125, 104101. 68 J. J. Mortensen, L. B. Hansen and K. W. Jacobsen, Phys. Rev. B: Condens. Matter, 2005, 71, 035109. 69 V. Eyert, Inter. J. Quantum Chem., 2000, 77, 1007. 70 R. Dovesi, V. R. Saunders, C. Roetti, R. Orlando, C. M. Zicovich-Wilson, F. Pascale, B. Civalleri, K. Doll, N. M. Harrison, I. J. Bush, Ph. D’Arco and M. LLunell, Crystal06 User’s Manual, University of Torino, Torino, 2006. 71 G. te Velde and E. J. Baerends, Phys. Rev. B: Condens. Matter, 1991, 44, 7888. 72 G. Wiesenekker and E. J. Baerends, J. Phys.: Condensed Matter, 1991, 3, 6721. 73 BAND2007.01, SCM, Theoretical Chemistry, Vrije Universiteit, Amsterdam, The Netherlands, http://www.scm.com. 74 J. M. Soler, E. Artacho, J. D. Gale, A. Garcia, J. Junquera, P. Orde´jon and D. Sa´nchezPortal, J. Phys.: Condensed Matter, 2002, 14, 2745. 75 D. Sa´nchez-Portal, P. Ordejo´n and E. Canadell, Struct. Bonding, 2004, 113, 103. 76 A. I. Liechtenstein, V. I. Anisimov and J. Zaanen, Phys. Rev. B: Condens. Matter, 1995, 52, R5467. 77 S. F. Matar, Progress in Solid State Chemistry, 2003, 31, 239. 78 F. Gygi and A. Baldereschi, Phys. Rev. B: Condens. Matter, 1986, 34, 4405. 79 A. Seidl, A. Go¨rling, P. Vogl, J. A. Majewski and M. Levy, Phys. Rev. B: Condens. Matter, 1996, 53, 3764. 80 A. Go¨rling, Phys. Rev. B: Condens. Matter, 1996, 53, 7024. 81 M. Sta¨dele, J. A. Majewski, P. Vogl and A. Go¨rling, Phys. Rev. Lett., 1997, 79, 2089. 82 M. Sta¨dele, M. Moukara, J. A. Majewski, P. Vogl and A. Go¨rling, Phys. Rev. B: Condens. Matter, 1999, 59, 10031. 83 O. Gunnarsson, M. Jonson and B. I. Lundqvist, Phys. Lett. A, 1976, 59, 177. 84 O. Gunnarsson, M. Jonson and B. I. Lundqvist, Solid State Commun., 1977, 24, 765. 85 J. A. Alonso and L. A. Girifalco, Phys. Rev. B: Condens. Matter, 1978, 17, 3735. 86 S. K. Ghosh and R. G. Parr, Phys. Rev. A: Atom., Mol. and Opt. Phys., 1986, 34, 785. 87 A. D. Becke and M. R. Roussel, Phys. Rev. A: Atom., Mol. and Opt. Phys, 1989, 39, 3761. 88 J. Tau, J. P. Perdew, V. N. Staroverov and E. Scuseria, Phys. Rev. Lett., 2003, 91, 146401. 89 S. Chawla and G. A. Voth, J. Chem. Phys., 1998, 108, 4697. 90 M. C. Gibson, PhD Thesis, University of Durham, 2006, http://cmt.dur.ac.uk/sjc/ thesis_mcg/thesis/thesis.html. 91 V. I. Anisimov, F. Aryasetiawan and A. I. Liechtenstein, J. Phys.: Condens. Matter, 1997, 9, 767. 92 S. L. Dudarev, A. I. Liechtenstein, M. R. Castell, G. A. D. Briggs and A. P. Sutton, Phys. Rev. B: Condens. Matter, 1997, 56, 4900. 93 S. L. Dudarev, G. A. Botton, S. Y. Savrasov, C. J. Humphreys and A. P. Sutton, Phys. Rev. B: Condens. Matter, 1998, 57, 1505. 94 A. Rohrbach, J. Hafner and G. Kresse, J. Phys.: Condens. Matter, 2003, 15, 979. 95 N. J. Mosey and E. A. Carter, Phys. Rev. B: Condens. Matter, 2007, 76, 155123. 96 A. D. Becke, J. Chem. Phys., 1993, 98, 5648. 97 S. H. Vosko, L. Wilk and M. Nusair, Can. J. Phys., 1980, 58, 1200. 98 C. Lee, W. Yang and R. G. Parr, Phys. Rev. B: Condens. Matter, 1988, 37, 785. 99 C. Franchini, V. Bayer, R. Podloucky, J. Paier and G. Kresse, Phys. Rev. B: Condens. Matter, 2005, 72, 045132. 100 M. Enzerhof and G. E. Scuseria, J. Chem. Phys., 1999, 110, 5029. 101 C. Adamo and V. Barone, J. Chem. Phys., 1999, 110, 6158. 102 G. Kresse and J. Furthmu¨ller, Comput. Mater. Sci., 1996, 6, 15. 103 J. P. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1996, 77, 3865. 104 J. P. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1997, 78, 1396. 105 A. Alkauskas and A. Pasquarello, Physica B, 2007, 401–2, 670. 106 F. Tran, P. Blaha, K. Schwarz and P. Novak, Physical Review B: Condens. Matter, 2006, 74, 155108. 107 X. B. Feng and N. M. Harrison, Phys. Rev. B: Condens. Matter, 2004, 70, 092402.

144 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

108 F. Cora, Mol. Phys., 2005, 103, 2483. 109 E. A. Moore, Phys. Rev. B: Condens. Matter, 2007, 76, 195107. 110 B. Montari, B. Civalleri, C. M. Zicovich-Wilson and R. Dovesi, Inter. J. Quantum Chem., 2006, 106, 1703. 111 N. C. Wilson, J. Muscat, D. Mkhonto, P. E. Ngoepe and N. M. Harrison, Phys. Rev. B: Condens. Matter, 2005, 71, 075202. 112 J. Yang and M. Dolg, J. Phys. Chem. B, 2006, 110, 19254. 113 I. D. Prodan, G. E. Scuseria and R. L. Martin, Phys. Rev. B: Condens. Matter, 2006, 73, 045104. 114 J. Heyd, G. E. Scuseria and M. Ernzerhof, J. Chem. Phys., 2003, 118, 8207. 115 F. Cora, M. Alfresson, G. Mallia, D. S. Middlemiss, W. C. Mackrodt, R. Dovesi and R. Orlando, Structure and Bonding, 2004, 113, 171. 116 P. Nova˘k, F. Boucher, P. Gressier, P. Blaha and K. Schwarz, Phys. Rev. B: Condens. Matter, 2001, 63, 235114. 117 V. I. Anisimov, A. I. Poeryaev, M. A. Korotkin, A. O. Anokhin and G. Kotliar, J. Phys.: Condens. Matter, 1997, 9, 7359. 118 A. I. Lichtenstein and M. I. Katsnelson, Phys. Rev. B: Condens. Matter, 1998, 57, 6884. 119 A. I. Lichtenstein, M. I. Katsnelson and G. Kotliar, Phys. Rev. Lett., 2001, 87, 067205. 120 G. Kotliar, S. Y. Savrasov, K. Haule, V. S. Oudovenko, O. Parcollet and C. A. Marianetti, Rev. Mod. Phys., 2006, 78, 865. 121 E. Z. Kuchinskii, I. A. Nekrasov and M. V. Sadovskii, JETP Lett., 2005, 82, 198. 122 J. H. Harding, A. H. Harker, P. B. Keystra, R. Pandey, J. M. Vail and C. Woodward, Physica B and C, 1985, 131, 151. 123 J. Vail, ‘Topics in the Theory of Solid Materials in Series in Materials Science and Engineering’, Institute of Physics, 2003. 124 M. Sierka and J. Sauer, Faraday Discuss., 1997, 106, 41. 125 U. Eichte, C. M. Ko¨lmel and J. Sauer, J. Comput. Chem., 1997, 18, 463. 126 D. Nachtigallova, P. Nachtigall, M. Sierka and J. Sauer, Phys. Chem. Chem. Phys., 1999, 1, 2019. 127 J. R. Hill and J. Sauer, J. Phys. Chem., 1995, 99, 9536. 128 K. P. Schroder and J. Sauer, J. Phys. Chem., 1996, 100, 11043. 129 D. Bahomes and W. Thiel, J. Comput. Chem., 1996, 17, 87. 130 R. A. Bryce, M. A. Vincente and I. H. Hillier, J. Phys. Chem. A, 1999, 103, 4094. 131 B. T. Thole and P. Th. van Duijen, Theor. Chim. Acta, 1980, 55, 307. 132 A. H. de Vries, P. Th. van Duijen, A. H. Juffer, J. A. C. Bullmann, J. P. Dijkman, H. Merenga and B. Thole, J. Comput. Chem., 1995, 16, 37. 133 P. Sherwood, in ‘Hybrid quantum mechanics/molecular mechanics approaches in Modern Methods and Algorithms of Quantum Chemistry’, Proceedings, ed. J. Grotendorst, John van Neumann Institute of Computing, Ju¨lich, NIC series, second edn., 2000, vol. 3, pp. 285–305 available on-line. 134 P. Sherwood, A. H. de Vries, M. F. Guest, G. Schreckenbach, C. R. A. Catlow, S. A. French, A. A. Sokol, S. T. Bromley, W. Thiel, A. J. Turner, S. Billeter, F. Terstegen, S. Thiel, J. Kendrick, S. C. Rogers, J. Casci, M. Watson, F. King, E. Karlsen, M. Sjøvoll, A. Fahmi, A. Scha¨fer and C. Lennartz, J. Mol. Struct.: THEOCHEM, 2003, 632, 1. 135 A. A. Sokol, S. T. Bromley, S. A. French, C. R. A. Catlow and P. Sherwood, Inter. J. Quantum. Chem., 2004, 99, 695. 136 A. A. Maraudin, The Theory of Lattice Dynamics in the Harmonic Approximation, Solid State Physics, Academic Press, New York, 1971, vol. 3. 137 M. Sternik and K. Parlinski, J. Phys. Chem. Solids, 2006, 67, 796. 138 A. Senyshyn, H. Kraus, V. B. Mikhailik, L. Vasylechko and M. Knapp, Phys. Rev. B: Condens. Matter, 2006, 73, 014104. 139 G. D. Barrera, J. A. O. Bruno, T. H. K. Barron and N. L. Allan, J. Phys.: Condens. Matter, 2005, 17, R217. 140 N. Metropolis, A. E. Rosenbluth, M. N. Rosenbluth, A. H. Teller and E. Teller, J. Chem. Phys., 1953, 21, 1087. 141 T. Verlet, Phys. Rev., 1967, 159, 98. 142 M. P. Allen and D. J. Tildesley, ‘Molecular Dynamics Simulation—elementary methods’, Wiley, New York, 1987. 143 P. Martin, D. Spagnoli, A. Marmier, S. C. Parker, D. C. Sayle and G. Watson, Mol. Simul., 2006, 32, 1079. 144 I. Todorov and W. Smith, Philos. Trans. R. Soc. London, A, 2004, 362, 1835: for further information see www.cse.scitech.ac.uk/ccg/software/DL_POLY. 145 S. Kerisit and K. M. Rosso, J. Chem. Phys., 2005, 123, 224712. 146 C. Dellago, P. G. Bolhuis, F. S. Csajka and D. Chandler, J. Chem. Phys., 1998, 108, 1964. 147 C. Dellago, P. G. Bolhuis and D. Chandler, J. Chem. Phys., 1998, 108, 9236.

Chem. Modell., 2008, 5, 119–149 | 145 This journal is

c

The Royal Society of Chemistry 2008

148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191

P. G. Bolhuis, C. Dellago and D. Chandler, Faraday Discuss., 1998, 110, 421. C. Dellago, P. G. Bolhuis and D. Chandler, J. Chem. Phys., 1999, 110, 6617. T. S. van Erp, D. Moroni and P. G. Bolhuis, J. Chem. Phys., 2003, 118, 7762. T. S. van Erp, PhD Thesis, Universiteit van Amsterdam, 2003. T. S. van Erp and P. G. Bolhuis, J. Comput. Phys., 2005, 205, 157. C. E. Mohn and S. Stølen, J. Chem. Phys., 2005, 123, 114104. J. P. Valleaus and D. N. Card, J. Chem. Phys., 1972, 57, 5457. J. A. Purton, G. D. Barrera, N. L. Allan and J. D. Blundy, J. Phys. Chem. B, 1998, 102, 5202. N. L. Allan, G. D. Barrera, J. A. Purton, C. E. Sims and M. B. Taylor, Phys. Chem. Chem. Phys., 2000, 2, 1099. P. Erhart, A. Klein, R. G. Edgell and K. Albe, Phys. Rev. B: Condens. Matter, 2007, 75, 153205. M. V. Berry, Proc. Roy. Soc. London, A, 1984, 392, 45. A. Ferretti, A. Calzoari, B. Bonferroni and R. Di Felice, J. Phys.: Condens. Matter, 2007, 19, 036215. M. Gajdosˇ , K. Hummer, G. Kresse, J. Furthmu¨ller and F. Bechstedt, Phys. Rev. B: Condens. Matter, 2006, 73, 045112. F. Mauri, B. G. Pfrommer and S. G. Louie, Phys. Rev. Lett., 1996, 77, 5300. C. J. Pickard and F. Mauri, Phys. Rev. B: Condens. Matter, 2001, 63, 245101. C. J. Pickard and F. Mauri, Phys. Rev. Lett., 2003, 91, 196401. D. Sebastiani and M. Parrinello, J. Phys. Chem., A, 2001, 105, 1951. CPMD, copyright IBM corp., 1990–2006, copyright MPI fu¨r Fetho¨rperforschung Stuttgart, 1997–2001, http://www.cpmd.org. T. Charpentier, S. Ispas, M. Profeta, F. Mauri and C. J. Pickard, J. Phys. Chem. B, 2004, 108, 4147. S. Rossano, F. Mauri, C. J. Pickard and I. Farnan, J. Phys. Chem. B, 2005, 109, 7245. J. R. Yates, C. J. Pickard and F. Mauri, Phys. Rev. B: Condens. Matter, 2007, 76, 024401. L. Truflandier, M. Paris, C. Payer and F. Boucher, J. Phys. Chem. B, 2006, 110, 21403. H. M. Weng, Y. Kawazoe and J. M. Dong, Phys. Rev. B: Condens. Matter, 2006, 74, 085205. N. Singh, S. M. Saini, T. Nautiyal and S. Auluck, Physica B, 2007, 388, 99. A. H. Reshak and S. Auluck, Physica B, 2007, 388, 34. F. Giustino, M. L. Cohen and S. G. Louie, Phys. Rev. B: Condens. Matter, 2007, 76, 165108. C. Baerlocher, W. M. Meier and D. H. Olson, ‘Atlas of Zeolite Framework types’, Elsevier, Amsterdam, 2001, www.iza-structure.org. C. R. A. Catlow, R. A. van Senten and B. Smit, ‘Modelling of Microporous Materials’, Academic Press, London, 2003. S. Yip, ‘Handbook of Materials Modelling’, Springer, Dordrecht, 2005. E. Apra, R. Dovesi, C. Freyriafava, C. Pisani, C. Roetti and V. R. Saunders, Modelling and Simulation in Materials Science and Engineering, 1993, 1, 297. C. Corminboeuf, F. Tran and J. Weber, J. Mol. Struct. THEOCHEM, 2006, 762, 1. P. Nachtigall, in ‘Zeolites and ordered mesoporous materials: progress and prospects’, ed. Studies in Surface Science and Catalysis, New York, 2005, vol. 157, pp. 243–262. G. Poulet, P. Sautet and E. Artacho, Phys. Rev. B: Condens. Matter, 2003, 68, 075118. O. Delgado-Friedrichs, M. O’Keeffe and O. M. Yaghi, Phys. Chem. Chem. Phys., 2007, 9, 1035. M. D. Foster, A. Simperler, O. D. Friedrichs, R. G. Bell, F. A. A. Paz and J. Klinowski, Acta Crystallog. B Struct. Sci., 2005, 61, 263. G. Sastre and A. Corma, J. Phys. Chem. B, 2006, 110, 17949. B. Vessal, M. Leslie and C. R. A. Catlow, Mol. Simul., 1989, 3, 123. P. Kamakoti and T. A. Barckholtz, J. Phys. Chem. C, 2007, 111, 3575. S. M. Auerbach, M. H. Ford and P. A. Monson, Curr. Opin. Colloid Interface Sci., 2005, 10, 220. M. J. Mora-Fonz, C. R. A. Catlow and D. W. Lewis, The role of solvation and pH in the nucleation of pure silica zeolites in Studies in Surface Science and Catalysis: Molecular Sieves: From basic research to Industrial Applications parts A and B, 2005, 158, 295. J. R. Agger, M. Shoaee, M. Mistry and B. Slater, J. Cryst. Growth, 2006, 294, 78. T. S. van Erp, T. P. Caremans, C. E. A. Kirschhock and J. A. Martens, Phys. Chem. Chem. Phys., 2007, 9, 1044. J. D. Gale, Solid State Sci., 2006, 8, 234. J. To, P. Sherwood, A. A. Sokol, I. J. Bush, C. R. A. Catlow, H. J. J. van Dam, S. A. French and M. F. Guest, J. Mater. Chem., 2006, 16, 1919.

146 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

192 R. Bulanek, H. Drobna, P. Nachtingall, M. Rubes and O. Bludsky, Phys. Chem. Chem. Phys., 2007, 8, 5535. 193 J. L. Lowe, A. L. Rohl, J. D. Gale, P. G. Smith and G. M. Parkinson, Mol. Simul., 2006, 32, 35. 194 L. Benco, T. Bucko, R. Grybos, J. Hafner, Z. Sobalik, J. Dedecek and J. Hrusak, J. Phys. Chem. C, 2007, 111, 586. 195 P. Nachtigall and R. Bulanek, Appl. Catal., A, 2006, 307, 118. 196 P. K. Ghoral, M. Sluiter, S. Yashonath and Y. Kawazoe, Solid State Sci., 2006, 8, 248. 197 H. Takaba, A. Yamamoto and A. Nakao, Desalination, 2006, 192, 82. 198 F. J. Llopis, G. Sastre and A. Corma, J. Catal., 2006, 242, 195. 199 R. Babarao, Z. Q. Hu, J. W. Jiang, S. Chempath and S. I. Sandler, Langmuir, 2007, 23, 659. 200 R. R. Kotdawala, A. A. Ozgur Yazaydin, N. Kazantzis and R. W. Thompson, Mol. Simul., 2007, 33, 843. 201 C. Tuma and J. Sauer, Phys. Chem. Chem. Phys., 2006, 8, 3955. 202 D. S. Sholl, Acc. Chem. Res., 2006, 39, 403. 203 D. Bougeard and K. S. Smirnov, Phys. Chem. Chem. Phys., 2007, 9, 226. 204 A. R. Ruiz-Salvador, N. Almora-Barrios, A. Gomez and D. W. Lewis, Phys. Chem. Chem. Phys., 2007, 9, 521. 205 N. H. d. Leeuw and S. C. Parker, Phys. Rev. B: Condens. Matter, 1998, 58, 13901. 206 N. H. d. Leeuw and S. C. Parker, J. Am. Ceram. Soc., 1999, 82, 3209. 207 Y. X. Huang, O. Hochrein, D. Zahn, Y. Prots, H. Bormann and R. Kniep, Chem.- Eur. J., 2007, 13, 1737. 208 L. Gomez-Hortiguela, F. Cora, C. R. A. Catlow, T. Blasco and J. Pariez-Pariente, Fluorinated dibenzyldimethylammonium as structure directing agent for the synthesis of ALPO and SAPO-5. Effect of fluorine, in Studies in Surface Science: Molecular Sieves: from basic research to industrial applications. A and B, 2005, pp. 327–334. 209 G. Sastre, Phys. Chem. Chem. Phys., 2007, 9, 1052. 210 A. Schleife, F. Fuchs, J. Furthmueller and F. Bechstedt, Phys. Rev., B: Condens. Matter, 2006, 73, 245212. 211 N. V. Skorodumova, K. Hermansson and B. Johansson, Phys. Rev. B: Condens. Matter, 2005, 72, 125414. 212 A. A. Sokol, S. A. French, S. T. Bromley, C. R. A. Catlow, H. J. J. van Dam and P. Sherwood, Faraday Discuss., 2007, 134, 267. 213 F. Claeyssens, C. L. Freeman, N. L. Allan, Y. Sun, M. N. R. Ashfold and J. H. Harding, J. Mater. Chem., 2005, 15, 139. 214 J. L. Zhao, W. Q. Zhang, X. M. Li, J. W. Feng and X. Shi, J. Phys.: Condens. Matter, 2006, 18, 1495. 215 A. Janotti and C. G. van de Walle, Phys. Rev. B: Condens. Matter, 2007, 76, 165202. 216 R. A. Jackson and M. E. G. Valerio, J. Phys.: Condensed Matter, 2005, 17, 837. 217 R. M. Araujo, K. Lengyel, R. A. Jackson, L. Kovacs and M. E. G. Valerio, J. Phys.: Condensed Matter, 2007, 19, 046211. 218 B. S. Thomas, N. A. Marks and P. Harrowell, Phys. Rev. B: Condens. Matter, 2006, 74, 214109. 219 B. S. Thomas, N. A. Marks and B. D. Begg, Nucl. Instrum. Methods Phys. Res. Sect. B, 2007, 254, 211. 220 E. M. Diniz and C. W. A. Paschoal, J. Phys. Chem. Solids, 2007, 68, 153. 221 C. A. J. Fisher, M. Yoshiya, Y. Iwamoto, J. Ishii, M. Asanuma and K. Yabuta, Solid State Ionics, 2007, 177, 3425. 222 M. Alfredsson, J. P. Brodholt, D. P. Dobson, A. R. Oganov, C. R. A. Catlow, S. C. Parker and G. D. Price, Phys. Chem. Miner., 2005, 31, 671. 223 S. M. Hosseini, T. Movlarooy and A. Kompany, Physica B, 2007, 391, 316. 224 S. Piskunov, E. A. Kotomin, D. f*cks and S. Dorfman, Mater. Sci. Eng. B, 2005, 118, 15. 225 S. J. Liu, R. Taylor, N. S. Petrovic, L. Budd, M. van Schilfgaarde and N. Newman, J. Appl. Phys., 2005, 97, 014105. 226 H. Wang, B. Wang, R. Wang and Q. K. Li, Physica B, 2007, 390, 96. 227 C. Li, B. Wang, R. Wang, H. Wang and X. Lu, Physica B, 2008, 403, 539. 228 J. M. Henriques, E. W. S. Caetano, V. N. Freire, J. A. P. da Costa and E. L. Albuquerque, J. Solid State Chem., 2007, 180, 974. 229 X. G. Wan and J. M. Dong, Mod. Phys. Lett. B, 2006, 20, 557. 230 S. Kabuk, H. Akkus and A. M. Mamedov, Physica B, 2007, 394, 81. 231 Y. Uratani, T. Shishidou, F. Ishil and T. Oguchi, Physica B, 2006, 383, 9. 232 Z. Y. Li, J. L. Yang, J. G. Hou and Q. S. Zhu, Phys. Rev. B: Condens. Matter, 2005, 71, 024502. 233 Y. Xu and Z. Zeng, Physica B, 2007, 391, 389.

Chem. Modell., 2008, 5, 119–149 | 147 This journal is

c

The Royal Society of Chemistry 2008

234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281

K. P. Ong, K. Bai, P. Blaha and P. Wu, Chem. Mater., 2007, 19, 634. M. Ogata, J. Phys.: Condensed Matter, 2007, 19, 145282. C. A. Marianetti and G. Kotliar, Phys. Rev. Lett., 2007, 98, 176405. Y. Idemoto, D. Shimizu and N. Koura, Electrochemistry, 2006, 74, 815. Y. Idemoto, D. Shimizu, N. Kours, Y. Takahashi and J. Akimoto, J. Ceram. Soc. Jpn., 2006, 114, 849. M. S. Park, S. H. Rhim and A. J. Freeman, Phys. Rev. B: Condens. Matter, 2006, 74, 205416. I. I. Mazin, Phys. Rev. B: Condens. Matter, 2007, 75, 094407. M. S. Islam, J. Mater. Chem., 2000, 10, 1027. M. S. Islam, Solid State Ionics, 2002, 154, 75. M. S. Islam, J. R. Tolchard and P. R. Slater, Chem. Comm., 2003, 1486. J. R. Tolchard, M. S. Islam and P. R. Slater, J. Mater. Chem., 2003, 13, 1956. A. Najib, J. E. H. Sansom, J. R. Tolchard, P. R. Slater and M. S. Islam, Dalton Trans., 2004, 3106. E. Kendrick, M. S. Islam and P. R. Slater, Solid State Ionics, 2007, 177, 3411. E. Kendrick, J. E. H. Sansom, J. R. Tolchard, M. S. Islam and P. R. Slater, Faraday Discuss., 2007, 134, 181. D. J. Driscoll, M. S. Islam and P. R. Slater, Solid State Ionics, 2005, 176, 539. M. S. Islam, D. J. Driscoll, C. A. J. Fisher and P. R. Slater, Chem. Mater., 2005, 17, 5085. G. C. Mather, M. S. Islam and F. M. Figueiredo, Adv. Func. Mater., 2007, 17, 905. C. A. J. Fisher and M. S. Islam, J. Mater. Chem., 2005, 15, 3200. M. Nolan, J. E. Fearon and G. W. Watson, Solid State Ionics, 2006, 177, 3069. G. Balducci, J. Kaspar, P. Fornasiero, M. Graziani, M. S. Islam and J. D. Gale, J. Phys. Chem. B, 1997, 101, 1750. A. Gotte, D. Spa˚ngberg, K. Hermansson and M. Baudin, Solid State Ionics, 2007, 178, 1421. C. E. Mohn, N. L. Allan and S. Stølen, Solid State Ionics, 2006, 177, 223. N. L. Allan, J. M. Lawton and W. C. Mackrodt, Philos. Mag., B, 1989, 59, 191. M. S. Islam and L. J. Winch, Phys. Rev. B: Condens. Matter, 1995, 52, 10510. A. Chroneos, I. L. Goulatis and R. V. Vovk, Acta Chim. Slov., 2007, 54, 179. M. J. Akhtar, C. R. A. Catlow, B. Slater, A. M. Walker and S. M. Woodley, Chem. Mater., 2006, 18, 1552. W. E. Pickett, Rev. Mod. Phys., 1989, 61, 433. K. Larbaoui, A. Tadjer, B. Abbar, H. Aourag, B. Khelifa and C. Mathieu, J. Alloys Compd., 2005, 403, 1. P. Blaha, K. Schwarz and P. Novak, Int. J. Quantum Chem., 2005, 101, 550. P. H. Zhang, S. G. Louie and M. L. Cohen, Phys. Rev. Lett., 2007, 98, 067005. F. J. Ohkawa, Phys. Rev. B: Condens. Matter, 2007, 75, 064503. E. Z. Kushinskii, I. A. Nekrasov, Z. V. Pchelkina and M. V. Sadovskii, J. E. T. P., 2007, 104, 792. T. V. Ramakrishnan, J. Phys.: Condens. Matter, 2007, 19, 125211. F. L. Tang and X. Zhang, J. Phys.: Condensed Matter, 2007, 19, 106216. E. Birsan, J. Magn. Magn. Mater., 2008, 320, 646. I. R. Shein, K. I. Shein, V. L. Kozhevnikov and A. L. Ivanovskii, Phys. Solid State, 2005, 47, 2082. K. I. Shein, I. R. Shein, V. L. Kozhevnikov and A. L. Ivanovsky, Metallofiz. Noveishie Tekhnol., 2006, 28, 123. I. R. Shein, V. L. Kozhevnikov and A. L. Ivanovskii, J. Phys. Chem. Solids, 2006, 67, 1436. I. R. Shein, V. L. Kozhevnikov and A. L. Ivanovskii, Semiconductors, 2006, 40, 1261. V. V. Bannikov, I. R. Shein, V. L. Khozevnikov and A. L. Ivanovskii, J. Magn. Magn. Mater., 2007, in press, doi:10:1016/j.jmmm.2007.09.012. D. Stoeffler and S. Colis, Mater. Sci. Eng. B, 2006, 126, 133. T. C. Gibb, J. Mater. Chem., 2001, 11, 456. T. C. Gibb, J. Mater. Chem., 2005, 15, 4015. S. C. Parker, D. J. Cooke, S. Kerisit, A. S. Marmier, S. L. Taylor and S. N. Taylor, J. Phys.: Condens. Matter, 2004, 16, S2735. A. Jones, D. Palmer, M. S. Islam and M. Mortimer, Phys. Chem. Miner., 2001, 28, 28. K. F. Austen, K. Wright, B. Slater and J. D. Gale, Phys. Chem. Chem. Phys., 2005, 7, 4150. A. M. L. Smith, K. A. Hudson-Edwards, W. E. Dubbin and K. Wright, Geochim. Cosmochim. Acta, 2006, 70, 608. A. L. Auzende, R. J. M. Pellenq, B. Devouard, A. Baronnet and O. Grauby, Phys. Chem. Miner., 2006, 33, 266.

148 | Chem. Modell., 2008, 5, 119–149 This journal is

c

The Royal Society of Chemistry 2008

282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301

D. Mkhonto and N. H. de Leeuw, J. Mater. Chem., 2002, 12, 2633. N. H. deLeeuw, J. R. Bowe and J. A. L. Rabone, Faraday Discuss., 2007, 134, 195. J. A. L. Rabone and N. H. de Leeuw, J. Comput. Chem., 2006, 27, 253. P. E. Ngoepe and C. R. A. Catlow, S. Afr. J. Sci., 2005, 101, 475. P. E. Ngoepe, P. S. Ntoahae, S. S. Mangwejane, H. M. Sithole, S. C. Parker, K. V. Wright and N. H. de Leeuw, S. Afr. J. Sci., 2005, 101, 480. V. L. Vinograd, B. Winkler, A. Putnis, J. D. Gale and M. H. F. Sluiter, Chem. Geol., 2006, 225, 304. J. Breu and C. R. A. Catlow, Inorg. Chem., 1995, 34, 4504. N. L. Allan, G. D. Barrera, M. Y. Lavrentiev, C. L. Freeman, I. T. Todorov and J. A. Purton, Comp. Mater. Sci., 2006, 36, 42. M. Y. Lavrentiev, W. van Westrenen, N. L. Allan, C. L. Freeman and J. A. Purton, Chem. Geol., 2006, 225, 336. J. A. Purton, N. L. Allan, M. Y. Lavrentiev, I. T. Todorov and C. L. Freeman, Chem. Geol., 2006, 225, 278. M. Reich and U. Becker, Chem. Geol., 2006, 225, 291. S. Tosoni, K. Doll and P. Ugliengo, Chem. Mater., 2006, 18, 2135. R. Orlando, F. J. Torres, F. Pascale, P. Ugliengo, C. Zicovich-Wilson and R. Dovesi, J. Phys. Chem. B, 2006, 110, 692. M. J. Gillan, D. Alfe, J. Brodholt, L. Vocadlo and G. D. Price, Rep. Prog. Phys., 2006, 69, 2365. A. R. Oganov, R. Martonak, A. Laie, P. Raiteri and M. Parrinello, Nature, 2005, 438, 1142. P. Carrez, D. Ferre and P. Condier, Philos. Mag., 2007, 87, 3229. M. Blanchard, K. Wright and J. D. Gale, Phys. Chem. Miner., 2005, 32, 332. M. Blanchard, K. Wright and J. D. Gale, Phys. Chem. Miner., 2005, 32, 585. D. Y. Jung and A. R. Oganov, Phys. Chem. Miner., 2005, 32, 146. Issue 5, Mol. Simul., 2005, 31.

Chem. Modell., 2008, 5, 119–149 | 149 This journal is

c

The Royal Society of Chemistry 2008

Density functional theory studies of alloys in heterogeneous catalysis John R. Kitchin,*ab Spencer D. Millera and David S. Shollzab DOI: 10.1039/b608782p

1. Introduction Heterogeneous catalysis is used broadly today in industry. Catalysts crack crude oils into the fuels we use daily. They are used in catalytic converters to minimize dangerous gas emissions from internal combustion engines. Many chemicals are made in large quantities using heterogeneous catalysts. Ethylene oxide is one well known example of a commodity chemical made in this way. Heterogeneous catalysts can be made of many materials, including carbon, metal oxides, transition metals, and a variety of promoters such as alkalis or halogens. The catalytic properties of these materials depend on their detailed morphological structure and in particular on the properties of their surfaces. These properties may additionally depend on the reaction environment. These complexities can make it difficult to understand how changes in catalyst composition and structure are related to the catalyst activity or selectivity. This situation is exacerbated by limitations in our ability to characterize catalysts at atomic scales under reaction conditions. Computational studies of heterogeneous catalysts can often provide additional insight into them, because the detailed atomic structure in these simulations is completely specified. Decades of heterogeneous catalysis research have found empirically that combinations of transition metals are often better catalysts than the pure metal components by themselves.1 It remains challenging in many cases, however, to understand why these multi-metallic materials are better catalysts than the pure components. We focus in this review on combinations of transition metals where mixing of the components has occurred on atomic length-scales (as opposed to the conceptually simpler situation where the two components co-exist in pure but separated forms). We will refer to these combinations generically as alloys, although below we define a more precise nomenclature for the range of possible materials that exist. The composition of an alloy surface is often very different than the alloy’s bulk composition due to segregation effects. The overall activity of a catalyst is determined by the distribution of active sites. This distribution may be very heterogeneous both in terms of the local environments that define each site and their chemical reactivities. The reactivity of any specific active site can be affected by contributions from strain, ligand and ensemble effects. Computational methods are well suited to exploring these effects because one can simulate model systems where only one effect dominates as well as model systems where multiple effects are important. Quantum chemical simulations based on density functional theory (DFT) are widely regarded as reaching the appropriate compromise between chemical accuracy and the need to study structurally complex extended materials in order to tackle problems associate with heterogeneous catalysis involving alloys. A review of DFT and heterogeneous catalysis can be found in the previous SPR2 and that review also listed several general reviews of applications and foundations of DFT. Experimental and theoretical studies of monolayer bimetallic surfaces were recently reviewed.3 In a b

Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh PA 15213, USA. E-mail: [emailprotected]; Fax: +1 412 268 7139; Tel: +1 412 268 7803 National Energy Technology Laboratory, Pittsburgh PA 15236, USA

{ Address from January 2008: School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA

150 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

this review, we examine recent applications of density functional theory in the study of alloy heterogeneous catalysis, limiting our attention to papers published between approximately mid-2005 and mid-2007. 1.1 Nomenclature conventions in this review We use the term alloy or bimetallic to mean any combination of two or more metals that are mixed to some degree on atomic length-scales. In bulk materials, this term encompasses a broad range of structures, including disordered alloys in which the chemical species form a solid solution by being randomly distributed on the lattice sites of the material’s crystal structure, ordered alloys in which the chemical species have a well defined ordering, and of course many intermediate structures with greater or lesser degrees of short and/or long range ordering. As emphasized above, heterogeneous catalysis is largely a surface effect. There are several examples mentioned later in the review where behavior of surfaces or nanoparticles is completely different than what is expected from bulk behaviors. It is therefore useful to use terms that distinguish between various types of alloy surfaces. These bimetallic structures include sandwiches, pseudomorphic monolayers and near surface alloys. These structures are illustrated and briefly defined in Fig. 1. In describing the catalytic properties of an alloy there are three primary mechanisms responsible for the modification of an alloy surface’s catalytic properties at the atomistic scale: strain, ensemble and ligand effects. An alloy typically has a lattice constant that differs from that of the pure metal components, resulting in a different average metal-metal distance in the alloy compared to that in the pure metal components. The electronic structure of an alloy is largely determined by the overlap of neighboring atomic orbitals. The change in metal-metal distances results in a change in orbital overlap and consequently a change in the electronic structure and the chemical properties. This effect is termed a strain effect, with strain defined relative to some reference system, usually one of the pure components in a similar structure. One way to probe strain effects computationally is to change the lattice constant of the system, or to examine pseudomorphic monolayers as illustrated in Fig. 1b. It can often happen that the active site in an alloy is composed of more than one species of metal atom. Adsorbates then bond to different metals simultaneously, and consequently the adsorption energy or reaction barriers are different than if the active site was composed of only one type of metal. This effect is called an ensemble effect. Fig. 1c and Fig. 2b shows one example where ensemble effects can be probed in a DFT calculation. We note that another somewhat different use of the term ensemble effect is to describe scenarios where a critical number or geometry of

Fig. 1 Side view of slab models of various bimetallic structures often used in computational studies. In each case, the bottom layers of the material are defined using the structure of a specified bulk material. The number of surface and bulk layers varies in different studies. (a) In the sandwich structure the surface is one component, often the same component as the bulk material and the second layer is another component. This structure is often used to determine ligand effects. (b) The pseudomorphic monolayer structure combines strain and ligand effects in one structure by placing a second component on top of a bulk material. (c) The near surface alloy combines strain, ligand and ensemble effects in one structure by considering an alloy film defined by just a few atomic layers on top of an ordered bulk material.

Chem. Modell., 2008, 5, 150–181 | 151 This journal is

c

The Royal Society of Chemistry 2008

Fig. 2 Illustration of strain, ensemble and ligand effects. (a) Adsorbate bonding is affected by a change in metal-metal bond distances. (b) Adsorbate bonding is affected by bonding to two different metals instead of one type of metal. (c) Adsorbate bonding is affected by heterometallic bonding of the active site atoms with neighboring atoms.

adsorption sites is required for a reaction to occur, and the critical number and geometry of sites is termed the ensemble. Finally, the electronic structure of a metal atom that is bonded to unlike metals is typically different than the electronic structure of the metal atom in its pure state. The change in electronic structure is again related to overlap of neighboring orbitals; the presence of different atomic orbitals changes the orbital overlap of the reference atom and thus, its electronic structure. This effect is termed a ligand effect and it is an indirect but significant effect. This effect is illustrated in Fig. 2c. In reality, all three effects usually take place on any practical catalyst and they are not independent. Strain and ligand effects in particular are both related to orbital overlaps of neighboring metal atoms. As a result, it can be difficult to disentangle the significance of each effect. Efforts aimed at separating out the effects usually rely on model surfaces such as those in Fig. 1. Additional effects that can be significant in alloy catalysis include site distributions and bifunctional mechanisms. Since a catalyst may be comprised of a distribution of active site compositions and geometries, there will be a distribution of adsorption energies and reaction rates. In nanoparticles, edge and corner sites may form a significant fraction of the active sites. These sites will have different properties than more highly coordinated sites on the terraces. The true measure of catalytic activity is the average of these rates, weighted by the distribution of sites. This average rate can be substantially different than the rates associated with an individual site. In 152 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

bifunctional mechanisms, one process occurs on one type of site, and another process (perhaps involving the products of the first process) occurs on a different type of site. The electrocatalysis reactions reviewed in section 6 include several examples of bifunctional mechanisms. With regards to the sign of adsorption energies we have used the convention that an exothermic adsorption energy has a negative sign. That is, more negative adsorption energies correspond to more energetically favored states. In some papers the opposite convention was used, and in those cases we changed the sign of their energies when reporting results in this review. We used the units presented in each paper throughout the review. 1.2 Review organization The review is organized in the following manner. We first discuss modeling studies of segregation in alloys including the effects of reactive environments in section 2. We follow that with a review of the adsorption of small molecules on alloy surfaces in section 3. Neither of these topics alone offer a complete description of the catalytic properties of bimetallic catalysts, but a thorough understanding of these phenomena is a prerequisite to serious efforts to model these catalysts. Simulations of reactions on alloy surfaces are presented in section 4. Section 5 briefly examines two topics closely related to bimetallic catalysts, namely metals supported on metal oxides and the magnetic properties of alloys. Studies of alloy surfaces that are related to electrocatalysis are reviewed in section 6. We conclude in section 7 with some comments about future directions and needs in the application of theoretical methods to bimetallic heterogeneous catalysis.

2. Segregation It is difficult to understand the catalytic behavior of a surface if the actual composition and structure of that surface is unknown. Segregation is crucial to understanding what the surface of a bimetallic material actually looks like, because a simple termination of the bulk composition is generally not the most energetically favorable configuration. In fact, very dramatic differences can be seen in the composition and organization of the first few layers of a surface of a material compared to its bulk configuration, and this can be further complicated by the presence of adsorbates on the surface. In section 2.1 we discuss how DFT has been used to help understand experimental alloy surface characterization. We then move on to using DFT to study segregation in nanoparticles in section 2.2, and alloy bulk surfaces in section 2.3. The potentially powerful effect of adsorbate induced segregation is discussed in section 2.4, and statistical mechanical simulations of segregation in section 2.5. 2.1 Alloy surface characterization Although the focus of this article is catalysis on alloy surfaces, we recognize the importance of characterizing alloy surfaces as an essential aspect of research in this area. Here we briefly mention papers that focus solely on using DFT to help interpret experimental characterization of alloy surfaces. X-ray photoelectron spectroscopy (XPS) is commonly used to examine alloy structures because the binding energy of core electrons is sensitive to the chemical environment around the atom. Thus, formation of alloys may cause shifts in the XPS spectra that allow one to say an alloy has been formed. However, many factors in the alloy affect these shifts and some of them may cancel each other so a shift that is near zero does not necessarily imply no change in the chemical environment. In a disordered alloy the wide range of chemical environments can cause disorder broadening in the signals, further complicating the analysis. Computational tools based on DFT can simulate Chem. Modell., 2008, 5, 150–181 | 153 This journal is

c

The Royal Society of Chemistry 2008

these spectra for different alloy models and facilitate the interpretation of these experiments. Olovsson and co-workers have been most active in this area. They have presented a broad and detailed framework for calculating core level shifts including an extension to Auger spectroscopy.4 Their approach has been applied to many alloy systems including the following random fcc alloys: CuPd, AgPd, CuNi, NiPd, CuAu, PdAu, CuPt, and NiPt.5 They have used their methods to follow the layer composition profile in film growth of PdCu and PdAg alloy films on Ru(0001).6 In a related work they studied how the electronic structure of embedded alloy layers is related to the bulk alloys6 and developed a method to characterize deep interfaces using computed core-level shifts.7 The effect of disorder broadening and the contributions from initial and final states was examined.8 Significantly they found that the importance of these effects seems to be metal dependent. 2.2 Nanoparticles Nanoclusters are of great interest because of their high surface area to mass ratio, which allows an exposure of a large catalytic surface per volume of catalyst. Because many of these catalytic compounds are costly noble metals, such as platinum, it may be advantageous to create bimetallic nanoclusters which expose an even higher number of catalytically active noble metal atoms per volume of the more expensive component. Ferna´ndez, Balba´s, et al. studied two such nanoclusters, CuAu and PtPd, finding the lowest energy isomers of clusters containing 5–22 pairs of atoms using the many body Gupta potential with parameters from the bulk alloys, followed by energy minimization using the DFT GGA method.9 They found that the ordering of these lowest energy isomers by the Gupta potential and DFT were not in good agreement, however some structural trends found for the nanoclusters using the Gupta potential were the same as those found via DFT. Specifically they found CuAu structural properties corresponding to the existence of icosahedral patterns in the geometry of the lowest lying isomers of the clusters using both methodologies. Additionally, the segregation of PtPd nanoclusters, which form Pt cores with Pdenriched surfaces, were reproduced using both methodologies despite the disagreement in energy ordering of the lowest level isomers. This suggests that although the Gupta potential parameterized for the bulk materials may not definitively identify the lowest energy isomers, it may be capable of identifying overall trends in the structure of nanoclusters. Following on this work, Paz-Borbo´n, Johnston, Barcaro, and Fortunelli performed similar analysis for 34 atom Pd-Pt clusters using a genetic search algorithm and the many body Gupta potential followed by energy minimization by DFT.10 The general conclusions of their work were the same: the ordering of the lowestenergy configurations of the clusters at a given composition was not consistent between the Gupta potential minimization and DFT calculations. They concluded that although the Gupta potential predicted several nearly degenerate lowest energy configurations, reoptimization by DFT showed that the configuration known as Dhcp(DT), which for the 20–14 Pd-Pt composition involves a core that is a double tetrahedron of Pt atoms, is the lowest energy state for all compositions. They additionally confirmed that the results of the Gupta potential and DFT calculations are in agreement that Pd segregates to the surface, forming Pt cores at the center of the nanoclusters. Also considering bimetallic nanoclusters, Sahoo, Rollman and Entel have studied segregation and ordering in Fe-Ni nanoclusters of 13 and 55 atoms using DFT.11 They studied every different composition of 13 atom nanoclusters and determined that in every case the lowest energy geometry is a distorted icosahedra where the Ni atoms occupy surface sites. They also studied the mixing energy of these nanoclusters, a measurement of the configuration’s stability. They found that the most stable composition in the 13 atom nanocluster was Fe3Ni10. For the 55 atom cluster they 154 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

only considered two compositions: Fe12Ni43 and Fe43Ni12. In both cases they found that Ni atoms are found on the surface in the lowest energy geometry with an Fe core. In the Fe-rich case the Ni atoms on the surface formed a very particular configuration with no Ni–Ni bonds. An examination of the mixing energy vs. the number of Fe atoms on the surface confirmed that the most stable configurations are those where the surface is dominated by Ni atoms. 2.3 Bulk surfaces Although nanoclusters are of great interest, due largely to their high surface area, bimetallic surfaces remain an active field of catalysis research. Similar to nanoclusters the inclusion of multiple metals can serve to lower the cost of the catalyst while maintaining its activity, to improve the activity of the catalyst, or to protect it from corrosion and poisoning. Key to understanding how a bimetallic catalyst will operate in a chemical environment is understanding what surface is actually present for a given bulk composition. Ropo presented an ab initio study of AgPd surface segregation and how it depends on the surface orientation.12 DFT calculations were performed using the exact muffin-tin orbitals method using the coherent potential approximation on 8 layer slabs, the bottom 6 of which were fixed at bulk alloy compositions. He found that the surface layer relaxation played a minimal role in surface segregation for (111), (110), and (100) surfaces, but that this surface relaxation did play an important role in segregation in the second layer (the first subsurface layer) for the (110) and (100) surfaces, with an impact of up to 9% in Ag concentration from the non-relaxed state for a 4% inward movement on the (100) surface. He also found results that matched with previous theoretical and experimental results which show that Ag strongly segregates to the surface, with Pd only appearing on the surface for Pd-rich alloys. Yuge et al. explored the Cu75Pt25 (111) surface using DFT calculations and the cluster expansion technique, with Monte Carlo simulations to study segregation behavior at finite temperature.13 DFT calculations were carried out on 9 layer slabs, selected to insure that bulk behavior would be reproduced by the lower layers. Great care was taken in the selection of the cluster expansion parameters from the bulk alloy, which resulted in an estimated predictive error of 4.0 meV/atom. The cluster expansion results were then used in a Monte Carlo simulation to study the segregation of the alloy surface at 800 K, near a phase transition to a L12-ordered alloy. MC showed that the segregation on the alloy surface is driven by a competition between segregation energies and a slight ordering tendency, with the end result of platinum segregation to the first and third layers, copper segregation to the second layer (with bulk-like behavior on the 4th and lower layers) and no long range order in the top layer, despite the presence of some short-range order. Investigation of surface mixing enthalpies identified two surface ground-state atomic arrangements (for layers 1–3) for the bulk L12-ordered structure for Pt compositions of 0.167 and 0.333, suggesting short-range order. They studied the electronic structure of these configurations to produce their density of states (DOS) band structures, and the top layer for both cases shows a significant peak around 3.0 eV. This matches with experimental results, showing significant ordering in the first layer surface electronic states of the alloy. Løvvik studied the segregation of Pd based alloys by performing DFT and bandstructure calculations on Pd(111) slabs alloyed with 5% of Ag, Au, Cd, Cu, Fe, Mn, Ni, Pb, Pt, Rh, Ru, and Sb.14 The goal of these calculations was to find the segregation energy of the alloy atoms, and how this varies with the depth of the substituted atom. The results of these calculations showed that Mn, Ru, Fe, Rh, Ni, Cu, and Pt all have positive segregation energies suggesting that they tend to segregate into the lower levels leaving Pd on the surface while Ag, Cd, Au, Sn, and Pb all had negative segregation energies suggesting that they segregate to the surface. This matches well with experimental results, except in the case of Cu which Chem. Modell., 2008, 5, 150–181 | 155 This journal is

c

The Royal Society of Chemistry 2008

has been seen to segregate to the surface in Pd–Cu nanoparticles. Cu and Ni were also found to have oscillatory depth profiles, a feature that has also been observed experimentally for the Ni alloy. Lastly the relationship between atom size (based on the covalent radii) and experimental surface energies with segregation energies were tested, showing that although there are trends (larger atoms tend to segregate to the surface as do those with lower surface energies) they are unfortunately not strong enough to be of predictive value. Ni3Al is an alloy of interest due to its high temperature strength, high hardness, wear resistance, low density, high melting point, and high temperature corrosion resistance. Because of these properties Ni3Al is used in gas turbine and cutting tool applications. In order to improve its oxidation-resistance it can be alloyed with Pt, and it is believed that the surface segregation of the Pt may serve an important function in this improvement. Jiang and Gleeson studied this segregation using wellconverged DFT results for the low-index (100), (110), and (111) planes with one platinum per (2 2) surface unit cell.15 The results confirmed what is known from experiments: namely that Pt strongly segregates to the surface layer. They also found that Ni segregates to the second layer for both the (100) and (111) surfaces, but for the (110) surfaces with its narrower spacing they found that Pt segregates to both the first and second layers. These results were confirmed by Qin et al.16 with DFT calculations and experimental methods. Additionally Qin added evidence that size effects are responsible for this segregation behavior. Pt segregation to the surface helps to explain why Pt improves the oxidation-resistance of the alloy: by displacing Ni to the lower levels it increases the surface Al:Ni ratio, allowing the formation of Al2O3 to be more favorable to the NiO oxidation. By forming a scale exclusively composed of Al2O3, further oxidation is prevented. 2.4 Adsorbate-induced segregation Segregation, in addition to being caused by the properties of the alloy surface, can be strongly influenced by the presence of adsorbate molecules on the surface. This can have a powerful effect on the surface composition, which can thus vary depending on the conditions in which the alloy is manufactured. Han, Van der Ven, Ceder, and Hwang studied these effects for oxygen adsorption on PtRu alloys (111) surfaces using DFT and the cluster expansion method, as well as Monte Carlo simulations.17 Standard cluster expansion calculations could account for either the alloy composition at the surface or for the presence of adsorbates, but not for both simultaneously, so a special approach was required to perform these calculations. A ‘‘coupled cluster-expansion’’ was therefore used which was composed of two interwoven lattices, one representing the surface composition, and the other representing the empty or occupied adsorption sites. The results of the coupled cluster expansion were then utilized in Monte Carlo simulations to calculate the surface properties at finite temperatures from 100–1100 K and different Ru chemical potentials in the bulk beneath the surface (corresponding to different alloy compositions). What they found was that although in a vacuum Pt would be expected to fully saturate the surface, oxygen adsorption on the surface could dramatically increase the amount of Ru on the surface (because the strong binding between Ru and O easily overcomes the higher surface energy of Ru), where it forms island structures. The size and structure of these Ru islands could be crucial in the effectiveness of the alloy as a catalyst in fuel-cell applications, confirming that these adsorbate-driven segregation effects must be seriously considered. Adsorption driven segregation is not necessarily only an issue in the manufacture of an alloy, but also in understanding how operating conditions may effect segregation compared to what is seen by studies performed in ultra-high vacuum (UHV) conditions. Vestergaard et al. examined this issue for the Au/Ni (111) surface by using both experimental techniques and DFT calculations.18 They found that when the AuNi alloy system is exposed to high pressures of CO, the surface is heavily 156 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

modified, exposing Au clusters on the surface. By studying STM movies of the process occurring on the surface under these high CO pressure conditions, and utilizing relevant DFT calculations to help illuminate the underlying physical driving forces, they found that CO selectively adsorbs to the Ni atoms on the surface, and hypothesize that it forms Ni(CO)3 at step edges which subsequently desorb from the surface. This desorption causes the step edges to move, and Au atoms diffuse along the step edge to form clusters that are left behind on the resulting terraces as the step edge moves away. Although this is not traditional segregation behavior, this chemical attack on the bimetallic surface under operating conditions helps to highlight the importance of understanding the limitations of UHV approaches and the utility of computational and experimental methods to better understand surface behaviors under more complicated conditions. We have already seen how Pt alloyed with Ni3Al can strengthen the alloy against oxidation, and how studying the segregation of the surface can help explain why the alloy surface is more oxidation-resistant. Could the presence of oxygen have an effect on this surface segregation? A similar oxidation-resistance benefit has been postulated for silver addition to copper. Kangas et al. addressed this issue with a study of Ag segregation in the Ag/Cu system when exposed to oxygen adsorbates.19 They used a DFT approach, calculating segregation energies for silver on the first, second, and third layers with different oxygen coverage on the surface. It was found that at 0.25 monolayer (ML) coverage, segregation is not strongly favored by the energies, but at 0.5ML and higher O coverage the segregation energies suggest that Cu strongly segregates to the surface. This suggests that what may form under actual conditions is a Cu/O phase on the surface with an unoxidized subsurface Cu/Ag alloy covering the remaining bulk Cu. The underlying Cu/Ag alloy layer prevents further oxidation of the Cu bulk. 2.5 Statistical mechanical simulations Ruban et al. used the generalized perturbation method (GPM) based on the coherent potential approximation (CPA) to study segregation on the surfaces of Ag-Pd and isoelectronic alloys (Cu-Pd, Au-Pd, Cu-Pt, Ag-Pt, and Au-Pt).20 Although the longperiod superstructures they identified are unlikely to actually occur, they concluded that the ordering behavior of Ag-Pd that they predicted can be partly observed on alloy surfaces. They found that these alloys tend toward a formation of (111)-type ordered structures, a tendency which originates from a relatively strong ordering interaction at the second coordination shell. In the case of Pd-rich Ag-Pd alloys this leads to an oscillating concentration profile for the (111) surface. Other tendencies were also identified such as the hindrance of long range order caused by Pt-Pt interactions in Ag(Au)-Pt alloys. Mu¨ller approached the problem of surface segregation and interface properties of bimetallic alloys using a combination of DFT, a cluster expansion, and Monte Carlo. The two systems studied were the Zn-Al solid solution21 and the Pt-Rh (111) surface.22 DFT calculations were performed to parameterize the cluster expansion (CE) that was then used to produce a predicted ground state line. Configurations near this line that had not yet had DFT calculations performed for them were then used in new DFT calculations to produce a new CE until the entire ground state line was characterized by DFT calculations. This CE was then used in Monte Carlo simulations to simulate the segregation behavior of the surface at finite temperatures. For PtRh(111) the results of MC showed that for temperatures of roughly 1000 K Pt composed 70% of the surface, with no long range order present. These results are in excellent agreement with experimental results showing 69% Pt on the surface, and no long range order. For the Zn-Al solid solution he studied the formation of precipitates in the Al rich alloy, reproducing experimental results and identifying the correct precipitate formations ([111]-soft precipitates of Zn embedded in a matrix of Al that has an elastically soft [100] direction). These calculations also Chem. Modell., 2008, 5, 150–181 | 157 This journal is

c

The Royal Society of Chemistry 2008

allowed for the identification of the driving forces behind these behaviors with interfacial energies stabilizing the (111) oriented interfaces in the Al-Zn solid solution, while the Pt-Rh segregation was explained by the on-site energies of the near-surface layers.

3. Adsorption properties on alloy surfaces Heterogeneous catalysis involves adsorption of intermediates on surfaces. Alloy surfaces pose particular challenges in modeling. The bonding to alloy surfaces is different than that on the pure metal surfaces due to heterometallic bond formation (ligand effects), multifunctional adsorption sites (ensemble effects), lattice constant changes (strain effects) and coverage effects. Disentangling these effects is non-trivial and can typically only be done clearly in model systems. In this section we review studies of atomic and molecular adsorbates on alloy surfaces. The section is organized from simplest adsorbates to most complex. Within each subsection we have organized the papers reviewed from most general to most specific. We begin with hydrogen in section 3.1 and oxgyen in 3.2. Carbon monoxide, carbon dioxide and other small molecules such as NO are presented in 3.3. Hydrocarbons such as ethylene and acetylene are discussed in 3.4. Finally, carbon and sulfur are presented in section 3.5. 3.1 Hydrogen Greeley and Mavrikakis have investigated the adsorption of surface and subsurface hydrogen and the diffusion of hydrogen on a wide range of near-surface alloys (NSAs).23 Near-surface alloys are defined simply as alloys where the surface composition differs from the bulk composition, even if the compositions are unstable. Remarkably, they found that hydrogen adsorption on some NSAs was as weak as the adsorption on noble metals, while simultaneously dissociating hydrogen more easily (i.e. lower dissociation barriers). Subsurface hydrogen was generally found to be unstable in NSAs with respect to gas phase hydrogen,23 with the exception of Pd. The ability of some NSAs to simultaneously bind hydrogen weakly and easily dissociate hydrogen makes them interesting candidates for fuel cell anode electrocatalysts.24 Carbon monoxide poisoning is a serious problem in hydrogen fuel cells because the hydrogen fuel contains trace amounts of CO. One strategy for mitigating the effect of CO poisoning is to design new electrocatalysts that bind CO more weakly. Surfaces that bind CO weakly tend to bind hydrogen weakly too, and weak binding of H suggests high dissociation barriers. If that were always true, there would be little hope for solving this problem. The discovery of new NSAs that avoid this problem is remarkable. In particular, Pd and Pt-terminated NSAs were shown to have weak CO binding energies and facile hydrogen dissociation barriers. These types of surfaces are among the best known anode electrocatalysts to date. Substrate interactions and strain effects are two of the primary mechanisms that modify the adsorption properties in alloy surfaces. In pseudomorphic Pd layers on Cu(111) the Pd atoms are compressed by the smaller Cu lattice constant. In addition, strong Cu-Pd bonds cause rehybridization of the Pd d-band, resulting in weaker H adsorption energies on the Pd monolayer than on Pd(111).25 In contrast, for a Cu monolayer on Pd(111) the tensile strain of the Cu monolayer should increase the H adsorption energy, while the Pd-Cu bonds should reduce the adsorption energy. The net result is coincidentally that the Cu monolayer has the same H adsorption energy as Cu(111).25 The adsorption energies of hydrogen were changed by less than 60 meV when a water bilayer was added to the surface compared to the vacuum reference. This suggests that trends in gas-phase calculations may be sufficiently representative of electrochemical environments to identify trends in those environments. 158 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

The adsorption energy of hydrogen can play a crucial role in the selective hydrogenation of unsaturated bonds.26 However, hydrogen can induce changes in the composition of an alloy surface. Ag-Pd alloys have been used to tune the adsorption energy of hydrogen. Under vacuum conditions, Ag segregation is observed, but with increasing coverage of hydrogen atoms Pd segregation gradually occurs. This work highlights one of the challenges in interpreting experimental results on alloy surfaces: the composition of alloy surfaces can typically only be measured on clean surfaces, but the composition under reaction conditions may be different than that of the clean surface. In addition to developing alloy surfaces with tunable adsorption energies to modify selectivity, another approach is to modify the diffusion of hydrogen on alloy surfaces. Fearon and Watson have found that differences in the diffusion of hydrogen on Pt and PtSn(111) surfaces may be responsible for changes in the catalytic activity of these surfaces.27 Hydrogen adsorption and vibrational features on three-fold adsorption sites on Pt and PtSn surfaces are nearly identical. The diffusion of hydrogen on these surfaces is very different. On Pt, the diffusion barriers are typically 4–5 kJ/mol. In contrast, on the PtSn surfaces the barriers for diffusion are at least five times higher than those on Pt due to the very weak interactions of the Sn atoms with the hydrogen atoms. These differences are likely to be important in catalytic hydrogenations. 3.2 Oxygen Computational screening studies are often expected to be more efficient than an equivalent experimental study. Still, the expense of running large numbers (thousands) of DFT calculations is not trivial and the combinatorial space for computational work is still just as large. Consequently, there is a significant need for simpler methods to calculate surface properties to help narrow the focus of computational studies. Greeley and Nørskov have developed a simple way to rapidly estimate the oxygen adsorption energy on alloy surfaces based on DFT calculations.28 Their scheme relies on the fact that oxygen adsorption energies tend to correlate strongly with the d-band center of the clean metal surface. They found that the d-band center of an alloy surface can be reasonably approximated from linear combinations of pure overlayer data from their clean surface electronic structure database. Thus, with a reasonable estimate of the d-band center, they could use the correlation of oxygen binding energy to estimate the binding energy on an alloy surface without performing additional DFT calculations. With their simple scheme they predicted the oxygen binding energy on several hundred binary alloy surfaces and compared them directly to DFT calculations. They found that in the majority of cases the prediction error was less than 0.2 eV. The model had large prediction errors for a few systems involving Pt-group overlayers on noble metals. This work points to the utility of simple chemical descriptors of alloy surfaces such as the ‘‘effective’’ d-band center of a multi-component adsorption site in making reasonable correlations for narrowing the scope of computational studies. Oxygen bonds to transition metals strongly. In alloy systems, this may have an unintended consequence of inducing segregation of an undesirable alloy component under reaction conditions. The stability of Pt-3d-Pt(111) sandwich structures (3d = Ti, V, Cr, Mn, Fe, Co or Ni) in the presence of oxygen was investigated using DFT to determine whether oxygen could induce a segregation of the subsurface 3d metals to the surface.29 It was found that as little as 12 a monolayer of oxygen could destabilize the sandwich structure and cause segregation of the more reactive 3d metal to the surface. These results were confirmed experimentally for Pt-Ni-Pt(111) and Pt-Co-Pt(111) structures, where the activation barrier to segregation of subsurface Ni was found to be approximately 15 kcal/mol and for Co to be 7 kcal/mol. The authors noted that the apparent activation energy was temperature dependent, Chem. Modell., 2008, 5, 150–181 | 159 This journal is

c

The Royal Society of Chemistry 2008

indicating the possibility there are two barriers: one for inward diffusion and one for outward segregation. Surface composition heterogeneity can also strongly affect the adsorption properties of alloy surfaces. Ji and co-workers have investigated the adsorption and diffusion of OH on Pt(111) and PtMo(111) surfaces.30 They approximated the PtMo(111) surface in a 7 layer, 3 3 Pt surface supercell with a single Mo atom in each layer, yielding a Mo concentration of 11%. The presence of Mo in the surface layer caused a bigger relaxation of the surface layers than that found in Pt(111), probably due to the strong Pt-Mo bond strength. The Mo atom tends to sink deeper in the surface than Pt atoms do. OH preferentially adsorbs on the top site of both surfaces with the most stable adsorption energy of 2.35 eV on Pt(111) and 3.32 eV on the top Mo site of the PtMo(111) surface. On Mo(110) (the most close-packed bcc surface) OH prefers a three-fold site with an adsorption energy of 4.43 eV. Consequently, the dramatic increase in adsorption energy of OH on the alloy surface is due primarily to the bonding of OH to the Mo atom. The stronger adsorption energy of OH on the Mo atom has consequences on reactions that have OH as a product, for example, water dissociation. In the same work the authors found that the dissociation barrier for water on Pt(111) to be 0.89 eV, whereas on the alloy surface with OH ending on the Mo atom the barrier is only 0.63 eV.30 Thus, water dissociates more readily on the alloy surface than the Pt(111) surface. Of course, if a catalytic cycle is to continue, the OH fragment must be able to diffuse away from the Mo atom. On Pt(111) they calculate a diffusion barrier of only 0.13 eV from a top site to a top site through a bridge site transition state. In contrast, on the PtMo surface the barrier of OH leaving a Mo top site is around 1.2–1.3 eV, suggesting that once OH forms on the site, it will stay there a long time.30 There could still be important repulsive effects that weaken OH adsorption energies on neighboring sites, but this is an area of research that has not been fully explored. In the oxygen reduction reaction, OOH is postulated to be one of the main products in the first step of some mechanisms. Seminario et al. have investigated the adsorption and dissociation of OOH on three atom Pt-X-X and Pt-Pt-X (X = Co, Cr, and Ni) bimetallic clusters.31 They found these X atoms are better adsorption sites for OOH than Pt, but they hypothesize that the strong adsorption may not facilitate dissociation. Instead, the X atoms appear to transfer electrons to the Pt atom, making the Pt more negatively charged, and more able to transfer electrons to oxygenated species. Thus, the authors speculate that subsurface X atoms may contribute favorably to facilitating OOH dissociation. However, the authors did find exceptions to that suggestion; clusters with two Cr atoms were able to dissociate OOH with no barrier.31 Larger alloy clusters have also been investigated. Lacaze-Dufaure et al. have investigated the adsorption and dissociation of oxygen on Cu-doped Al clusters up to 31 metal atoms.32 They found that the addition of Cu stabilizes the aluminum clusters differently depending on where the Cu atom is. If the Cu atom is at the surface, the cluster is more stable by 0.31 eV, but if it is in the center of the cluster the cluster is stabilized by 1.18 eV. This indicates that Cu anti-segregates in these clusters, preferring to be in the bulk, more highly coordinated, environment rather than at the low-coordinated surface. Furthermore, electron transfer from the Al cluster to the Cu atom is significant (0.7 e) at the surface, and negligible in the bulk. Interestingly, the Cu 3d orbital filling remains nearly constant, and all the electron transfer changes are observed in the 4s and 4p orbitals. The authors found that incorporation of Cu atoms into the oxygen adsorption site on the Al clusters results in a weakening of the oxygen adsorption energy. They obtained an oxygen adsorption energy of 5.99 eV on a pure Al31 cluster and 5.64 eV on the same site with a Cu atom in it for an Al30Cu cluster. On another type of site they calculated an adsorption energy of 5.58 eV on a pure Al31 cluster, and 5.32 and 5.28 eV for the same site depending on where the Cu atom was placed for an Al30Cu cluster.32 Cu incorporation into the active site reduces the oxygen adsorption energy 160 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

by reducing the charge transfer from the cluster to the oxygen atom. The authors note a large sensitivity of their results to relaxation effects. 3.3 Carbon monoxide and other small molecules 3.3.1 Carbon monoxide. Carbon monoxide is one of the most commonly used probe molecules in the study of the chemical properties of metal surfaces. CO represents a step in the direction of complexity compared to atomic adsorbates and diatomic molecules. On one hand, the bonding involves molecular orbitals and it is sensitive to the detailed electronic structure of the metal surface. This allows one to use the CO bonding properties as a probe of changes in surface electronic structure. Yet at the same time, in many cases CO retains aspects of the simplicity that atomic adsorbates have. The effect of alloy formation within a supported monolayer on the adsorption of CO on these surfaces was investigated by Gonzalez and Illas.33 In that work, the authors used model monolayers of Pd, Rh, and Cu as well as mixed monolayers of these metals supported on Ru(0001) to determine the role of alloy formation in modifying the CO adsorption properties. Generally, they found that the variations in adsorption energies were largely explained by changes in the ability of a metal to back donate charge into the CO(2 p*) orbital, as indicated by changes in the d-band center and density of d-states at the Fermi level. Although the adsorption energies varied from 21 kcal/mol (Cu/Ru(0001)) to 44 kcal/mol (Rh/Ru(0001)), the adsorption geometry was practically the same on these surfaces. In the previous work, the authors found that CO bonds more weakly to Pd monolayers supported on Ru(0001) than on Pd(111).33 Remarkably, incorporating Cu atoms into the supported Pd monolayer was found to increase the CO adsorption energy on the Pd atoms by up to 12 kcal/mol, depending on the Cu concentration. In contrast, CO bonds slightly more strongly to Cu/Ru(0001), and incorporation of Pd into the Cu monolayer tends to decrease the CO adsorption energy on the Cu atoms by up to 6 kcal/mol. These effects are attributed to an increase in the back-donation to CO on these surfaces. In related work, Sakong et al. have used CO adsorption on ordered Cu-Pd alloys and surface alloys to help distinguish between these ligand, ensemble and strain effects, and in some cases show they are related.34 They attribute the increased reactivity of Cu atoms with increasing Pd content in a surface alloy as due to an ‘‘effective compressive strain’’ due to the larger size of the Pd atoms compared to the Cu atoms. Despite observed variations in the CO adsorption energy on the Cu atoms, the CO vibrational frequency was hardly changed. On Pd surfaces, in contrast, large blueshifts were observed on the compressed Pd overlayers. The interaction of CO with alloy impurities is significantly different if the impurity is a metal adatom on the surface, or a substitutional impurity in the surface. The adsorption properties of CO on Au/Ni(111) surfaces has been studied with Au in the form of adatoms35 and substitutional impurities.36 In the case of adatoms, the adsorption energy of CO in sites that include a nearest neighbor of the Au adatom is reduced by up to 1.2 eV. At larger distances from the adatom the adsorption energies are nearly unchanged. The Au adatom was found to bind CO much more strongly than a flat Au(111) surface. In contrast, the substitutional Au impurity bonds CO more weakly than a flat Au(111) surface. A substitutional Au impurity also reduces the CO adsorption energy on Au/ Ni(111) surfaces on the sites immediately surrounding the impurity.36 In this case, the incorporation of Au into the Ni(111) surfaces by exchanging with a Ni atom was found to be endothermic, and consequently surface alloy formation must be entropically driven. The Au atom center of mass sits about 0.5 A˚ above the average location of the Ni atoms due to its larger size. The adsorption energy in the threefold sites that include the Au atom are only 1.18 eV, compared to 2.16 eV in neighboring three fold sites that only include Ni. On pure Ni(111), the CO Chem. Modell., 2008, 5, 150–181 | 161 This journal is

c

The Royal Society of Chemistry 2008

adsorption energy is 2.13 eV. The effect of the Au impurity is significant, but highly localized. In the preferential oxidation of CO in the presence of hydrogen it is desirable to use an alloy catalyst with weak CO adsorption energies. One strategy for modifying Pt to weaken the CO adsorption energy is to alloy it with Sn. The adsorption properties of CO on Pt3Sn alloys have been calculated for the (111), (110) and (001) surfaces and compared to the same surfaces of pure Pt.37 In general, the alloy surface bonds CO more weakly than the corresponding Pt surface. For alloy sites that contain Sn or with neighboring Sn, the adsorption of CO is always weaker than sites that do not contain the Sn atom. In contrast, subsurface Sn atoms increase the adsorption energy of CO to the Pt atoms directly above the Sn atom. One of the main challenges in modeling alloy surfaces is finding ways to unambiguously relate the results to experimental data. The majority of comparisons between theory and experiments have used the adsorption energy directly. An alternative approach is to compare computed and experimental vibrational fingerprints of CO on well-defined alloy surfaces. Dupont et al. have used DFT to calculate the anharmonic vibrational frequencies and intensities and systematically compared them to high-resolution electron energy loss spectroscopy measurements (HREELS).38 Their approach to calculating the anharmonic frequencies is to first calculate the harmonic frequencies and modes and then to use these to sample the potential energy curve of each vibrational mode. They fit a Morse potential to each of these curves to calculate the anharmonic vibrational frequency. The average corrective anharmonic frequency was 17 cm1. Finally, the frequencies were scaled by 1.022 (the ratio of the experimental gas-phase stretching frequency to the calculated stretching frequency). The authors are able to simulate HREELS spectra using these frequencies for direct comparison with the experiments. Using this hybrid experimental/computational approach the authors concluded that CO preferentially adsorbs on atop sites of Pt, but that multifold adsorption can coexist at room temperature, in contrast with Pt(111). CO does not bond to the Sn atoms. The presence of a second metal in the surface can actually change the reaction coordinate for some reactions. Ji and Li demonstrated that a Mo atom in a PtMo alloy surface changes the reaction coordinate for CO oxidation.39 They found the barrier for CO oxidation on Pt(111) to be 1.2 eV, but on PtMo(111) the barrier was only 1.0 eV. They observed a correlation between the reaction barrier and CO adsorption energy: stronger adsorption energies had higher oxidation barriers. This correlation is due to the desorption of the product CO2. Examination of the reaction coordinates on Pt(111) and PtMo(111) showed that the two reaction coordinates were not the same. On Pt(111) oxygen atoms prefer to adsorb in three-fold sites, where as on the PtMo(111) surface oxygen atoms prefer the atop site of the Mo atom. In the transition state the oxygen atom is practically on an atop site, which means additional energy must be used on Pt(111) to get the atom out of a three-fold site. The authors suggest that it is this geometric effect on PtMo where the oxygen atom is already near the position it occupies in the transition state that results in the lower reaction barrier. Methanation of CO is interesting for a number of applications including cleaning up hydrogen gas of CO, and synthetic fuel synthesis. Industry prefers Ni catalysts, but there is still substantial room for improvement in activity. Andersson and coworkers have applied a computational screening approach to optimize multiple objectives such as activity, selectivity and cost in finding a better catalyst.40 There are two important surface properties that determine the rate of the methanation process: the CO dissociation barrier and the stability of the C and O products. They showed these two properties are related through a Brønstead-Evans-Polanyi relation and consequently only one descriptor is needed to characterize the surface activity. Furthermore, they identified a volcano-shaped dependence of the activity of pure metal surfaces on the CO dissociation energies on those surfaces. This allowed them to identify a narrow range of desirable CO dissociation energies that have activities 162 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

at least as good as the best pure metals. By developing a simple interpolative scheme to estimate the CO dissociation energies on alloy surfaces based on combinations of pure metal surface properties and mixed overlayer bimetallic structures they were able to rapidly screen 117 potential alloy catalyst candidates for activity. It is not simply the most active catalyst that is necessarily the best in an application, the authors showed that the most active catalysts are the most expensive ones, and that there can be a compromise in cost and activity. They identified a series of Ni-Fe surfaces that were ‘‘Pareto-optimal’’ in cost and activity. These surfaces were examined in a set of experiments and were shown to be better than pure Ni catalysts.41,42 Nanoparticles have different morphologies than flat, bulk surfaces. Perez et al. have considered the activation of water and COads + OHads reactions on Pt and PtRu clusters including the effects of solvation.43 They found that the presence of under-coordinated Ru adatoms on the Pt cluster surfaces enhances the production of OHads from water compared to Ru alloyed into the nanoparticle surfaces. More significantly, they found that the presence of an aqueous environment simulated by up to six water molecules dramatically stabilized the transition state and products of the reactions. For example, in a gas-phase environment they calculated a water dissociation barrier of 20 kcal/mol whereas in the solvated environment the barrier was reduced to 4.5 kcal/mol on the alloy surface. The barrier for water dissociation on the Ru adatom in the aqueous environment was only 0.9 kcal/mol. Although their results are for an adatom on a near flat (111) surface, they may have significance in describing the catalytic properties of undercoordinated Ru atoms at edge and corner sites on nanoparticles. The phase behavior of alloy nanoparticles may differ from that in the bulk due to the increased significance of surface effects. Ge and coworkers examined the heats of formation of Pt-Au nanoparticles up to 13 atoms and compared them to bulk heats of formation.44 For bulk systems they found the heats of formation of Pt-Au alloys were all small and positive indicating a tendency to phase separate at low temperatures. In contrast, the heats of formation of some of the alloy atom clusters were significantly negative indicating that alloy nanoparticles may readily form. The adsorption of CO on the Pt-Au nanoparticles was larger than that on the bulk surfaces, and CO preferentially adsorbs on Pt sites that are adjacent to Au atoms. The effect of coadsorption of oxygenated species on the adsorption and reactions of adsorbed CO on Pt and PtRu(111) surfaces has been investigated by Han and Ceder.45 They found that relaxation effects in the slab calculations were significant (40.1 eV) and must be considered in quantitative work. The coadsorption of O or OH weakens the CO adsorption energy on Pt(111) by about 0.2 eV. They found the effect of alloying Ru in the surface to be minimal on the adsorption energy of CO on neighboring Pt, unless the Ru is oxidized by adsorption of O or OH, and then it significantly reduces the CO adsorption energy. They note that OH bonds more strongly to Ru than CO does, and in an aqueous environment one expects Ru to be oxidized.

3.3.2 Other small molecules. Carbon dioxide interacts weakly with most surfaces, and promoters are typically required to activate the molecule to induce chemisorption. Vines et al. examined the role of sodium promoters in CO2 adsorption on W(011) surfaces in a combined experimental and computational effort.46 On clean W(011) CO2 physisorbs at temperatures below 120 K. In the presence of at least 0.5 ML of Na, CO2 chemisorbs in a mechanism that involves charge transfer from the Na atoms, converting them to Na+. At low coverages the authors found the adsorption energies of Na decreases in the range of 0 to 0.5 ML due to repulsive interactions between the positively charged alkali adsorbates (the valence electron on Na is ‘‘lost’’ by incorporation into the W slab valence band). Above 0.5 ML metallic bonding of the Na to W was observed. They observed very little difference in the Chem. Modell., 2008, 5, 150–181 | 163 This journal is

c

The Royal Society of Chemistry 2008

adsorption energy (o0.1 eV) of Na on atop, bridge and 4-fold hollow sites at low coverages. The charge donated to the W valence band from Na is ‘‘back donated’’ to adsorbing CO2 molecules which stabilizes the adsorption.46 On clean W(011) the most stable adsorption energy of CO2 is only 0.06 eV, and the CO2 molecule retains its linear geometry. In the presence of Na, the adsorption energy increases dramatically to 5.23 eV and the geometry of the chemisorbed CO2 is bent, closely resembling a gas-phase CO2 anion. Predicting the adsorption energy of molecules in heterogeneous adsorption sites remains a challenge. Tang and Trout illustrated some of the short-comings in simple adsorption models that only utilize the d-band center and introduced a new weighting scheme that improves predictions of NO adsorption on Pt(111), Rh/ Pt(111) and Pd/Pt(111).47 The main issue they identified is that simple models tend to fail when adsorbates bond unequally to multiple surface atoms. In this model, in addition to weighting the d-band center by a d-band coupling matrix element which partially accounts for ligand effects on the electronic structure of the atom due to neighboring atoms, they also weight the bond strength contributions of each metal atom in the active site according to an electrostatic bond model. Although the simpler model and previous d-band weighting schemes48 qualitatively showed the same predictive trends the prediction errors were typically greater than 5 kJ/mol. The more sophisticated weighting scheme developed in this paper had a significantly lower prediction error than the simpler models with an average error on the order of 5 kJ/mol or less. 3.4 Hydrocarbon molecules As molecules get physically larger and chemically multifunctional the challenges in modeling their adsorption and reactions increase rapidly. Zellner et al. have used DFT in conjunction with surface science measurements to investigate the adsorption properties of ethylene on Pd/Mo(110) and Pd/W(110) monolayer bimetallic surfaces.49 Hydrogen and ethylene both adsorb more weakly to these bimetallic surfaces than to the corresponding pure metal surfaces. A simple d-band correlation was unsuccessful in predicting the adsorption energy of ethylene on the metal surfaces used in this work (Pd, Pd, Mo, W, and combinations thereof). This occurs because one must consider the Pauli repulsion between the p electrons in the ethylene molecule and the metal surface. The amount of repulsion depends on the spatial extent of the d-orbitals. Since 5d orbitals are larger than 4d orbitals which are larger than 3d orbitals, the authors observed separate correlations between the d-band center and adsorption energy depending on surface metal. Their experiments showed that ethylene is p-bonded to the bimetallic surfaces, but di-s bonded to the parent metal surfaces. Goda et al. examined reaction pathways of C2 hydrocarbons on bimetallic surfaces using a hybrid DFT/bond order conservation (BOC approach).50 They examined hydrogen, ethylene, acetylene, ethyl and vinyl on a variety of monometallic and bimetallic surfaces. DFT was used to examine the adsorption energies and correlations with the electronic structure. BOC theory was then used with these adsorption energies to calculate activation energies for ethyl and vinyl dehydrogenation. The authors observed separate adsorption energy/d-band center correlations for 3d, 4d and 5d metal surfaces as seen by Zellner et al.49 Acetylene tends to bond more strongly to the surfaces than ethylene does. They also found that the dehydrogenation reaction barriers correlate reasonably with the surface d-band centers. This is consistent with a linear free energy relationship between an initial state and transition state. CuPt clusters have been found to have high ethylene selectivity in the hydrogen assisted dechlorination of 1,2-dichloroethane. Avdeev et al. have used DFT to investigate the adsorption of ethylene on Cu12Pt2 clusters to help explain these 164 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

observations.51 They examined three types of clusters with different symmetry. Each cluster had an active site for ethylene adsorption that consisted of a Pt atom surrounded by Cu atoms. Ethylene bonds in a p configuration on all of the clusters. Ethlylene was found to bond more strongly to the bimetallic cluster than a pure Cu cluster, but more weakly than to a pure Pt cluster. The cluster with C3v symmetry had the strongest adsorption energy, likely because the Pt atom was the most undercoordinated. The weaker bonds to the CuPt bimetallic cluster are due to the lack of Pt-Pt adsorption sites where ehthylene could form a di-s bond. The authors also calculated vibrational frequencies of the adsorbed ethylene molecules and observed shifts in the features that could be used to identify p-bonded ethylene species on real surfaces. In the partial hydrogenation of multifunctional adsorbates such as acrolein, which contains unsaturated CQO and CQC bonds, the selectivity with which each functional group is hydrogenated can be important. Murillo and coworkers investigated the partial hydrogenation of acrolein on two Pt/Ni bimetallic surfaces in ultra high vacuum and using DFT.52 The authors use pseudomorphic overlayers of Ni on Pt(111), denoted Ni-Pt-Pt(111) and sandwich structures with a subsurface monolayer of Ni in Pt(111), denoted Pt-Ni-P(111) as model surfaces to perform calculations on. On Pt(111) and Ni-Pt-Pt(111) they only observed hydrogenation of the CQC bond. On the Pt-Ni-Pt(111) surface, however, they saw a higher activity and production of 2-propenol and fully hydrogenated 1-propanol. DFT calculations suggest that on Pt(111) and Ni-Pt-Pt(111) the acrolein can only adsorb in a di-s C-C geometry which leads to hydrogenation of the CQC bond. On Pt-Ni-Pt(111), however, the di-s C–C and di-s C–O geometries are close in energy. That suggests either bond could be hydrogenated under reaction conditions. Finally, the authors showed a correlation between different adsorption geometry energies and the surface d-band center. It may be possible to identify more selective bimetallic surfaces using this correlation. 3.5 Carbon and Sulfur Carbon poisoning, or co*king, occurs at the anodes of solid oxide fuel cells that utilize Ni as electrocatalysts. In a series of papers Nikolla et al. analyze the origin of this effect and propose a novel alloy that is shown to suppress co*king.53,54 Using DFT calculations they showed that C–C bond formation and C–O bond formation activation energies are comparable on Ni surfaces. They further found that C atoms are quite mobile on these surfaces at temperatures relevant to fuel cell operation. By formulating the problem of co*king in terms of C–C bond and C–O bond selectivity they identified that a Ni-Sn surface has a higher selectivity towards C–O bond formation. Consequently, Ni-Sn is resistant to co*king. The mechanism for the enhanced selectivity is in the relative rates of diffusion of C and O atoms. They found that C atom diffusion was substantially more hindered by Sn surface alloys than O was. They further analyzed the stability of Ni-Sn surfaces using DFT and thermodynamic arguments and found that Sn was stable in the surface under typical operating conditions. Their findings were confirmed experimentally by synthesizing Ni-Sn catalysts and running them under steam reforming conditions where co*king was not observed. Sulfur is typically a poison in transition metal catalysts, either by site-blocking or formation of inactive bulk sulfides. Alloys are one route to improving the sulfur tolerance of catalysts. Sulfur can be present in many forms including H2S, sulfurcontaining hydrocarbons and SO2. Zhao et al. investigated the chemisorption of SO2 on Cu/Au(111) surfaces using synchrotron techniques and DFT.55 They found using DFT that Cu preferentially segregates to the second layer down from the surface by comparing the total energies of the alloy slabs in different compositional configurations. SO2 bonds more strongly to Cu than to Au, and the more Cu atoms there were in the surface the stronger the SO2 adsorption energy was. The Cu-SO2 bonds were Chem. Modell., 2008, 5, 150–181 | 165 This journal is

c

The Royal Society of Chemistry 2008

not strong enough, however, to induce Cu segregation to the surface. On Cu(111) SO2 dissociates and the resulting atomic oxygen oxidizes the Cu surface, corroding it. In contrast, on the Cu/Au(111) surface the Cu atoms dissolve into the second layer and SO2 desorbs molecularly. Thus, a Cu-Au alloy may be protected from corrosion due to the lack of active sites for SO2 dissociation. Pd alloys are often used in dense metal membranes to separate hydrogen. However, most hydrogen streams that need purification also contain sulfur compounds such as H2S. These sulfur compounds can poison the surface, reducing the effectiveness of the membranes. Jiang et al. found that alloying a small amount of Pt into Pd improved the sulfur resistance of the alloy.56 Incorporation of Pt increased the adsorption energies of both H2 and H2S, but the adsorption energy of H2 was increased more. Furthermore, they found that H2S prefers to bond to an atop Pt site if it is available, while H2 always dissociates in three-fold sites. Therefore, in addition to changing the adsorption energies differently the alloy surface also changes the preferred sites for the different adsorbates. The interactions of S and S-compounds with transition metal surfaces and alloy surfaces are not trivial. Whereas simple correlations between the adsorption energy of H and O and the surface d-band center of many alloy structures have been observed in many cases, Hyman and co-workers report that the Fermi energy and density of states at the Fermi level must also be considered to understand and predict trends in H2S reaction energetics on alloy surfaces where strain and ligand effects are important.57 In the absence of ligand effects, they found that the d-band center and density of states at the Fermi level were strongly correlated and only one of them was necessary to predict adsorption properties. However, ligand effects in many alloys destroy the correlation and then it requires both parameters to understand trends in adsorption properties. The density of states at the Fermi level is an indicator in bonding because these electrons participate directly in bonding, and they indicate the density of states just above the Fermi level which are necessary to reduce antibonding repulsion. The authors also found that correlations between changes in the adsorption properties of H2S were well correlated with changes in the adsorption properties of S on all the surfaces. Thus, correlations between adsorbate properties may be more useful for predicting changes in other adsorbates than electronic structure correlations. In a solid oxide fuel cell, fuel is electrochemically oxidized at the anode. The fuel is typically hydrogen or a hydrocarbon, and very often the fuel contains trace amounts of sulfur-containing compounds. The highly active Ni-based catalysts that are preferred are not sulfur tolerant due to nickel sulfide formation, and the more sulfur-tolerant Cu catalysts are not sufficiently active. Choi and coworkers have used DFT to investigate the sulfur tolerance of Ni-Cu alloy surfaces.58 They found that S bonds more strongly to Ni than to Cu, and that alloying Cu into the Ni surface reduces the adsorption energy of the S making the alloy surface more sulfur tolerant than a pure Ni surface but not as tolerant as a Cu surface.

4. Reaction kinetics The ultimate test of whether a bimetallic catalyst is practically interesting is of course how the catalyst performs a chemical reaction of interest. Characterizing the complete mechanism and rate of a surface catalyzed reaction is certainly possible with DFT calculations, but this process remains time consuming even for structurally simple surfaces and chemically simple reactions. In this section, we review DFT calculations that have approached this task for a variety of reactions. Perhaps the key quantity of interest in most catalyzed reactions is the activation energy of the rate determining step(s), so we have only included work in this section that explicitly computed activation energies by finding transition states on the potential energy surface of the reacting species. The section is organized according to the reactions that were studied in roughly increasing order of chemical complexity. 166 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

4.1 CO oxidation It is probably reasonable to say that there have been more DFT studies of CO oxidation on metal surfaces than for all other surface catalyzed reactions combined. The popularity of this reaction is driven in part by the simplicity of the chemical species it involves, but it is also driven by technological demands. In past years, much activity focused on developing and understanding catalysts for use in automobile catalytic converters. More recently, attention has shifted to catalysts for fuel cells, where catalyst poisoning by CO can be a severe issue. Because bimetallic catalysts can potentially reduce the cost of fuel cell catalysts relative to the ‘‘standard’’ material, Pt, and also mitigate the effects of CO poisoning, a larger number of experimental and theoretical groups are actively studying CO chemistry on bimetallic surfaces. Ji and Li studied CO dissociation on Pt(111) and in Mo-doped Pt(111) where 1/4 of the surface atoms are Mo.39 The bimetallic surface weakens the chemisorption of CO to the surface and also reduces the activation energy for CO dissociation. Jiang and Carter reported similar calculations for CO dissociation on a monolayer of Al on Fe(100).59 CO was found to bind weakly on this surface and to dissociate with a barrier of B0.35 eV, compared to the barrier of B1 eV on Fe(100). Wang, Jiao, and Bu analyzed several possible mechanisms for CO oxidation on ordered Au3Ni(111) surfaces.60 When both CO and O2 are chemisorbed on the surface, a pathway to formation of CO2 with a barrier of only 0.13 eV was found. In this reaction, the chemisorbed O2 has a di-s configuration on the surface with an adsorption energy of 1.3 eV. The existence of this pathway is interesting, but it gives an incomplete description of steady state formation of CO2 for at least two reasons. First, Wang et al. also showed that chemisorbed O2 can dissociate into two adsorbed O atoms with a negligible barrier. This process is highly energetically favored, indicating that the population of chemisorbed O2 on the surface will be extremely low. Moreover, the pathway outlined above for CO2 formation also creates strongly bound atomic O on the surface. This work therefore gives interesting hints regarding the overall activity of ordered Au3Ni(111) surfaces, but doesn’t give a complete picture of the steady state reaction kinetics of this surface. Dupont, Jugnet and Loffreda carefully characterized the kinetics of CO oxidation on O-precovered PtSn surfaces, comparing their results with pure Pt.61 Two kinds of Pt-Sn surfaces were considered, one in which the top layer of a Pt(111) surface is replaced with Pt3Sn, and separately the (111) termination of an ordered Pt3Sn crystal. Using previous results for Pt(111), the authors make the plausible argument that the dissociation of O2 into adsorbed atomic oxygen is not rate limiting, so understanding the reaction between adsorbed CO and O is sufficient to characterize steady state reaction kinetics. An especially useful feature of this work was that the vibrational frequencies (including surface phonons) were computed for each energy minimum and transition state within the harmonic approximation. These frequencies allow zero point energy corrections and transition state theory prefactors to be calculated. The existence of a surface layer of Pt3Sn on Pt(111) was found to lower the barrier to CO oxidation by 0.18 eV relative to Pt(111) and also to increase the transition state theory prefactor by an order of magnitude. The activity of Pt3Sn(111) was also shown to be higher than that of Pt(111), but this material is predicted to be less active for CO oxidation than the material with a single surface layer of Pt3Sn. Analysis of CO oxidation on surfaces precovered with O was also performed by Gonza´lez, Sousa, and Illas for bimetallic RhCu(111) surfaces.62 Because Cu has very low miscibility in Rh, the Rh-rich bimetallic surface was modeled using a slab with one Cu atom in a (3 3) surface unit cell and no Cu atoms in the subsurface layers. Chem. Modell., 2008, 5, 150–181 | 167 This journal is

c

The Royal Society of Chemistry 2008

A moderate reduction in the activation energy for CO oxidation from B1.4 eV on Rh(111) to B1.1 eV on some sites on the bimetallic surface was found. Other paths with considerably lower activation energies were also found, but the authors were careful to point out that these paths are unlikely to be thermodynamically relevant. These authors also examined NO dissociation on the same surface and an analogous Cu-rich surface.63 Similar to CO, the activation energy for NO dissociation is lower on the Rh-rich bimetallic surface than on Rh(111), while the opposite was found for the Cu-rich bimetallic. Wang and Hammer approached the challenging problem of describing catalytic oxidation on a metal surface in a low temperature fuel cell by considering CO oxidation on Pt2Mo(111) covered by a bilayer of water.64 Materials that readily oxidize CO in this environment are desirable because they would help make a fuel cell tolerant to CO. The binding energy of CO on the bimetallic surface is lower than on Pt(111) in the presence of adsorbed water. The oxidation of CO in the presence of water can occur via formation of surface hydroxyl species and subsequent oxidation of CO via a COOH intermediate. The bimetallic surface was calculated to be much more favorable for OH formation than Pt(111), and the overall formation of CO2 from the water bilayer and adsorbed CO can take place with an overall barrier of B0.3 eV. It is especially interesting to note that the activation energies relevant to COOH intermediates in the absence of the water layers are considerably higher than when the water is present. This trend is also observed in studies of electrocatalytic systems which are reviewed later in section 6.

4.2 Water gas shift (WGS) catalysis The water gas shift (WGS) reaction, conversion of CO and H2O to CO2 and H2, is of enormous importance in future scenarios for generating large amounts of H2 from fossil fuel sources. This reaction is closely related to CO oxidation—in fact the work discussed above regarding CO oxidation in an aqueous layer could also be thought of as an example of WGS chemistry. Knudsen et al. performed a clever combination of experiments and DFT calculations to identify a near-surface alloy of Cu/Pt as a promising bimetallic WGS catalyst.65 First, they used STM and XPS measurements and DFT calculations to show that Cu deposited on a Pt(111) crystal preferentially segregates to the subsurface layer, with much smaller amounts of Cu appearing in layers further from the surface. Having established the stability of this NSA, DFT calculations were used66,67 to probe three key characteristics of the surface of a WGS catalyst: (i) chemisorption of CO, (ii) chemisorption of formate, and (iii) activation of H2O. The first two species are known to block sites on WGS catalysts, so reductions in these bond strengths are desirable. The calculations indicate that both CO and formate are destabilized on the NSA relative to Pt(111). This effect was confirmed directly in experimental measurements of CO desorption. Water activation is useful as a proxy for WGS activity because for a number of well characterized catalysts this is the rate limiting step. DFT calculations indicate that the activation energy for H2O dissociation is lower on Pt(111) than on the Cu/Pt NSA or Cu(111). Superficially, this may appear to make the NSA less attractive. The bond strength of OH on the NSA, however, is considerably weaker than on Pt(111), which would reduce poisoning of the catalyst by OH groups. An obvious direction for future work motivated by this study would be the synthesis and testing of a high surface area catalyst based on the Cu/Pt NSA. Another study that is relevant for WGS chemistry used a DFT cluster approach to describe water dissociation on bimetallic Pt/Ru clusters.68 A cluster in which a single Ru atom sites on top of a Pt(111) plane was found to be more active for water dissociation and CO oxidation than a flat alloy surface, but it is not clear that this type of material is thermodynamically stable. 168 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

4.3 Methanol steam reforming Catalytic reforming of methanol with water, CH3OH + H2O - 3 H2 + CO2, is an attractive candidate reaction for fuel cells because methanol is relatively inexpensive and is easy to store and transport. One kind of catalyst that holds promise for this reaction is a combination of Pd and ZnO. Several lines of evidence point to the 1:1 PdZn alloy as being the active component in this catalyst. Ro¨sch and co-workers have performed detailed DFT calculations to understand a series of relevant processes on this alloy.69–71 Calculations of the surface energies of various surfaces of PdZn (in the L10 structure) indicated that the (111) surface has the lowest surface energy and that at low temperatures surface segregation is not a strong effect.71 Activation energies for the dehydrogenation of formaldehyde were calculated on PdZn(111) and also PdZn(211), a stepped surface.69 For comparison, similar calculations were performed on Cu(111) and Pd(111). This dehydrogenation reaction was calculated to be favorable on Pd(111) (relative to formaldehyde desorption), but not on the flat Cu or PdZn surfaces. The reaction is more favored on the stepped PdZn surface than the flat bimetallic surface, but not as strongly as on Pd(111). Lim, Moskaleva, and Ro¨sch also tackled the important task of assessing the surface coverage of adsorbed species under conditions relevant for practical catalysis.70,71. Specifically, they used ab initio thermodynamics to describe PdZn(111) in the presence of H2O and H2 at 500 K. The general principles underlying ab initio thermodynamics have been reviewed in an earlier SPR2 and elsewhere72–77. These methods have limitations in systems where the coverages of adsorbed species are coupled via reactions, Lim et al. also derived a kinetic model for the surface coverages of the relevant surface species. The main outcome of this analysis is that the surface is expected to be free of all water-related adsorbates, including surface oxygen, at 500 K whenever the partial pressure of H2 exceeds a very small value, estimated to be B104 atm. This result is a strong indication that the previous calculations that had been performed to understand the catalytic chemistry of formaldehyde and related species on this surface are relevant to realistic catalysts.

4.4 Other reactions Albenze and Shamsi used DFT to characterize H2S dissociation on Ni(111) and on two bimetallic surfaces that mimic Ni-Mo solid solutions with o20% Mo.78 The existence of Mo atoms in the surface layer was found to reduce the energy barriers that exist to H2S dissociation, but this reaction is predicted to be facile on all three surfaces at elevated temperatures. Although some calculations on the effect of surface S coverage were performed, it is not clear how relevant these calculations are to experimental observations with Ni/Mo catalysts, which tend to sulfide rapidly. Most of the papers we have discussed apply DFT to a small number of adsorbed species on one or two specific surfaces. An ambitious effort to explore a larger variety of surfaces with a reduced level of rigor was developed by Kieken, Neurock, and Mei.79 In this work, extensive DFT calculations were performed for the energies and activation barriers for NO decomposition in the presence of excess O2 on Pt(111). These results were then used to parameterize a bond order conservation (BOC) description of each intermediate and reaction. The important advantage of this description is that it could be extended with relative ease to a large number of PtAu alloy surfaces without performing detailed DFT calculations on these alloys. Kinetic Monte Carlo simulations were used to simulate the steady state kinetics on each surface considered by defining a lattice gas model of the reaction kinetics from the BOC description. By considering a large variety of Pt-Au(100) surfaces with random distributions of Pt and Au atoms, surface compositions that increased the turnover frequency relative to pure Pt by up to a factor of two were found. Chem. Modell., 2008, 5, 150–181 | 169 This journal is

c

The Royal Society of Chemistry 2008

Independent DFT calculations of the key steps on surfaces with this composition would have provided a useful test of the BOC approach, but at least in this paper no calculations of this kind were reported. This general approach of a limited dataset of DFT results with more empirical descriptions that give rapid access to a large number of material compositions should receive more consideration. In general, it is of course desirable to examine the most promising materials found in a screening exercise of this kind through more rigorous means.

5. Miscellaneous In this section, we discuss two topics that are not direct applications of bimetallic catalysts but for which a number of theoretical studies have appeared that also give useful insights into the properties of bimetallic catalysts. 5.1 Metals supported on metal oxides The striking catalytic properties of small Au clusters on metal oxide supports has attracted enormous amounts of experimental and theoretical attention. This area was reviewed in detail in an earlier RSC SPR.2 Here, we simply mention several recent theoretical studies of small metal clusters on metal oxides to give the interested reader an entry into this literature. Most of this work has focused on MgO substrates, since the structural simplicity of this material is advantageous for both theoretical and experimental studies. A key observation that is now well established is that the catalytic activity of metal clusters is often quite different on defect-free and defective MgO surfaces. A particularly interesting example of this phenomenon was analyzed by Yoon et al., who performed experiments and calculations for CO oxidation by Au8 clusters on MgO.80 In this instance, the presence of a surface defect, specifically, an oxygen vacancy, underneath the cluster is critical to the cluster being catalytically active. The combination of experimental and theoretical analysis given by Yoon et al. indicates that charging of the clusters adsorbed on these defects is crucial. The catalytic activity of Au particles supported on metal oxide supports was reviewed in depth by Molina and Hammer.81 They were able to conclude that lowcoordinated edge and corner sites are crucial to understanding the catalytic activity of Au particles. Similarly, the Au/oxide interface periphery can be highly active, with the precise type of interface having an additional effect on activity. The exact nature of the CO oxidation reaction that is catalyzed by Au nanoparticles supported on TiO2 was not well understood at this time, but later work by Wang and Hammer was able to reveal a mechanism by which atomic oxygen adsorbs at perimeter sites near the support and CO further away from the support, and are then able to react with small energy barriers.82 A number of papers have examined the adsorption of metal atoms other than Au on MgO. Fernandez et al. systematically studied all first row transition metals (KZn) on defect-free MgO(100), at coverages corresponding to epitaxial overlayers and at coverages representative of isolated atoms on the surface.83 The trends in adsorption energy for these species were dominated by the interaction between the adsorbed atoms and O atoms in the surface, although interactions with adjacent surface Mg atoms also contribute for the larger adsorbed species (e.g. K and V). It is not clear from these results whether these trends would transfer in a simple way to defects on the oxide surface. Two studies using cluster models for MgO(100) and localized basis sets have given insight into the complexities of understanding small clusters of metal atoms on metal oxide supports. Giordano et al. examined the stability of Pd dimers on a range of defects on MgO(100), including neutral vacancies (also known as F centers), monoatomic steps, and surface OH groups.84 To characterize the stability of these adsorbed dimers, the energy of the dimer relative to having a single Pd atom 170 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

adsorbed at the defect and another Pd atom adsorbed on a defect-free MgO(100) terrace was calculated for each type of defect. For each kind of defect, this comparison showed that it was energetically favorable to move the Pd atom from the defect-free surface to form a dimer at the defect site. More interestingly, the F center showed the weakest energy for dimer formation of the three kinds of defect sites. As Giordano et al. point out, however, this does not necessarily mean that step or OH sites will dominate nucleation of Pd clusters on MgO(100), since in many situations the density of these sites will be far lower than the density of F centers. Barcaro and Fortunelli have tackled the difficult problem of probing the structure of bimetallic clusters adsorbed on defect sites on MgO(100).85 In particular, they studied Pd1AgN clusters for N = 1–8 adsorbed on F centers on MgO(100) using a cluster model of MgO(100) and localized basis sets. A basin-hopping algorithm was used to search for the lowest energy structure of each cluster. Multiple local minima among the cluster configurations found with this method that lie within 0.3 eV of the lowest energy structures. One striking conclusion from these calculations is the prediction that Pd1Ag6 is a magic cluster on the surface, with a large hom*o–LUMO gap. In the preceding studies, the metal oxide defines the substrate upon which metals are deposited. Ferrari, Roetti, and Pisani studied the complementary situation of thin MgO islands deposited on Ag(100) using B3LYP DFT in a periodic slab geometry.86 The dissociation of water on these surfaces was found to be dominated by the edge sites of the MgO islands. An additional complication in understanding the properties of small metal clusters on metal oxide supports is that controlled growth of these clusters is often achieved in practice by creating a thin metal oxide layer on top of another substrate. This situation can in some cases give the resulting metal clusters properties that are different from those adsorbed on a bulk metal oxide. A recent example that provides a good overview of this interesting area is the work of Honkala and Ha¨kkinen, who reported calculations for adsorption of isolated Au atoms on thin MgO(100) films supported on Mo.87 The adsorption energy of an Au atom on a defect-free film is significantly different from the outcome for a bulk MgO substrate, even when the film is five layers thick. In contrast, if the Au atom was located at a defect on the MgO film, its properties were very similar to those of the same defect on a bulk MgO substrate. Other aspects of this problem have been reported by Pacchioni and co-workers.88–90 Similar calculations were performed by Wang, Fan, and Liu for Au clusters on the (101) surface of anatase TiO2 that had been modified by depositing either AlOOH or Al2O3.91 These calculations included assessments of the oxidation of CO on the surface. The binding energy of Au atoms on the Al2O3-modified anatase was computed to be far larger than the equivalent energy on bare anatase, but the boundary between the adsorbed Au clusters and anatase was determined to be the active site for CO oxidation in both cases. Taniike et al. used DFT calculations to probe NO-CO reactions on a dimer of Co atoms on a g-alumina support.92 These dimers had been shown to exist in previous experiments that also suggested an unusual mechanism in which NO is reduced to N2O via interaction with gas phase CO molecules. The calculations indicated that Co dimers allow the formation of an adsorbed cis-(NO)2 species that has an unusually large reactive cross section for reaction with gaseous CO via an Eley-Rideal mechanism. The dynamics and growth of small metal clusters on substrates is of course important for realistic catalysts, but is also relevant in other areas. One recent example is the work of Wu et al., who examined Cu atoms and small Cu clusters (up to 4 atoms) on WN(001) as a model for Cu agglomeration in barrier materials for semiconductor devices.93 A noteworthy aspect of this work is that it explicitly considered the pathways for diffusion and growth of small clusters. Chem. Modell., 2008, 5, 150–181 | 171 This journal is

c

The Royal Society of Chemistry 2008

5.2 Magnetic properties Several studies have examined the magnetic properties of bimetallic surfaces. Canto and Ordejo´n performed GGA calculations for the (001) surface of TiFe, an ordered intermetallic compound with the CsCl structure.94 If this surface is defect-free it has two possible terminations, one with a Fe layer and another with a Ti layer. The calculations indicate a strong enhancement in the surface magnetic moment for the Fe-terminated surface, but a much weaker effect for the Ti-termination. Achilli et al. used Green’s function-based DFT calculations to study a perfect monolayer of Fe on a Cu(100) substrate.95 To understand the relative roles of bonding of this monolayer to the substrate and the strain induced by the substrate, calculations were also performed for an unsupported Fe layer. The magnetic moment of the Fe atoms was larger than the bulk value for both the supported and unsupported layers, with the effects of the substrate being fairly weak. Pick and Demangeat reported DFT calculations of thin Mn films on Co(001) substrates.96 In this system, earlier experiments had suggested that surface oxidation could play a decisive role in the surface magnetic ordering, so calculations for several possible states of adsorbed oxygen were also performed. These calculations confirm that the precise Mn film thickness and organization of O atoms can alter the characteristics of the surface magnetism, while at the same time suggesting that the atomic configurations that have been observed experimentally are more complex than the ones examined theoretically. Qian used a composite elastic description to assess the surface stress in W(001) with and without an overlayer of Fe.97 This approach uses DFT calculations at several different lattice constants as input into a description of the surface stress based on linear elasticity. This framework may provide useful insight into the roles of surface stresses in other similar situations.

6. Electrocatalysis Electrocatalysis is receiving increasing levels of attention in the computational community due to the recent interest in fuel cells and electrochemical energy conversion technology. The presence of an electrolyte and electric potential substantially complicates modeling efforts of these systems. Simple models to account for these phenomena were developed by Nørskov and co-workers,98,99 while more sophisticated approaches were developed by Neurock and co-workers.100–106 There are other approaches as well, but these two are the ones most often used in the work reviewed here. Other notable approaches include that by Alavi and co-workers107– 109 , and the Anderson group’s approach.110 In this section, we review approaches to predicting dissolution potentials in 6.1. Next we discuss the hydrogen evolution reaction and studies to understand it and predict new electrocatalysts in 6.2. Electrochemical oxidation reactions that are typical in fuel cells are discussed in 6.3. Finally, the studies on the increasingly important oxygen reduction reaction are reviewed in 6.4. 6.1 Dissolution Alloy stability is always of concern in heterogeneous catalysis, but in electrocatalysis there are new mechanisms for destabilizing alloys, namely electrochemical dissolution or corrosion. Greeley and Nørskov developed an intuitive and simple thermodynamic framework for estimating the stability of alloy surfaces in electrochemical environments.111 Their scheme is essentially an extension of an atomistic thermodynamic approach that uses chemical potentials to determine stability to one that uses electrochemical potentials to determine stability. They estimate the electrochemical potentials using total energies calculated within DFT and ideal solution behavior of the ions to consider concentration and pH effects. Within this formalism they are able to estimate the dissolution potential of metals in alloys. They further compared the trends in dissolution behavior to trends in segregation behavior and 172 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

found they tend to follow each other. The simple and intuitive reasoning for this is related to the stability of the surfaces. Solutes that are more stable in the surface are more resistant to dissolution. The most significant results in this work are the prediction that Pt-skins from Pt3X (X = Fe, Co and Ni) are more stable to dissolution than pure Pt surfaces are. However, when water splitting also occurs, the dissolution potentials of Pt are reduced because the OH adsorption energy is greater on pure Pt than it is on the alloy surface. Consequently, there is a thermodynamic driving force to form pure Pt surfaces. Gu and Balbuena also predict that most solutes dissolved in Pt reduce its stability towards dissolution.112 They investigated the role of oxygenated intermediates in the oxygen reduction reaction on the dissolution mechanisms of Pt surfaces. They found that the most favorable mechanism for dissolution is an electrochemical mechanism (as opposed to a purely chemical mechanism) that involves oxygenated metal surface atoms. Their investigation focused on single metal atoms or dimers, however, and used small clusters of water to include some solvation effects. They compared the relative stabilities of Ir, Pd, Rh, Ni and Co and found that Pt-Ir was the most stable. 6.2 Hydrogen evolution In order to predict the electrocatalytic properties of an alloy surface, for instance, its hydrogen evolution activity, we must understand the factors that contribute to electrocatalytic activity. Greeley and coworkers developed a simple view of activity trends in hydrogen evolution on Pd-monolayers supported on other metals based on the free energy of hydrogen adsorption.113 With this descriptor they observe a volcano curve in activity on a series of Pd monolayer bimetallic systems with the maximum activity occurring at a DGH* of 0 eV, where H* represents an adsorbed H atom. For metals with greater DGH* hydrogen evolution is limited by a high barrier to get H* on the surface, whereas for bimetallic surfaces with DGH* much less than zero the reaction is limited by getting H* off the surface. The variations they observed in DGH* are largely explained by changes in the electronic structure (the dband center) in the Pd monolayer due to interactions with the support metal.114 Once a good chemical descriptor has been identified, such as DGH* or the d-band center in the previous work, then large-scale screening approaches to identifying new alloy electrocatalysts can be attempted. Greeley et al. have used DFT to screen over 700 binary alloy surfaces to identify candidate surfaces with DGH* near 0 eV.115,116 They further screen the candidate structures with the right hydrogen adsorption properties to exclude candidates that are not stable with respect to segregation, dissolution or that would be expected to be poisoned by hydroxide intermediates. Using this approach they identified PtBi as a promising hydrogen evolution electrocatalyst and demonstrated experimentally that it was indeed more active than pure Pt. The significance of their approach is that Bi is known to be a bad hydrogen evolution catalyst on its own, and the primary reason it was tried experimentally is that the computational approach ‘‘found’’ it. Finally, there is a substantial need for non-Pt based electrocatalysts. Pt is very expensive and not particularly poison tolerant. Jaramillo et al. used the concepts described above to identify MoS2 as a potential hydrogen evolution electrocatalyst.117 They observed that the DGH* on MoS2 was only 0.08 eV. This is sufficiently close to 0 eV that they hypothesized it could be a good hydrogen evolution electrocatalyst. They demonstrated this by synthesizing MoS2 nanoparticles on a Au(111) surfaces and found indeed MoS2 is a reasonable electrocatalyst for this reaction that is better than pure Mo, but not as high in activity as Pt. 6.3 Anode reactions At the anode in a polymer electrolyte membrane (PEM) fuel cell, the fuel is typically oxidized to create protons and electrons and CO2 in the case of carbon containing Chem. Modell., 2008, 5, 150–181 | 173 This journal is

c

The Royal Society of Chemistry 2008

fuels. One major concern in hydrogen fuel cells is that residual CO in the fuel stream could poison the anode electrocatalyst. One approach to solving this problem is to develop alloy anode materials that either weaken the CO adsorption energy or make it easier to oxidize, and preferably without lowering the hydrogen oxidation activity. Shimodaira et al. have used DFT to screen a large number of Pt5M5 alloy clusters with these goals in mind.118 Specifically, they sought alloy surfaces where the CO adsorption energy was weaker than that on Pt, and where the activation energy of hydrogen scission was not higher than that on Pt. Among the systems they examined Co-Pt, Pt-Ni, Pt-Au, Pt-Fe, Pt-Mo and Pt-Ru were identified as meeting these criteria. They also investigated the stability of these surfaces with respect to segregation and found their results were in reasonable agreement with what is expected from the difference in experimental surface tension data. That is to say, the metal with the lowest surface tension tends to segregate to the surface. CO oxidation is critically important at the anode. If the CO can not be removed from the anode during operation it will block all the sites and poison the surface. Davies et al. have examined the ligand effect in Pt/Ru catalysts on the desorption of CO using DFT and steady state isotopic transient kinetic analysis.119 They found the primary effect of the alloy surface is to reduce the CO coverage due to the reduced adsorption energy of CO on the alloy surface. The exchange rate of CO on the surface was unaffected by the alloy surface. However, in the presence of hydrogen gas the exchange rate is influenced on the alloy because hydrogen can compete for adsorption sites, resulting in lower desorption/exchange rates for CO in the presence of hydrogen gas. 6.4 Oxygen reduction From a catalysis standpoint the oxygen reduction reaction at the PEM fuel cell cathode is one of the biggest challenges for fuel cells. The main problem is the reaction rate is too slow, resulting in high overpotentials and low performance in fuel cells. One solution to this problem has been the use of alloy surfaces to increase the reaction rates. Wang and Balbuena developed thermodynamic guidelines for designing oxygen reduction catalysts based on DFT calculations of elementary reactions steps on seventeen transition metal atoms from groups V to XII and on clusters of three atoms for ten of those metals.120 In this highly simplified model system they found a significant correlation between the reaction energy in the formation of OOH by hydrogenation and the reductions of O and OH. This correlation showed that metals that were good at formation of OOH were not good at reduction of the intermediates and vice versa. Furthermore, these reactivities correlated well with the number of vacancies in the d orbitals. Metals with unfilled dorbitals were good at forming OOH, and metals with filled d-orbitals were better at reducing the intermediates. They propose that coupling metals from these two groups would result in a superior bifunctional electrocatalyst. Calvo and Balbuena examined the structure and reactivity of Pd-Pt nanoclusters with 10 atoms in the oxygen reduction reaction.121 In contrast with what is expected in a periodic slab calculation, they found that ‘‘mixed’’ states with randomly distributed Pd atoms in a Pt7Pd3 cluster was more stable than an ‘‘ordered’’ cluster structure due to more effective charge transfer in the ‘‘mixed’’ state. They found that increasing the concentration of Pd in the surface favors formation of the OOH species in the first step of the reaction, but Pt atoms were needed to promote the second stage of the oxygen reduction reaction. They report that due to charge transfer effects the Pd atoms have an intermediate reactivity between pure Pd and Pt, and in the mixed cluster the Pd atoms the Pd atoms act more similarly to Pt than in the ordered cluster. There are two mechanisms that are considered in the oxygen reduction reaction. In the series mechanism oxygen molecules are protonated twice generating hydrogen peroxide as an intermediate which can then be further reduced. In the direct 174 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

mechanism the O–O bond is broken and then the atomic adsorbates are each protonated twice to form water. Lamas and Balbuena investigated these two mechanisms on Pd-Co alloy surfaces by calculating the binding energies of oxygen reduction intermediates on Pt, Pd, Pd0.75Co0.25, and Pt0.75Co0.25.122 In this work they showed that both mechanisms are likely to be operating in parallel and that the highest thermodynamic barriers were in the first hydrogenation step in both mechanisms. Although they used an electrochemical potential formalism to estimate the thermochemical barriers of electrochemical reactions, they did not include solvent effects and their results are for oxygen coverages of 1/4 ML. Hyman and Medlin considered relationships between the adsorption energies of oxygen reduction intermediates and the surface electronic structure of Pt surface alloys.123 In this work they used model surface structures to disentangle strain and ligand effects in the alloys. They found that compressive strain destabilized the adsorption energies of all intermediates due to the broadening of the surface d-band. For the OH, O2 and OOH species they found strong correlations between the changes in adsorption energies of these species on all the surfaces including both strain and ligand effects. In contrast, they found poor correlations between these intermediates and the O adsorption energy. The d-band center was only an effective descriptor of adsorption energies for the O atom, and there were separate correlations for the 3d, 4d and 5d alloy components. They found that they could not predict the changes in adsorption energy due to strain and ligand effects by separately estimating the strain effect and ligand effect and then summing them. These effects are not independent of each other; the magnitude of the ligand effect depends on the amount of strain.124 Peroxide formation at the cathode is problematic for two reasons. First it is only a two electron process and not as thermodynamically favorable as formation of water, which results in a lower fuel cell potential. Second, it may decompose into oxidizing radicals that can degrade the membrane and catalyst support. Balbuena and coworkers used three atom clusters of Pt with Ni, Co or Cr to investigate the catalytic decomposition of hydrogen peroxide.125 H2O2 binds weakly to the Pt top positions in all of the bimetallic clusters and much stronger on the 3d metals. H2O2 dissociates in the hollow sites of these clusters. The three atom clusters appear substantially more reactive than a 10 atom cluster and more reactive than periodic surfaces of similar composition. Solvation effects are not negligible in this system, and the authors showed that water molecules destabilize adsorbed H2O2 by hydrogen bonding. Roques and Anderson used a different approach to understanding the increased reactivity of Pt-Co electrocatalysts in the oxygen reduction reaction.126 Based on the observation that OH species adsorb strongly to the surface and block sites for further reactions they set out to determine the electrochemical stability of these adsorbates on different alloy surfaces. They developed a model where the potential at which the intermediate is stable on the surface, and more importantly the change in that potential due to alloying, is directly related to the change in adsorption energy of the OH species. This allows them to compare potential shifts on surfaces with different compositions. Relative to Pt(111) they found that incorporation of Co into the subsurface layers weakens the OH adsorption energy and increases the potential at which adsorbed OH is stable. The effect stabilizes above 75% Co in the second layer. The changes in adsorption energies are again largely explained by shifts in the d-band center, but only for Pt atoms directly bonded to sub-surface Co. They reported a good correlation between the Pt surface charge and the OH adsorption energy for the alloys (not including a pure Co surface). Fernandez et al. used a combined high throughput electrocatalyst study and DFT study to examine Pd-Co electrocatalysts for oxygen reduction.127 They screened an array of electrocatalysts using a scanning electrochemical microscope to assess the activity of each element. The DFT study was performed to help identify the role of Co in the system. They examined oxygen molecule adsorption, dissociation and Chem. Modell., 2008, 5, 150–181 | 175 This journal is

c

The Royal Society of Chemistry 2008

oxygen atom diffusion at and around a Co impurity in a Pd surface. They found that Co atoms in the surface substantially reduce the dissociation energy of O2 on the surface, but that the Co atom is likely poisoned by the strong Co-O bond. However, their preliminary calculations suggest O2 can still dissociate more easily near a poisoned Co atom than on a pure Pd surface. They estimated that on a clean surface Co would prefer to anti-segregate to the second layer, but in the presence of oxygen the strong O-Co bonds make it stable in the top surface layer. Ternary alloy surfaces offer another dimension in tuning the catalytic properties of oxygen reduction electrocatalysts. Zhang et al. used a novel synthetic approach combined with DFT studies to identify active electrocatalysts and explain trends in ternary alloy electrocatalysts.128 They used galvanic displacement of underpotential deposited Cu on Pd surfaces to create mixed Pt-M alloy surfaces supported on Pd nanoparticles as electrocatalysts. They chose M based on DFT calculations so that M would strongly bond O or OH resulting in lateral repulsion of other O and OH species near the Pt surface atoms. They are able to tune the Pt properties both by the interactions with Pd in the subsurface and by ligand effects with M in the surface. Re and Os were found to destabilize the OH on Pt the most, and had the highest activity. In contrast, Au and Pd were not effective at destabilizing OH on Pt and they had very low activity. Stamenkovic et al. used the adsorption energy of O on alloy surfaces as a catalytic activity indicator.129 They use this approach to develop a volcano model of activity for the oxygen reduction reactivity and compare their computed predictions of activity to experimental results. They showed that Pt-3d alloy surfaces are near the top of the volcano, but more significantly identified the chemical descriptor (d-band center or O adsorption energy) that could be used to identify new alloy surfaces that could have higher activity.

7. Conclusions The last 5 years have seen considerable advances in the use of quantitative computational models to understand (and in a limited number of cases, predict) the reactivity of active sites with relevance for practical bimetallic heterogeneous catalysts. The bibliography below is an indication of the high level of activity in this area by research groups around the world. There are multiple issues that are still challenging to address computationally in this area, so we conclude by outlining these areas and pointing to topics with the greatest probability for making large impacts in the field. Perhaps the central issue that runs through all of the areas we identify below is the need to sample a large range of active sites in order for theoretical methods to make physically meaningful predictions about the behavior of bimetallic materials. Significantly more work needs to be done in identifying the relevance or statistical probability of the range of sites that can exist in real materials and in relating atomistic scale properties to meso- and macro-scale properties of alloy surfaces with site heterogeneity. This is a very challenging problem that will require linking computational approaches that address different length and time scales as well as discrete to continuum descriptions of the physics at these scales. This problem will require considerable methodological development; a ‘‘business as usual’’ model in which individual surface sites are probed with plane wave DFT calculations cannot be expected to completely solve this problem. The general problem we just outlined can be reiterated for almost every property that controls the catalytic performance of bimetallic systems. Quantitatively describing segregation behavior in alloy surfaces remains challenging. On one hand it is relatively easy to compare the relative stabilities of two surface alloy configurations using DFT. However, the phase space is so rich that we cannot yet compare the relative stabilities of enough configurations to adequately sample the free energy landscape. Furthermore, the poor scaling of DFT calculation time with system size 176 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

limits the size of the calculations to only a few hundred, perhaps up to a thousand transition metal atoms. Methods that use simple models (potentials) to identify structures to examine with DFT9,10 or use DFT to parameterize simpler and faster models are under development in several groups9,10,17,45,130 to address this problem. These models can be used to evaluate the role of entropy in determining composition profiles because they are substantially faster to calculate with than DFT. In other fields of computational materials design, such as bulk alloys, new methods based on genetic programming are being developed for sampling complex potential energy surfaces to build simple potentials.131–133 Another approach used in the bulk alloy community is to use datamining techniques on large databases to extract trends and correlations that can be used to predict new behavior.134–138 These machine learning approaches could certainly result in new understanding and insights into alloy catalysis simply by expanding our ability to analyze complex systems and identify non-trivial correlations between them. Some of these approaches are starting to be used in studies of alloy catalysis. The presence of a reactive environment can also influence the segregation profile in an alloy surface, i.e. the types and numbers of active sites. The complexity of the interactions of adsorbates with alloy surfaces makes it difficult to predict the effects of the environment on the alloy surface. By implication, it is difficult to predict a meaningful or relevant model of the alloy surface and its active sites under reaction conditions for use in simulating reactions. Future developments in this area will certainly advance the field. Modeling kinetic phenomena on alloy surfaces is an even more challenging area of research. The topics listed above can be largely described by thermodynamic approaches, and while not at all trivial, the concepts are essentially well understood. Reaction conditions, in contrast, often operate away from equilibrium and consequently thermodynamic approaches are only an approximation to real systems. Direct inclusion of reactive environments, such as bilayers of water, is increasingly common and is shown in many cases to significantly impact reaction barriers and adsorption energies on alloy surfaces. This adds additional challenges in identifying relevant configurations because these solvent configurations may have many local minima, making it difficult to get an ‘‘average’’ solvent effect. We did not see any examples of alloy calculations with an effective solvent medium while preparing this review, although this approach is very common in quantum chemistry calculations for other types of physical systems. Kinetic simulations that involve calculating reaction barriers or adsorbate vibrational modes are substantially more computationally time consuming than ‘‘simple’’ geometry optimizations. The high probability of heterogeneity in site composition and distribution further add to the challenges of choosing relevant active sites and reaction pathways that can be related to experimental results. New algorithms for exploring the manifold of possible reactions and identifying the most kinetically relevant reactions in alloy systems are critically needed to advance the field beyond the current situation where a ‘‘best guess’’ for reaction coordinates is often used. It seems likely to us that DFT will be used to parameterize simpler and faster models that can more rapidly search the phase space for relevant reaction pathways to study in more detail with DFT. While all of these challenges point to the need for new methodological developments, it is also important to remember that quantitative modeling of complex materials such as bimetallic catalysts must continually make contact with experimental reality. Innovative ways to combine theoretical and experimental observations that can jointly speed up the search for new catalysts are in the long run likely to make the largest impacts in the field. Heterogeneous catalysis will play a central role in the development of new technologies to address many of the pressing energy concerns facing society. Computational modeling is poised to make important contributions to this area, Chem. Modell., 2008, 5, 150–181 | 177 This journal is

c

The Royal Society of Chemistry 2008

and we expect that this will be a vibrant area for both fundamental and applied research for the foreseeable future.

References 1 J. H. Sinfelt, Bimetallic catalysts: discoveries, concepts, and applications, Wiley, New York, 1983. 2 D. S. Sholl, Applications of Density Functional Theory to Heterogeneous Catalysis, in Chemical Modelling: Applications and Theory, edited by A. Hinchliffe, The Royal Society of Chemistry, 2006, vol. 4, pp. 108–160. 3 J. G. Chen, C. A. Menning and M. B. Zellner, Submitted to Surface Science Reports, 2007. 4 W. Olovsson, C. Goransson, T. Marten and I. A. Abrikosov, Physica Status Solidi BBasic Solid State Physics, 2006, 243, 2447–2464. 5 W. Olovsson, C. Goransson, L. V. Pourovskii, B. Johansson and I. A. Abrikosov, Physical Review B, 2005, 72, 064203. 6 W. Olovsson, E. Holmstrom, J. Wills, P. James, I. A. Abrikosov and A. M. N. Niklasson, Physical Review B, 2005, 72, 155419. 7 E. Holmstrom, W. Olovsson, I. A. Abrikosov, A. M. N. Niklasson, B. Johansson, M. Gorgoi, O. Karis, S. Svensson, F. Schafers, W. Braun, G. Ohrwall, G. Andersson, M. Marcellini and W. Eberhardt, Physical Review Letters, 2006, 97, 266106. 8 T. Marten, W. Olovsson, S. I. Simak and I. A. Abrikosov, Physical Review B, 2005, 72, 054210. 9 E. M. Fernandez and L. C. Balbas, International Journal of Modern Physics B, 2005, 19, 2339–2344. 10 L. O. Paz-Borbon, R. L. Johnston, G. Barcaro and A. Fortunelli, Journal of Physical Chemistry C, 2007, 111, 2936–2941. 11 S. Sahoo, G. Rollmann and P. Entel, Phase Transitions, 2006, 79, 693–700. 12 M. Ropo, Physical Review B, 2006, 74, 195401. 13 K. Yuge, A. Seko, A. Kuwabara, F. Oba and I. Tanaka, Physical Review B, 2007, 76, 045407. 14 O. M. Lovvik, Surface Science, 2005, 583, 100–106. 15 C. Jiang and B. Gleeson, Acta Materialia, 2007, 55, 1641–1647. 16 F. Qin, C. Jiang, J. W. Anderegg, C. J. Jenks, B. Gleeson, D. J. Sordelet and P. A. Thiel, Surface Science, 2007, 601, 376–380. 17 B. C. Han, A. Van der Ven, G. Ceder and B. J. Hwang, Physical Review B, 2005, 72, 205409. 18 E. K. Vestergaard, R. T. Vang, J. Knudsen, T. M. Pedersen, T. An, E. Laegsgaard, I. Stensgaard, B. Hammer and F. Besenbacher, Physical Review Letters, 2005, 95, 126101. 19 T. Kangas, N. Nivalainen, H. Pitkanen, A. Puisto, M. Alatalo and K. Laasonen, Surface Science, 2006, 600, 4103–4107. 20 A. V. Ruban, S. I. Simak, P. A. Korzhavyi and B. Johansson, Physical Review B, 2007, 75, 054113. 21 S. Muller, Surface and Interface Analysis, 2006, 38, 1158–1163. 22 S. Muller, M. Stohr and O. Wieckhorst, Applied Physics A-Materials Science & Processing, 2006, 82, 415–419. 23 J. Greeley and M. Mavrikakis, Journal Of Physical Chemistry B, 2005, 109, 3460–3471. 24 J. Greeley and M. Mavrikakis, Catalysis Today, 2006, 111, 52–58. 25 A. Roudgar and A. Gross, Surface Science, 2005, 597, 42–50. 26 S. Gonzalez, K. M. Neyman, S. Shaikhutdinov, H. J. Freund and F. Illas, Journal of Physical Chemistry C, 2007, 111, 6852–6856. 27 J. Fearon and G. W. Watson, Journal Of Materials Chemistry, 2006, 16, 1989–1996. 28 J. Greeley and J. K. Nørskov, Surface Science, 2005, 592, 104–111. 29 C. A. Menning, H. H. Hwu and J. G. G. Chen, Journal Of Physical Chemistry B, 2006, 110, 15471–15477. 30 Z. Ji, A. F. Jalbout and J. Q. Li, Solid State Communications, 2007, 142, 148–153. 31 J. M. Seminario, L. A. Agapito, L. Yan and P. B. Balbuena, Chemical Physics Letters, 2005, 410, 275–281. 32 C. Lacaze-Dufaure, C. Blanc, G. Mankowski and C. Mijoule, Surface Science, 2007, 601, 1544–1553. 33 S. Gonzalez and F. Illas, Surface Science, 2005, 598, 144–155. 34 S. Sakong, C. Mosch and A. Gross, Physical Chemistry Chemical Physics, 2007, 9, 2216– 2225.

178 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

35 K. Termentzidis, J. Hafner and F. Mittendorfer, Journal of Physics-Condensed Matter, 2006, 18, 10825–10835. 36 K. Termentzidis and J. Hafner, Journal of Physics-Condensed Matter, 2007, 19, 246219. 37 M. A. Gulmen, A. Sumer and A. E. Aksoylu, Surface Science, 2006, 600, 4909–4921. 38 C. Dupont, D. Loffreda, F. Delbecq and Y. Jugnet, Journal of Physical Chemistry C, 2007, 111, 8524–8531. 39 Z. Ji and J. Q. Li, Chemical Physics Letters, 2006, 424, 111–114. 40 M. P. Andersson, T. Bligaard, A. Kustov, K. E. Larsen, J. Greeley, T. Johannessen, C. H. Christensen and J. K. Nørskov, Journal of Catalysis, 2006, 239, 501–506. 41 A. L. Kustov, A. M. Frey, K. E. Larsen, T. Johannessen, J. K. Nørskov and C. H. Christensen, Applied Catalysis A-General, 2007, 320, 98–104. 42 J. Sehested, K. E. Larsen, A. L. Kustov, A. M. Frey, T. Johannessen, T. Bligaard, M. P. Andersson, J. K. Nørskov and C. H. Christensen, Topics in Catalysis, 2007, 45, 9–13. 43 A. Perez, M. J. Vilkas, C. R. Cabrera and Y. Ishikawa, Journal Of Physical Chemistry B, 2005, 109, 23571–23578. 44 Q. Ge, C. Song and L. Wang, Computational Materials Science, 2006, 35, 247–253. 45 B. C. Han and G. Ceder, Physical Review B, 2006, 74, 205418. 46 F. Vin˜es, A. Borodin, O. Ho¨fft, V. Kempter and F. Illas, Physical Chemistry Chemical Physics, 2005, 7, 3866–3873. 47 H. R. Tang and B. L. Trout, Journal Of Physical Chemistry B, 2005, 109, 17630–17634. 48 V. Pallassana, M. Neurock, L. B. Hansen and J. K. Nørskov, Journal of Chemical Physics, 2000, 112, 5435–5439. 49 M. B. Zellner, A. M. Goda, O. Skoplyak, M. A. Barteau and J. G. Chen, Surface Science, 2005, 583, 281–296. 50 A. M. Goda, M. A. Barteau and J. G. Chen, Journal Of Physical Chemistry B, 2006, 110, 11823–11831. 51 V. I. Avdeev, V. I. Kovalchuk, G. M. Zhidomirov and J. L. d’Itri, Surface Science, 2005, 583, 46–59. 52 L. E. Murillo, A. M. Goda and J. G. Chen, Journal of the American Chemical Society, 2007, 129, 7101–7105. 53 E. Nikolla, A. Holewinski, J. Schwank and S. Linic, Journal of the American Chemical Society, 2006, 128, 11354–11355. 54 E. Nikolla, J. Schwank and S. Linic, Journal of Catalysis, 2007, 250, 85–93. 55 X. Y. Zhao, P. Liu, J. Hrbek, J. A. Rodriguez and M. Perez, Surface Science, 2005, 592, 25–36. 56 H. Jiang, H. Yang, R. Hawkins and Z. Ring, Catalysis Today, 2007, 125, 282–290. 57 M. P. Hyman, B. T. Loveless and J. W. Medlin, Surface Science, 2007, 601, 5382–5393. 58 Y. M. Choi, C. Compson, M. C. Lin and M. L. Liu, Journal of Alloys and Compounds, 2007, 427, 25–29. 59 D. E. Jiang and E. A. Carter, Journal Of Physical Chemistry B, 2006, 110, 22213–22219. 60 G. C. Wang, J. Jiao and X. H. Bu, Journal of Physical Chemistry C, 2007, 111, 12335– 12339. 61 C. Dupont, Y. Jugnet and D. Loffreda, Journal of the American Chemical Society, 2006, 128, 9129–9136. 62 S. Gonzalez, C. Sousa and F. Illas, Physical Chemistry Chemical Physics, 2007, 9, 2877– 2885. 63 S. Gonzalez, C. Sousa and F. Illas, Journal of Catalysis, 2006, 239, 431–440. 64 J. G. Wang and B. Hammer, Journal of Catalysis, 2006, 243, 192–198. 65 J. Knudsen, A. U. Nilekar, R. T. Vang, J. Schnadt, E. L. Kunkes, J. A. Dumesic, M. Mavrikakis and F. Besenbacher, Journal of the American Chemical Society, 2007, 129, 6485–6490. 66 E. Kaxiras, Y. Baryam, J. D. Joannopoulos and K. C. Pandey, Physical Review B, 1987, 35, 9625–9635. 67 E. Kaxiras, Y. Baryam, J. D. Joannopoulos and K. C. Pandey, Physical Review B, 1987, 35, 9636–9643. 68 Y. Ishikawa, R. R. Diaz-Morales, A. Perez, M. J. Vilkas and C. R. Cabrera, Chemical Physics Letters, 2005, 411, 404–410. 69 K. H. Lim, Z. X. Chen, K. M. Neyman and N. Rosch, Journal Of Physical Chemistry B, 2006, 110, 14890–14897. 70 K. H. Lim, L. V. Moskaleva and N. Rosch, Chemphyschem, 2006, 7, 1802–1812. 71 K. M. Neyman, K. H. Lim, Z. X. Chen, L. V. Moskaleva, A. Bayer, A. Reindl, D. Borgmann, R. Denecke, H. P. Steinruck and N. Rosch, Physical Chemistry Chemical Physics, 2007, 9, 3470–3482. 72 X. G. Wang, A. Chaka and M. Scheffler, Physical Review Letters, 2000, 84, 3650–3653.

Chem. Modell., 2008, 5, 150–181 | 179 This journal is

c

The Royal Society of Chemistry 2008

73 X. G. Wang, W. Weiss, S. K. Shaikhutdinov, M. Ritter, M. Petersen, F. Wagner, R. Schlogl and M. Scheffler, Physical Review Letters, 1998, 81, 1038–1041. 74 K. Reuter and M. Scheffler, Physical Review B, 2002, 65, 035406. 75 K. Reuter and M. Scheffler, Physical Review Letters, 2003, 90, 046103. 76 C. Stampfl, Catalysis Today, 2005, 105, 17–35. 77 C. Stampfl, Phase Transitions, 2007, 80, 311–332. 78 E. J. Albenze and A. Shamsi, Surface Science, 2006, 600, 3202–3216. 79 L. D. Kieken, M. Neurock and D. H. Mei, Journal Of Physical Chemistry B, 2005, 109, 2234–2244. 80 B. Yoon, H. Ha¨kkinen, U. Landman, A. S. Wo¨rz, J.-M. Antonietti, S. Abbet, K. Judai and U. Heiz, Science, 2005, 307, 403–407. 81 L. M. Molina and B. Hammer, Applied Catalysis A–General, 2005, 291, 21–31. 82 J. G. Wang and B. Hammer, Physical Review Letters, 2006, 97, 136107. 83 S. Fernandez, A. Markovits, F. Fuster and C. Minot, Journal of Physical Chemistry C, 2007, 111, 6781–6788. 84 L. Giordano, C. D. Valentin, G. Pacchioni and J. Goniakowski, Chemical Physics, 2005, 309, 41–47. 85 G. Barcaro and A. Fortunelli, Journal of Physical Chemistry C, 2007, 111, 11384–11389. 86 A. M. Ferrari, C. Roetti and C. Pisani, Physical Chemistry Chemical Physics, 2007, 9, 2350–2354. 87 K. Honkala and H. Ha¨kkinen, Journal of physical Chemistry C, 2007, 111, 4319–4327. 88 G. Pacchioni, L. Giordano and M. Baistrocchi, Physical Review Letters, 2005, 94, 226104. 89 L. Giordano, M. Baistrocchi and G. Pacchioni, Physical Review B, 2005, 72, 226103. 90 D. Ricci, A. Bongiorno, G. Pacchioni and U. Landman, Physical Review Letters, 2006, 97, 036106. 91 C.-M. Wang, K.-N. Fan and Z.-P. Liu, Journal of Physical Chemistry C, 2007, 111, 13539–13546. 92 T. Taniike, M. Tada, R. Coquet, O. Morikawa, T. Sasaki and Y. Iwasaw, Chemical Physics Letters, 2007, 443, 66–70. 93 J. Wu, B. Han, C. Zhou, X. Lei, T. R. Gaffney, J. A. T. Norman, Z. Li, R. Gordon and H. Cheng, Journal of Physical Chemistry C, 2007, 111, 9403–9406. 94 G. Canto and P. Ordejon, Surface Review and Letters, 2006, 13, 495–501. 95 S. Achilli, S. Caravati and M. I. Trioni, Journal of Physics-Condensed Matter, 2007, 19, 305021. 96 S. Pick and C. Demangeat, Surface Science, 2005, 584, 146–152. 97 X. Qian, Journal of Physics-Condensed Matter, 2006, 18, 1157–1163. 98 J. K. Nørskov, T. Bligaard, A. Logadottir, J. R. Kitchin, J. G. Chen and S. Pandelov, Journal of the Electrochemical Society, 2005, 152, J23–J26. 99 J. K. Nørskov, J. Rossmeisl, A. Logadottir, L. Lindqvist, J. R. Kitchin, T. Bligaard and H. Jonsson, Journal of Physical Chemistry B, 2004, 108, 17886–17892. 100 C. D. Taylor and M. Neurock, Current Opinion in Solid State & Materials Science, 2005, 9, 49–65. 101 J. S. Filhol and M. Neurock, Angewandte Chemie-International Edition, 2006, 45, 402– 406. 102 M. J. Janik and M. Neurock, Electrochimica Acta, 2007, 52, 5517–5528. 103 C. D. Taylor, M. J. Janik, M. Neurock and R. G. Kelly, Molecular Simulation, 2007, 33, 429–436. 104 C. Taylor, R. G. Kelly and M. Neurock, Journal of the Electrochemical Society, 2007, 154, F55–F64. 105 C. D. Taylor, S. A. Wasileski, J. S. Filhol and M. Neurock, Physical Review B, 2006, 73, 165402. 106 C. D. Taylor, S. A. Wasileski, J. S. Filhol and M. Neurock, Physical Review B, 2006, 73, 169904. 107 A. Y. Lozovoi and A. Alavi, Physical Review B, 2003, 68, 245416. 108 A. Y. Lozovoi, A. Alavi, J. Kohanoff and R. M. Lynden-Bell, Journal of Chemical Physics, 2001, 115, 1661–1669. 109 C. G. Sanchez, A. Y. Lozovoi and A. Alavi, Molecular Physics, 2004, 102, 1045–1055. 110 A. B. Anderson, Electrochimica Acta, 2003, 48, 3743–3749. 111 J. Greeley and J. K. Nørskov, Electrochimica Acta, 2007, 52, 5829–5836. 112 Z. H. Gu and P. B. Balbuena, Journal of Physical Chemistry A, 2006, 110, 9783–9787. 113 J. Greeley, J. K. Nørskov, L. A. Kibler, A. M. El-Aziz and D. M. Kolb, Chemphyschem, 2006, 7, 1032–1035. 114 T. Bligaard and J. K. Nørskov, Electrochimica Acta, 2007, 52, 5512–5516. 115 J. Greeley, T. F. Jaramillo, J. Bonde, I. B. Chorkendorff and J. K. Nørskov, Nature Materials, 2006, 5, 909–913.

180 | Chem. Modell., 2008, 5, 150–181 This journal is

c

The Royal Society of Chemistry 2008

116 J. Greeley and J. K. Nørskov, Surface Science, 2007, 601, 1590–1598. 117 T. F. Jaramillo, K. P. Jørgensen, J. Bonde, J. H. Nielsen, S. Horch and I. Chorkendorff, Science, 2007, 317, 100–102. 118 Y. Shimodaira, T. Tanaka, T. Miura, A. Kudo and H. Kobayashi, Journal of Physical Chemistry C, 2007, 111, 272–279. 119 J. C. Davies, J. Bonde, A. Logadottir, J. K. Nørskov and I. Chorkendorff, Fuel Cells, 2005, 5, 429–435. 120 Y. X. Wang and P. B. Balbuena, Journal Of Physical Chemistry B, 2005, 109, 18902– 18906. 121 S. R. Calvo and P. B. Balbuena, Surface Science, 2007, 601, 165–171. 122 E. J. Lamas and P. B. Balbuena, Journal of Chemical Theory and Computation, 2006, 2, 1388–1394. 123 M. P. Hyman and J. W. Medlin, Journal of Physical Chemistry C, 2007, 111, 17052– 17060. 124 J. R. Kitchin, J. K. Nørskov, M. A. Barteau and J. G. Chen, Physical Review Letters, 2004, 93, 156801. 125 P. B. Balbuena, S. R. Calvo, E. J. Lamas, P. F. Salazar and J. M. Seminario, Journal of Physical Chemistry B, 2006, 110, 17452–17459. 126 J. Roques and A. B. Anderson, Surface Science, 2005, 581, 105–117. 127 J. L. Fernandez, J. M. White, Y. M. Sun, W. J. Tang, G. Henkelman and A. J. Bard, Langmuir, 2006, 22, 10426–10431. 128 J. Zhang, M. B. Vukmirovic, K. Sasaki, A. U. Nilekar, M. Mavrikakis and R. R. Adzic, Journal of the American Chemical Society, 2005, 127, 12480–12481. 129 V. Stamenkovic, B. S. Mun, K. J. J. Mayrhofer, P. N. Ross, N. M. Markovic, J. Rossmeisl, J. Greeley and J. K. Nørskov, Angewandte Chemie-International Edition, 2006, 45, 2897–2901. 130 J. Luyten, M. Schurmans, C. Creemers, B. S. Bunnik and G. J. Kramer, Surface Science, 2007, 601, 1668–1676. 131 K. Sastry, D. D. Johnson, A. L. Thompson, D. E. Goldberg, T. J. Martinez, J. Leiding and J. Owens, Materials and Manufacturing Processes, 2007, 22, 553–561. 132 K. Sastry, D. E. Goldberg and D. D. Johnson, Materials and Manufacturing Processes, 2007, 22, 570–576. 133 K. Sastry, D. D. Johnson, D. E. Goldberg and P. Bellon, Physical Review B, 2005, 72, 085438. 134 G. Ceder, D. Morgan, C. Fischer, K. Tibbetts and S. Curtarolo, MRS Bulletin, 2006, 31, 981–985. 135 S. Curtarolo, D. Morgan and G. Ceder, Calphad-Computer Coupling of Phase Diagrams and Thermochemistry, 2005, 29, 163–211. 136 D. Morgan, G. Ceder and S. Curtarolo, Measurement Science & Technology, 2005, 16, 296–301. 137 Y. Wang, S. Curtarolo, C. Jiang, R. Arroyave, T. Wang, G. Ceder, L. Q. Chen and Z. K. Liu, Calphad-Computer Coupling of Phase Diagrams and Thermochemistry, 2004, 28, 79– 90. 138 S. Curtarolo, D. Morgan, K. Persson, J. Rodgers and G. Ceder, Physical Review Letters, 2003, 91, 135503.

Chem. Modell., 2008, 5, 150–181 | 181 This journal is

c

The Royal Society of Chemistry 2008

Fluctuation relations, free energy calculations and irreversibility Debra J. Searlesa and Denis J. Evansb DOI: 10.1039/b608839m

1. Introduction Over the past 15 years a number of important fundamental theorems in nonequilibrium statistical mechanics of many-particle systems have been proved. These proofs result in a number of important relations in nonequilibrium statistical mechanics. In this review we focus on three new relationships: the dissipation function or Evans-Searles fluctuation relation (ES FR),1–5 the Jarzynski equality (JE)6–9 and the Crooks fluctuation relation (Crooks FR).10,11 These relations not only provide a better understanding of the behaviour of nonequilibrium systems (in particular their irreversibility), but provide new practical results that apply to manyparticle systems including derivations of transport properties (viscosity, thermal conductivity and shear viscosity) and completely new ways of calculating free energy differences between states and determining potentials of mean force. The purpose of this review will be to outline these three relations, discuss their links with irreversibility and calculation of free energy changes, and to provide a critical review of some recent work on these topics (i.e. on work published BJuly-2005–June-2007). The first of these relations, the ES FR, is also referred to as the transient fluctuation relation (transient FR) or the dissipation function fluctuation relation (O-FR). It gives an analytic expression for the probability that for a finite system observed R for a finite time, the average of the so-called ‘dissipation function’, t ¼ 1 t OðGðsÞÞds, takes on a positive or negative value. That is it states: O t 0 t ¼ AÞ PðO At t ¼ AÞ ¼ e PðO

ð1:1Þ

t takes on a value between A dA and t = A) is the probability that O where P(O A + dA. Since there are few analytic relationships that describe the behaviour of many-particle, nonequilibrium systems, this is an important result in itself. However, in many cases the dissipation function can be shown to be related to an important physical property, and it leads to some simple, fundamental results. In a wide class of nonequilibrium steady states, close to equilibrium (where entropy can be defined) the average time-integral of the dissipation function becomes the entropy production,12 and therefore it expresses the probability that for a finite time the entropy will be consumed rather than produced, as is normally the case in thermodynamic (macroscopic) systems. The ES FR can thus be viewed as a generalisation of the Second Law of Thermodynamics since it applies to finite systems and finite times. As well as providing a derivation of the Second Law of Thermodynamics, it predicts fluctuations in the sign of the entropy production do occur in small systems, and (1.1) provides a relationship to described their probability. It therefore has potential utility in the study of small systems and development of technology—including those of interest in nanotechnology and the development of nanomachines—where entropy consumption is predicted for measurable times. The importance of taking thermal fluctuations into account when measuring properties of small systems is a

Nanoscale Science and Technology Centre and School of Biomolecular and Physical Sciences, Griffith University, Brisbane Qld 4111, Australia b Research School of Chemistry, Australian National University, Canberra ACT 0200, Australia

182 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

indicated by (1.1).13 The result can also be applied to Hamiltonian systems, giving relationships that are consistent with Le Chatelier’s principle for large systems.14 It can be used to derive the Green-Kubo relations for transport coefficients4,15–17 and shows how thermodynamic irreversibility emerges from time reversible equations of motion.17–19 Finally, fluctuation relations for arbitrary properties that are odd with respect to time reversal can be obtained. These expressions are more complex than (1.1), but have been verified numerically.20 The JE and the Crooks FR, in contrast, predict free energy differences between different equilibrium systems. Free energy differences provide us with crucial information on the relative stability of states and the direction of change of a process. If free energy differences are known, important information including phase diagrams and the stability of nanoscale structures for example, may be determined. Furthermore, these relations allow free energy differences to be calculated in both molecular simulations and experiments. Through simulation studies, insight into the behaviour of these systems can be obtained at the molecular level, extending our understanding of important phenomena such as phase transitions, nonequilibrium response and the behaviour of nanoscale systems. The Crooks FR has a very similar form to the transient FR, P1 ðw1!2 ¼ AÞ ¼ ebðDF1!2 AÞ P2 ðw2!1 ¼ AÞ

ð1:2Þ

where Pi(wi-j = A) is the probability that the work done along a path from an equilibrium state i to a second equilibrium state j is between A dA and A + dA, and DFi-j is the free energy difference between states i and j. Furthermore, the arguments of the two fluctuation relations (work and the dissipation function) can be shown to be closely related.11,17,21 The novelty of the Crooks FR lies in the fact that it allows calculation of differences in free energies of two equilibrium states using the work along irreversible paths between them. Integration of (1.2) gives the Jarzynski equality, hebw1-2i1 = ebDA1-2.

(1.3)

However, we note that this relation was derived prior to (1.2), in a different way.6,7 We also note that like the ES FR, the JE can be extended to consider other phase variables.22 Another important FR (the L-FR or GC FR) describes fluctuations in the phase space compression rate, L: L¼

@ _ C @C

ð1:4Þ

in a thermostatted nonequilibrium steady state, where C (q, p) is a point in phase space. It can be written, t ¼ AÞ 1 PðL lim ln ¼ A; A oAoA t PðLt ¼ AÞ

t!1

ð1:5Þ

where A* provide the bounds on the magnitude of A to which it applies. This relation was first derived heuristically by Evans, Cohen and Morriss23 in 1993 for an isoenergetic system. Indeed this was the first statement of any Fluctuation Relation. Eqn (1.5) was first rigorously proven by Gallavotti and Cohen24,25 and the FR is widely referred to as the Gallavotti-Cohen fluctuation theorem (GCFT). This relationship is described in section 2.1, along with its connection to the relationships that are the main subject of this review. Historically there has been a lot of confusion about nomenclature. Many authors have confused the GC FR with the ES FR. Because these relations refer to fluctuations there is a profound difference between the two. Nevertheless this confusion persists in the literature to the present day, as discussed in section 5. Chem. Modell., 2008, 5, 182–207 | 183 This journal is

c

The Royal Society of Chemistry 2008

In recent years, a number of reviews on these topics have appeared.4,5,17,26–41 Bustamante et al.29 published a comprehensible Physics Today article that summarises some of the main features of the fluctuation relations, discusses their practical implications for small systems and highlights experimental work in the area. We note that in order to make the paper accessible to the wide audience of Physics Today, some subtleties associated with the different arguments of the ES FR and of the GC FR (see discussion in section 5) are not addressed. In particular, it should be noted that the 2002 experiment42 discussed on page 47 of ref. 29 does not test eqn (3) of that paper (as stated), but rather the fluctuation relation where the argument is the work and not the phase space compression. Bustamante30 and Tinoco et al.31 also review experimental studies on RNA unfolding experiments that have been carried out to test fluctuation relations in their more general reviews on single-molecule unfolding experiments. Ritort28 discusses experiments designed to test the various FRs. Evans and Searles4 reviewed the transient fluctuation relations, how steady state relations can be derived from them, their implications and applications, and experimental tests available in 2002. A shorter review paper highlighted the main results, and a derivation of the second law inequality.5 Recently, Sevick et al.17 reviewed the ES FR, the Crooks FR and the Jarzynski equality, highlighting the similarities and differences between the two FRs and also discussing experimental work that has been carried out to test these results. Rondoni and Mejı´ a-Monasterio32 give a thorough, and current review of the various FRs. They discuss their theoretical basis as well as applications. Kurchan33 presents a concise review including the ES FR, Crooks FR and JE, emphasising the generality and simplicity of the relationships. Several reviews have focussed on stochastic dynamics. Harris and Schu¨tz27 discuss a range of fluctuation theorems (the Jarzynski equality, the ES FR and the GC FR) in the context of stochastic, Markov systems, but in a widely applicable context. They investigate the conditions under which they apply, including an analysis of the conditions under which the GC FR is valid. In 2006, Gaspard26 reviewed studies where a stochastic approach to the treatment of boundaries is used to obtain FRs for the current in nanosystems, with a focus on work from his group, and also published a review on Hamiltonian systems34 that includes a discussion on FRs and the JE. Astumian35 presents a pedagodic discussion on FRs and their treatment in terms of Onsager-Machlup theory close to equilibrium. Jarzynski36 provides a comprehensive review of the JE, discussing the practical application of the relationship, the importance of carefully defining the function used to calculate the work, and also its application in single molecule experiments. Kofke37 discusses different approaches to determination of free energy differences, and compares their accuracy for model systems, and in reference38 compares the use of the JE with other routes to the free energy difference. Xiong et al.39 also review the JE, discussing examples where it had been applied. Hummer and Szabo40 review work on the JE with particular emphasis on its application to the creation of free energy profiles using single-molecule pulling experiments. Meirovitch’s recent review41 on methods for calculation of free energies for biological systems, includes a discussion on the JE.

2. Fluctuation relations 2.1 Background Development of the ES FR was motivated by the seminal work by Evans, Cohen and Morriss.23 This paper focussed on constant energy dynamics and, based on results for the 2 dimensional Lorentz model, proposed that the measure of a trajectory was related to the exponential of the sum of the positive finite-time Lyapunov exponents of that trajectory. In this way they obtained a relationship for 184 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

the probability ratio of the entropy production. Evans and Searles approached the same problem from a different point of view1,2,4—using the Liouville measure to derive the fluctuation relations. The work of Evans et al.23 also motivated Gallavotti and Cohen to develop the well-known Gallavotti-Cohen Fluctuation Relation for the phase space compression (GC FR, eqn (1.5)) that is valid for a special class of dynamical systems known as ‘‘Anosov’’ systems.43 While the usual chemical and physical systems cannot be classified as ‘‘Anosov’’, it was proposed that this relation is applicable to such systems under particular conditions referred to as the ‘‘chaotic hypothesis’’24,25 and it was originally expected that it might be very widely applicable. However, recently it has been found that there are various restrictions on the system related to the boundedness of the interaction potential44 or state space;45 and the values of the phase space compression to which it can be applied,16,43,44 which seem to go to zero as O(Fe2) in some cases, and are even more restrictive in some others.46 The proof also breaks down at large fields, and a modified relation has been proposed to treat this system.47,48 It has also been found that the convergence times of the GC FR can be very large, and much larger than those of the ES FR.49–51 The types of systems to which it is expected to apply, and its convergence time are therefore difficult to characterise. Due to these difficulties, the GC FR will not be discussed in detail in this review, and only studies that make comparisons with the relations that are the main subject of this manuscript will be considered. Readers are referred to references16,44,45,48,49,52–58 for recent work in this area. When applied to systems relaxing from an equilibrium state to a nonequilibrium steady state, the ES FR takes on the form given by eqn (1.1). This equation is very general—it only requires the assumption of reversibility of the equations of motionw (and in fact this can be relaxed to the assumption that for any trajectory, the time reverse trajectory is also a solution of the equation of motion), and the assumption of ergodic consistency.60 Ergodic consistency means that the combination of dynamics and ensemble are selected so that phase points along the nonequilibrium trajectories are represented in the initial distribution of states with a nonzero probability density. The dissipation function, O, is defined through its time-integral as: Z t t t ln fðCð0Þ; 0Þ O KðCðsÞÞds: ð2:1Þ fðCðtÞ; 0Þ 0 were f(C,0) is the equilibrium phase space density at the point C and f(C(s1),s2) is the phase space distribution function at a point C(s1) according to a distribution function that has evolved from the equilibrium distribution for a period s2. Using this notation, the condition of ergodic consistency can be expressed as f(C(t),0) a 0 for all C(0) for which f(C(0),0) a 0. As mentioned above, the time-integral of the dissipation function takes on the value of the extensive generalised entropy production, St, over a period, t under suitable circ*mstances.12,17 The main requirement is that the dynamics satisfies the condition know as ‘the adiabatic incompressibility of phase space’.60 In this case, St = JtFebV, where Jt is the dissipative flux caused by the field, Fe, b = 1/(kBT) where T is the temperature of the corresponding initial system and V is the volume of the system. An example where such a relation can be applied is if a molten salt at equilibrium was exposed to a constant electric field. In that case the entropy production would be directly proportional to the current induced, and the FR would describe the probability that it would be observed to flow in the +ve or ve { By time reversibility here, we mean that the time reversed trajectory exists. This therefore includes, for example, systems subject to a sinusoidal field where the phase at t = 0 is variable. It also allows inclusion of some classes of dynamics where a stochastic term is added.59

Chem. Modell., 2008, 5, 182–207 | 185 This journal is

c

The Royal Society of Chemistry 2008

direction. The FR can then be written: PðJt ¼ AÞ ¼ eAFe bVt PðJt ¼ AÞ

ð2:2Þ

and a negative dissipative flux implies a positive current. If Fe and the current are chosen to be positive (i.e. A o 0 and Fe 4 0) then the right hand side of this equation will increase exponentially with both the volume and time. That is, in the limit V N, the probability ratio will increase exponentially and it will become impossible to observe trajectories with the corresponding negative value of the current. In the limit t - N the same effect will be observed. So in the large system or long time limits, negative currents will not be observed. This is consistent with the 2nd Law of Thermodynamics. It demonstrates how irreversibility emerges, despite the assumption of the reversibility of the equations of motion in derivation of (2.2). However, (2.2) also tells us that when V and t are small, negative fluctuations in the current are possible. The ES FR can be used to provide a simple proof of the Second Law Inequality:5 ti 4 0 hO

(2.3)

that the entropy production cannot be negative. It should be noted here that the inequality does not apply to the instantaneous values of the dissipation function or the entropy production rate, but to total entropy production over the period t.61,62 These relations apply to systems with adiabatic dynamics as well as those that are thermostatted, say using artificial, reversible thermostats such as the Gaussian thermostat or Nose´-Hoover thermostat. In consideration of steady state dynamics, it is necessary to use such thermostats or ergostats. However, Williams et al.63 have recently provided a theoretical argument showing that using thermostats that are applied remotely (i.e. so that the driven particles are Newtonian but some ‘wall’ particles, separated from the main system by other Newtonian particles, are thermostatted, as illustrated in Fig. 1), the ES FR, which provides information about fluctuations in the field-driven system, is insensitive to the details of the thermostatting mechanism. They used MD simulations to support their argument. 2.2 Recent results for deterministic dynamics In the earliest paper on FRs, a footnote remarked that it is possible to obtain the Green-Kubo relations from them if the fluctuations have a Gaussian distribution.23

Fig. 1 Schematic diagram of a unit cell in a simulation where artificial thermostatting is applied remotely to the particles that are being driven by a field.

186 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

A published derivation of the Green-Kubo or fluctuation-dissipation expressions from the combination of the FR and the central limit theorem (CLT) was finally presented in 2005.16 This issue had been addressed previously and the main arguments presented,15 but subtleties in taking limits in time and field that lead to breakdown of linear response theory at large fields, despite the fact that both the FR and CLT apply,15,64 were not fully resolved.15,17 In the derivation of the ES FR, time-reversibility of the equations of motion is required. Thus time reversibility is a sufficient condition for the ES FR. The question arises as to whether time reversibility is a necessary condition. In numerical calculations where irreversibility was introduced by employing an applied field that had no definite parity under time reversal, it was difficult to observe the breakdown of the ES FR. Careful numerical experiments have finally shown that a breakdown indeed occurs and identifies how this comes about.65–67z This work confirms that time reversibility and ergodic consistency are necessary and sufficient conditions for ES FR. In the early work of Evans and Searles, it was presumed that: (i) provided a valid transient version of the ES FR holds for a given system (i.e. where the initial distribution is an equilibrium distribution); (ii) and provided the system is able to relax to a unique steady state at long times under the application of a dissipative field and a thermostat (Note: this is certainly not always the case.); (iii) and provided negative fluctuations in the dissipative flux can be observed in the steady state, then a steady state version of the FR (with the same dissipation function) would be valid. These arguments are described in references,4,17 and the resulting steady state ES FR is, t ¼ AÞ 1 PðO lim ln ¼A t PðOt ¼ AÞ

t!1

ð2:4Þ

However, despite numerical, and recent experimental evidence,68,69 that the steady state relation does hold under these assumptions, it must be said that there are some subtleties regarding convergence times and the decay of fluctuations in nonequilibrium steady states. Recently a much more rigorous derivation of the steady state ES FR has been given. This derivation demonstrated that the underlying assumption necessary to obtain (2.4) is that time-correlations decay sufficiently quickly.32,70 As discussed in more detail in section 5 below, there has been some confusion in the literature as to the similarities and differences between the GC FR and the ES FR. In steady isoenergetic dynamics O(t) L(t), but under other conditions they are not instantaneously equal. There has therefore been some work has also been carried out to clarify the situation.16,32 An aspect of the ES FR that has not been fully exploited as yet is the fact that the dissipation function is sensitive to the choice of the distribution function. Therefore, if a system is presumed to have a particular equilibrium distribution function, with an associated dissipation function, then a field is applied, the transient ES FR should be satisfied for all time. This provides a way of testing if a system is equilibrated, for example. If the FR is not satisfied for the presumed O, then it indicates that the equilibrium state is not what was expected. This fact has recently been used to establish that domains of the nondissipative, nonequilibrium distributions of glassy systems can be described by Boltzmann weights.71 Apart from the reversibility of the dynamics, the other key assumption in the derivation of the transient ES FR is that the initial distribution and the dynamics are ergodically consistent.4,60 In the same paper,71 Williams and Evans demonstrated that away from the actual glass { We note that in some cases use of symmetry in the experiment allows what would seem to have irreversible equations of motion to be set in a reversible form or to use time-reversal mappings other than the standard (q, p) - (q, p).3

Chem. Modell., 2008, 5, 182–207 | 187 This journal is

c

The Royal Society of Chemistry 2008

transition, the domains of the glassy systems were robust—that is, they did not change when the system was subject to small changes in the pressure or temperature—by testing the FR when this change was imposed. If the FR was not satisfied, it would indicate that ergodic consistency had broken down and that the domains had changed when the system was subject to the changes in the state variables. Williams and Evans72 have tested the transient steady state ES FR on glassy systems. They verified that while the transient ES FR is obeyed at all times, the ES FR converges slowly, and the convergence time increases as the glass transition point is reached. Since the steady state ES FR in conjunction with the central limit theorem can be used to derive the linear response expressions for transport properties, this indicates that the strength of the field for which linear response theory applies becomes smaller as this point is approached. The slow convergence of the ES FR is in accord with its derivation which relies on decay of correlation times for convergence.70 It was noted in that at sufficiently large fields, the derivation of the GC FR following Gallavotti and Cohen would break down.47 A modified version of the FR was proposed in that case. This was a curious result because the steady state ES FR derived by Evans and Searles becomes equivalent to the GC FR under some circ*mstances (constant energy dynamics), and therefore this appeared to be a contradiction. It has been difficult to test this result because at large fields the probability of observing negative fluctuations decreases, and for large systems the correction is quite small. However, recently a simple model was developed where the correction term would be significant.48 The model was a 1-particle system, thermostatted using Nose´-Hoover thermostats. It could be driven to a sufficiently high field that the proposed correction term was significant, but fluctuations could still be observed. No correction term was found to apply, and in fact the GC FR appeared to describe the data better at larger fields, as opposed to the prediction. It would seem that this system did not satisfy the chaotic hypothesis on which the GallavottiCohen derivation of the GC FR was based. Giardina` et al.73 point out the difficulty of testing or numerically predicting large deviation functions because they require sampling of rare events. A similar issue arises in testing the FRs. They devise a numerical procedure that allows the rare events to be sampled in a biased manner and therefore allow more efficient determination of the probability functions in the rare event regime. We should also point out that in some recent work on nonlinear response theory, it has been shown that there is an intimate and deep connection between the dissipation function of the fluctuation relation, and the general expressions for the exact nonlinear response of systems.74 Nonlinear response theory, originally developed for hom*ogeneously thermostatted or ergostatted nonequilibrium steady states60 can now be applied to a much wider class of systems through this use of the dissipation function.74 Evans et al.74 show how the dissipation function is the generally applicable variable in the nonlinear response theory expression. Andrieux and Gaspard use stochastic FRs to obtain non-linear response coefficients,75 and Harada and Sasa have formulated an expression for the energy dissipation in the nonlinear regime for Langevin systems.76 Intricate experimental studies have also recently enabled nonlinear response expressions (formulated in a different way than that in reference60 by Speck and Seifert77), to be tested and verified.78 The Fluctuation Relation has been used by Dewar79 to assist in establishing a basis maximum entropy production principle. The difficulties with this outside the linear response regime have recently been discussed by Bruers.80 2.3 Recent results for stochastic and quantum dynamics Although the transient FR (1.1) is usually applied to systems that are initially at equilibrium, the consideration of the derivation shows that this is not necessary, and the underlying requirement is just that the initial distribution is known. For many-particle deterministic dynamics, the nonequilibrium distribution function is 188 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

not known (at least not in a closed form), and therefore the derivation is restricted and only an asymptotic relation (2.4) can be derived. However, for stochastic dynamics, initial distributions are sometimes known, so then FRs can be developed for nonequilibrium steady states that are valid at all times—these are known as the ‘exact fluctuation theoerm’.68,81 For deterministic systems, the FR takes on the form (1.1) and the dissipation function is given by (2.1). However, in the case of stochastic dynamics, the same process might be modelled at different levels with different dynamics, and for each model a different fluctuation relation may be obtained. Therefore there are more papers on stochastic systems than on deterministic dynamics, as derivations for new dynamics allow new systems to be treated. This is particularly true for the Jarzynski relation, as discussed in section 3.2. Extensive theoretical work on FR for stochastic systems has been carried out by van Zon et al.82–84 In particular they considered fluctuations in the properties work, w, and heat, Q, which can be shown to correspond to the dissipation function and phase space contraction, respectively.53 They found that FRs of the form (1.1) and (2.4) were obeyed when the argument was w, but not for Q. One of the systems that they analysed was a particle dragged through a medium at constant temperature. Baiesi,81 also considered work and heat FRs for this system, describing the behaviour by a Langevin equation with a time-dependent potential and derived a modified heat FR. This study differed from that of van Zon et al., because they did not assume a harmonic potential and used an arbitrary protocol for movement. The results were tested computationally. Dhar85 modelled the stretching of a polymer using the stochastic Rouse model, for which distributions of various definitions of the work can be obtained. Two mechanisms for the stretching were considered: one where the force on the end of the polymer was constrained and the other where its end was constrained. Dhar commented that the variable selected for the work was only clearly identified as the entropy production in the latter case. In the former case they argue that the average work is non-zero for an adiabatic process, and therefore should not be considered as an entropy production, however we note that the expression is consistent with a product of flux and field as used in linear irreversible thermodynamics. In understanding experimental studies where a particle in an optical trap could be considered as a Brownian particle, FRs based on the stochastic Langevin equations were developed.86 This allowed analytic expressions for the entropy production and its probability to be obtained, and numerical predictions to be made. A similar approach has been used to study a Brownian particle diffusing in a periodic potential under steady state conditions and useful information characterising the fluctuations have been obtained analytically and from numerical calculations.70 Recently, there has been interest in considering FR for systems coupled to heat reservoirs at different temperatures.46,87,88 The relevant FR for systems modelled using deterministic dynamics was determined and tested numerically in 2001,89 however interest in heat flow for simple models is important for studies aimed at determining the nonequilibrium temperatures and studying Fourier’s Law, and therefore consideration of FR in these systems has received some attention. Gomez-Marin and Sancho88 and were unable to verify an FR proposed by Jarzynski and Wo´jcik,87 presumably due to neglect of boundary terms in the theoretical analysis that were relevant for their numerical calculations. Visco46 has studied the GC FR in a model heat flow problem. Fluctuation relations for stochastic models of granular gases have been studied and the ES FR was found to be verified, but the GC FR was not satisfied on the timescales considered.51,56,57 Golinelli and Mallik90 have examined FRs in mathematical models that have been developed for studying general properties of nonequilibrium systems, and which are analytically tractable. Tanaiguchi and Cohen91 consider the relationship between Onsager-Machlup theory and the FRs. Chem. Modell., 2008, 5, 182–207 | 189 This journal is

c

The Royal Society of Chemistry 2008

The first quantum versions of the FR appeared in the early 2000s.92,93 Unlike the classical, deterministic FR, in quantum systems there is no clear concept of a single trajectory,94,95 and there is some flexibility in the choice of the argument that appears in the FR, and therefore a number of FR have been obtained using different approaches.93,95 For some cases the FR takes on the form of (1.1),92,94 while in others a corrected formula is obtained.93 The formulation of FRs for quantum systems, and development of experiments to test them, is therefore ongoing. Recently De Roeck and Maes96 obtained a quantum FR for a system undergoing heat conduction due to coupling with two reservoirs at different temperatures. Jennings et al.97 note that in studying photosynthetic processes, thermodynamic assumptions are assumed, preventing negative entropy changes. They then derive results which indicated that there can be a situation where negative entropy production occurs. Their arguments were based on relations based on thermodynamic (averaged) properties and were therefore rightly disputed,98,99 however the work highlighted the point that for these small systems, for finite times finite-time negative entropy producing events might occur. 2.4 Experiments The first conclusive experimental tests of the fluctuation relations were on a colloidal particle in water, held by an optical trap.19,42 The trap was either translated by movement of the optical trap relative to the system, or used to ‘capture’ the particle by increasing the strength of the optical trap.19 More recently, Carberry et al.100 were able to experimentally demonstrate that the so-called ‘‘Kawasaki function’’, that is heOtti = 1 for any time, t, when the system is initially at equilibrium. This follows directly from the fluctuation relation and is also discussed in ref. 70. This relationship suffers from similar difficulties in convergence as do the integrated fluctuation relation and Jarzynski relation (see discussion t needing to be properly below), with rare trajectories with negative values of O sampled. It therefore serves as a useful experimental control to provide an indication of the level of sampling required for the integrated fluctuation relation and Jarzynski relation to converge. Similar elegant experiments have subsequently been carried out. Wang et al.68,69 verified the steady state version of the ES FR, firstly in an integrated form:68 t o0Þ PðO t 40Þ ¼ hexpðOt tÞiO t o0 t!1 PðO lim

ð2:5Þ

t o 0, where h . . . iO to0 implies that the ensemble average is over trajectories with O 69 and then in the complete form given by eqn (2.4). This was a particularly important result due to controversy that surrounded the validity of applying this relation to steady state dynamics at the time. In all these experimental studies, the particle was in a viscous fluid and therefore the equations of motion of the particle were well approximated by a stochastic Langevin equation. In 2007, a capture experiment was carried out in a viscoelastic solvent where this approximation no longer applies. It was shown that despite this, the experiments validated the ES FR, and therefore could not be consider just a special property of Brownian dynamics. Blickle et al.101 verified the fluctuation relation for the work (or dissipation function) for a system where the trap potential was not harmonic. Narayan and Dhar102 demonstrated the importance of choosing the correct expression for the entropy production (see discussion in section 5), by demonstrating that an FR for heat (corresponding to the GC FR53) is not obeyed in their experimental studies, whereas the FR for work (corresponding to the ES FR53) is. 190 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

Garnier and Ciliberto103 verified an FR for work (equivalent to O in this paper53) of the form given by eqn (1.1), and a heat fluctuation relation as predicted by van Zon et al.82–84 They studied fluctuations in the power injected to an electrical dipole that is subject to current. Recently, Douarche et al.104 verified the transient ES FR and steady state ES FR for a harmonic oscillator (a brass pendulum in a water–glycerol solution, that is driven out of equilibrium by an applied torque). They also developed a steady state relation for a system with a sinusoidal forcing, and showed that the convergence time was considerably longer in this case. Schuler et al.105 and Tietz et al.106 excite a single defect in an diamond by a laser in a periodic manner. They test the Kawasaki function, which is obeyed, and the ES FR and find that for symmetric protocols it is obeyed. Andrieux et al.107 develop stochastic FRs from which the change in entropy can be obtained, and consider two experiments to verify the FR: a dragged Brownian particle and electric circuit with imposed mean current.107 Seitaridou et al.108 have examined a flux FR for particles undergoing diffusion. In this case the dissipation function will be proportional to the flux. They find a linear relationship, consistent with eqn (1.1), however, it is unclear if the slope is consistent with the predictions of (1.1) as the proportionality constant is not known. It would be interesting to examine this experiment further.

3. Free energy relations 3.1 Background In the past few years, development of new theories have led to completely new ways of determining free energy changes. Traditionally, the difference in the free energy of two equilibrium state is (DF1-2) and the free energy change of a process can be obtained directly from the statistical mechanical definition of the free energy, F, Rin terms of the partition function. For the canonical ensemble F = kBT ln dCebH(C) = kBT ln Z, where kB is Boltzmann’s constant, H(C) is the phase variable corresponding to the internal energy, and also the Hamiltonian of the dynamics that preserves that internal energy, T is the temperature of the system, and b = 1/(kBT). That is, DF1-2 = kBT ln (Z1) kBT ln (Z2) = kBT ln heb(H2(C)H1(C))i1

(3.1)

where the initial state has Hamiltonian H1 and the final state has Hamiltonian H2. The notation h . . . i1 implies that the equilibrium distribution is taken with respect to the distribution function of the initial state. This approach is referred to as free energy perturbation (FEP) and was introduced by Zwanzig. Due to numerical issues related to determination of the final expression directly when the free energy difference is large, it is usual to break the procedure into tractable (often nonphysical) stages that take the system step-wise from state 1 to state 2, and then to use thermodynamic integration to obtain an overall result. Bias potentials or reweighting schemes (e.g. umbrella sampling methods) have been used to improve the sampling. Alternatively, if the change from state 1 to state 2 is carried out so slowly that the process can be considered ‘reversible’ the well known relationship between the change in free energy and the reversible work done in transforming from state 1 to 2, w1-2;rev, applies: DF1-2 = w1-2;rev.

(3.2)

In simulations, the change from one state to another is achieved by slowly and continuously varying a control parameter that produces state 1 initially and state 2 finally. A completely reversible situation can only be approached, and the procedure is often referred to as a ‘slow switching’ approach. Chem. Modell., 2008, 5, 182–207 | 191 This journal is

c

The Royal Society of Chemistry 2008

From differentiation of the partition function and carrying out a reversible change in the internal energy using parameter, l, (H = H0(1 l) + H1l) that goes from 0 to 1, it can be shown that: R (3.3) DF1-2 = 10hH1 H0irev,ldl. Evaluation of free energy differences using eqn (3.3) is referred to as thermodynamic integration. Development of the Jarzynski equality (also known as the Work Relation or the Nonequilibrium Work Relation) in 1997, and the Crooks FR (or Crooks Identity or Crooks Fluctuation Theorem) have provided new approaches to the calculation of free energy differences, enabling their determination by following irreversible (and therefore possibly fast) pathways. The approaches have not yet resulted in a generally more efficient algorithm for calculating macroscopic free energy differences. Indeed it is probable that these new approaches will not lead to more efficient schemes for macroscopic systems. However for nano-scale systems these new approaches offer much promise and have already proved their worth. These new methods are often referred to as a ‘fast switching’ method. The Crooks FR, eqn (1.2), compares the probability that the work done in going from state 1 to 2 takes a value of A, with the probability that it takes a value of A in going from state 2 to 1. The ratio of these probabilities is directly related to the difference in free energy of states 1 and 2 through eqn (1.2). As mentioned above, the JE (1.3) is obtained by integrating (1.2), and it gives another expression from which the free energy difference between the states can be obtained. The change is made parametrically varying H(C,l(s)) over a period, t, with the parameter, l, changing so that H1(C) = H(C,l(0)); H2(C) = H(C,l(t)). The initial distribution is assumed to be canonical, f¼R

ebHðCÞ ebHðCÞ ¼ bHðCÞ Z e dC

ð3:4Þ

Originally the equations of motion for the process were selected to be the Hamiltonian equations of motion obtained using the time-varying H. That is, @HðC; lðsÞÞ @p @HðC; lðsÞÞ p_ ¼ @q q_ ¼

ð3:5Þ

and the work in this case is, w1!2 ¼

Z 0

t

@HðC; lðsÞÞ _ lðsÞds @l

ð3:6Þ

In contrast to relation (3.2), the path over which the work is measured does not have to be thermodynamically reversible, so the system can be out of equilibrium during the process. The time-dependence of l (often referred to as the ‘protocol’) is arbitrary from a theoretical perspective, so the nonequilibrium dynamics can have enormous variety. However, as discussed below, some protocols are more feasible from a numerical point of view. Theoretically, the only restriction is that the dynamics be selected so that if f1(C(0)) a 0, then f2(C(t)) a 0. During the ‘reverse’ trajectory from state 2 to 1, the time-evolution of l should be reversed. Recently it has been recognised that the equations of motion can be further modified and can even be non-Hamiltonian. This is discussed below. Eqns (3.1) and (3.2) can be considered as direct consequences of (1.3). Eqn (3.1) is obtained when l is a step function and the change is applied instantaneously, whereas (3.2) is obtained by taking the infinitely slow path. Furthermore, using (1.3), it can be shown that the work is maximised in the reversible process and the work 192 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

done in an irreversible process is always greater than the difference in free energy between the states:7 DF1-2 Z w1-2;irrev

(3.7)

Both the Crooks FR and JE have been verified experimentally and used to obtain free energy changes. A number of recent papers discuss the relationship of the average value of the irreversible work that appears in the Crooks and Jarzynski relations with the irreversible entropy production.109,110 In fact Ben-Amotz and Honig define an excess entropy (difference between the irreversible and reversible entropy change) in terms of averages of the irreversible work providing a relationship which they refer to as a ‘rectified 2nd law’, although they make some strong assumptions on the temperature that they choose to use in the processes. In the past there has been some discussion on the validity of the JE in real systems due to assumptions based on the coupling between the system and the bath.9,111 However, the relationship has now been derived in various settings, including deterministic, fully thermostatted systems,112,113 as well as tested experimentally. The controversy now seems to have settled and the JE it is widely accepted and applied.114 It is implemented in molecular dynamics programmes the such as the widely used NAMD software.115 A recent paper116 notes that the JE is an identity, but points out that if an adiabatic experiment is carried out that starts at one temperature, is altered parametrically over a finite period, and then is allowed to relax, the difference in free energy between the initial state and the final state will not be that predicted by the Jarzynski equality. This is because even if the final system reaches equilibrium, it will not be at the required temperature. They also note that some processes will not even relax to a new equilibrium state. However, we note that it is also possible to carry out the whole process under thermostatted conditions,112 leading to the same expression, and in that case the difference in the free energy of the state at the beginning and end will be equal to that predicted by the Jarzynski equality, provided the system relaxes to equilibrium. As recognised in ref. 116, the JE was not supposed to give the difference in free energy along such a path, but the difference in free energies between two states at the same temperature, and the free energy obtained is equal to the work done in a fictitious, reversible path between those two states. Therefore the observations in ref. 116 have no impact on the validity of the JE. Recent reviews on the Jarzynski equality and related theorems discuss the method and its application.17,27–33,36,37,39–41 In section 3.2 we describe recent developments associated with the Crooks FR and JE, citing papers where the different approaches have been used and compared. A special issue of Molecular Simulation ‘‘Challenges in Free Energy Calculations’’ published in Jan–Feb 2002 includes articles on the different approaches to free energy calculations, including the Jarzynski approach. A number of reviews consider alternative approaches to calculation of free energy differences, and the ‘traditional’ methods are continually being update and improved (see, for example, refs. 117–121.) Fowler et al. describe how calculations can be sped up using a computational grid.122 Here we will focus on the JE and Crooks FR, and for more general reviews on recent advances in the calculation of free energies refer the reader to reviews by Rodinger and Pome`s,123 Kofke,37 Adco*ck and McCammon124 and Hu and Yang.125 3.2 Recent theoretical results An instructive study of the JE for an exactly solvable model of an ideal gas under expansion (where the probability distribution of the work can be derived) verifies the relationship and shows how its predictions compare with numerical results, providing insight into its application.64 With similar motives, a model for effusion of the ideal gas is considered,126 and more recently, this group has extended this analysis to Chem. Modell., 2008, 5, 182–207 | 193 This journal is

c

The Royal Society of Chemistry 2008

include momentum transfer during the effusion.127 The effusion is considered as a stochastic process, and good agreement with simulations of hard spheres is obtained. Recently, Crooks and Jarzynski128 consider a dilute classical gas of interacting particles. They obtain the work distribution function and verify the JE and Crooks FR. They also test the relationship obtained when the distribution of the work is approximated as a Gaussian, showing the limitations of this approach. Dhar85 has considered a stochastic Rouse model for the stretching of a polymer where the distribution of the work is shown to be Gaussian and the JE is verified, and Piana studied the stretching of a DNA dodecamer to obtain the free energy curve.129 Imparato et al.130 use an Ising model for protein unfolding and generate the free energy landscape using a JE-like equality. Chvosta et al.131 consider another case where the work probability distribution function can be determined. They study a two energy-level system, modelled as a stochastic, Markovian process, where the transition rates and energies depend on time. Like the previous examples it provides an exact model that can be used to assist in identifying the accuracy of approximate, numerical studies. Ge and Qian extended the stochastic derivation for a Markovian chain to a inhom*ogeneous Markov chain.132 Recently, Kawai et al.133 define the time average of the dissipative work for systems undergoing adiabatic dynamics using a very similar setup-up to that used by Evans and Searles4 to give eqn (2.1) for the dissipation function (which is equivalent to the dissipative work, and is simplified for adiabatic dynamics where L = 0 at all times) and consequently derive (1.1). However, in determining the probability of a trajectory, rather than considering the distribution functions at the initial point of trajectories and their time-reversed conjugates, they consider them halfway along, and they also allow the initial equilibrium distribution functions to be different giving: wdiss ¼ ln

f1 ðCðt=2Þ; 0Þ f2 ðC ðt=2Þ; 0Þ

ð3:8Þ

where C* is the time-reversed map of C (q, p), C* (q, p). Futhermore in generating C*(t/2), the protocol is reversed (note that the Evans-Searles treatment D E is

restricted to protocols that are time-reversible). The property

ðCðt=2Þ;0Þ ln ff21ðC ðt=2Þ;0Þ

is

recognised as the Kullback-Leibler distance, or relative entropy, between forward and backward probability distributions, and therefore the average dissipation function or average dissipative work in this case can be considered to be a relative entropy. This can then be related to the difficulty of distinguishing forward and backward trajectories and to the breaking of reversibility or detailed balance.133 Imparato and Peliti134,135 show how the work distribution for a given protocol can be obtained using a joint probability distribution for the work and the initial state, and apply it to system represented by a coordinate in a mean field. They show that it leads to the JE, and demonstrate some numerical issues when applied to large systems where it becomes difficult to observe fluctuations. They also provide an analytical treatment of a driven Brownian particle, obtaining the probability distribution for the work and FR for this system.136 Paramore et al.22 show that the JE can be extended to consider variables other than the work, and thereby derive the expression for the determination of the free energy of a system where a reaction coordinate is fixed. Horowitz and Jarzynski21 examine the connection between a result by Bochkov and Kuzovlev137 that can be used to obtain an FR, the Crooks FR and the ES FR. They show that the relationships differ in their definition of the work and range of applicability. Recently Jayannavar and Sahoo138 have obtained an FR for a stochastic model of a charged particle that is translated or oscillated in a magnetic field and show that the free energy is not affected by the magnetic field, in agreement with the Bohr–van 194 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

Leeuwen theorem. Marathe and Dhar66 study an Ising spin in a time-varying magnetic field. They verify the Crooks FR and JE, and confirm that time reversibility is required for the transient ES FR. They also observe that the steady state ES FR is not obeyed, confirming that this is an asymptotic result and indicating that observation times were insufficient for convergence. Originally the JE treated processes that were weakly coupled with a thermostat6 implying a theory for adiabatic processes. In subsequent work Jarzynski7 developed a stochastic approach where this was not required. Recently, Baule et al. considered the isothermal expansion of a gas as a stochastic process, and showed that the JE applies.139 Evans112 also considered general deterministic processes that were directly thermostatted, deriving the Crooks FR and JE in a way that closely parallels the derivation of the ES FR.4 The details of the thermostat were somewhat arbitrary and the treatment could include Gaussian and Nose´-Hoover thermostatted systems. A very similar approach was later used by Cuendet140 but only for systems specifically thermostatted with a Nose´-Hoover thermostat. Cuendet later extended it to other thermostatting (also barostatting) mechanisms.141 Scho¨ll-Paschinger and Dellago113 also obtained an FR for systems whose dynamics preserved a canonical distribution when the parameter, l, was fixed. Adib142 determined a fluctuation relation for the free energy difference between two constrained configurations (i.e. with q fixed at qA and then at qB) by allowing the two different constrained configurations to relax to equilibrium. As noted in that work, this relationship is fundamentally different to Crooks (Jarzynski) because no work is performed during the process. The FR is given by PðqA ! q; tÞ ¼ ebDFA!B PðqB ! q; tÞ

ð3:9Þ

where P(qA - q; t) denotes the probability that a system whose configuration was fixed at qA relaxes to a configuration q at time t. The relation is considered in the context of experiments that allow single molecules to be constrained in particular configurations. The JE and Crooks fluctuation relations were originally developed to treat systems in canonical equilibrium states. Adib143 extended the JE and thermodynamic integration schemes to obtain DS of isoenergetic systems that originate in a microcanonical ensemble. The FR is obtained by designing constant energy equations of motion and is given by: heLtiE = eDS1-2/kB

(3.10)

where the subscript E is used to indicate that the ensemble average with R t is taken @ respect to the microcanonical distribution with energy E, and Lt 0 ds @C C_ is the phase space expansion. In a very general treatment for arbitrary ensembles and both deterministic and stochastic dynamics, Reid et al.144 discuss the relationship of the ES FR with the Crooks FR and a new ‘conjugate’ version of the Crooks FR. They tabulate the arguments of the these FR for the micro-canonical, canonical and isothermal–isobaric ensemble, indicating that the Crooks FR is in accord with the JE given by eqn (3.10): PE ðLt;1!2 ¼ AÞ ¼ eAþDS1!2 =kB PE ðLt;1!2 ¼ AÞ

ð3:11Þ

In a similar study by Cleuren et al.,145 samples from an initial microcanonical ensemble are considered (on an energy shell with energy E), but the dynamics in not isoenergetic, and therefore each trajectory moves off the energy shell. The Crooks Fluctuation relation is then given by: PE ðw ¼ AÞ ¼ eðSEþw SE Þ=kB ¼ eDS1!2 =kB PEþw ðw ¼ AÞ

ð3:12Þ

Chem. Modell., 2008, 5, 182–207 | 195 This journal is

c

The Royal Society of Chemistry 2008

which contrasts to other versions of the Crooks FR where the probability distribution considered in the denominator is the same for all values of w. In this paper the authors also attempt to explain how the GC FR can be obtained from this result, however the thermodynamic limit is taken and the fluctuating term (w) is equated to the average change in energy (DE) in their argument, and therefore this argument is not generally applicable. The JE has also been extended to cases where there is a change in the temperature of the system146 for Markovian systems. Recent work has shown that this is also readily derived for deterministic systems.147,148 In both cases, the relationship becomes Rt _ _ dsðbðsÞHðCðsÞ;lðsÞÞþbðsÞ HðCðsÞ;lðsÞÞ Þ 0 e ¼ eðb2 A2 b1 A1 Þ ð3:13Þ 1

and in the particular case where there is no work done on the system due to change in the Hamiltonian, and the change in b is linear

he(b2b1)Hti1 = e(b2A2b1A1)

(3.14)

In ref. 146 this approach is applied to determine the surface tension of a 3D Ising model. Other derivations for free energy differences in ensembles other than canonical NVT ensemble have been presented,141,148–150 as well as the ratio of partition functions for systems that are in different ensembles.148 The results have also been tested numerically.151 In our group, the difference in free energy has also been derived and verified in numerical simulations of realistic systems where the system of interest is thermostatted and/or barostatted externally.150 One of the interesting aspects of the FR is that the protocol used is rather arbitrary (at least in a theoretical sense). As well as varying the protocol, however, external fields can be applied during the period over which l is changing.17,21 These fields will do additional work on the system, and this must be included in the expression for work. In addition, the system may be thermostatted. A thermostat will not do work on the system, but will contribute to Q. For example the equations of motion could be written, @HðC; lðsÞÞ þ C Fe @p @HðC; lðsÞÞ þ D Fe ap p_ ¼ @q

q_ ¼

ð3:15Þ

where C and D represent the coupling of the field to the system, and a is the thermostat multiplier that extracts heat. In this case the work that appears in the Crooks FR and JE is, Z t @HðC; lðsÞÞ _ @H @H lðsÞ þ C Fe þ D Fe ds w1!2 ¼ ð3:16Þ @l @q @p 0 Use of appropriately chosen fields may improve the convergence properties of the JE, and this is currently under investigation. In deterministic systems, it is the dissipation function that is the subject of the FR (1.1). In contrast, it has been demonstrated that for stochastic systems, there can be more than one property that satisfies a fluctuation relation.86,152 There have been a number of quantum versions of the Crooks FR and JE. In 2005, Monnai153 presented a paper that clearly demonstrated how the FR, as presented by Crooks, could be readily extended to quantum systems in general. They provide a definition for the argument of the FR and show that in the quantum system there is some flexibility in this argument, depending on the choice of observables made in determining the probability ratio of paths. Also in 2005, 196 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

Allahverdyan and Nieuwenhuizen154 consider Thomson’s formulation of the second law and argue that the variable of the quantum FRs in previous studies should not be considered to be the work. They discuss different definitions of the work in quantum systems and argue for introduction of a new definition. Esposito and Mukamel95 present transient and steady state FRs as well as JE by developing a quantum mechanical trajectory, and then carrying out a derivation in a similar way to that used for stochastic dynamics. Talkner and Ha¨nggi155 recently derived a quantum version of the Crooks FR using a characteristic function for the work obtained by Talkner et al.156

3.5 Applications in simulations There have been many applications of the nonequilibrium free energy calculations in the literature, and here we outline a number of them. The Crooks FR has been using in simulations and experiments, but presumably because of the simplicity of evaluation of the free energy using JE, this relationship is much more widely used. In some particular examples, advantages of the Crooks approach have been noted.157,158 JE has shown considerable promise in steered molecular dynamics studies of the free energy landscape, and in particular it has become a popular choice for protein unfolding dynamics probably partly due to the exciting experiments carried out soon after its discovery. In this regard, the unfolding of deca-alanine has provided a relatively simple, yet realistic model that has been used to compare and test different approaches.118–120,159–161 However, like other approaches to determining free energy differences, the JE can be troubled with numerical accuracy. In slow switching techniques, thermodynamic integration and FEP, problems associated with equilibration are met. While these are largely avoided in the JE, adequately sampling the distribution to obtain a reliable ensemble average, that is also an issue in FEP methods, can be problematic. The difficultly lies in the fact that rare events when the work is large and negative, will give large contributions to the ensemble average. Depending on the shape of the distribution, this can cause errors that are difficult to eliminate. Recently several studies have considered the performance of the JE and Crooks FR with other methods. It has been observed that in some cases the new approaches are more efficient157 and in others the traditional approaches are still preferable. In order for JE to be accurate one must sample the extremely rare events which if investigated individually would appear to violate the second law (the waterfall running backwards or the jet engine running backwards generating oxygen and kerosene). If these time reversed events cannot be sampled both JE and Crooks will not yield accurate results. Moreover in the case of the JE, the results will appear converged, so some control is required to ensure that they have converged to the correct result. A number of approaches have been used to try to improve the convergence of the JE in cases where it is problematic. Some of these are analogues of processes introduced to improve sampling in the FEP approaches. For example, Adjanor et al.162 introduce path-biasing schemes to improve convergence. They consider a stochastic system, and carry out tests with simulations on clusters of LJ particles. Kofke38 has provided a useful analysis to determine when numerical problems will occur in the calculation of the JE, and considers a pedagogical example. It is demonstrated that convergence will be problematic when the probability distributions of the work for the forward and reverse protocols are well separated. This will generally increase as the number of particles, rate of change of the parameter and size of the perturbation are increased. Kofke proposes using the relative entropy to assist in the assessment of the accuracy of the results obtained. Chem. Modell., 2008, 5, 182–207 | 197 This journal is

c

The Royal Society of Chemistry 2008

Wu and Kofke163,164 discuss the significance of the overlap of the phase space domain of different states in the testing and use of FRs. They also introduce Rosenbluth sampling to assist in convergence of the JE to the correct value.165 MacFadyen and Andricioaei166 used as scheme whereby the initial distribution is modified by increasing the variance of the momentum distribution. They show that the results obtained significantly improve the accuracy of the results for a simple model of molecular isomerization. Lechner and Dellago167 compare the efficiency of various approaches for evaluation of the free energy using the JE for a particle pulled through a viscous liquid and for the expansion of an ideal gas. In both cases the analytic result for the probability distribution of the work is known. They describe and compare thermodynamic integration, fast switching methods and variations of the JE: (i) a direct method; (ii) a method based on thermodynamic integration where the sampling of important rare events is increased by introduction of a biasing parameter; (iii) a method based on umbrella sampling with a weighting function based on the path. It is found that the indirect methods can significantly increase the efficiency of the calculations. Lua and Grosberg168 also treat an ideal gas expansion and show that the number of samples required to give accurate results increases exponentially with system size. Shirts and Pande169 also compare methods, noting that for simple models there is no universally preferred approach and then examine the different methods for particle insertions. Ytreberg et al.170 consider the free energy change due to a change in the size or charge of a molecule in a solvent in their study on the efficiency and precision of various approaches. Using the fact that, in theory, the JE approach is insensitive to the path taken, Lechner et al.171 showed that the timestep in the numerical integration of the nonequilibrium trajectory can be made quite large, thus improving speed without losing accuracy. A low mean value of the work distribution seems consistent with a distribution that samples the negative fluctuations more regularly, however this is not certain as the variance and higher moments might shrink at the same time and reduce negative fluctuations. Nevertheless, assuming that the minimum mean work leads to the best sampling path, Schmiedl and Seifert172 determined the optimal path by minimising the mean work using different protocols. They found that in two cases, the optimal path involved step functions in the protocol. Oostenbrink and van Gunsteren173 study processes that are often met in biomolecular studies—redistribution of charges, creation and annihilation of neutral particles and conformational changes-comparing calculations of free energy changes using thermodynamic integration and the JE. They find that using the standard approaches, thermodynamic integration is more efficient, but that if slow conformational changes occur the JE offers an advantage. Li et al.174 compare different approaches, in the study of the conversion of ethane to methanol. They employ a l weighted histogram analysis method (WHAM) in order to improve statistics. Minh175 uses WHAM to develop a multi-dimensional potential of mean force. Lelievre et al.176 use a projection of the dynamics, rather then a constraint to follow the reaction coordinate. Bastug and Kuyucak177 compare the success of the JE for a simple and complex system for a restricted range of parameters. They showed that it was necessary to carry out the change slowly to get good results in both cases. They also carried out a simulation of double occupancy ions in a channel using the JE to determine the free energy profiles with the aim of examining biological process.178 Recently Calderon159 introduced a ‘surrogate process approximation’ (SPA) to improve the sampling in calculation of the JE. The scheme is applied to the study of the unravelling of deca-alanine at constant temperature in a steered molecular dynamics simulation. The distribution of the work is approximated by developing a model for the dynamics using a relatively small number of real trajectories in conjunction with stochastic differential equations selected to model the process. The 198 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

scheme avoids the need to make assumptions about the distribution of the work, and takes advantage of the fact that there is generally a lot of flexibility in the choice of the nonequilibrium process that takes one from the initial to the final state. Min et al.,179 consider the free energy change during a stochastic model of an enzyme reaction scheme. They consider the Crooks FR, the JE (via the Kawasaki function) and the results from a Gaussian approximation (fluctuation dissipation theorem) and show that in this case the Crooks FR is more efficient. They also consider the errors associated with the schemes for finite sampling. In order to circumvent problems associated with sampling, in some situations where the distribution is Gaussian it is possible to use limited data in conjunction with JE or the Crooks FR to obtain a simpler expression for the free energy difference.180,181 A cumulant expansion of the JE is also sometimes used, with a second order cumulant expansion: b DF1!2 ¼ hw1!2 i1 ðhw21!2 i1 hw1!2 i21 Þ 2

ð3:17Þ

being exact if the distribution is Gaussian. We would be cautious with such approaches unless it was known that the distribution is indeed Gaussian. This is because the convergence difficulties occur in the wings of the distribution where the noise is large, and the central limit theorem does not necessarily predict a Gaussian distribution. This departure from a Gaussian will be difficult to observe and erroneous assumptions on the distribution will lead to poor results. This view is supported by work on a system where an analytical expression for the work is known.128 If the distribution is approximated by a Gaussian or a truncated cumulant expansion, preliminary studies should be carried out to compare the results with other approaches or a full JE or Crooks FR evaluation to ensure that the distribution is well approximated by a Gaussian over the relevant range of the values of work. Recent work by Presse´ and Silbey discuss the application of cumulant expansions to different systems.182 Schettino et al.183 demonstrate this point in their protein-folding/unfolding simulations. They show that the use of a Gaussian approximation for distributions (in situations where a Gaussian distribution has been assumed to be a good approximation) can lead to inaccurate results. Liu et al. also carry out simulations on RNA unfolding, and create a free energy landscape for the process. They find that there is no barrier to unfolding, and question models based on a two-level system.184 Mihn185 demonstrates how data obtained using various pulling velocities can be combined to improve free energy estimates. Orzechowski and Cieplak186 carry out steered molecular dynamics studies using the JE to look at formation of a DNAdrug complex and stretching of molecules, however the number of samples used in the ensemble averages were only 10, which by far too small to give useful results. Gao et al.187 give a clear summary of the use of the JE in steered MD on proteins, discussing the reconstruction of the potential of mean force using the JE, and also problems that can occur. They provide some solutions to these problems in the case of a distribution of work that is Gaussian. West et al.188 study protein folding and point out that care needs to be taken when comparing the predictions of the JE to those of experiments, as the JE (1.3) normally considers averages over an initial equilibrium state, whereas the experiments constrain the initial configuration. In ref. 189 a potential of mean force is calculated and used to determine diffusion of ubiquinone through a light-harvesting complex determined using MD simulations. The complete JE and JE with a Gaussian approximation are used and compared.189 Bidon-Chanal et al.190 have used JE in steered MD studies of free-energy profile for ligand diffusion through a tunnel system of truncated haemoglobin, and Amaro et al.191 used steered MD simulation and calculation of free energy using JE to Chem. Modell., 2008, 5, 182–207 | 199 This journal is

c

The Royal Society of Chemistry 2008

determine the passage of ammonia through a barrel enzyme. They used a 2nd order cumulant expression for the free energy. Hwang et al.192 use the second order cumulant expression to examine the potential of mean force for ion transport in a channel—specifically the transport of Na+ and K+ ions in a cyclic peptide nanotube in water. They show how the de-solvation of the ions on entering the tube produces a free energy barrier and an attractive interaction between the ions and the functionalised tube provide free energy minimum in the tube. A second order cumulant expansion of the JE is also used to study the transport of sugar across lactose permease,193 and the potential of mean forces as the distance between dimers of cyclic peptides was decreased.194 In the latter case, a full average was also performed to check the accuracy of the assumption of the results obtained using the Gaussian approximation, and it was found that it was necessary to increase the transformation time in some cases in order to get reasonable agreement. Wang et al.195 use a Gaussian approximation to examine the potential of mean force of glycerol passing through an aquaporin. Classical simulations are used to examine the transport of di-oxygen across to the active site of the heme protein.196 An expansion was not used, and different scanning times and agreement of the profiles were used to assess the convergence of the results. Vemparala et al.197 use the JE and methods based on the JE to examine the transport of a halothane molecule across a lipid/water interface. QM-MM simulations were used to generate trajectories for the steered molecular dynamics calculation of the free energy profile for the conversion of two molecules (chorismate to prephenate) involved in the biosynthesis of amino acids.198 The results showed good agreement with those obtained from umbrella sampling. Only fair agreement with experiment was attributed to the accuracy of the density function theory calculations. The JE is used by Mu and Song199 to calculate the interfacial free energies of different crystal orientations for a Lennard-Jones system, and they find it more efficient than thermodynamic integration. Due to the changes in the dynamics, a general relationship for stochastic dynamics is not available like it is for deterministic dynamics. However, for mesoscopic systems, a mesoscopic FR is useful.200 Therefore, there has been much work on developing stochastic models with different conditions. Andrieux and Gaspard developed a stochastic fluctuation relation for nonequilibrium systems whose dynamics can be described by Schnakenberg’s network theory (e.g. mesoscopic electron transport, biophysical models of ion transport and some chemical reactions).201 Due to early experimental work on protein unfolding and related molecular motors, and their ready treatment by stochastic dynamics, a number of papers have appeared that model these systems and test the FR200,202 or JE for these.202 FR have also been obtained for stochastic models of chemical reaction networks.203,204 A number of fluctuation relations have also been developed with the view to design of experiments (see section 3.6) or devices. Recently a stochastic model of a ‘molecular refrigerator’ has been developed and the relevant expression for the FR derived.205 It is a Brownian particle with magnetic moment that is captured by an optical trap, tethered to another particle using a polymer to prevent rotation, and then subject to magnetic field that acts to counter the motion due to thermal fluctuations. The magnetic field thereby serves to ensure that the kinetic energy of the bead is less than its surroundings. The subject of the JE is no longer just the work due to the movement of the optical trap, but also includes a contribution due to the feedback mechanism. 3.6 Experiments The first tests of the JE and Crooks FR were by Liphardt et al.206 who used optical tweezers to extend a DNA-RNA hybrid chain, measuring the work required as the 200 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

extension proceeded. As well as demonstrating the ability of observing fluctuations that would allow the JE and FR to be applied, it led to the use of the JE as an experimental tool for studying protein folding and for generating free energy landscapes. More recently, Collin et al.158 carried out an experiment using the Crooks FR to determine the difference in free energies of an RNA molecule and a mutant that differs by one base pair. The Crooks FR was shown to be useful far from equilibrium where insufficient sampling hampers convergence of the JE. Hummer and Szabo40 demonstrated that in single molecule stretching experiments, the JE provides an expression for the work at different times, whereas from an experimental point of view it is of more interest to know the free energy difference between states at different extensions of the molecule. They show how this can be obtained and apply it in experiments. Blickle et al.101 carried out an experiment resembling the optical trap ‘drag’ experiment discussed in section 2.3 to verify the JE in a system with a non-harmonic optical trap where there was no free energy change. Noy has used207 the JE to benefit in interpretation of experimental results of chemical force microscopy where the probes of atomic force microscopy are functionalized, and Douarche et al.208 have verified the Crooks FR and JE for fluctuations in the work of a mechanical oscillator that is in contact with a reservoir and driven by a large external field.

4. Fluctuation relations and irreversibility Several works have discussed the implication of the FR and JE on thermodynamic irreversibility.17–19,86,144,209–213 It should be mentioned, that underlying the derivation of these relationships is the assumption of causality.18 That is, in determining the probability of sets of trajectories we assume that they are sampled from an initial known phase space density, not a final one. Should we assume the reverse, an ‘antiFR’ would be obtained which would predict that, on average, the entropy decreases. However, given this, the reversible equations of motion lead to relations such as the Second Law Inequality (2.3), and the maximum work principle (3.7) that clearly shows irreversibility. Furthermore, in the form given by eqn (2.2), the way in which irreversibility emerges with increasing observation time or volume is evident. Recent papers have noted the connection of the FR to Kullback-Leibler distance, or relative entropy133,212 which measures the irreversibility of the system and Sevick et al.17 consider a similar property—the departure of the average of the timeaveraged dissipation function from zero—as a measure of irreversibility. The effect of system on size on reversibility is discussed in ref. 213.

5. Comment on the interpretation of the fluctuation relation The arguments of the deterministic fluctuations relations are well defined (see eqns (1.4), (2.1) and (3.6) above). In the ES FR, O is the subject of the fluctuation relation, in the Crooks FR it is the work and in the GC FR it is L. However, considerable confusion has developed in the past due to the temptation of equating fluctuations in properties when they have the same mean. For example, in field driven nonequilibrium system discussed above, the argument of the transient fluctuation relation is O = J FeVb, where J is the dissipative flux, Fe is the applied field, V is the volume of the system, and b = 1/(kBT) where T is the temperature. For a thermostatted system, the ensemble average value of hOi (which can be interpreted as work multiplied by b) is equal to the average phase space contraction hLi = h3Nthermai where Ntherm is the number of thermostatting particles and a is the thermostat multiplier and which can be interpreted as heat. However, the fluctuations in these properties are very different, and if O satisfies a fluctuation relationship the fact hOi = h3Nthermai alone clearly does not imply that 3Ntherma does, even in the long time limit. Nevertheless, this is often wrongly assumed. A clear numerical Chem. Modell., 2008, 5, 182–207 | 201 This journal is

c

The Royal Society of Chemistry 2008

demonstration of the difference in the fluctuations of the two properties is given by Dolowschia´k and Kova´cs49 who study the Nose´-Hoover thermostatted periodic Lorentz gas. The problem is possibly perpetuated by the fact that the argument of the fluctuation relations is often referred to as the entropy production, although the entropy production is only defined in the thermodynamic limit and therefore many properties (with different fluctuations) will average to give an entropy production. The argument is also often referred to as ‘work’ or ‘heat’ when these properties are also not uniquely defined (see ref. 21, for example). However, another reason that this result has led to some confusion in the literature is that in the original works on the fluctuation theorems,1,2,23 isoenergetic systems were considered. In this particular case J(t) FeVb = 3Ntherma(t) instantaneously (i.e. the dissipation function is equal to the phase space contraction), and therefore the fluctuations in these properties are identical in this special case. This has led to authors mistakenly suggesting that, even in thermostatted systems, the ES FR refers to phase space contraction8 and/or that the GC FR refer to the dissipative flux when only the reverse is true in general.16 It has even resulted in the misleading adoption of the term ‘Gallavotti-Cohen Fluctuation Theorem’ for the fluctuation relations irrespective of the argument involved, although the derivation by Gallavotti and Cohen refers specifically to fluctuations in the phase space contraction. It should also be noted that different approaches or theorems can lead to FR that apply to the same property, but under different conditions. For example FR can be derived using the approach of Evans and Searles or Crooks that apply at all times, in contrast to the approach of Gallavotti and Cohen who require a long time limit (see discussion on heat flow where this distinction was important.88,214,215 The Crooks derivation does not require that the protocol or applied field be time-symmetric (or time-antisymmetric), whereas the approach of Evans and Searles does. While the arguments of the fluctuation relations are well defined theoretically, in some complex physical and chemical systems it might not be easy to identify the specific relationship for that property or to directly measure the correct property. This has led to many papers where a relationship for the property that appears in the argument is proposed (say B), and then the fluctuation relation is tested. In some PðBt ¼aÞ cases a linear relationship for the variation of ln PðB vs. a is obtained over a t ¼aÞ PðBt ¼aÞ restricted range of values of |a| r amax, but this is hardly surprising as ln PðB is t ¼aÞ an odd function of a. At best, these papers serve to test whether or not the property is appropriate and should not be interpreted as failure of the fluctuation theorems, or if a linear relationship with a slope other than the predicted unit slope is obtained, it does not provide evidence that the fluctuation relation is obeyed.

7. Conclusions We have presented brief descriptions of the relationships obtained from theorems about nonequilibrium systems: the ES FR, the Crooks FR and the JE. We also mention the GC FR in regard to its relationship to the ES FR. Over the years these relationships have increased our fundamental understanding of nonequilibrium systems and can explain how thermodynamic irreversibility emerges from underlying reversible equations of motion. This review describes recent work on the development and extensions of the relationships, tests of the relationships using simulation and experiment, and studies that have been carried out to understand physical and chemical phenomena. This area of research is growing rapidly. The relations are now well established and much work is being carried out on examining the utility and limitations. In the case of the JE, in particular, many adaptations have been proposed to improve its efficiency. Although it is highly unlikely to replace other techniques for free energy calculations in general, in some circ*mstances, particularly small systems where equilibration is slow, they will prove to be the method of choice. 202 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

As noted in several papers, the FR are particularly relevant for small systems and are therefore expected to play an important role in nanoscience and understanding biological, single-molecule processes. However, at this point the practical application in these areas has been limited,207 with the most common use being the use of the JE to calculate free energy landscapes for transport in biological systems, for which other methods are available. It will be of considerable interest to see how the FR can be used in more novel applications—predicting rare events, design of nanomachines, development of nanofluidics, etc. A new area that is starting to attract is the application of these relations and those derived from them to nonergodic systems such as glasses.71,147,216 An important spin-off from the work on fluctuation relations is that it has been clearly demonstrated that experimental techniques have developed to such an extent that we can now test nonequilibrium relations directly.30,78,217 It is anticipated that such experiments will continue to improve to provide a tool that can be used as an alternative, or complement to computer simulations in testing new relationships.

References 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41

D. J. Evans and D. J. Searles, Phys. Rev. E, 1994, 50, 1645. D. J. Evans and D. J. Searles, Phys. Rev. E, 1995, 52, 5839. D. J. Searles and D. J. Evans, J. Chem. Phys., 2000, 113, 3503. D. J. Evans and D. J. Searles, Adv. Phys., 2002, 51, 1529. D. J. Searles and D. J. Evans, Aust. J. Chem., 2004, 57, 1119. C. Jarzynski, Phys. Rev. Lett., 1997, 78, 2690. C. Jarzynski, Phys. Rev. E, 1997, 56, 5018. C. Jarzynski, J. Stat. Phys., 2000, 98, 77. C. Jarzynski, J. Stat. Mech.-Theory Exp., 2004, P09005. G. E. Crooks, J. Stat. Phys., 1998, 90, 1481. G. E. Crooks, Phys. Rev. E, 1999, 60, 2721. D. J. Searles, L. Rondoni and D. J. Evans, J. Stat. Phys., 2007, 128, 1337. J. K. Dreyer, K. Berg-Sørensen and L. Oddershede, Phys. Rev. E, 2006, 73, 051110. D. J. Evans, D. J. Searles and E. Mittag, Phys. Rev. E, 2001, 63, 051105. D. J. Searles and D. J. Evans, J. Chem. Phys., 2000, 112, 9727. D. J. Evans, D. J. Searles and L. Rondoni, Phys. Rev. E, 2005, 71, 056120. E. M. Sevick, R. Prabhakar, S. R. Williams and D. J. Searles, Annu. Rev. Phys. Chem., 2007, 59, 10.1146/annurev.physchem.58.032806.104555. D. J. Evans and D. J. Searles, Phys. Rev. E, 1996, 53, 5808. D. M. Carberry, J. C. Reid, G. M. Wang, E. M. Sevick, D. J. Searles and D. J. Evans, Phys. Rev. Lett., 2004, 92, 140601. D. J. Searles, G. Ayton and D. J. Evans, AIP Conference Series, 2000, 519, 271. J. Horowitz and C. Jarzynski, J. Stat. Mech.-Theory Exp., 2007, P11002. S. Paramore, G. S. Ayton and G. A. Voth, J. Chem. Phys., 2007, 126, 051102. D. J. Evans, E. G. D. Cohen and G. P. Morriss, Phys. Rev. Lett., 1993, 71, 2401. G. Gallavotti and E. G. D. Cohen, Phys. Rev. Lett., 1995, 74, 2694. G. Gallavotti and E. G. D. Cohen, J. Stat. Phys., 1995, 80, 931. P. Gaspard, Prog. Theor. Phys. Suppl., 2006, 165, 33. R. J. Harris and G. M. Schu¨tz, J. Stat. Mech.-Theory Exp., 2007, P07020. F. Ritort, J. Phys.-Condes. Matter, 2006, 18, R531. C. Bustamante, J. Liphardt and F. Ritort, Phys. Today, 2005, 58, 43. C. Bustamante, Q. Rev. Biophys., 2005, 38, 291. I. Tinoco, P. T. X. Li and C. Bustamante, Q. Rev. Biophys., 2006, 39, 325. L. Rondoni and C. Mejı´ a-Monasterio, Nonlinearity, 2007, 20, R1. J. Kurchan, J. Stat. Mech.-Theory Exp., 2007, P07005. P. Gaspard, Physica A, 2006, 369, 201. R. D. Astumian, Am. J. Phys., 2006, 74, 683. C. Jarzynski, Prog. Theor. Phys. Suppl., 2006, 165, 1. D. A. Kofke, Fluid Phase Equilibria, 2005, 228–229, 41. D. A. Kofke, Mol. Phys., 2006, 104, 3701. H. Xiong, A. Crespo, M. Marti, D. Estrin and A. E. Roitberg, Theor. Chem. Acc., 2006, 116, 338. G. Hummer and A. Szabo, Accounts Chem. Res., 2005, 38, 504. H. Meirovitch, Curr. Opin. Struct. Biol., 2007, 17, 181.

Chem. Modell., 2008, 5, 182–207 | 203 This journal is

c

The Royal Society of Chemistry 2008

42 G. M. Wang, E. M. Sevick, E. Mittag, D. J. Searles and D. J. Evans, Phys. Rev. Lett., 2002, 89, 050601. 43 G. Gallavotti, Mathematical Physics Electronic Journal, 1995, 1, 12pp. 44 F. Bonetto, G. Gallavotti, A. Giuliani and F. Zamponi, J. Stat. Phys., 2006, 123, 39. 45 R. J. Harris, A. Ra´kos and G. M. Schu¨tz, Europhys. Lett., 2006, 75, 227. 46 P. Visco, J. Stat. Mech.-Theory Exp., 2006, P06006. 47 F. Bonetto, G. Gallavotti and P. L. Garrido, Physica D, 1997, 105, 226. 48 S. R. Williams, D. J. Searles and D. J. Evans, J. Chem. Phys., 2006, 124, 194102. 49 M. Dolowschia´k and Z. Kova´cs, Phys. Rev. E, 2005, 71, 025202. 50 T. Gilbert, Phys. Rev. E, 2006, 73, 035102. 51 A. Puglisi, L. Rondoni and A. Vulpiani, J. Stat. Mech.-Theory Exp., 2006, P08010. 52 F. Zamponi, F. Bonetto, L. F. Cugliandolo and J. Kurchan, J. Stat. Mech.-Theory Exp., 2005, P09013. 53 D. J. Evans, Mol. Simul., 2005, 31, 389. 54 G. Gallavotti, Chaos, 2006, 16, 043114. 55 A. Giuliani, F. Zamponi and G. Gallavotti, J. Stat. Phys., 2005, 119, 909. 56 A. Puglisi, P. Visco, E. Trizac and F. van Wijland, Phys. Rev. E, 2006, 73, 021301. 57 P. Visco, A. Puglisi, A. Barrat, E. Trizac and F. van Wijland, Europhys. Lett., 2005, 72, 55. 58 P. Visco, A. Puglisi, A. Barrat, E. Trizac and F. van Wijland, J. Stat. Phys., 2006, 125, 533. 59 D. J. Searles and D. J. Evans, Phys. Rev. E, 1999, 60, 159. 60 D. J. Evans and G. P. Morriss, Statistical Mechanics of Nonequilibrium Liquids, Academic Press, New York, 1990. 61 S. R. Williams, D. J. Evans and E. Mittag, Comptes Rendus Physique, 2007, 8, 620. 62 O. Hess, C. Goddard and S. Hess, Physica A, 2006, 366, 31. 63 S. R. Williams, D. J. Searles and D. J. Evans, Phys. Rev. E, 2004, 70, 066113. 64 I. Bena, C. Van den Broeck and R. Kawai, Europhys. Lett., 2005, 71, 879. 65 E. Mittag, D. J. Evans and S.R. Williams, Pure and Applied Chemistry, 2007, 79, 1361. 66 R. Marathe and A. Dhar, Phys. Rev. E, 2005, 72, 066112. 67 T. Mai and A. Dhar, Phys. Rev. E, 2007, 75, 061101. 68 G. M. Wang, J. C. Reid, D. M. Carberry, D. R. M. Williams, E. M. Sevick and D. J. Evans, Phys. Rev. E, 2005, 71, 046142. 69 G. M. Wang, D. M. Carberry, J. C. Reid, E. M. Sevick and D. J. Evans, J. Phys.-Condes. Matter, 2005, 17, S3239. 70 A. Gomez-Marin and I. Pagonabarraga, Phys. Rev. E, 2006, 74, 061113. 71 S. R. Williams and D. J. Evans, J. Chem. Phys., 2007, 127, 184101. 72 S. R. Williams and D. J. Evans, Phys. Rev. Lett., 2006, 96, 015701. 73 C. Giardina`, J. Kurchan and L. Peliti, Phys. Rev. Lett., 2006, 96, 120603. 74 D. J. Evans, D. J. Searles and S. R. Williams, J. Chem. Phys., 2008, 128, 014504. 75 D. Andrieux and P. Gaspard, J. Stat. Mech.-Theory Exp., 2007, P02006. 76 T. Harada and S. Sasa, Phys. Rev. Lett., 2005, 95, 130602. 77 T. Speck and U. Seifert, Europhys. Lett., 2006, 74, 391. 78 V. Blickle, T. Speck, C. Lutz, U. Seifert and C. Bechinger, Phys. Rev. Lett., 2007, 98, 210601. 79 R. C. Dewar, J. Phys. A-Math. Gen., 2005, 38, L371. 80 S. Bruers, J. Phys. A-Math. Theor., 2007, 40, 7441. 81 M. Baiesi, T. Jacobs, C. Maes and N. S. Skantzos, Phys. Rev. E, 2006, 74, 021111. 82 R. van Zon, S. Ciliberto and E. G. D. Cohen, Phys. Rev. Lett., 2004, 92, 130601. 83 R. van Zon and E. G. D. Cohen, Phys. Rev. E, 2004, 69, 056121. 84 R. van Zon and E. G. D. Cohen, Physica A, 2004, 340, 66. 85 A. Dhar, Phys. Rev. E, 2005, 71, 036126. 86 J. C. Reid, D. M. Carberry, G. M. Wang, E. M. Sevick, D. J. Evans and D. J. Searles, Phys. Rev. E, 2004, 70, 016111. 87 C. Jarzynski and D. K. Wo´jcik, Phys. Rev. Lett., 2004, 92, 230602. 88 A. Gomez-Marin and J. M. Sancho, Phys. Rev. E, 2006, 73, 045101. 89 D. J. Searles and D. J. Evans, Int. J. Thermophys., 2001, 22, 123. 90 O. Golinelli and K. Mallick, J. Phys. A-Math. Gen., 2006, 39, 12679. 91 T. Taniguchi and E. G. D. Cohen, J. Stat. Phys., 2007, 126, 1. 92 J. Kurchan, arXiv:cond-mat/0007360v2, 2001. 93 T. Monnai and S. Tasaki, arXiv:cond-mat/0308337v1, 2003. 94 M. Esposito, U. Harbola and S. Mukamel, Phys. Rev. B, 2007, 75, 155316. 95 M. Esposito and S. Mukamel, Phys. Rev. E, 2006, 73, 046129. 96 W. De Roeck and C. Maes, Rev. Math. Phys., 2006, 18, 619.

204 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

97 R. C. Jennings, E. Engelmann, F. Garlaschi, A. P. Casazza and G. Zucchelli, Biochim. Biophys. Acta, 2005, 1709, 251. 98 J. Lavergne, Biochim. Biophys. Acta, 2006, 1757, 1453. 99 R. S. Knox and W. W. Parson, Biochim. Biophys. Acta, 2007, 1767, 1189. 100 D. M. Carberry, S. R. Williams, G. M. Wang, E. M. Sevick and D. J. Evans, J. Chem. Phys., 2004, 121, 8179. 101 V. Blickle, T. Speck, L. Helden, U. Seifert and C. Bechinger, Phys. Rev. Lett., 2006, 96, 070603. 102 O. Narayan and A. Dhar, J. Phys. A-Math. Gen., 2004, 37, 63. 103 N. Garnier and S. Ciliberto, Phys. Rev. E, 2005, 71, 060101. 104 F. Douarche, S. Joubaud, N. B. Garnier, A. Petrosyan and S. Ciliberto, Phys. Rev. Lett., 2006, 97, 140603. 105 S. Schuler, T. Speck, C. Tietz, J. Wrachtrup and U. Seifert, Phys. Rev. Lett., 2005, 94, 180602. 106 C. Tietz, S. Schuler, T. Speck, U. Seifert and J. Wrachtrup, Phys. Rev. Lett., 2006, 97, 050602. 107 D. Andrieux, P. Gaspard, S. Ciliberto, N. Garnier, S. Joubaud and A. Petrosyan, Phys. Rev. Lett., 2007, 98, 150601. 108 E. Seitaridou, M. M. Inamdar, R. Phillips, K. Ghosh and K. Dill, J. Phys. Chem. B, 2007, 111, 2288. 109 D. Ben-Amotz and J. M. Honig, Phys. Rev. Lett., 2006, 96, 020602. 110 D. Ben-Amotz and J. M. Honig, J. Phys. Chem. B, 2006, 110, 19966. 111 E. G. D. Cohen and D. Mauzerall, Mol. Phys., 2005, 103, 2923. 112 D. J. Evans, Mol. Phys., 2003, 101, 1551. 113 E. Scho¨ll-Paschinger and C. Dellago, J. Chem. Phys., 2006, 125, 054105. 114 J. Sung, Phys. Rev. E, 2007, 76, 012101. 115 J. C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R. D. Skeel, L. Kale and K. Schulten, J. Comput. Chem., 2005, 26, 1781. 116 B. Palmieri and D. Ronis, Phys. Rev. E, 2007, 75, 011133. 117 J. B. Abrams, L. Rosso and M. E. Tuckerman, J. Chem. Phys., 2006, 125, 074115. 118 C. Chipot and J. He´nin, J. Chem. Phys., 2005, 123, 244906. 119 B. Ensing, M. De Vivo, Z. W. Liu, P. Moore and M. L. Klein, Accounts Chem. Res., 2006, 39, 73. 120 D. Branduardi, F. L. Gervasio and M. Parrinello, J. Chem. Phys., 2007, 126, 054103. 121 G. Bussi, A. Laio and M. Parrinello, Phys. Rev. Lett., 2006, 96, 090601. 122 P. W. Fowler, S. Geroult, S. Jha, G. Waksman and P. V. Coveney, J. Chem. Theory Comput., 2007, 3, 1193. 123 T. Rodinger and R. Pome`s, Curr. Opin. Struct. Biol., 2005, 15, 164. 124 S. A. Adco*ck and J. A. McCammon, Chem. Rev., 2006, 106, 1589. 125 H. Hu and W. Yang, Annual Review of Physical Chemistry, 2008, 59, 573. 126 B. Cleuren, C. Van den Broeck and R. Kawai, Phys. Rev. E, 2006, 74, 021117. 127 K. Wood, C. Van den Broeck, R. Kawai and K. Lindenberg, Phys. Rev. E, 2007, 75, 061116. 128 G. E. Crooks and C. Jarzynski, Phys. Rev. E, 2007, 75, 021116. 129 S. Piana, Nucleic Acids Res., 2005, 33, 7029. 130 A. Imparato, A. Pelizzola and M. Zamparo, Phys. Rev. Lett., 2007, 98, 148102. 131 P. Chvosta, P. Reineker and M. Schulz, Phys. Rev. E, 2007, 75, 041124. 132 H. Ge and M. Qian, J. Math. Phys., 2007, 48, 053302. 133 R. Kawai, J. M. R. Parrondo and C. Van den Broeck, Phys. Rev. Lett., 2007, 98, 080602. 134 A. Imparato and L. Peliti, Europhys. Lett., 2005, 70, 740. 135 A. Imparato and L. Peliti, Phys. Rev. E, 2005, 72, 046114. 136 A. Imparato and L. Peliti, Phys. Rev. E, 2006, 74, 026106. 137 G. N. Bochkov and Y. E. Kuzovlev, Zh. Eksp. Teor. Fiz., 1977, 72, 238. 138 A. M. Jayannavar and M. Sahoo, Phys. Rev. E, 2007, 75, 032102. 139 A. Baule, R. M. L. Evans and P. D. Olmsted, Phys. Rev. E, 2006, 74, 061117. 140 M. A. Cuendet, Phys. Rev. Lett., 2006, 96, 120602. 141 M. A. Cuendet, J. Chem. Phys., 2006, 125, 144109. 142 A. B. Adib, J. Chem. Phys., 2006, 124, 144111. 143 A. B. Adib, Phys. Rev. E, 2005, 71, 056128. 144 J. C. Reid, E. M. Sevick and D. J. Evans, Europhys. Lett., 2005, 72, 726. 145 B. Cleuren, C. Van den Broeck and R. Kawai, Phys. Rev. Lett., 2006, 96, 050601. 146 C. Chatelain, J. Stat. Mech.-Theory Exp., 2007, P04011. 147 S. R. Williams, D. J. Searles and D. J. Evans, in preparation, 2007. 148 R. Chelli, S. Marsili, A. Barducci and P. Procacci, Phys. Rev. E, 2007, 75, 050101. 149 R. Chelli, S. Marsili, A. Barducci and P. Procacci, J. Chem. Phys., 2007, 126, 044502.

Chem. Modell., 2008, 5, 182–207 | 205 This journal is

c

The Royal Society of Chemistry 2008

150 S. R. Williams, D. J. Searles and D. J. Evans, Mol. Phys., 2007, 105, 1059. 151 R. Chelli, S. Marsili, A. Barducci and P. Procacci, J. Chem. Phys., 2007, 127, 034110. 152 V. Y. Chernyak, M. Chertkov and C. Jarzynski, J. Stat. Mech.-Theory Exp., 2006, P08001. 153 T. Monnai, Phys. Rev. E, 2005, 72, 027102. 154 A. E. Allahverdyan and T. M. Nieuwenhuizen, Phys. Rev. E, 2005, 71, 066102. 155 P. Talkner and P. Ha¨nggi, J. Phys. A-Math. Theor., 2007, 40, F569. 156 P. Talkner, E. Lutz and P. Ha¨nggi, Phys. Rev. E, 2007, 75, 050102. 157 H. Hı´ jar, J. Quintana-H and G. Sutmann, J. Stat. Mech.-Theory Exp., 2007, P04010. 158 D. Collin, F. Ritort, C. Jarzynski, S. B. Smith, I. Tinoco and C. Bustamante, Nature, 2005, 437, 231. 159 C. P. Calderon, J. Chem. Phys., 2007, 126, 084106. 160 F. Costanzo, R. G. Della Valle and V. Barone, J. Phys. Chem. B, 2005, 109, 23016. 161 P. Procacci, S. Marsili, A. Barducci, G. F. Signorini and R. Chelli, J. Chem. Phys., 2006, 125, 164101. 162 G. Adjanor, M. Athe`nes and F. Calvo, Eur. Phys. J. B, 2006, 53, 47. 163 D. Wu and D. A. Kofke, J. Chem. Phys., 2005, 123, 054103. 164 D. Wu and D. A. Kofke, J. Chem. Phys., 2005, 123, 084109. 165 D. Wu and D. A. Kofke, J. Chem. Phys., 2005, 122, 204104. 166 J. MacFadyen and I. Andricioaei, J. Chem. Phys., 2005, 123, 074107. 167 W. Lechner and C. Dellago, J. Stat. Mech.-Theory Exp., 2007, P04001. 168 R. C. Lua and A. Y. Grosberg, J. Phys. Chem. B, 2005, 109, 6805. 169 M. R. Shirts and V. S. Pande, J. Chem. Phys., 2005, 122, 144107. 170 F. M. Ytreberg, R. H. Swendsen and D. M. Zuckerman, J. Chem. Phys., 2006, 125, 184114. 171 W. Lechner, H. Oberhofer, C. Dellago and P. L. Geissler, J. Chem. Phys., 2006, 124, 044113. 172 T. Schmiedl and U. Seifert, Phys. Rev. Lett., 2007, 98, 108301. 173 C. Oostenbrink and W. F. van Gunsteren, Chem. Phys., 2006, 323, 102. 174 H. Z. Li and W. Yang, Chem. Phys. Lett., 2007, 440, 155. 175 D. D. L. Minh, J. Phys. Chem. B, 2007, 111, 4137. 176 T. Lelie`vre, M. Rousset and G. Stoltz, J. Comput. Phys., 2007, 222, 624. 177 T. Bastug and S. Kuyucak, Chem. Phys. Lett., 2007, 436, 383. 178 T. Bastug and S. Kuyucak, J. Chem. Phys., 2007, 126, 105103. 179 W. Min, L. Jiang, J. Yu, S. C. Kou, H. Qian and X. S. Xie, Nano Lett., 2005, 5, 2373. 180 G. Hummer, J. Chem. Phys., 2001, 114, 7330. 181 M. O. Jensen, S. Park, E. Tajkhorshid and K. Schulten, Proc. Natl. Acad. Sci. USA, 2002, 99, 6731. 182 S. Presse´ and R. Silbey, J. Chem. Phys., 2006, 124, 054117. 183 V. Schettino, R. Chelli, S. Marsili, A. Barducci, C. Faralli, M. Pagliai, P. Procacci and G. Cardini, Theor. Chem. Acc., 2007, 117, 1105. 184 F. Liu, H. Tong and Z. C. Ou-Yang, Biophys. J., 2006, 90, 1895. 185 D. D. L. Minh, Phys. Rev. E, 2006, 74, 061120. 186 M. Orzechowski and P. Cieplak, J. Phys.-Condens. Matter, 2005, 17, S1627. 187 M. Gao, M. Sotomayor, E. Villa, E. H. Lee and K. Schulten, Phys. Chem. Chem. Phys., 2006, 8, 3692. 188 D. K. West, P. D. Olmsted and E. Paci, J. Chem. Phys., 2006, 125, 204910. 189 A. Aird, J. Wrachtrup, K. Schulten and C. Tietz, Biophys. J., 2007, 92, 23. 190 A. Bidon-Chanal, M. A. Marti, A. Crespo, M. Milani, M. Orozco, M. Bolognesi, F. J. Luque and D. A. Estrin, Proteins, 2006, 64, 457. 191 R. E. Amaro, R. S. Myers, V. J. Davisson and Z. A. Luthey-Schulten, Biophys. J., 2005, 89, 475. 192 H. Hwang, G. C. Schatz and M. A. Ratner, J. Phys. Chem. B, 2006, 110, 26448. 193 M. O. Jensen, Y. Yin, E. Tajkhorshid and K. Schulten, Biophys. J., 2007, 93, 92. 194 E. Khurana, S. O. Nielsen, B. Ensing and M. L. Klein, J. Phys. Chem. B, 2006, 110, 18965. 195 Y. Wang, K. Schulten and E. Tajkhorshid, Structure, 2005, 13, 1107. 196 M. A. Marti, A. Crespo, L. Capece, L. Boechi, D. E. Bikiel, D. A. Scherlis and D. A. Estrin, J. Inorg. Biochem., 2006, 100, 761. 197 S. Vemparala, L. Saiz, R. G. Eckenhoff and M. L. Klein, Biophys. J., 2006, 91, 2815. 198 A. Crespo, M. A. Marti, D. A. Estrin and A. E. Roitberg, J. Am. Chem. Soc., 2005, 127, 6940. 199 Y. Mu and X. Y. Song, J. Chem. Phys., 2006, 124, 034712. 200 D. Andrieux and P. Gaspard, J. Stat. Mech.-Theory Exp., 2006, P01011. 201 D. Andrieux and P. Gaspard, J. Stat. Phys., 2007, 127, 107.

206 | Chem. Modell., 2008, 5, 182–207 This journal is

c

The Royal Society of Chemistry 2008

202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217

D. Andrieux and P. Gaspard, Phys. Rev. E, 2006, 74, 011906. T. Schmiedl and U. Seifert, J. Chem. Phys., 2007, 126, 044101. T. Schmiedl, T. Speck and U. Seifert, J. Stat. Phys., 2007, 128, 77. K. H. Kim and H. Qian, Phys. Rev. E, 2007, 75, 022102. J. Liphardt, S. Dumont, S. B. Smith, I. Tinoco and C. Bustamante, Science, 2002, 296, 1832. A. Noy, Surf. Interface Anal., 2006, 38, 1429. F. Douarche, S. Ciliberto and A. Petrosyan, J. Stat. Mech.-Theory Exp., 2005, P09011. J. Bisquert, Phys. Rev. E, 2005, 72, 056115. P. Gaspard, New J. Phys., 2005, 7, 77. D. J. Pine, J. P. Gollub, J. F. Brady and A. M. Leshansky, Nature, 2005, 438, 997. A. Porporato, J. R. Rigby and E. Daly, Phys. Rev. Lett., 2007, 98, 094101. W. P. Wong and E. Evans, Nature, 2005, 437, 198. A. Gomez-Marin and J. M. Sancho, Phys. Rev. E, 2006, 74, 063102. F. van Wijland, Phys. Rev. E, 2006, 74, 063101. A. E. Allahverdyan and T. M. Nieuwenhuizen, Phys. Rev. E, 2007, 75, 051124. L. M. Ying, Mol. Biosyst., 2007, 3, 377.

Chem. Modell., 2008, 5, 182–207 | 207 This journal is

c

The Royal Society of Chemistry 2008

Many-body perturbation theory and its application to the molecular structure problem S. Wilsonab DOI: 10.1039/b608836h

1. Introduction At its most fundamental, chemistry is a quantum mechanical subject. The properties of molecules and interactions between molecular systems are determined by the solutions of the Schro¨dinger equation (or its relativistic generalizations) describing the component electrons and nuclei. Approximate solutions to the Schro¨dinger equation are developed by first decoupling the motions of the electrons from those of the nuclei by invoking the Born-Oppenheimer model. The development of approximations to the solutions of the resulting electronic Schro¨dinger equation are the central problem of molecular electronic structure theory. This problem is most usually attacked in two stages. In the first stage, some independent electron model is assumed in which each electron is assumed to move in some mean field. HartreeFock theory is the most widely used independent electron model. The second and most challenging stage of a molecular electronic structure study takes account of electron correlation effects, that is, the corrections to independent electron models attributable to instantaneous interactions between electrons. The ‘traditional’ approach to the electron correlation problem is the method of configuration interaction, C.I., (sometimes called configuration mixing). Over the past twenty years, the configuration interaction approach has been largely superceded by ‘manybody’ methods. This report covers applications of and developments in the theory of the many-body perturbative approach to the molecular structure problem during the period June 2005 through to May 2007. Applications of many-body perturbation theory in its lowest order form, which is often designated MP2 w, continue to make it the most widely used of the ab initio approaches to the molecular electronic structure problem which go beyond an independent particle model and take account of the effects of electron correlation. During the reporting period, applications have been reported in an ever-increasing range of research areas and the main focus of this review is on some of the emerging fields in which MP2 calculations are being carried out. Obviously, within the limited space available it is not possible to cover all of the fields of application. To some extent, the choices made reflect my personal taste, but it is to be hoped that they provide a ‘‘snapshot’’ of the range of contemporary applications of chemical modelling using many-body perturbation theory. This review continues four earlier biennial reviews published in this series1–4 in 2000, 2002, 2004 and 2006. In turn, these reviews built on my report, written almost three decades ago, for a previous Specialist Periodical Reports series (Theoretical Chemistry) and published in 1981.5 This earlier review was one of the first to survey the application of ‘many-body’ methods, and, in particular, many-body perturbation theory to the molecular structure problem. a

Physical & Theoretical Chemistry Laboratory, University of Oxford, South Parks Road, Oxford OX1 3QZ b Department of Chemical Physics, Faculty of Mathematics, Physics and Informatics, Comenius University, 84215, Bratislava, Slovakia. E-mail: [emailprotected] { Møller-Plesset theory through second order, after Møller and Plesset who first considered the application of second order Rayleigh-Schro¨dinger perturbation theory with respect to a Hartree-Fock model reference function

208 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

We begin this report with a very brief overview of my previous reports to this series since, unlike the present report, previous reports have concentrated on the development of new ‘many-body’ theory and its practical realization through computation. This overview is given in section 2. The present review is almost exclusively focussed on applications of many-body perturbative methods in chemical modelling and it is to this application-orientated research that we turn our attention in section 3 and the sections which follow. In developing new methodology, the primary focus of research is often, quite rightly, on the introduction of improved approximations. Approximations may be improvements on existing techiques because they support higher accuracy or because they display greater computational efficiency allowing the study of larger and/or more complex systems. In supporting new application areas, attention is focussed on the development of ‘‘theoretical model chemistries’’ or what may be described as a ‘‘numerical spectrometer’’. This ‘‘numerical spectrometer’’ places computational quantum chemistry along side other probes of matter on an atomic and molecular scale. It is to the development and successful exploitation of these ‘‘numerical spectrometers’’ z that we devote section 3. In section 4, we turn our attention to the many applications of second order manybody perturbation theory in chemical modelling. We provide a brief synopsis of the applications of ‘‘MP2’’ theory during the reporting period. In the following four sections, sections 5, 6, 7 and 8, we present more detailed descriptions of four specific application areas. In section 5, we describe the application of many-body perturbation theory to periodic systems. In section 6, we turn our attention to the application of many-body perturbation theory to biochemical molecular systems, in particular, DNA bases and amino acids. The study of the structure and function of biological macromolecules is, at present, dominated by molecular mechanics techniques which are invariably built on empirical force field. A third application area, which we consider in section 7, is in the ‘‘benchmarking’’ of DFT-based methods. DFT-based methods do not handle the two-electron interactions explicitly and thus have lower computational cost and therefore a wider range of applicability. However, practical forms of DFT involve a considerable amount of empirical parameterization. Furthermore, there is no clear defined procedure leading to the convergence of DFT methods to the right answer for the right reason. The final application area considered in some detail is basis set extrapolation and the calibration of general energy models. This is the focus of section 8. A summary is given in section 9 where future directions are briefly surveyed.

2. An overview of previous reports This, being the fifth in a series of biennial reports to the series Chemical ModellingApplications and Theory, it brings to ten years the period for which detailed reviews of the literature of many-body perturbation theory and its application to the molecular structure problem have been presented. (The first report, published in 2000, updated a previous report written some twenty years earlier for another series. For the first and second reports only applications of many-body perturbation theory to the problem of molecular electronic structure were considered, but with the third and subsequent reports the word ‘‘electronic’’ was dropped to indicate a broader remit). After a decade, it seems appropriate to begin this report by identifying the key advances in applications and in theory over the past decade before turning to the primary purpose of this present report which is to review the literature for the period June, 2005, to May, 2007. { The plural is used here since different ‘‘theoretical model chemistries’’ give rise to different ‘‘spectrometers’’. In rough terms, the methods and approximations which constitute different ‘‘theoretical model chemistries’’ can be liken to the different component parts of a spectrometer.

Chem. Modell., 2008, 5, 208–248 | 209 This journal is

c

The Royal Society of Chemistry 2008

The first report in this series was published in 2000 and reviewed the literature up to June 1999. The developments considered included: the analysis and evaluation of higher order terms in the many-body perturbation series. the formulation of many-body perturbation theory within the framework of relativistic quantum mechanics. the systematic implementation of the algebraic approximation with particular reference to many-body perturbation theory studies using finite basis set approximations. the exploitation of concurrent computing techniques in electron correlation calculations based on many-body perturbation theory. Published in 2002, the second report covered the period June 1999 to May 2001 and provided an opportunity to review the wide range of applications to which many-body perturbation theory in its simplest form, that is Møller-Plesset perturbation theory through second order, was being put at the turn of the millennium. The main development considered in this second report were: ‘‘MP2’’ theory and its current status in the landscape of many-body perturbation theory-based techniques. the remainder term in low-order perturbation theory the choice of zero order Hamiltonian operator in many-body perturbation theory. the status of theory for handling problems requiring a multireference formulation of the electron correlation problem based on Brillouin-Wigner perturbation theory. The period between June 2001 and May 2003 was reviewed in the third report which was published in 2004. This was the report for which the word ‘‘electronic’’ was dropped from the title and the application of ‘many-body’ methods to nuclear motion considered. The developments considered in this report were: the development of many-body perturbation theory to simultaneously describe both the electronic and the nuclear motion in molecular systems. the use of low order approximants (Pade´ and quadratic Pade´ approximants) in many-body perturbation theory studies. the development of local formulations of the electronic structure problem based on second order many-body perturbation theory for handling large molecules. A review of the wide range of applications of ‘‘MP2’’ theory-based methods was also given for the period covered by this review. The fourth report, immediately prior to this, was published in 2006 and covered the molecular many-body perturbation theory literature for the period beginning June 2003 and ending May 2005. The report focussed on two important and related areas of early twentieth century science: computation and supercomputation, and complexity. The former are becoming increasingly important in the realization of practical routes to useful chemical information from the basic equations of quantum mechanics. The latter is recognized as an important characteristic of the molecular 210 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

entities and materials which are of interest in modern chemistry. In the discussion of computation and supercomputation, we gave particular emphasis to literate programming techniques and the potential that they have for a new approach to computational quantum chemistry, providing as they do a firm bridge between theoretical developments and their practical realization. The complexity of molecular systems was considered under the headings: large systems, relativistic formulations, multireference formalisms, and the theoretical description of multicomponent systems. This fourth report also contained a synopsis of applications carried out during the period under review.

3. Applications The main emphasis of this report is on the applications of many-body perturbation theory in chemical modelling, whereas previous reports have concentrated on new theory and its practical realization through computation. Application-orientated research using many-body perturbation theory adopts a different perspective to that assumed in methodological work. As was stated in the introduction, in supporting new application areas, attention is focussed on the development of ‘‘theoretical model chemistries’’ or what may be described as a ‘‘numerical spectrometer’’. This ‘‘numerical spectrometer’’ places computational quantum chemistry alongside other probes of matter on an atomic and molecular scale. It is to the development and successful exploitation of these ‘‘numerical spectrometers’’ that we devote this section whilst in the following sections we consider the many applications of second order many-body perturbation theory in chemical modelling and some specific application areas. We begin this section by considering the concept of a ‘‘theoretical model chemistry’’ and the development of a ‘‘numerical spectrometer’’ in section 1. In section 2, we briefly consider the complementary probes of matter which are used in modern science to elucidate the structure and properties of matter on an atomic scale. In section 3, we discuss how the very concept of atoms and molecules depends on the probes employed and how in modern science, computers provide one of the most powerful probes. The complementarity of different probes of matter is briefly described in section 4. The different perspectives given by complementary probes are emphasised and the greater potency acquired when probes are used in conjunction in a ‘‘problem based’’ environment is underlined. We turn our attention to the ‘knowledge base’ of quantum chemistry and, in particular, quantum chemical many-body perturbation theory in sections 5, 6 and 7. In section 5, we review the various texts into which our knowledge of the many-body problem in molecules has been distilled. These text books are complemented by the computer software and hardware which form essential ingredients of the ‘knowledge base’, in that they translate from the printed page to practical schemes of computation. A computer program6 ‘‘often possesses a structure and is characterised by details not adequately described by conventional mathematical notation. This is largely a consequence of the fact that the execution of a program is a dynamic process, in which the sequence of operations is of prime importance.’’ In section 6, available computer software for carrying out many-body perturbation theory calculations for molecular systems is described. Whereas a printed page can be copied with little regard to the nature of the material, usually paper, on to which it is copied, the correct execution of computer software is usually highly dependent on the hardware upon which it is implemented. In section 7, we discuss the hardware requirements for computational many-body perturbation theory for molecules. 3.1 Computational quantum chemistry: numerical spectrometer The approximate solution of the electronic Schro¨dinger equation is central to the study of molecular structure. Following the advent of the electronic digital computer Chem. Modell., 2008, 5, 208–248 | 211 This journal is

c

The Royal Society of Chemistry 2008

in the mid-twentieth century, it was recognised that the computer offered a route to approximate solutions of the Schro¨dinger equation for molecular systems and a sustained effort to deveop the necessary theoretical and computational apparatus was undertaken. In a famous after dinner speech delivered at the Boulder Conference in 1959, the leading British Theoretical Chemist of the day, Professor Charles A. Coulson,7 predicted that ‘‘the whole group of theoretical chemists is on the point of splitting into two parts. . . . In its simplest form this difference was associated with the large-scale use of electronic computers, though . . . I think there is a deeper aspect of it than just this.’’ He named the two groups ‘‘group I (electronic computors) and group II (nonelectronic computors)’’ but suggested that the alternative designations ‘‘ab initio-ists and a posteriori-ists’’ might be more appropriate. Today, these ‘‘ab initio-ists and a posteriori-ists’’ have evolved into two groups who both use computers in their research. Computing is now ubiquitous in chemical modelling. The modern groups differ in their approach to the development, calibration and exploitation of quantum chemical models. In contemporary research, the study of molecular structure using ab initio methodology can be divided into two conceptually different approaches. In their well-known text ‘‘Ab inito Molecular Orbital Theory’’,8 published in 1986, J. A. Pople et al. describe the first of these two approaches as that in which ‘‘each problem is examined at the highest level of theory currently feasible for a system of its size’’ In the second approach, ‘‘a level of theory is first clearly defined after which it is applied uniformly to molecular systems of all sizes up to a maximum determined by available computational resources.’’ ‘‘Such a theory, if prescribed uniquely for any configuration of nuclei and any number of electrons, may be termed a theoretical model, within which all structures, energies, and other physical properties can be explored once the mathematical procedure has been implemented through a computer program.’’ This approach results in what Pople and his co-workers term a ‘‘theoretical model chemistry’’. Systematic comparison of the results obtained by application of this model with corresponding experimental data allows the model to acquire some predictive capability in situations where experiment is either difficult or impossible, or simply too expensive. In recent years, a range of theoretical methods have become available which provide a hierarchy of techniques for the molecular electronic structure problem. For example, full CI (configuration interaction) provides an exact solution of the electronic Schro¨dinger equation within a given basis set. The computational demands of the full CI method dictate that only basis sets of very modest size can be employed and the basis set truncation errors are very large indeed. On the other hand, calculations using systematic sequences of basis sets, an approach pioneered in the late 1970s, provides a technique for estimating the basis set limit for theoretical methods which are less demanding than full CI. The development of a theoretical model chemistry can involve the comparison of some ‘high-level’ (i.e. more accurate, but usually more computationally demanding) method with some ‘lower-level’’ technique which might be less accurate but which is also less 212 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

demanding of computer resources and can therefore be applied more widely and also to larger systems. The past twenty years have witnessed a relentless increase in the power of computing machines. It has been observedy that the processing power of computers seems to double every eighteen months. This has been dubbed ‘‘Moore’s Law’’. As J. M. Roberts9 points out ‘‘No other technology has ever improved so rapidly for so long.’’ This ever-increasing computing power has led to higher accuracy in molecular electronic structure calculations, often because larger basis sets can be utilized. It has also has opened up the possibility of applications to larger molecules and molecular systems. Moore’s ‘law’ coupled with advances in theory and computer algorithms has led to molecular modelling becoming increasingly competitive as well as providing a complementary probe of matter. The understanding of the structure, behavior, and interactions of atoms and molecules is crucial to progress in modern science. Over the past thirty years, computational quantum chemistry has become part of the broader field of computational chemistry. Computational chemistry is itself one of the computational sciences. Today’s computers and supercomputers are revealing the intricacies of matter in unprecedented detail.

3.2 Complementary probes of matter Computational science is one of a suite of powerful techniques capable of elucidating the structure and properties of matter on an atomic scale. The value of computational science lies in the unique capabilities of the computer as a non-destructive and non-invassive probe of matter. Computational science is often complementary to probes employed in laboratory experiments. When applied in a ‘‘problem based’’, as opposed to a ‘‘technique based’’, approach to a given challenge, each technique affords a different perspective. Different techniques give complementary information about the studied system; the sum of the information gained is greater than that obtained from the techniques separately. The complexity inherent in many new forms of matter suggests that a single probe is unlikely to provide a complete understanding of a given problem. This complexity carries with it the need for complementary probes. Computational science often relates information obtained from different experimental probes of matter.

3.3 Probes of matter The concept of atoms and molecules emerged in the earliest days of modern chemistry from meticulous measurements of mass and volume. These were the only probes of matter available to early workers. The reality of atoms and molecules was established both by the explanatory power of the ‘atomic’ model and by experimental evidence of microscopic entities, with Einstein’s famous interpretation of Brownian motion in 1905 providing irrefutable confirmation of the atomic hypothesis. Today, science has an armoury of powerful probes to study matter at a range of scales from viral proteins to sub-atomic quarks. In spectroscopy, these probes are electromagnetic radiation, whilst in scattering experiments they are particles. Waveparticle duality allows electrons, for example, to be used in either mode. Each probe affords a unique perspective on the nature of the sample under investigation and different probes interact with the sample in different ways. } by Mr G. E. Moore, co-founder Intel, Electronics, 38, No. 8, April 19, 1965.

Chem. Modell., 2008, 5, 208–248 | 213 This journal is

c

The Royal Society of Chemistry 2008

The the use of a computational approach provides a unique and non-invasive probe of matter. The computer is often a more cost-effective and/or less hazardous probe. However, the complexity of the systems studied coupled with the complexity of the equations which describe them always necessitates the introduction of approximations and the construction of models. This limits the applicability of the computational approach, but, as we shall demonstrate in this report, this is a limitation which is continually being push back both by developments in computational quantum chemistry itself and advances in computing technology.

3.4 Complementarity Whilst computational science on its own provides a unique and powerful probe of matter on an atomic scale, it often acquires greater potency when used in conjunction with other techniques, such as synchrotron radiation, nuclear magnetic resonance, transmission electron microscopy or neutron scattering. Complementarity between experimental techniques is well established. For example, both X ray and neutron scattering scattering are immensely valuable in the study of the structure, bonding, and function of molecules. However, there are important differences between the two methods, which make them complementary. X-ray photons interact primarily with electrons whereas neutrons interact primarily with nuclei. Heavier atoms, with more electrons, scatter X-rays more strongly than light atoms. X-ray diffraction is not, therefore, so well suited to locating light atoms, such as hydrogen. Yet the identification of hydrogen atoms and hydrogen bonds is fundamental to the understanding of many materials, for example, the folding, stability and function of biomolecules. Neutron scattering from nuclei varies almost randomly across the Periodic Table and is particularly strong for hydrogen and, in particular, its isotope deuterium. Neutrons are therefore able to ‘‘see’’ hydrogen atoms. Computational studies complement experimental methods. Complementarity’ is at the heart of a ‘problem-based’ approach to the study of matter at a microscopic level, which is increasingly seen as more productive than traditional ‘technique-based’ approaches. Complementary experimental techniques can each give a different perspective on a specific ‘problem’, since they probe the studied system in different ways. Computer simulation and modelling can provide an alternative route to information which may be difficult and/or expensive and/or hazardous to obtain experimentally. Equally, they can also help to interpret and relate results obtained from different experimental techniques. Increasingly modern science is concerned with complex systems, for example, the layered copper-oxide compounds displaying high temperature superconductivity, the folding and interaction of biomacromolecules which is influenced not only by the solvent but also by intra- and inter-molecular hydrogen bonding. With complexity, materials may gain new emergent properties which are not possessed by their component parts. However, the complexity inherent in many new forms of matter suggests that a single probe is unlikely to provide a complete understanding of a given system. Complexity carries with it the need for complementarity. Other techniques including X-ray and optical spectroscopy, NMR, and electron and proximal probe microscopies (STM-scanning tunneling microscopy, SFM-scanning force microscopy, NSOM-near-field scanning optical microscopy) are required to build up a more complete picture. With this requirement comes the need for an integrated, problemsolving approach. The great flexibility of the computational approach makes it a most powerful probe of the microscopic nature of matter which will continue to reveal key insights. 214 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

3.5 The knowledge base: text books Any application of the methods of computational quantum chemistry and the broader field of computational chemistry is based on the available knowledge and although this is contained in the primary literature it is at its most accessible in textbooks. There are a number of textbooks at various levels from undergraduate to research frontier. In reverse chronological order, these are: S. Wilson, Electron Correlation in Molecules, Dover Publications, New York, 200710 F. Jensen, Introduction to Computational Chemistry, 2nd Edition, John Wiley, Chichester, 200611 C. J. Cramer, Essentials of Computational Chemistry, 2nd Edition, John Wiley, Chichester, 200412 P. W. Atkins and R. S. Friedman, Molecular Quantum Mechanics, Oxford University Press, 200413 G. H. Grant and W. G. Richards, Computational Chemistry, Oxford University Press, 200414 A. Hinchliffe, Molecular Modelling for Beginners, John Wiley, Chichester, 200315 Handbook of Molecular Physics and Quantum Chemistry, eds. S. Wilson, P. F. Bernath and R. McWeeny, John Wiley, Chichester, 200316 D. Young, Computational Chemistry: A Practical Guide for Applying Techniques to Real World Problems, John Wiley, Chichester, 200117 T. Helgaker, P. Jorgensen and, J. Olsen, Molecular Electronic Structure Theory, John Wiley, Chichester, 200018 A. Hinchliffe, Modelling Molecular Structures, John Wiley, Chichester, 200019 D. B. Cook, Handbook of Computational Quantum Chemistry, Oxford University Press, 199820 Encyclopedia of Computational Chemistry, eds. P. von Rague´ Schleyer (Editorin-Chief), N. L. Allinger, H. F. Schaefer III, T. Clark, J. Gasteiger, P. Kollman and P. Schreiner, John Wiley, Chichester, 199821 A. Szabo and N. S. Ostlund, Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory, 2nd Edition, Dover Publications, New York, 199622 F. E. Harris, D. L. Freeman and H. J. Monkhorst, Algebraic and Diagrammatic Methods in Many-Fermion Theory, Oxford University Press, 199223 R. McWeeny, Methods of Molecular Quantum Mechanics, 2nd Edition, Academic Press, 199224 Methods in Computational Chemistry: Electron Correlation in Atoms and Molecules, ed. S. Wilson, Kluwer Academic/Plenum Press, Dordrecht 198725 Chem. Modell., 2008, 5, 208–248 | 215 This journal is

c

The Royal Society of Chemistry 2008

I. Lindgren and J. Morrison Atomic Many-Body Theory, Springer-Verlag, Berlin, 198626 W. J. Hehre, L. Radom, P. V. Schleyer and J. A. Pople, Ab initio Molecular Orbital Theory, John Wiley, Chichester, 198627 S. Wilson, Chemistry by Computer: An Overview of the Applications of Computers in Chemistry, Kluwer Academic/Plenum Publishers, Dordrecht, 198628 A. C. Hurley, Electron Correlation in Small Molecules, Academic Press, 197729 A. C. Hurley Introduction to the Electron Theory of Small Molecules, Academic Press, 197730 The following series contain reviews of aspects of comutational quantum chemistry: Computational Chemistry: Reviews of Current Trends, ed. J. Leszczynski, World Scientific, 1996-31 Reviews in Computational Chemistry, eds. K. B. Lipkowitz, D. B. Boyd, [except vol. 21–22] R. Larter [vols. 19–21] and T. R. Cundari [vols. 19–23], V. J. Gillet [vol. 22], Wiley, 1990-32 There are also a number of volumes dealing with the problem of describing atoms and molecules within a relativistic formalism. Again in reverse chronological order these are: K. G. Dyall, and K. Faegri, Introduction to Relativistic Quantum Chemistry, Oxford University Press, 200733 I. P. Grant, Relativistic Quantum Theory of Atoms and Molecules: Theory and Computation, Springer-Verlag, New York, 200634 Relativistic Electronic Structure Theory. Part 2. Applications, ed. P. Schwerdtfeger, Elsevier, Amsterdam, 200435 Recent Advances in Relativisitic Molecular Theory, eds. K. Hirao and Y. Ishikawa, World Scientific Publishing, Singapore, 200436 Theoretical Chemistry and Physics of Heavy and Superheavy Elements, eds. U. Kaldor and S. Wilson, Kluwer Academic Publishers, Dordrecht, 200337 Relativistic Effects in Heavy Element Chemistry and Physics, ed. B. A. Hess, John Wiley, Chichester, 200238 Relativistic Electronic Structure Theory. Part 1. Fundamentals, ed. P. Schwerdtfeger, Elsevier, Amsterdam, 200239 K. Balasubramanian, Relativistic Effects in Chemistry: Theory and Techniques and Relativistic Effects in Chemistry: Theory and Techniques Pt. A, John Wiley, Chichester, 199740 K. Balasubramanian, Relativistic Effects in Chemistry: Applications: Applications Pt. B, John Wiley, Chichester, 199741 216 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

The Effects of Relativity in Atoms, Molecules and the Solid State, eds. S. Wilson, I. P.Grant and B. L. Gyorffy, Kluwer Academic/Plenum Press, Dordrecht, 199142 Methods in Computational Chemistry: Relativistic Effects in Atoms and Molecules, ed. S. Wilson, Kluwer Academic/Plenum Press, Dordrecht, 198943 Relativistic Effects in Atoms, Molecules, and Solids, ed. G. L. Malli, 198344

3.6 The knowledge base: computer software The text books, encyclopedias and handbooks listed in the previous section are complemented by the computer software and hardware through which practical schemes of computation are realized. In this section, we consider the computer software for carrying out molecular many-body perturbation theory calculations. The computer hardware appropriate for such calculations is considered in section 3.7. That computation would play a central role in practical applications of quantum chemistry was recognized in the 1950s, when, for example, in a now famous article entitled ‘‘Broken Bottlenecks and the Future of Molecular Quantum Mechanics’’, Mulliken and Roothann recognized that in ref. 45 ‘‘A major and indeed crucial step beyond the development of formulas for molecular integrals was the programming for large electronic digital computers of the otherwise still excessively time-consuming numerical computation of these integrals, and of their combination to obtain the desired molecular wave functions and related molecular properties; The pioneering work in this field was that of S. F. Boys at Cambridge, England.’’ Mulliken and Roothaan continued with the following prophecy which has largely been fulfilled45 ‘‘It can now be predicted with confidence that machine calculations will lead gradually toward a really fundamental quantitative understanding of the rules of valence theory and the exceptions to these; toward a real understanding of the dimensions and detailed structures, force constants, dipole moments, ionization potentials, and other properties of stable molecules and equally of unstable radicals, anions, and cations, and chemical reaction intermediates; toward a basic understanding of activated states in chemical reactions, and of triplet and other excited states which are important in combustion and explosion processes and in photochemistry and in radiation chemistry; also of intermolecular forces; further, of the structure and stability of metals and other solids; of those parts of molecular wave functions which are important in nuclear magnetic resonance, nuclear quadrupole coupling, and other interactions involving electrons and nuclei; and of very many other aspects of the structure of matter which are now understood only qualitatively or semiempirically. From the wave functions of ground and excited states, spectroscopic transition probabilities and lifetimes of excited states can also be computed; these and other items of information which can be obtained for molecules and radicals and are especially difficult to obtain experimentally are important in an understanding of upper atmosphere phenomena and in astrophysics.’’ In our introduction to section 3, we emphasized that a computer program6 ‘‘often possesses a structure and is characterised by details not adequately described by conventional mathematical notation.’’ The algorithms employed in quantum chemistry are central to its practical application and therefore should be regarded as an integral part of the literature of the field. In a previous report, we have considered Chem. Modell., 2008, 5, 208–248 | 217 This journal is

c

The Royal Society of Chemistry 2008

the effective communication of ideas and concepts embodied in quantum chemistry computer software. We have emphasized, following Ziman46 that ‘‘Research results do not count as scientific unless they are reported, disseminated, shared, and eventually transformed into communal property by being formally published.’’ Ziman47 recognizes that ‘‘ . . . what is distinctive about formal scientific communication is neither the medium nor the message: it is that it is published’’ the essential point being that ‘‘ . . . it is fully and freely available for open criticism and constructive use.’’ Unfortunately, as we explained in our previous report,4 ‘‘much contemporary quantum chemistry software does not satisfy these requirements’’ of transparency, accountability and, therefore, accessibility. The Quantum Chemistry Program Exchange (QCPE)48 was an early attempt to formalize the distribution of quantum chemistry software. It was created in 1962 by H. Shull ay Indiana University and offered a small collection of some 33 pieces of software. Another attempt to formalize the distribution of software for atomic and molecular physics is the journal Computer Physics Communications49 K. V. Roberts, Comput. Phys. Commun., 1969, 1, 1 which published both scientific papers and computer programs. In spite of the continued success of this journal, it has nevertheless has to be observed that few complete quantum chemical packages have been published via this mechanism, although there are exceptions. For example, the HONDO code, ‘‘The general atomic and molecular electronic structure system HONDO: version 7.0’’, was published in Computer Physics Communications in 1989.50 In some of our recent work,51–53 in our previous report4 and in the volume by Cook,54 the use of literate programming to provide a mechanism for introducing higher standards of code documentation and thus greater levels of collaboration in quantum chemistry code development and distribution is discussed. Literate programming combines the theoretical development of a particular model with the associated computer code. In 1984, D. E. Knuth published his seminal paper55 entitled ‘‘Literate programming’’ in The Computer Journal. However, this approach has not been widely adopted in spite of its widely recognized advantages in the computer science and computational science communities. In this section, we restrict our attention to the available computer software for carrying out many-body perturbation theory calculations for molecular systems. The first published quantum chemical code for carrying out molecular many-body perturbation theory calculations was due to D. M. Silver56,57 and the present author.58 The package, which was originally published in Computer Physics Communications in 1978, was written in FORTRAN code for an IBM 360/91 computer. The Computer Physics Communication library numbersz are:

} http://cpc.cs.qub.ac.uk/cpc/

218 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Together these programs calculate the components of the correlation energy corresponding to all second and third order diagrammatic terms. The code was heavily overlaid and restricted in the size of problem that could be handled. They were:56 ‘‘ . . . restricted to non-degenerate, closed-shell ground- states of atoms and molecules. The reference wavefunction must be closed-shell matrix Hartree-Fock single-determinantal wavefunction. Program dimension statements limit the basis set size to 10 occupied spatial orbitals (20 virtual electrons) and 25 unoccupied spatial orbitals (50 virtual states): these dimensions can easily be increased if necessary.’’ Today, quantum chemical code for carrying out molecular many-body perturbation theory calculations is made available as a component of many ‘‘black box’’ computer packages. Although, it is obviously possible to build many-body perturbation theory code from routines taken from ‘‘standard libraries’’ such as the NAG Library8 or the ‘‘Numerical recipes’’59 volumes* (and this may well be done in development work), most contemporary applications exploit careful tailored packages which can handle the whole molecular electronic structure problem by means of an interfaced suite of routines for carrying out the various stages of the calculation. Often the complexity of quantum chemical calculations demands the use of code specifically tailored to the algorithm being invoked and the hardware platform being employed. We note, however, that these carefully tailored packages often make use of optimized code for certain computational kernels. For example, the Basic Linear Algebra Subroutines (BLAS) are often exploited. Some use is made of software tools such as MAPLE ‘‘an advanced, high performance mathematical computation engine with fully integrated numerics & symbolics, all accessible from a WYSIWYG technical document environment’’ is made in development work.ww Some quantum chemistry packages make using of more specialized tools such as the Tensor Contraction Engine (TCE)zz. Hirata60 describes how complex symbolic algebra, such as the manipulation of second-quantized operators, Slater determinants, Feynman diagrams, which is inevitable in quantum chemistry, can be carried out by machine. This approach faciliates the automation of the algebraic transformation and computer implementation of ‘many-body’ methods for electron correlation calculations. The development of ‘‘black box’’ computer programs began in the early 1960s. The POLYATOM system was described by M. P. Barnett in 1963 in a paper published in Rev. Mod. Phys. entitled ‘‘Mechanized Molecular Calculations The POLYATOM System’’,61 although it was revealed in an historical review62 published in 1997 by B. T. Sutcliffe, one of Barnett’s coworkers, that ‘‘the report was more about what was planned to happen rather than what by that time had happened’’

|| The NAG Library was orginally developed as collaborative project led by Dr B. Ford at the University of Nottingham, where he had worked on numerical algorithms for carrying out quantum chemistry calculations in association Professor G. G. Hall. ‘‘The Nottingham Algorithm Group’’ became ‘‘The Numerical Algorithms Group’’ providing ‘‘general purpose’’ numerical software. For further details consult the company website at http://www.nag.co.uk/. * For details consult the ‘‘Numerical recipes’’ website at http://www.nr.com/ {{ For details, see the website http://www.maplesoft.com/. See also MATHEMATICA http://www.wolfram.com/ {{ For discussion of the Tensor Contraction Engine see http://www.csc.lsu.edu/gb/TCE/ and the recent publication of S. Hirata60 entitled ‘‘Symbolic Algebra in Quantum Chemistry’’

Chem. Modell., 2008, 5, 208–248 | 219 This journal is

c

The Royal Society of Chemistry 2008

Sutcliffe continues: . . . it is nevertheless an accurate account of what eventually was to happen. By the end of 1963 the program had become widely distributed, without guarantee, and though it was not quite bug-free initially, it rapidly became so on use . . . ’’ In this paper, Barnett61 explains ‘‘The system will consist of several major blocks which are listed below. It will be possible to use these in a fully automatic manner, or under manual control, by the provision of the appropriate master programs and linking parameters. The system is being coded in FORTRAN for the IBM 709/90 computers. It is designed to allow independent modification and refinement of the individual blocks of the system, and to allow easy expansion to include further blocks as these are developed.’’ The initial structure of the POLYATOM system corrresponded to the procedure described earlier by Boys and Cook63 and in similar work by Reeves and Harrison64 entitled ‘‘Use of Gaussian Functions in the Calculation of Wavefunctions for Small Molecules’’. The use of Gaussian functions, rather than the Slater (exponential) functions favoured by Mulliken and his coworkers at that time, was to be crucial to the success of practical computational quantum chemistry and, in particular, its application to arbitrary polyatomic molecules. The main blocks of the initial POLYATOM system were as follows: 1. ‘‘an input program, to store the nuclear coordinates and orbital parameters, with their transformation properties’’ 2. ‘‘an integral list generator’’ 3. ‘‘an integral evaluation program’’ 4. ‘‘an

SCF

program’’

5. ‘‘a program to transform one- and two-electron integrals from one basis to another’’ 6. ‘‘a codetor list generator to read or generate a list of functions that are to used in a configuration interaction calculation’’ 7. ‘‘a formula generator . . . for the matrix elements between codetors’’ 8. ‘‘a matrix generator to compute the matrix elements between codetors’’ 9. ‘‘a program to solve the secular equation’’ 10. ‘‘a program to form the density matrices, dipole moments,’’ etc The POLYATOM project was not the only one of its type. Indeed, in the same issue of Rev. Mod. Phys. as Barnett’s paper, R. K. Nesbet65 of IBM Cooporation’s San Jose Laboratory describes ‘‘Computer Programmes for Electronic Wave Function calculations’’ and F. E. Harris66 discusses ‘‘Gaussian Wavefunctions for Polyatomic Molecules’’. A fuller account of the POLYATOM code was published in 1966 by Csizmadia, Harrison, Moskowitz and Sutcliffe67 in a paper entitled ‘‘Non-Empirical LCAO-MOSCF-CI Calculations on Organic Molecules with Gaussian Type Functions. Part I. Introductory Review and Mathematical Formalism’’. This was followed immediately 220 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

by one of the first applications of the POLYATOM program68 ‘‘Part II. Preliminary Investigations on Formyl Fluoride’’. In their historical review, Smith and Sutcliffe62 recount how ‘‘In 1963 the Chicago group and Enrico Clementi were taking the first steps toward what was to become IBMOL, and for a time there was a friendly rivalry between them and the developers of POLYATOM, but as IBMOL began its IBM-based development it drew steadily ahead of POLYATOM in the level of support and the facilities provided. But even today it is possible to find programs employing POLYATOM ideas and, occasionally, the vestiges of POLYATOM code can be recognised. But POLYATOM and IBMOL began a line of program systems for molecular electronic structure calculations whose current representatives are GAUSSIAN, GAMESS, and so on.’’ Initially, the POLYATOM program was coded to ‘‘limit the basis set to 50 elements’’.61 By 1966, the IBMOL program could (in principle) ‘‘handle a maximum of 800 Gaussian functions distributed on 50 centers.’’69 This program also used contracted Gaussian basis functions which had been introduced earlier by Clementi.70 Many other codes followed. The most well known of these was the GAUSSIAN code which led to the award of the Nobel Prize for Chemistry to the late Sir John Pople in 1998 ‘‘for his development of computational methods in quantum chemistry’’. GAUSSIAN 7071 was published a program number 236 by the Quantum Chemistry Program Exchange.48 Other codes included the HONDO program developed by Dupuis and his collaborators72 and also published through the Quantum Chemistry Program Exchange, and the ATMOL program by Saunders and coworkers.73 In the 1970s, development of the ATMOL code was undertaken as part of the UK’s Collaborative Computational Project (CCP1) on Electron correlation in molecules. During the 1980s and early 1990s the collaborative element of CCP1 was replaced by a more competitive approach for we saw the emergence of a number of rival electronic structure packages in the United Kingdom beginning with CADPAC and GAMESS(UK), and subsequently MOLPRO and contributions to Pople’s GAUSSIAN. There is much duplication in the functionality of these codes which will be discussed further below. Many of the ideas and methods incorporated in these codes had, as Smith and Sutcliffe pointed out62 in their 1997 review on ‘‘The development of Computational Chemistry in the United Kingdom’’, ‘‘ . . . been grown at public expense and by various hands’’ and, therefore, their ‘‘ . . . position as property, to be disposed of or traded, was deeply ambiguous.’’yy Today, many quantum chemistry program packages are available which contain code which can perform ‘many body’ calculations using a perturbative formalism. Some of these packages are freely available, some are marketed as commercial products. For some the source code is freely available, for others only the binary code is distributed. Some are well documented, others only provide documentation for the user. Our list is probably not complete, but should serve to illustrate what is available. The list is arranged alphabetically by the name of the quantum chemistry package. }} Smith and Sutcliffe continue ‘‘The obvious solution to it is to expropriate the codes and market them, using any profits to further the work. But this would undoubtedly cause tensions in the groups involved and might well call into question the basic idea of a publicly funded collaborative computational enterprise.’’

Chem. Modell., 2008, 5, 208–248 | 221 This journal is

c

The Royal Society of Chemistry 2008

ACES

II-Advanced Concepts in Electronic Structure

ACES II is a program product of the Quantum Theory Project, University of Florida. Authored by R. J. Bartlett and his co-workers,74 this program was developed for calculations using ‘many-body’ techniques to treat electron correlation. The authorship is as follows: J. F. Stanton, J. Gauss, S. A. Perera, J. D. Watts, A. D. Yau, M. Nooijen, N. Oliphant, P. G. Szalay, W. J. Lauderdale, S. R. Gwaltney, S. Beck, A. Balkov 0 a, D. E. Bernholdt, K. K. Baeck, P. Rozyczko, H. Sekino, C. Huber, J. Pittner, W. Cencek, D. Taylor, and R. J. Bartlett. Integral packages included are VMOL (J. Almlof and P. R. Taylor); VPROPS (P. Taylor); ABACUS (T. Helgaker, H. J. Aa. Jensen, P. Jørgensen, J. Olsen, and P. R. Taylor); HONDO/GAMESS (M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert, M. S. Gordon, J. J. Jensen, S. Koseki, N. Matsunaga, K. A. Nguyen, S. Su, T. L. Windus, M. Dupuis and J. A. Montgomery). Further details can be found at the website: http://www.qtp.ufl.edu/Aces2/ The ACES II Mainz-Austin-Budapest version was released in 2005. Details of this version can be found at the following website: http://www.aces2.de/

CADPAC

- The Cambridge Analytical Derivatives Package

This code was original developed during the 1980s under the auspices of the UK CCP1. The current version is cited as follows: R. D. Amos with contributions from I. L. Alberts, J. S. Andrews, S. M. Colwell, N. C. Handy, D. Jayatilaka, P. J. Knowles, R. Kobayashi, K. E. Laidig, G. Laming, A. M. Lee, P. E. Maslen, C. W. Murray, J. E. Rice, E. D. Simandiras, A. J. Stone, M.-D. Su and D. J. Tozer, The Cambridge Analytic Derivatives Package-A suite of quantum chemistry programs, Issue 6, Cambridge, 1995. Further details can be found at the website: http://www-theor.ch.cam.ac.uk/software/cadpac.html

COLUMBUS

originated in 1980 in the Department of Chemistry of the Ohio State University, this code was developed by I. Shavitt (Ohio State), H. Lischka (University of Vienna) and R. Shepard (Battelle Columbus Laboratories. The key publications describing the COLUMBUS package are H. Lischka, R. Shepard, F. B. Brown and I. Shavitt, Int. J. Quantum Chem., Quantum Chem. Symp., 1981, 15, 91. R. Shepard, I. Shavitt, R. M. Pitzer, D. C. Comeau, M. Pepper, H. Lischka, P. G. Szalay, R. Ahlrichs, F. B. Brown, and J. Zhao, Int. J. Quantum Chem., Quantum Chem. Symp., 1988, 22, 149. H. Lischka, R. Shepard, R. M. Pitzer, I. Shavitt, M. Dallos, Th. Mu¨ller, P. G. Szalay, M. Seth, G. S. Kedziora, S. Yabush*ta and Z. Zhang, Phys. Chem. Chem. Phys., 2001, 3, 664. The current release is cited as H. Lischka, R. Shepard, I. Shavitt, R. M. Pitzer, M. Dallos, Th. Mu¨ller, P. G. Szalay, F. B. Brown, R. Ahlrichs, H. J. Bo¨hm, A. Chang, D. C. Comeau, R. Gdanitz, H. Dachsel, C. Ehrhardt, M. Ernzerhof, P. Ho¨chtl, S. Irle, G. Kedziora, T. Kovar, V. Parasuk, M. J. M. Pepper, P. Scharf, H. Schiffer, M. Schindler, M. Schu¨ler, M. Seth, E. A. Stahlberg, J.-G. Zhao, S. Yabush*ta, Z. Zhang, 222 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

M. Barbatti, S. Matsika, M. Schuurmann, D. R. Yarkony, S. R. Brozell, E. V. Beck, and J.-P. Blaudeau, COLUMBUS, an ab initio electronic structure program, release 5.9.1, 2006 Further details can be found at the website: http://www.univie.ac.at/columbus/

DALTON

Named after John Daltonzz the father of modern atomic theory, this package is described by the authors (in the program manual) as ‘‘in many respects an ‘expert’s’ program’’ The current version is cited as follows: DALTON, a molecular electronic structure program, Release 2.0, 2005 Further details, including the list of contributors, can be found at the website: http://www.kjemi.uio.no/software/dalton/dalton.html

HONDO

This code was first developed during the 1970s and distributed by QCPE. It featured very fast integral evaluation based on the work of Dupuis, Rys and King.72 HONDO 78 was an early package containing procedures to compute the energy gradient in geometry optimization and force constant matrix evaluation.76 The current version is cited as follows: M. Dupuis, A. Marquez, and E. R. Davidson, HONDO 99.6, 1999, based on HONDO 95.3, M. Dupuis, A. Marquez, and E. R. Davidson, Quantum Chemistry Program Exchange (QCPE), Indiana University, Bloomington, IN 47405. Additional capabilities are included in the HONDOPLUS package, which is cited as follows: HONDOPLUS-v.5.1, by H. Nakamura, J. D. Xidos, A. C. Chamberlin, C. P. Kelly, R. Valero, J. D. Thompson, J. Li, G. D. Hawkins, T. Zhu, B. J. Lynch, Y. Volobuev, D. Rinaldi, D. A. Liotard, C. J. Cramer, and D. G. Truhlar, University of Minnesota, Minneapolis, 2007 based on HONDO v.99.6.

GAMESS-General

Atomic and Molecular Electron Structure System

This package is based on code originally developed by the National Resource for Computations in Chemistry The citation for the original code is: M. Dupuis, D. Spangler, and J. J. Wendoloski National Resource for Computations in Chemistry Software Catalog, University of California: Berkeley, CA, 1980, Program QG01 The current version is cited as follows: M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert, M. S. Gordon, J. H. Jensen, S. Koseki, N. Matsunaga, K. A. Nguyen, S. Su, T. L. Windus, M. Dupuis and J. A. Montgomery, J. Comput. Chem., 1993, 14, 1347–1363. General Atomic and Molecular Electronic Structure System M. S. Gordon, M. W. Schmidt, in ‘‘Theory and Applications of Computational Chemistry: the first forty years’’ eds. C. E. Dykstra, G. Frenking, K. S. Kim, G. E. Scuseria, Elsevier, Amsterdam, 2005, pp. 1167–1189, Advances in electronic structure theory: GAMESS a decade later Further details can be found at the website: http://www.msg.ameslab.gov/gamess/ There is also an independent user group which can be found at the website: http://groups.google.com/group/gamess }} 1766–1844

Chem. Modell., 2008, 5, 208–248 | 223 This journal is

c

The Royal Society of Chemistry 2008

There are a number of programs based on or derived from GAMESS. These include: – PC-GAMESS A freely available code derived from GAMESS which has been ‘‘developed to offer high performance on Intel-compatible x86, AMD64, and EM64T processors’’ The code contains ‘‘very efficient MP2 energy and gradient modules’’. Further details can be found at the website: http://classic.chem.msu.su/gran/gamess/ – CHEMBIODRAW This is a powerful chemistry and biology drawing application delivering drawing, publishing and analytical features on a Windows/Mac environment. GAMESS is provided in the latest release ChemBioDraw Ultra 11.0, but although the underlying code is the full version, there is no way for users to access any features not available in the graphical user interface. Further details can be found at the website: http://www.cambridgesoft.com/ – GAMESS (UK) A version of GAMESS developed and made available to the acedemic community in the UK. Further details can be found at the website: http://www.nsccs.ac.uk/

GAUSSIAN

Certainly GAUSSIAN is today the most well known and widely used quantum chemistry program. GAUSSIAN70 was distributed by QSCP as program number 23671 entitled GAUSSIAN 70: Ab initio SCF-MO Calculations on Organic Molecules and authored by W. J. Hehre, W. A. Lathan, R. Ditchfield, M. D. Newton and J. A. Pople. The current version is cited as M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, J. A. Montgomery, Jr, T. Vreven, K. N. Kudin, J. C. Burant, J. M. Millam, S. S. Iyengar, J. Tomasi, V. Barone, B. Mennucci, M. Cossi, G. Scalmani, N. Rega, G. A. Petersson, H. Nakatsuji, M. Hada, M. Ehara, K. Toyota, R. f*ckuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, M. Klene, X. Li, J. E. Knox, H. P. Hratchian, J. B. Cross, V. Bakken, C. Adamo, J. Jaramillo, R. Gomperts, R. E. Stratmann, O. Yazyev, A. J. Austin, R. Cammi, C. Pomelli, J. W. Ochterski, P. Y. Ayala, K. Morokuma, G. A. Voth, P. Salvador, J. J. Dannenberg, V. G. Zakrzewski, S. Dapprich, A. D. Daniels, M. C. Strain, O. Farkas, D. K. Malick, A. D. Rabuck, K. Raghavachari, J. B. Foresman, J. V. Ortiz, Q. Cui, A. G. Baboul, S. Clifford, J. Cioslowski, B. B. Stefanov, G. Liu, A. Liashenko, P. Piskorz, I. Komaromi, R. L. Martin, D. J. Fox, T. Keith, M. A. AlLaham, C. Y. Peng, A. Nanayakkara, M. Challacombe, P. M. W. Gill, B. Johnson, W. Chen, M. W. Wong, C. Gonzalez, and J. A. Pople, GAUSSIAN 03, (Revision C.02), Gaussian, Inc., Wallingford CT, 2004. and is marketed by Gaussian Inc. Further details can be found at the Gaussian Inc. website: http://www.gaussian.com/ HYPERCHEM A PC-based computational chemistry code with very efficient routines for optimization of MP2 geometries using the method ofconjugate directions. The current version, HYPERCHEM 8.0, is marketed by HYPERCHEM Molecular Modeling Software, Hypercube, Inc., 224 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

1115 NW 4th St., Gainesville FL, USA Further details can be found at the website: http://www.hyper.com/

JAGUAR

JAGUAR is described as a ‘‘a high-performance ab initio package for both gas and solution phase simulations, with particular strength in treating metal containing systems’’. It is marketed by a company called Schro¨dinger with head office at Schro¨dinger Inc., 101 SW Main Street, Suite 1300, Portland, OR 97204, USA Further details can be found at the website: http://www.schrodinger.com/

MOLCAS

MOLCAS, like DALTON, is something of an ‘‘experts code’’. It is described as a code for the ‘‘ab initio treatment of very general electronic structure problems for molecular systems in both ground and excited states’’. It is particularly for electronic structure problems requiring a multi-reference treatment. G. Karlstrom, R. Lindh, P.-E. Malmqvist, B. O. Roos, U. Ryde, V. Veryazov, P.-O. Widmark, M. Cossi, B. Schimmelpfennig, P. Neogrady, L. Seijo, Comput. Mater. Sci. 2003, 28, 222. Further details can be found at the website: http://www.teokem.lu.se/molcas/

MOLPRO

The MOLPRO quantum chemistry package has an ‘‘emphasis is on highly accurate computations, with extensive treatment of the electron correlation problem’’. It is maintained by H.-J. Werner and P. J. Knowles, and contains contributions from a number of authors: MOLPRO is a package of ab initio programs written by H.-J. Werner and P. J. Knowles, with contributions from R. D. Amos, A. Bernhardsson, A. Berning, P. Celani, D. L. Cooper, M. J. O. Deegan, A. J. Dobbyn, F. Eckert, C. Hampel, G. Hetzer, T. Korona, R. Lindh, A. W. Lloyd, S. J. McNicholas, F. R. Mandy, W. Meyer, M. E. Mura, A. Nicklass, P. Palmieri, R. Pitzer, G. Rauhut, M. Schutz, H. Stoll, A. J. Stone, R. Tarroni, T. Thorsteinsson. Further details can be found at the website: http://www.molpro.net/

MPQC

a quantum chemistry package written for massively parallel computers and available as open-source software. The program reference is: The Massively Parallel Quantum Chemistry Program (MPQC), C. L. Janssen, I. B. Nielsen, M. L. Leininger, E. F. Valeev, E. T. Seidl, Sandia National Laboratories, Livermore, CA, USA, 2004 More details of the approach used in this package can be found in: J. P. Kenny, S. J. Benson, Y. Alexeev, J. Sarich, C. L. Janssen, L. Curfman Mcinnes, M. Krishnan, J. Nieplocha, E. Jurrus, C. Fahlstrom and T. L. Windus, Component-based integration of chemistry and optimization software, J. Comp. Chem. 2004, 25, 1717 Further details can be found at the website: http://www.mpqc.org/ The open-source software is distributed by sourceforge.net http://sourceforge.net/projects/mpqc Chem. Modell., 2008, 5, 208–248 | 225 This journal is

c

The Royal Society of Chemistry 2008

NWCHEM an ab initio quantum chemistry program written specfically for parallel computers. It was developed by the Molecular Sciences Software group of the Environmental Molecular Sciences Laboratory (EMSL) at the Pacific Northwest National Laboratory (PNNL). The current version, NMCHEM 5.0, is cited as: E. J. Bylaska, W. A. de Jong, K. Kowalski, T. P. Straatsma, M. Valiev, D. Wang, E. Apra`, T. L. Windus, S. Hirata, M. T. Hackler, Y. Zhao, P.-D. Fan, R. J. Harrison, M. Dupuis, D. M. A. Smith, J. Nieplocha, V. Tipparaju, M. Krishnan, A. A. Auer, M. Nooijen, E. Brown, G. Cisneros, G. I. Fann, H. Fru¨chtl, J. Garza, K. Hirao, R. Kendall, J. A. Nichols, K. Tsemekhman, K. Wolinski, J. Anchell, D. Bernholdt, P. Borowski, T. Clark, D. Clerc, H. Dachsel, M. Deegan, K. Dyall, D. Elwood, E. Glendening, M. Gutowski, A. Hess, J. Jaffe, B. Johnson, J. Ju, R. Kobayashi, R. Kutteh, Z. Lin, R. Littlefield, X. Long, B. Meng, T. Nakajima, S. Niu, L. Pollack, M. Rosing, G. Sandrone, M. Stave, H. Taylor, G. Thomas, J. van Lenthe, A. Wong, and Z. Zhang, NWChem, A Computational Chemistry Package for Parallel Computers, Version 5.0 Pacific Northwest National Laboratory, Richland, Washington 99352–0999, USA. 2006. An overview of NWCHEM can be found in: High Performance Computational Chemistry: An Overview of NWChem a Distributed Parallel Application, R. A. Kendall, E. Apra, D. E. Bernholdt, E. J. Bylaska, M. Dupuis, G. I. Fann, R. J. Harrison, J. Ju, J. A. Nichols, J. Nieplocha, T. P. Straatsma, T. L. Windus, A. T. Wong, Computer Phys. Comm., 2000, 128, 260–283. Further details can be found at the website: http://www.emsl.pnl.gov/docs/nwchem/nwchem.html

PQS

Parallel Quantum Solutions is a commercial organization offering ‘‘parallel computers with integrated software for high performance computational chemistry’’. The hardware offered is based on personal computer technology. The authors of the software, P. Pulay, J. Baker and K. Wolinski, argue that: ‘‘The high computational demands of most ab initio calculations requires parallel processing to achieve reasonably fast turnaround. General purpose parallel computers are very expensive and are usually shared between many users, negating most of the advantages of parallelism. Parallel computational chemistry programs are expensive and difficult to set up.’’ PQS can be found at Parallel Quantum Solutions, 2013 Green Acres Road, Suite A, Fayetteville, Arkansas 72703 USA Further details can be found at the website: http://www.pqs-chem.com

PSI

3.3.0

an open-source ab initio electronic structure package, details of which were published recently in: PSI3: An Open-Source Ab initio Electronic Structure Package, T. D. Crawford, C. D. Sherrill, E. F. Valeev, J. T. Fermann, R. A. King, M. L. Leininger, S. T. Brown, C. L. Janssen, E. T. Seidl, J. P. Kenny, and W. D. Allen, J. Comp. Chem. 2007, 28, 1610–1616. Further details can be found at the website: http://www.psicode.org/index.html The open-source software is distributed by sourceforge.net http://sourceforge.net/projects/psicode 226 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

QCHEM 3.1 ‘‘a comprehensive ab initio quantum chemistry package’’. Details can be found in: J. Kong, C. A. White, A. I. Krylov, D. Sherrill, R. D. Adamson, T. R. Furlani, M. S. Lee, A. M. Lee, S. R. Gwaltney, T. R. Adams, C. Ochsenfeld, A. T. B. Gilbert, G. S. Kedziora, V. A. Rassolov, D. R. Maurice, N. Nair, Y. Shao, N. A. Besley, P. E. Maslen, J. P. Dombrowski, H. Daschel, W. Zhang, P. P. Korambath, J. Baker, E. F. Byrd, T. van Voorhis, M. Oumi, S. Hirata, C.-P. Hsu, N. Ishikawa, J. Florian, A. Warshel, B. G. Johnson, P. M. W. Gill, M. Head-Gordon, J. A. Pople, J. Comput. Chem., 2000, 21, 1532 The latest version is QCHEM 3.1. The QCHEM package is marketed by Q-Chem, Inc., The Design Center, Suite 690 5001 Baum Blvd. Pittsburgh, PA 15213 Further details can be found at the website: http://www.q-chem.com/index.htm

SPARTAN

aims to provide ‘‘easy-to-learn and easy-to-use molecular modeling tools for chemistry researchers and educators’’. In conjunction with QCHEM, SPARTAN provides a full range of post-Hartree-Fock methods. Further details can be found at the website: http://www.wavefun.com/index.html

TURBOMOLE

a general purpose code originally designed for high efficiency on workstation by the Ahlrichs group in Karlsruhe. Useful references are Electronic Structure Calculations on Workstation Computers: the Program System TURBOMOLE, R. Ahlrichs, M. Ba¨r, M. Ha¨ser, H. Horn and Ch. Ka¨lmel, Chem. Phys. Letters, 1989, 162, 165 S. Brode, H. Horn, M. Ehrig, D. Moldrup, J. E. Rice, R. Ahlrichs, J. Comput. Chem., 1993, 14, 1142 Further details can be found at the website: http://www.cosmologic.de/QuantumChemistry/ UT-CHEM a program suite for ab initio quantum chemistry. Details can be found in: T. Yanai, H. Nakano, T. Nakajima, T. Tsuneda, S. Hirata, Y. Kawashima, Y. Nakao, M. Kamiya, H. Sekino, K. Hirao, Computational Science-International Conference on Computational Science 2003, Lecture Notes in Computer Science (Springer) 2003, pp. 84–95 Further details can be found at the website: http://utchem.qcl.t.u-tokyo.ac.jp/

3.7 The knowledge base: computer hardware In the introduction to this section, we remarked that ‘‘Whereas a printed page can be copied with little regard to the nature of the material, usually paper, on to which it is copied, the correct execution of computer software is usually highly dependent on the hardware upon which it is implemented.’’ The performance of the software packages described in the previous section is usually strongly dependent on the hardware platform on which it is implemented and this in turn determines the range of application that can be considered. Chem. Modell., 2008, 5, 208–248 | 227 This journal is

c

The Royal Society of Chemistry 2008

In this section, we turned our attention to the hardware requirements for computational many-body perturbation theory for molecules. As Hennessy and Patterson explain in their well known treatise on computer architecture77 ‘‘Computer technology has made incredible progress in the roughly 60 years since the first general-purpose electronic computer was created. Today less than $500 will purchase a personal computer that has more performance, more main memory, and more disk storage than a computer bought in 1985 for 1 million dollars. This rapid improvement has come both from advances in the technology used to build computers and form innovation in computer design.’’ Today, quantum chemistry calculations are carried out on a wide range of machines and architectures from a personal computer to a state-of-the-art supercomputer. To date, the vast majority of computational chemists have been content to use general purpose computers in their work designing software to exploit the capabilities of commercially available machines. However, as long ago as 1977 in a volume entitled Personal Computers in Chemistry, published by the American Chemical Society, Neil Ostlund of Hypercube Inc., expressed the view that chemists should be involved in the construction of the machines that they use:78 ‘‘Computational chemistry is to established and important a field to leave its only significant piece of apparatus to disinterested computer scientists. Chemists in other fields have commonly built very sophisticated instruments—molecular beam machines, electron scattering spectrometers, ion cyclotron resonance spectrometers, etc. . . . If computational chemists are to take full advantage of the current revolution in micro-electronics, which offers great prospects for cost-effective high-performance computation, it will be necessary to become involved in the design and building of highly parallel architectures specific to specific chemical applications.’’ Some computational chemists design their own systems by buying commercially available components and thereby tailoring their system to their specific needs. As we described in the previous section, P. Pulay and his coworkers have established Parallel Quantum Solutions marketing ‘‘parallel computers with integrated software for high performance computational chemistry’’. These authors write88: ‘‘The high computational demands of most ab initio calculations requires parallel processing to achieve reasonably fast turnaround. General purpose parallel computers are very expensive and are usually shared between many users, negating most of the advantages of parallelism. Parallel computational chemistry programs are expensive and difficult to set up.’’ Typically, quantum chemistry packages are suppied for a range of hardware platforms. For example, GAUSSIAN 03 is available for machines from a a range of vendors with different CPU and operating systems: AMD with the – AMD64 (Opteron, Athlon 64) CPU and the * SuSE Linux 9.0, 9.3, 10.1–10.3, * SuSE Linux Enterprise 10, or * Red Hat Enterprise Linux 4 (Updates 3,4) and RHEL 5 (initial release) operating system. |||| http://www.pqs-chem.com

228 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Apple with the – IA32 (32-bit Intel Mac) CPU and the * OS X 10.4.9 and later Tiger releases of the operating system Apple with the – EM64T (64-bit Intel Mac) CPU and the * OS X 10.4.9 and later Tiger releases of the operating system Fujitsu with the – Prime Power CPU and the * Solaris 9 and 10 operating system HP with the – Alpha CPU and the * Tru64 5.1B operating system and so on . . . A full list can be found in the GAUSSIAN 03 documentation available on the website http://www.gaussian.com. In distributing any quantum chemistry packages, each machine specific implementation has been tailored to the given hardware platform taking account of the computer architecture. In brief, computer architecture comprises at least three main subcategories:77 Instruction Set Architecture—the abstract image of a computing system that is seen by a machine language (or assembly language) programmer. This inlcudes the instruction set, memory address modes, processor registers, and address and data formats. The Microarchitecture is a lower level, more concrete, description of the system. It involves such details as how the constituent parts of the system are interconnected and how they interoperate in order to implement the instruction set architecture. For example, the size of a computer’s cache memory is an organizational issue which, in general, has nothing to do with the instruction set architecture. System Design which includes all of the other hardware components within a computing system such as: 1. system interconnects such as computer busses and switches 2. memory controllers and hierarchies 3. CPU ‘off-load’ mechanisms such as Direct Memory Access (DMA) 4. issues such as multi-processing. The computers used in ab initio quantum chemistry range from desktop machines through to supercomuters. As an example of a state-of-the-art supercomputer, we briefly consider the NEC SX-9 lauched by NEC Corporation** in October 2007. When completed this will be the world’s fastest vector supercomputer with a peak processing performance of 839 TFLOPS (one trillion floating point operations per ** Press release from NEC Corporation, 25 October 2007, see http://www.nec.co.jp/press/en/ 0710/2501.html for details.

Chem. Modell., 2008, 5, 208–248 | 229 This journal is

c

The Royal Society of Chemistry 2008

second). The NEC SX-9 will have CPUs capable of a peak vector performance of 102.4 GFLOPS (one billion floating point operations per second) per single core. This will combine large-scale shared memory of up to 1TB (terabyte) and ultra highspeed interconnects achieving speeds up to 128GB/second. The largest NEC SX-9 can have 8,192 CPUs and will achieve peak processing performance of 839 TFLOPS, thus approaching the PFLOPS (one quadrillion floating point operations per second) range.

4. An overview of applications of second order theory 4.1 Incidence of the string ‘‘MP2’’ in titles and/or keywords and/or abstracts In previous reports to this series, the increasing use of many-body perturbation theory in molecular electronic structure studies was measured by interrogating the Institute for Scientific Information (ISI) databases. In particular, I determined the number of incidences of the string ‘‘MP2’’ in titles and/or keywords and/or abstracts. This acronym is frequently associated with the simplest form of manybody perturbation theory. This assessment of the use of second order many-body perturbation theory will undoubtedly miss many routine applications but should serve to convey both the extent and the breadth of contemporary application areas. In my report for the period up to 1999, I noted that the string ‘‘MP2’’ had occurred in the title and/or keywords of just 3 publications in 1989 but that this number had risen to 854 in 1998. In my report for the period June 1999 to May 2001, I measured 821 ‘‘hits’’ for 1999 and 883 for the year 2000. For my report covering the period June 2001 to May 2003, I found a total of 757 incidences for 2001 and 828 for 2002. For the period June 2003 to May 2005, the ISI database contained 834 and 819 ‘‘hits’’ for the years 2003 and 2004, respectively. In the present reporting period, June 2005 to May 2007, the ISI database contained 919 and 904 ‘‘hits’’ for the years 2005 and 2006, respectively, being, of course, the last two years at the time of writing for which data is available. Over the nine year period 1998–2006 inclusive, the average number of publications with the string ‘‘MP2’’ in the title and/or keywords has been 847, ranging between a minimum of 757 and a maximum of 919. Our analysis is summarized in Table 1. 4.2 Synopsis of applications of second order many-body perturbation theory As in my three most recent reviews,2–4 I have attempted to provide a snapshot of the many applications of many-body perturbation theory in its simplest form, i.e. Møller-Plesset theory designated MP2, during the period under review by performing a literature search for publications with the string ‘‘MP2’’ in the title only. During the period covered by the present report, a total of 80 papers were discovered satisfying this criterion. Table 1 Incidence of the acronym ‘‘MP2’’ in the title and/or keywords and/or abstract or publications over the period 1998 to 2006 Year

‘‘MP2’’

1998 1999 2000 2001 2002 2003 2004 2005 2006

854 821 883 757 828 834 819 919 904

230 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

We list the titles of these papers here: 1. MP2 study of substituent effects of 2-substituted alkyl ethyl methylcarbamates in hom*ogeneous, unimolecular gas phase elimination reaction79 2. Activation energies of pericyclic reactions: Performance of DFT, MP2, and CBS-QB3 methods for the prediction of activation barriers and reaction energetics of 1,3-dipolar cycloadditions, and revised activation enthalpies for a standard set of hydrocarbon pericyclic reactions80 3. G3(MP2) enthalpies of hydrogenation, isomerization, and formation of extended linear polyacetylenes81 4. Basis set superposition error in MP2 and density-functional theory: A case of methane-nitric oxide association82 5. Parametric representation of the low-energy portion of the water-carbon dioxide G3(MP2)//B3LYP electronic energy surface. Variations on the theme developed by Professor B. Widom in his PhD dissertation83 6. Structure and stability of thiourea with water, DFT and MP2 calculations84 7. G3(MP2) ring strain in bicyclic phosphorus heterocycles and their hydrocarbon analogues85 8. MP2 static first hyperpolarizability of azo-enaminone isomers86 9. The molecular structures and electron distributions of the 1,8-bis(dimethylamino)-naphthalenes, studied by density functional and ab initio MP2 calculations87 10. MP2 and DFT studies of the DNA rare base pairs: the molecular mechanism of the spontaneous substitution mutations conditioned by tautomerism of bases88 11. DFT and MP2 study on the redistributing preparation of dichlorodimethyl89 silane catalyzed by AlCl(II) 3 12. G3(MP2) study of the C3H6O+ isomers fragmented from 1,4-dioxane+90 13. First-principle DFT and MP2 modeling of infrared reflection-Absorption spectra of oriented helical ethylene glycol oligomers91 14. Energetics and bonding study of hexamethylenetetramine and fourteen related cage molecules: An ab initio G3(MP2) investigation92 15. An analysis of the vibrational spectra of N,N-dimethylformamide isotopomers and unsubstituted N,N-dimethylcabamoyl chloride with scaling the force field calculated by the MP2 method using a basis set including f functions93 16. Characterization of the conformational probability of N-acetyl-phenylalanylNH2 by RHF, DFT, and MP2 computation and AIM analyses, confirmed by jet-cooled infrared data94 17. Insights into the role of the aromatic residue in galactose-binding sites: MP2/6-311G++** study on galactose- and glucose-aromatic residue analogue complexes95 Chem. Modell., 2008, 5, 208–248 | 231 This journal is

c

The Royal Society of Chemistry 2008

18. Comparison of large basis set DFT and MP2 calculations in the study of the barrier for internal rotation of 2,3,5,6-tetrafluoroanisole96 19. DFT and local MP2 study of switching process in a pH controllable molecular ‘‘shuttle’’97 20. A combined DFT, MP2 and CASSCF study on the properties of Tropolone(H2O)n, (n = 0–2) complexes98 21. Novel 1,5-difluoropenta-1,4-diyn-3-one, its sulfur and selenium analogues: MP2 and DFT gas phase study of their molecular structures and vibrational spectra99 22. Berberine alkaloid: Quantum chemical study of different forms by the DFT and MP2 methods100 23. Modeling of the ‘hydration shell’ of uracil and thymine in small water clusters by DFT and MP2 methods101 24. Permanganate oxidation of alkenes. Substituent and solvent effects. Difficulties with MP2 calculations102 25. Treating dispersion effects in extended systems by hybrid MP2: DFT calculations—protonation of isobutene in zeolite ferrierite103 26. A DFT and MP2 study of luminescence of gold(I) complexes104 27. MP2 study of cation–(p)n–p interactions (n = 1–4)105 28. A new nonsymmetric As(OH)3 species. Comparison with the known C3 species and themochemistry at the HF, DFT(B3LYP), MP2, MP4, and CCSD(T) levels of theory106 29. Electron affinities and ionization potentials of 4d and 5d transition metal atoms by CCSD(T), MP2 and density functional theory107 30. The p-type hydrogen bond with triple C–C bond acting as a proton-acceptor. A gradient-corrected hybrid HF-DFT and MP2 study of the phenol-acetylene dimer in the neutral S-0 ground state108 31. Benchmark database of accurate (MP2 and CCSD(T) complete basis set limit) interaction energies of small model complexes, DNA base pairs, and amino acid pairs109 32. Ab initio base-pairing energies of uracil and 5-hydroxyuracil with standard DNA bases at the BSSE-free DFT and MP2 theory levels110 33. MP2 theory investigation on the halides of D6h C36:C36Xn (X = F,Cl,Br; n = 2,4,6,12)111 34. Estimated MP2 and CCSD(T) interaction energies of n-alkane dimers at the basis set limit: Comparison of the methods of Helgaker et al. and Feller112 35. The MP2 quantum chemistry study on the local minima of guanine stacked with all four nucleic acid bases in conformations corresponding to mean B-DNA113 232 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

36. Structures and properties of cytosine-BX3 (X = F, Cl) complexes: an investigation with DFT and MP2 methods114 37. B3LYP and MP2 calculations of the enthalpies of hydrogen-bonded complexes of methanol with neutral bases and anions: Comparison with experimental data115 38. Benchmark RI-MP2 database of nucleic acid base trimers: performance of different density functional models for prediction of structures and binding energies116 39. Assessment of the MP2 method, along with several basis sets, for the computation of interaction energies of biologically relevant hydrogen bonded and dispersion bound complexes117 40. Investigation of the intermolecular proton transfer in the supersystems adenine-methanol/ethanol/i-propanol: MP2 and DFT levels study118 41. New parallel algorithm for MP2 energy gradient calculations119 42. How many conformers determine the thymidine low-temperature matrix infrared spectrum? DFT and MP2 quantum chemical study120 43. MP2/6-311++G(d,p) study on galactose-aromatic residue analog complexes in different position-orientations of the saccharide relative to aromatic residue121 44. DFT and MP2 investigations on the interaction of furan hom*ologues C4H4Y (Y = O, S) with BX3 (X = H, F, Cl)122 45. Insufficient description of dispersion in B3LYP and large basis set superposition errors in MP2 calculations can hide peptide conformers123 46. Modeling of the ‘hydration shell’ of uracil and thymine in small water clusters by DFT and MP2 methods124 47. Ab initio base-pairing energies of an oxidized thymine product, 5-formyluracil, with standard DNA bases at the BSSE-free DFT and MP2 theory levels125 48. Comprehensive conformational analysis of the nucleoside analogue 2 0 -b-deoxy-6-azacytidine by DFT and MP2 calculations126 49. Density functional static dipole polarizability and first-hyperpolarizability calculations of Nan (n = 2,4,6,8) clusters using an approximate CPKS method and its comparison with MP2 calculations127 50. Structure and vibrational spectra of vinyl ether conformers. The comparison of B3LYP and MP2 predictions128 51. MP2 study of anion–p complexes of trifluoro-s-triazine with tetrahedral and octahedral anions129 52. New nonsymmetric P(OH)3 species. Comparison with the C-3 isomer and themochemistry at the DFT, MP2, and CCSD(T) levels of theory130 53. Structure and bonding differences in C3N4 and Si3N4 isomers-A comparative study of [Si-3,N-4] and [C-3,N-4] potential energy surfaces using DFT and MP2 methodologies131 Chem. Modell., 2008, 5, 208–248 | 233 This journal is

c

The Royal Society of Chemistry 2008

54. Analytical RISM-MP2 free energy gradient method: Application to the Schlenk equilibrium of Grignard reagent132 55. MP2 study of cooperative effects between cation-pi, anion-pi and pi-pi interactions133 56. Conformational study of isolated pindolol by HF, DFT and MP2 calculations134 57. Comparison of the conformational stability for several vinylhalomethanes and silanes with experiment using MP2 perturbation theory and DFT135 58. Stabilisation energy of C6H6 C6X6 (X = F, Cl, Br, I, CN) complexes: complete basis set limit calculations at MP2 and CCSD(T) levels136 59. A new non-symmetric N(OH)3 species: Comparison with the C3 species and thermochemistry at the HF, DFT, MP2, MP4 and CCSD(T) levels of theory137 60. Pt-bridges in various single-strand and double-helix DNA sequences. DFT and MP2 study of the cisplatin coordination with guanine, adenine, and cytosine138 61. Hydrogen-bonded nucleic acid base pairs containing unusual base tautomers: Complete basis set calculations at the MP2 and CCSD(T) levels139 62. Predicting potential stable isomers on the singlet surface of the [H,P,C,S] system by the MP2 and QCISD(T) methods140 63. Probing ab initio MP2 approach towards the prediction of vibrational infrared spectra of DNA base pairs141 64. C–H functionalisation through singlet chlorocarbenes insertions-MP2 and DFT investigations142 65. Alternative formulation of the matrix elements in MP2-R12 theory143 66. General orbital invariant MP2-F12 theory144 67. Optimized accurate auxiliary basis sets for RI-MP2 and RI-CC2 calculations for the atoms Rb to Rn145 68. Computational aspects of a local-MP2 treatment of electron correlation in periodic systems: SiC vs. BeS146 69. Fast local-MP2 method with density-fitting for crystals. I. Theory and algorithms147 70. Fast local-MP2 method with density-fitting for crystals. II. Test calculations and application to the carbon dioxide crystal148 71. Ab initio asymptotic-expansion coefficients for pair energies in MP2 perturbation theory for atoms149 72. MP2 basis set limit binding energy estimates of hydrogen-bonded complexes from extrapolation-oriented basis sets150 234 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

73. A general efficient implementation of the BSSE-free SCF and MP2 methods based on the Chemical Hamiltonian Approach151 74. Augmented Gaussian basis sets of triple and quadruple zeta valence quality for the atoms H and from Li to Ar: applications. in HF, MP2, and DFT calculations-of molecular dipole moment and dipole (hyper)polarizability152 75. Calculations with correlated molecular wave functions. HF, MP2 and DFT calculations on second-row diatomic hydrides153 76. HF and MP2 calculations on CN, N2, AlF, SiO, PN, SC, CIB, and P2 using correlated molecular wave functions154 77. Hybrid correlation models based on active-space partitioning: Correcting MP2 theory for bond-breaking reactions155 78. A new parallel algorithm of MP2 energy calculations156 79. DIESEL-MP2: A new program to perform large-scale multireference-MP2 computations157 80. Array files for computational chemistry: MP2 energies158 These papers were published in a total of 30 different journals which are listed alphabetically in Table 2. The number of relevant papers appearing in each journal is given in parenthesis. The largest number of papers appeared in J. Molec. Struct.Theochem, followed by J. Phys. Chem. A and Chem. Phys. Lett. 4.3 Comparison with other methods In volume 1 of this series, I compared the use of second-order many-body perturbation theory in its ‘‘MP2’’ form with that of density functional theory and coupled cluster theory. I recorded how the number of ‘‘hits’’ in a literature search on the string ‘‘MP2’’ rises from 3 in 1989 to 854 in 1998. The corresponding results for DFT, the most widely used semi-empirical method, are 7 in 1989 growing to 733 by 1998. By 1998, the number of ‘‘hits’’ recorded for CCSD stood as 244. I extended this comparison for the period 1998–2002 in volume 3 of this series, and to 2004 in volume 4. In Table 3, the comparison is continued through to 2006, the last complete year for which data is available at the time of writing. The most striking observation about this table is the growth in the use of ‘‘DFT’’ which first exceeded that of ‘‘MP2’’ in 1999, stood at roughly a factor of two greater at the time of my last report and now, according to the 2006 data, is approaching a point where its use will be a factor of four more. It is undoubtedly the demand for methods that can be deployed in the description of larger systems that is fueling the growth in the use of density functional theory. A significant number of papers with the string ‘‘MP2’’ in the title and/or keywords and/or abstract report comparative studies in which practical applications of ‘‘MP2’’ theory are compared with applications of other methods. In order to measure the number of papers of this type in recent years, literature search were carried out to determine:1. the number of papers containing both the string ‘‘MP2’’ and the string ‘‘DFT’’ in the title and/or keywords and/or abstract 2. the number of papers containing both the string ‘‘MP2’’ and the string ‘‘CCSD’’ in the title and/or keywords and/or abstract 3. the number of papers containing both the string ‘‘DFT’’ and the string ‘‘CCSD’’ in the title and/or keywords and/or abstract Chem. Modell., 2008, 5, 208–248 | 235 This journal is

c

The Royal Society of Chemistry 2008

Table 2 Journals in which papers appeared containing the acronym ‘‘MP2’’ in the title and/or keywords and/or abstract over the period June 2005 to May 2007 Abs. Am. Chem. Soc (1) Acta. Chimica Sinica (1) Biochem. (1) Bull. Korean Chem. Soc. (1) Chem. Phys. (2) Chem. Phys. Lett. (8) Collect. Czechoslovak Chem. Commun. (1) Inorganica Chimica Acta (1) Intern. J. Quantum Chem. (4) J. Am. Chem. Soc. (1) J. Biomolec. Struct. & Dynam. (1) J. Chem. Phys. (4) J. Chem. Theory & Comput. (1) J. Comput. Chem. (5) J. Molec. Modelling (2) J. Molec. Struct. (1) J. Molec. Struct.-Theochem (12) J. Organic Chem. (1) J. Phys. Chem. A (9) J. Phys. Chem. B (4) J. Serbian Chem. Soc. (1) J. Theoret. & Comput. Chem. (1) Lect. Notes Comp. Sci. (1) Molec. Phys. (4) New J. Chem. (1) Organic & Biomolec. Chem. (2) Phys. Chem. Chem. Phys. (4) Phys. Rev. B (2) Russian J. Phys. Chem. (1) Theoret. Chem. Accounts (2)

Table 3 Incidence of the acronyms ‘‘MP2’’, ‘‘DFT’’ and ‘‘CCSD’’ in the title and/or keywords and/or abstract of publications over the period 1998 to 2006 Year

‘‘MP2’’

‘‘DFT’’

‘‘CCSD’’

1998 1999 2000 2001 2002 2003 2004 2005 2006

854 821 883 757 828 834 819 919 904

738 923 1221 1528 1723 2101 2433 3393 3576

244 263 283 318 303 354 320 446 480

4. the number of papers containing the string ‘‘MP2’’, the string ‘‘DFT’’ and the string ‘‘CCSD’’ in the title and/or keywords and/or abstract The results are collected in Table 4. In 2006, the most recent year for which complete data is available at the time of writing, 228 publications referred to both ‘‘MP2’’ and ‘‘DFT’’. This represents over 25% of the 904 publications with ‘‘MP2’’ in the title and/or keywords and/or abstract of publications in 2006. 172 publications in the same year refer to ‘‘MP2’’ and ‘‘CCSD’’; about 19% of those with ‘‘MP2’’ in 236 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Table 4 Incidence of two or more of the acronyms ‘‘MP2’’, ‘‘DFT’’ and ‘‘CCSD’’ in the title and/or keywords and/or abstract of publications appearing in 2003–2006 Methods

2003

2004

2005

2006

‘‘MP2’’ & ‘‘DFT’’ ‘‘MP2’’ & ‘‘CCSD’’ ‘‘DFT’’ & ‘‘CCSD’’ ‘‘MP2’’ & ‘‘DFT’’ & ‘‘CCSD’’

204 109 50 25

189 103 46 18

267 137 86 39

228 172 83 40

the title and/or keywords and/or abstract. 83 publications refer to ‘‘DFT ’’ and ‘‘CCSD’’. 40 publications in 2006 refer to ‘‘MP2’’, ‘‘DFT ’’ and ‘‘CCSD’’.

5. Application area 1: periodic systems Some progress has been made during the reporting period towards incorporating a description of correlation effects in periodic solids using second order many-body perturbation theory. The aim research in this area is to provide a powerful and general-purpose computational tool, which can be used to study a variety of applications in condensed matter physics and solid state chemistry. Over the past twenty years, the available software for Hartree–Fock (HF) and for density-functional calculations on crystals has undergone very significant progress in terms of effectiveness and reliability. For example, the CRYSTAL program package159,160 is today applied routinely to a wide range of problems. The incorporation of correlation effects in calculations for periodic solids requires the use of a many-body formalism. Second order many-body perturbation theory, in its MP2 form, should provide the basis of an efficient computational approach to this problem. In particular, the local MP2 methods originally developed for large molecules can be adapted for the treatment of periodic solids. In a previous report,3 we have emphasized that ‘‘ . . . the steep scaling of algorithms for describing electron correlation effects in molecular systems is often an artifact of the orthogonal canonical basis, i.e the solutions of the matrix Hartree-Fock equations, used to construct post-HartreeFock correlation theories’’ For the local MP2 algorithm described by Hetzer et al.161 in 2000 ‘‘ . . . the calculation of the MP2 energy is less expensive than the calculation of the Hartree-Fock energy for large systems’’ Pisani et al.146,162 have developed a new computer program which they call for describing correlation effects in periodic solids. This program essentially combines the accurate HF solution of the crystal structure problem in an atomic orbital (AO) representation available from the CRYSTAL code, with local correlation techniques, and, in particular, those implemented the MOLPRO program of the Stuttgart group.163 In considering the computational aspects of a local-MP2 treatment of electron correlation in periodic systems, Pisani et al.146 point to number of distinctive features including: (i) full exploitation of point symmetry; (ii) proper definition of the ‘localvirtual’ basis set; (iii) assessment of the ‘locality of excitations’ concept; (iv) proper use of different levels of treatment of 2-electron integrals; (v) extrapolation of local results to infinity. Pisani et al. examine the relative importance of different kinds of local excitations and their dependence on the prevailingly covalent or ionic character of the crystal. They also demonstrate the usefulness of a multipolar approximation for the evaluation of the majority of 2-electron integrals which arise. CRYSCOR

Chem. Modell., 2008, 5, 208–248 | 237 This journal is

c

The Royal Society of Chemistry 2008

In subsequent work, Pisani et al.147,148 demonstrate the use of density-fitting techniques in the evaluation of the two-electron Coulomb interaction integrals between product distributions. They also show that use of density fitting techniques avoids the bottleneck of the four index transformation for neighbouring distributions.

6. Application area 2: DNA bases and amino acids The structure of proteins, deoxyribonucleic acids (DNA), ribonucleic acid (RNA), proteins and other biomacromolecular species is determined by noncovalent interactions among the building blocks-the DNA and RNA bases and amino acids. The methods of ab initio quantum chemistry have a key role to play in gaining an understanding of the structure and properties of biomacromolecules since over recent years accurate calculations for molecules of the size and complexity of these building blocks have become computationally tractable. For example, models of the four nucleic acids, adenine, cytosine, guanine and thymine, are shown in Fig. 1–4. Today, systems of this size are amenable to high quality ab initio quantum chemical studies and many-body perturbation theory is often the method of choice when incorporating the effects of electron correlation in such studies. The building blocks of biomacromolecules may be either electrically neutral or charged. Among the various energy terms which contribute to their overall stabilization, hydrogen bonding and electrostatic contributions are expected to be the most important. Hydrogen-bonding is highly specific and directional and thus responsible for the stabilization of biomacromolecules. Electrostatic interactions are is especially important in proteins, and ion pairs or salt bridges are known to play an important role in stabilizing selected protein structures. (Induction and charge-transfer terms do not play a decisive role but nevertheless should be taken into account.) It is now evident that stacking plays a key role in many biomacromolecules. Stabilization energies for stacked DNA base pairs as well as for stacked amino acids can be surprisingly large.

Fig. 1 Model of the adenine molecule (7H-purin-6-amine), one of the two purine nucleobases forming the nucleotides of the nucleic acids. The molecule has the formula C5H5N5 making it a 70-electron system. Today, systems of this size are amenable to high quality ab initio quantum chemical studies.

238 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Fig. 2 Model of the cytosine molecule (4-amino-1H-pyrimidine-2-one), one of the two pyrimidine nucleobases forming the nucleotides of the nucleic acids. The molecule has the formula C4H5N3O making it a 58-electron system. Today, systems of this size are amenable to high quality ab initio quantum chemical studies.

The size and complexity of extended biomacromolecules makes the understanding of the various energy contributions which contribute to their stabilization difficult, since only calculations using simple empirical potential calculations are tractable. Fortunately, the most importance biomacromolecules, DNA and proteins, consist of characteristic building blocks-the nucleic acid bases and amino acids-interacting through noncovalent interactions. The system can therefore be fragmented into smaller components, each of which can be described by means of ab initio quantum chemical methods.

Fig. 3 Model of the guanine molecule (2-amino-1H-purin-6(9H)-one), one of the two purine nucleobases forming the nucleotides of the nucleic acids. The molecule has the formula C5H5N5O making it a 78-electron system. Today, systems of this size are amenable to high quality ab initio quantum chemical studies.

Chem. Modell., 2008, 5, 208–248 | 239 This journal is

c

The Royal Society of Chemistry 2008

Fig. 4 Model of the thymine molecule (5-methylpyrimidine-2,4(1H,3H)-dione), one of the two pyrimidine nucleobases forming the nucleotides of the nucleic acids. The molecule has the formula C5H6N2O2 making it a 76-electron system. Today, systems of this size are amenable to high quality ab initio quantum chemical studies.

Jurecˇka et al.109 have published a database of accurate ‘‘benchmark quality’’ interaction energies of small model complexes, DNA base pairs, and amino acid pairs obtained from MP2 and CCSD(T) calculations together with estimates of the complete basis set limit. They examine interaction energies and geometries for more than 100 DNA base pairs, amino acid pairs and model complexes. Extrapolation to the complete basis set limit was carried out by these researchers using a two-point method in conjunction with correlation consistent basis sets. In related work, Riley and Hobza117 have provided an assessment of the MP2 method in conjunction with several medium and extended basis sets in the study of various hydrogen bonded and dispersion bound complexes of biological relevance. They report that the MP2/cc-pVTZ method supports the most well balanced description of noncovalent interactions. However, the MP2 method does not provide reliable results for the cyclic hydrogen bonds found in nucleic acid base pairs in combination with any of the basis sets considered by these authors. A new database for nucleic acid base trimers obtained by using RI-MP2 theory with a ‘‘triple zeta’’ quality basis set has been reported by Hobza and his coworkers.116 This database has been used to assess various density functional models. The heterogeneity of the interactions in these trimers, particularly the simultaneous hydrogen bonding and stacking interactions, makes this a demanding test of approximate methods. The authors suggests that ‘‘correct descriptions of all energy terms are unlikely to be accomplished by fortuitous cancellations of systematic errors’’. Rejnek and Hobza139 used MP2 and CCSD(T) theory together with procedures to estimate the complete basis set limits in the study of hydrogen-bonded nucleic acid base pairs containing unusual base tautomers. In particular, they focussed on the stabilization energies of adenine-thymine and guanine-cytosine base pairs containing various tautomeric forms. They found that the interaction energies for the optimized H-bonded tautomeric base pairs can be obtained with sufficient accuracy by the MP2/CBS approach, since the the CCSD(T) correction is negligible. They used a continuum model to describe bulk water. Rejnek and Hobza139 concluded that ‘‘The results clearly show how precisely the canonical building blocks of DNA molecules were chosen and how well their stability is maintained.’’ 240 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

MP2 theory and DFT was used by Danilov et al.88 to study DNA rare base pairs. In particular, these authors examined the molecular mechanism of the spontaneous substitution mutations conditioned by tautomerism of bases Of course, most biochemical reactions occur in aqueous solution. MP2 theory was used by Danilov et al.101,124 to model the hydration shell of uracil and thymine using small water clusters. The structure of the hydration shell is found to be determined by competition between the water-water and base-water interactions. It is known that exposure to reactive oxidation species and subsequent DNA damage can be linked to diseases, such as cancer and rheumatoid arthritis, and to aging.164,165 Volk et al.110 used MP2 theory and DFT to study the oxidized cytosine product, 5-hydroxyuracil. Oxidized cytosine products have been shown to be major chemical precursors in DNA for GC to AT transition mutations.166–168 In related work, Volk et al.125 studied the highly mutagenic 5-formyluracil which results from oxidation of the thymine methyl group. The choice of papers above necessarily represents a selective sample of the literature published during the period reviewed.

7. Application area 3: DFT benchmarking At the end of his 1998 Nobel Prize lecture, Professor Sir John Pople commented on the molecular applications of density functional theory. Of DFT he writes ‘‘Such methods do not handle the two-electron interactions explicitly but rather allow for them using properties of the one-electron density. This leads to lower cost and therefore a wider range of applicability. Recent forms of DFT have also introduced a considerable amount of empirical parameterization, sometimes using the same set of experimental data. At the present time, the principal limitation of DFT models is that there is no clear route for convergence of methods to the correct answer . . . ’’ He concluded that ‘‘Interaction between these two groups of theoretical chemists is a hopeful direction for future progress’’ The groups he mentions are essentially the ‘‘ab initio-ists and a posteriori-ists’’ discussed by Coulson back in 1959. The ‘‘ab initio-ists’’ demand a ‘‘clear route for convergence’’. The ‘‘a posteriori-ists’’ are willing to accept ‘‘a considerable amount of empirical parameterization’’ in order to facilitate a particular application. It is not surprising that considerable effort has been devoted to comparing ab initio methods and DFT. Second order many-body perturbation theory is the most efficient of ab initio methods and it is not surprising, as we have already noted in section 3, 228 of the 904 papers for which the string ‘‘MP2’’ occured in the title and/ or keywords and/or abstract—that is 25%—also contained the string ‘‘DFT’’. Comparisons of MP2 and DFT abound in almost all areas of application. For example, the work of Hobza et al.116 which we discussed in section 6 above, seeks to establish the validity of the DFT approach in studies of the nucleic acid base pairs. Other examples of comparative work on MP2 and DFT are: Gordon and Palmer87 who study the structure and electron distribution of the 1,8-bis-(dimethylamino)naphthalenes and Kieninger et al.96 who investigate the barrier for internal rotation around the -OCH3 bond in 2,3,5,6-tetrafluoroanisole. The switching process in a pH controllable molecular ‘‘shuttle’’ was investigated by Fomine et al.97 and Malysheva et al. studied the absorption spectra of oriented helical ethylene glycol oligomers containing self-assembled monolayers to gain a understanding of their infrared reflection absorption spectra. Four Au(I) mixed thiolate and phosphine complexes were examined by Costa and Calhorda;104 Ignatyev et al. studied the structure and vibrational spectra of vinyl ether conformers and Maron and A. Ramirez-Solis130 Chem. Modell., 2008, 5, 208–248 | 241 This journal is

c

The Royal Society of Chemistry 2008

focussed on a new nonsymmetric P(OH)3 species as well as a new nonsymmetric N(OH)3 species. This is inevitably a small sample of the many publications in which MP2 theory and DFT are compared during the reporting period.

8. Application area 4: basis set extrapolation and the calibration of general energy models The use of the algebraic approximation, that is the expansion of one-electron functions in terms of a finite analytic basis set, underpins much of the success of modern quantum chemistry particularly when the basis functions are chosen to be Gaussians. The integro-differential Hartree-Fock equations are replaced by a set of algebraic equations which can be cast in matrx form. However, the introduction of the algebraic approximation is not without consequences and molecular calculations carried out using a finite basis set necessarily suffer from basis set truncation errors. Such errors can often be as large or even larger than errors associated with the truncation errors arising in the correlation energy expansion. Quantum chemical calculations are usually defined in terms of the model used, including, except in the case of Hartree-Fock theory, the correlation treatmant used, and the basis set employed. The situation is summarized in Fig. 5. The column on the far right, headed full CI (FCI), records the exact solution within the finite space defined by the basis set. The entry at the bottom of the column on the far right represents the exact solution of the Schro¨dinger equation, which, of course, is not obtainable in practice. The importance of gaining some measure of the significance of basis set truncation effects, leads to many studies using sequences of basis sets constructed so as to approach the basis set limit. In addition, this approach also gives reliable procedures for performing extrapolations to the basis set limit from calculations employing smaller basis sets. Such extrapolation techniques necessarily introduce an empirical element into otherwise ab initio calculations. A typical example of this approach is the study of the interaction energies of n-alkane dimers using MP2 and CCSD(T) by Tsuzuki et al.,112 which compares the extrapolation method of Helgaker et al.169 with that of Feller170 using Dunning’s correlation consistent basis sets171. Similar studies using the two-point extrapolation formula of Helgaker et al. for the stabilisation energy of C6H6 C6X6 (X = F, Cl, Br, I, CN) complexes were reported by Pluhackova et al.136 The target accuracy in a given application extends beyond exploring the basis set limit for a given model to the complete basis set limit for full configuration

Fig. 5 General model table or model chart. The columns are labelled by the various methods for describing the effects of electron correlation arranged in order of increasing sophistication from left to right. HF denotes the Hartree–Fock model in which no account is taken of electron correlation. MP2 denotes second order many-body Møller–Plesset perturbation theory whilst MP3 and MP4 denote this theory taken through third and fourth order, respectively. QCI denotes the quadratic configuration interaction approach. The column on the far right which is headed FCI contains the full configuration interaction result within a given basis set. Basis sets are displayed vertically becoming more flexible down the chart. The bottom row of the chart corresponds to an infinite basis. On the right-hand side of this row we have FCI in an infinite basis set, i.e. the solution of the Schro¨dinger equation (S-eqn).

242 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Fig. 6 The G3 model table, an example of a composite model in which some compromises are made in order to obtain a wide range of applicability. The columns are labelled by the various methods for describing the effects of electron correlation arranged in order of increasing sophistication from left to right. HF denotes the Hartree–Fock model in which no account is taken of electron correlation. MP2 denotes second order many-body Møller–Plesset perturbation theory whilst MP4 denote this theory taken through fourth order and QCI denotes the quadratic configuration interaction approach. Basis sets are displayed vertically becoming more flexible down the chart. G3large is a large and flexible basis set for which only MP2 calculations are tractable. The composite model is labelled ’’model-1//model-2’’ where calculations are carried out using model-1 using the optimized geometry determined by using model-2. In the G3 table, the geometry is optimized using an MP2/6-31G(d) level of theory. All other calculations are carried out at this geometry. An estimate of the QCI/G3large energy is obtained by assuming additivity of corrections according to: ? = 2 + (3 2) + (5 2) + (7 2) + (8 1) (4 1) (6 1). Vibrational frequencies can be determined from a HF calculation at the same geometry together with a scaling factor.

interaction. In practice, this cannot be achieved with finite computing resources, except for the very smallest systems. Some compromises have to be made to achieve a wide range of applicability. For example, geometry optimization may be carried out using some ‘lower level’ theory followed by a ‘more accurate’ calculation using a ‘higher level’ carried out at the optimized geometry. Again, this approach introduces a degree of empiricism into an otherwise ab initio calculation, but this procedure can often lead to accuracy which could not otherwise be achieved. A typical procedure of this type, the G3 composite model developed by Pople and his co-workers,172 is illustrated in Fig. 6. Many-body methods and, in particular, MP2 theory are often used in validating such models. During the reporting period, a typical application is provided by the work of Rogers et al.,81 who calculated enthalpies of hydrogenation, isomerization and formation of extended linear polyacetylenes using the G3(MP2) model.173 Rogers et al. examined the conjugation enthalpies of linear polyynes with three, four, and five alternant triple bonds. Molecules as large as 1,3,5,7,9-decapentayne were studied. Space does not allow anything more than a brief overview of the many publications in which MP2 theory is used to investigate basis set truncation errors in molecular calculations during the period covered by this report.

9. Summary and future directions The present article continues our biennial survey of ‘‘Many-body Perturbation Theory and Its Application to the Molecular Structure Problem’’ covering the reporting period assigned to this volume: June 2005 to May 2007. Many-body perturbation theory in its lowest order form, which is often designated MP2, continues to be the most widely used of the ab initio approaches to the molecular electronic structure problem which go beyond an independent particle model and take account of the effects of electron correlation. The main focus of the present review has been on some of the emerging fields in which MP2 calculations are being carried out. Obviously, within the limited space available it has not possible to cover all of the fields of application. Some selectivity has been necessary, but the choices made do provide a ‘‘snapshot’’ of the range of contemporary applications of chemical modelling using many-body perturbation theory. Chem. Modell., 2008, 5, 208–248 | 243 This journal is

c

The Royal Society of Chemistry 2008

Applications have been reported in an ever increasing range of research areas during the reporting period. Given the ‘many-body’ nature of quantum chemical problems, many-body perturbation theory will continue to play a central role in the analysis and study of molecular structure.

References 1 S. Wilson, in Specialist Periodical Reports: Chemical Modelling- Applications and Theory, ed. Senior Reporter, A. Hinchliffe, Royal Society of Chemistry, London, 2000, vol. 1. 2 S. Wilson, in Specialist Periodical Reports: Chemical Modelling- Applications and Theory, ed. Senior Reporter, A. Hinchliffe, Royal Society of Chemistry, London, 2002, vol. 2. 3 S. Wilson, in Specialist Periodical Reports: Chemical Modelling- Applications and Theory, ed. Senior Reporter, A. Hinchliffe, Royal Society of Chemistry, London, 2004, vol. 3. 4 S. Wilson, in Specialist Periodical Reports: Chemical Modelling- Applications and Theory, ed. A. Hinchliffe, Royal Society of Chemistry, London, 2006, vol. 4. 5 S. Wilson, in Specialist Periodical Reports: Theoretical Chemistry, Royal Society of Chemistry, London, 1981, vol. 4. 6 M. P. Barnett, Methods in Computational Physics, Academic Press, 1963, vol. 2, p. 95. 7 C. A. Coulson, Rev. Mod. Phys., 1960, 32, 170. 8 W. J. Hehre, L. Radom, P. V. Schleyer and J. A. Pople, Ab initio Molecular Orbital Theory, John Wiley, Chichester, 1986. 9 J. M. Roberts, Twentieth Century-A History of the World 1901 to present, Allen Lane, London, 1999, p. 562. 10 S. Wilson, Electron Correlation in Molecules, Dover Publications, New York, 2007. 11 F. Jensen, Introduction to Computational Chemistry, John Wiley, Chichester, 2nd edn., 2006. 12 C. J. Cramer, Essentials of Computational Chemistry, John Wiley, Chichester, 2nd edn., 2004. 13 P. W. Atkins and R. S. Friedman, Molecular Quantum Mechanics, Oxford University Press, 2004. 14 G. H. Grant and W. G. Richards, Computational Chemistry, Oxford University Press, 2004. 15 A. Hinchliffe, Molecular Modelling for Beginners, John Wiley, Chichester, 2003. 16 Handbook of Molecular Physics and Quantum Chemistry, eds. S. Wilson, P. F. Bernath and R. McWeeny, John Wiley, Chichester, 2003. 17 D. Young, Computational Chemistry: A Practical Guide for Applying Techniques to Real World Problems, Wiley, 2001. 18 T. Helgaker, P. Jorgensen and J. Olsen, Molecular Electronic Structure Theory, John Wiley, Chichester, 2000. 19 A. Hinchliffe, Modelling Molecular Structures, John Wiley, Chichester, 2000. 20 D. B. Cook, Handbook of Computational Quantum Chemistry, Oxford University Press, 1998. 21 Encyclopedia of Computational Chemistry, eds. P. von Rague´ Schleyer, N. L. Allinger, H. F. Schaefer III, T. Clark, J. Gasteiger, P. Kollman and P. Schreiner, John Wiley, Chichester, 1998. 22 A. Szabo and N. S. Ostlund, Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory, Dover Publications, New York, 2nd edn., 1996. 23 F. E. Harris, D. L. Freeman and H. J. Monkhorst, Algebraic and Diagrammatic Methods in Many-Fermion Theory, Oxford University Press, 1992. 24 R. McWeeny, Methods of Molecular Quantum Mechanics, Academic Press, 2nd edn., 1992. 25 Methods in Computational Chemistry: Electron Correlation in Atoms and Molecules, ed. S. Wilson, Kluwer Academic/Plenum Press, 1987. 26 I. Lindgren and J. Morrison, Atomic Many-Body Theory, Springer-Verlag, Berlin, 1986. 27 W. J. Hehre, L. Radom, P. V. Schleyer and J. A. Pople, Ab initio Molecular Orbital Theory, John Wiley, Chichester, 1986. 28 S. Wilson, Chemistry by Computer: An Overview of the Applications of Computers in Chemistry, Kluwer Academic/Plenum Publishers, Dordrecht, 1986. 29 A. C. Hurley, Electron Correlation in Small Molecules, Academic Press, 1977. 30 A. C. Hurley, Introduction to the Electron Theory of Small Molecules, Academic Press, 1977. 31 Computational Chemistry Reviews of Current Trends, ed. J. Leszczynski, World Scientific, 1996.

244 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

32 Reviews in Computational Chemistry, eds. K. B. Lipkowitz and D. B. Boyd, [except vol. 21–22] R. Larter [vols. 19–21] and T. R. Cundari [vols. 19–23 ], V. J. Gillet [vol. 22], Wiley, 1990. 33 K. G. Dyall and K. Faegri, Introduction to Relativistic Quantum Chemistry, Oxford University Press, 2007. 34 I. P. Grant, Relativistic Quantum Theory of Atoms and Molecules: Theory and Computation, Springer-Verlag, New York, 2006. 35 Relativistic Electronic Structure Theory. Part 2. Applications, ed. P. Schwerdtfeger, Elsevier, Amsterdam, 2004. 36 Recent Advances in Relativisitic Molecular Theory, ed. K. Hirao and Y. Ishikawa, World Scientific Publishing, 2004. 37 Theoretical Chemistry and Physics of Heavy and Superheavy Elements, eds. U. Kaldor and S. Wilson, Kluwer Academic Publishers, 2003. 38 Relativistic Effects in Heavy Element Chemistry and Physics, ed. B. A. Hess, Wiley, 2002. 39 Relativistic Electronic Structure Theory. Part 1. Fundamentals, ed. P. Schwerdtfeger, Elsevier, Amsterdam, 2002. 40 K. Balasubramanian, Relativistic Effects in Chemistry: Theory and Techniques and Relativistic Effects in Chemistry: Theory and Techniques Pt. A, Wiley, 1997. 41 K. Balasubramanian, Relativistic Effects in Chemistry: Applications: Applications Pt. B, Wiley, 1997. 42 The Effects of Relativity in Atoms, Molecules and the Solid State, eds. S. Wilson, I. P. Grant and B. L. Gyorffy,Kluwer Academic/Plenum Press, 1991. 43 Methods in Computational Chemistry: Relativistic Effects in Atoms and Molecules, ed. S. Wilson, Kluwer Academic/Plenum Press, 1989. 44 Relativistic Effects in Atoms, Molecules, and Solids, ed. G. L. Malli, Kluwer Academic/ Plenum Publishers, 1983. 45 R. S. Mulliken and C. C. J. Roothaan, ‘‘Broken Bottlenecks and the Future of Molecular Quantum Mechanics’’, Proc. Natl. Acad. Sci. (US), 1959, 45, 394–398. 46 J. Ziman, Real Science. What it is, and what it means, Cambridge University Press, 2000, p. 110. 47 J. Ziman, Real Science. What it is, and what it means, Cambridge University Press, 2000, p. 111. 48 Quantum Chemistry Program Exchange, http://qcpe.chem.indiana.edu/, Indiana University. 49 K. V. Roberts, Comput. Phys. Commun., 1969, 1, 1. 50 M. Dupuis, J. D. Watts, H. O. Villar and G. J. B. Hurst, Comput. Phys. Commun., 1989, 52, 415. 51 H. M. Quiney and S. Wilson, Intern. J. Quantum Chem., 2005, 104, 430. 52 H. M. Quiney and S. Wilson, Literate programming for the molecular structure problem. I: An Introduction to Computational Methods in Quantum Chemistry (in preparation). 53 H. M. Quiney and S. Wilson, Molec. Phys., 2005, 103, 389. 54 D. B. Cook, Handbook of Computational Quantum Chemistry, Oxford University Press, Oxford, 1998. 55 D. E. Knuth, The Computer Journal, 1984, 27(2), 97. 56 D. M. Silver, Comput. Phys. Commun., 1978, 14, 71. 57 D. M. Silver, Comput. Phys. Commun., 1978, 14, 81. 58 S. Wilson, Comput. Phys. Commun., 1978, 14, 91. 59 W. H. Press, S. A. Teukolsky, W. T. Vetterling and B. P. FlanneryNumerical Recipes: The Art of Scientific Computing, Cambridge University Press, Third Edition, 2007. 60 S. Hirata, Theoretical Chemistry Accounts, 2006, 116, 2. 61 M. P. Barnett, Rev. Mod. Phys., 1963, 35, 571–572. 62 S. J. Smith and B. T. Sutcliffe, Reviews in Computational Chemistry, 1997, vol. 10. 63 S. F. Boys and G. B. Cook, Rev. Mod. Phys., 1960, 32, 285. 64 (a) C. M. Reeves, J. Chem. Phys., 1963, 39, 1; (b) C. M. Reeves and M. C. Harrison, J. Chem. Phys., 1963, 39, 11. 65 R. K. Nesbet, Rev. Mod. Phys., 1963, 35, 552–557. 66 F. E. Harris, Rev. Mod. Phys., 1963, 35, 558–569. 67 I. G. Csizmadia, M. C. Harrison, J. W. Moskowitz and B. T. Sutcliffe, Theoret. chim. Acta, 1966, 6, 191–216. 68 I. G. Csizmadia, M. C. Harrison and B. T. Sutcliffe, Theoret. chim. Acta, 1966, 6, 217–239. 69 E. Clementi and D. R. Davis, J. Comput. Phys., 1966, 1, 223. 70 E. Clementi, IBM Res. Develop., 1965, 9, 2. 71 W. J. Hehre, W. A. Lathan, R. Ditchfield, M. D. Newton and J. A. Pople, GAUSSIAN 70: Ab initio SCF-MO Calculations on Organic Molecules, QCPE Program Number 236,

Chem. Modell., 2008, 5, 208–248 | 245 This journal is

c

The Royal Society of Chemistry 2008

72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114

Quantum Chemistry Program Exchange, Department of Chemistry, Indiana University, 1970. M. Dupuis, J. Rys and H. F. King, HONDA 76: Ab initio SCF-MO Calculations Using s,p,dtype Cartesian Gaussians, QCPE Program Number, 1976, 336, 338. D. Moncrieff and V. R. Saunders, The ATMOL Manual, UMRCC, 1986. J. F. Stanton, J. Gauss, J. D. Watts, W. J. Lauderdale and R. J. Bartlett, Int. J. Quant. Chem. Symp., 1992, 26, 879. M. Dupuis, J. Rys and H. F. King, J. Chem. Phys., 1976, 65, 111. M. Dupuis and H. F. King, J. Chem. Phys., 1978, 68, 3998. J. L. Hennessy and D. A. Patterson, Computer Architecture: A Quantitative Approach, Morgan Kaufmann Publishers, Inc., Fourth Edition, 2006. N. S. Ostlund, Personal Computers in Chemistry, American Chemical Society, Washington D.C., 1977. J. A. R. Gonzalez, M. Lorono, T. Cordova and G. Chuchani, J. Molec. Struct.Theochem., 2005, 732, 55. D. H. Ess and K. N. Houk, J. Phys. Chem. A, 2005, 109, 9542. D. W. Rogers, A. A. Zavitsas and N. Matsunaga, J. Phys. Chem. A, 2005, 109, 9169. R. Crespo-Otero, L. A. Montero, W. D. Stohrer and J. M. G. de la Vega, J. Chem. Phys., 2005, 123, 134107. C. F. Wilcox and S. H. Bauer, Molec. Phys., 2005, 103, 2829. W. Q. Zhou, W. Yang and L. H. Qiu, J. Molec. Struct.-Theochem, 2005, 130, 133. M. L. Gg Borst, A. W. Ehlers and K. Lammertsma, J. Organic Chem., 2005, 70, 8110. T. L. Fonseca, H. C. B. de Oliveira, O. A. V. Amaral and M. A. Castro, Chem. Phys. Lett., 2005, 413, 356. A. J. Gordon and M. H. Palmer, Molec. Phys., 2005, 103, 2773. V. I. Danilov, V. M. Anisimov, C. Kurita and D. Hovorun, Chem. Phys. Lett., 2005, 412, 285. W. Y. Xu, Z. Y. He, Y. Chen, F. Y. Li and S. G. Hong, Acta. Chimica Sinica, 2005, 63, 1474. C. S. Lam, W. K. Li and S. W. Chiu, J. Phys. Chem. A, 2005, 109, 7296. L. Malysheva, A. Onipko, R. Valiokas and B. Liedberg, J. Phys. Chem. B, 2005, 109, 13221. H. O. Ho and W. K. Li, J. Serbian Chem. Soc., 2005, 70, 661. L. S. Khaikin, O. E. Grikina, N. F. Stepanov and A. Kovacs, Russian J. Phys. Chem., 2005, 79, 959. G. A. Chass, R. S. Mirasol, D. H. Setiadi, T. H. Tang, W. Chin, M. Mons, I. Dimicoli, J. P. Dognon, B. Viskolcz, S. Lovas, B. Penke and I. G. Csizmadia, J. Phys. Chem. A, 2005, 109, 5289. M. S. Sujatha, Y. U. Sasidhar and P. V. Balaji, Biochem., 2005, 44, 8554. M. Kieninger, R. E. Cachau, H. Oberhammer and O. N. Ventura, Intern. J. Quantum Chem., 2007, 107, 403. S. Fomine, P. Guadarrama and M. Zolotukhin, Intern. J. Quantum Chem., 2007, 107, 685. K. M. Pei, Y. F. Ma and X. M. Zheng, J. Molec. Struct.-Theochem, 2006, 774, 43. P. Ramasami, J. Molec. Struct.-Theochem, 2006, 775, 87. V. I. Danilov, V. V. Dailidonis, D. M. Hovorun, N. Kurita, Y. Murayama, T. Natsume, A. I. Potopalsky and L. A. Zaika, Chem. Phys. Lett., 2006, 430, 409. V. I. Danilov, T. van Mourik and V. I. Poltev, Chem. Phys. Lett., 2006, 429, 255. K. B. Wiberg, Y. G. Wang, S. Sklenak, C. Deutsch and G. Trucks, J. Am. Chem. Soc., 2006, 128, 11537. C. Tuma and J. Sauer, Phys. Chem. Chem. Phys., 2006, 8, 3955. P. J. Costa and M. J. Calhorda, Inorganica Chimica Acta, 2006, 359, 3617. A. Frontera, D. Quinonero, C. Garau, A. Costa, P. Ballester and P. M. Deya, J. Phys. Chem. A, 2006, 110, 9307. A. Ramirez-Solis, J. Hernandez-Cobos and C. Vargas, J. Phys. Chem. A, 2006, 110, 7637. Z. J. Wu and Y. Kawazoe, Chem. Phys. Lett., 2006, 423, 81. L. Pejov, M. Solimannejad and V. Stefov, Chem. Phys., 2006, 323, 259. P. Jurecka, J. Sponer, J. Cerny and P. Hobza, Phys. Chem. Chem. Phys., 2006, 8, 1985. D. E. Volk, V. Thiviyanathan, A. Somasunderam and D. G. Gorenstein, Organic & Biomolecular Chem., 2006, 4, 1741. B. Hong, Y. F. Chang, Y. Q. Qiu, H. Sun, Z. M. Su and R. S. Wang, J. Chem. Phys., 2006, 124, 144108. S. Tsuzuki, K. Honda, T. Uchimaru and M. Mikami, J. Chem. Phys., 2006, 124, 114304. P. Cysewski and Z. Czyznikowska-Balcerak, J. Molec. Struct.-Theochem., 2005, 757, 29. S. G. Zhang and P. Yang, J. Molec. Struct.-Theochem.

246 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

115 M. Kone, B. Illien, J. Graton and C. Laurence, J. Phys. Chem. A, 2005, 757, 77. 116 M. Kabelac, H. Valdes, E. C. Sherer, C. J. Cramer and P. Hobza, Phys. Chem. Chem. Phys., 2007, 9. 117 K. E. Riley and P. Hobza, J. Phys. Chem. A, 2007, 111, 8257. 118 V. B. Delchev, I. G. Shterev, H. Mikosch and N. T. Kochev, J. Molec. Modelling, 2007, 13, 1001. 119 K. Ishimura, P. Pulay and S. Nagase, J. Comput. Chem., 2007, 28, 2034. 120 Y. P. Yurenko, R. O. Zhurakivsky, M. Ghomi, S. P. Samijlenko and D. M. Hovorun, J. Phys. Chem. B, 2007, 111, 9655. 121 M. S. Sujatha, Y. U. Sasidhar and P. V. Balaj, J. Molec. Struct.-Theochem, 2007, 814, 11. 122 H. J. Wang, W. B. Dong, X. H. Ren and Y. Y. Shan, J. Molec. Struct.-Theochem, 2007, 814, 85. 123 L. F. Holroyd and T. van Mourik, Chem. Phys. Lett., 2007, 442, 42. 124 V. I. Danilov, T. van Mourik and V. I. Poltev, J. Biomolec. Struct. & Dynam., 2007, 24, 666. 125 D. E. Volk, V. Thiviyanathan, A. Somasunderam and D. G. Gorenstein, Organic & Biomolec. Chem., 2007, 5, 1554. 126 Y. P. Yurenko, R. O. Zhurakivsky, M. Ghomi, S. P. Samijlenko and D. M. Hovorun, J. Phys. Chem. B, 2007, 111, 6263. 127 K. B. Sophy, P. Calaminici and S. Pal, J. Chem. Theory & Comput., 2007, 3, 716. 128 I. S. Ignatyev, M. Montejo, T. Sundius, F. P. Urena and J. J. L. Gonzalez, Chem. Phys., 2007, 333, 148. 129 C. Garau, D. Quinonero, A. Frontera, D. Escudero, P. Ballester, A. Costa and P. M. Deya, Chem. Phys. Lett., 2007, 438, 104. 130 L. Maron and A. Ramirez-Solis, J. Phys.Chem. A, 2007, 111, 3173. 131 A. L. Puyad, P. Raghunath, G. K. Chaitanya, K. Ramakrishna and K. Bhanuprakash, J. Molec. Struct.-Theochem, 2007, 807, 73. 132 T. Mori and S. Kato, Chem. Phys. Lett., 2007, 437, 159. 133 A. Frontera, D. Quinonero, A. Costa, P. Ballester and P. M. Deya, New J. Chem., 2007, 31, 556. 134 S. C. C. Nunes, A. J. L. Jesus, M. T. S. Rosado and M. E. S. Eusebio, J. Molec. Struct.Theochem, 2007, 806, 231. 135 W. Forner and H. M. Badawi, Collect. Czechoslovak Chem. Commun., 2007, 72, 15. 136 K. Pluhackova, P. Jurecka and P. Hobza, Phys. Chem. Chem. Phys., 2007, 9, 755. 137 L. Maron and A. Ramirez-Solis, J. Molec. Struct.-Theochem, 2007, 802, 111. 138 M. Pavelka and J. V. Burda, J. Molec. Modelling, 2007, 13, 367. 139 J. Rejnek and P. Hobza, J. Phys. Chem. B, 2007, 111, 641. 140 H. T. Yu, Theoret. Chem. Accounts, 2007, 117, 49. 141 Y. Podolyan, M. J. Nowak, L. Lapinski and J. Leszczynski, J. Molec. Struct., 2005, 744, 19. 142 M. Ramalingam, K. Ramasami, P. Venuvanalingam and V. Sethuraman, Lecture Notes in Computer Science, 2006, 3993, 143. 143 S. Kedzuch, M. Milko and J. Noga, Intern. J. Quantum Chem., 2005, 105, 929. 144 H.-J. Werner, T. B. Adler and F. R. Manby, J. Chem. Phys., 2007, 126, 164102. 145 A. Hellweg, C. Hattig, S. Hofener and W. Klopper, Theoret. Chem. Acc., 2007, 117, 587. 146 C. Pisani, G. Capecchi, S. Casassa and L. Maschio, Molec. Phys., 2005, 103, 2527. 147 L. Maschio, D. Usvyat, F. R. Manby, S. Casassa, C. Pisani and M. Schutz, Phys. Rev. B, 2007, 76, 075101. 148 D. Usvyat, L. Maschio, F. R. Manby, S. Casassa, M. Schutz and C. Pisani, Phys. Rev. B, 2007, 76, 075102. 149 K. Jankowski, R. Slupski and J. R. Flores, Molec. Phys., 2006, 104, 2213. 150 Y. C. Park and J. S. Lee, Bull. Korean Chem. Soc., 2007, 28, 386. 151 P. Salvador, D. Asturiol and I. Mayer, J. Comput. Chem., 2006, 27, 1505. 152 P. A. Fantin, P. L. Barbieri, A. C. Neto and F. E. Jorge, J. Molec. Struct.-Theochem, 2007, 810, 103. 153 F. E. Jorge, L. M. Bernardo and E. P. Muniz, J. Theoret. & Comput. Chem., 2006, 5, 223. 154 E. P. Muniz and F. E. Jorge, Intern. J. Quantum Chem., 2006, 106, 943. 155 A. D. Bochevarov and C. D. Sherrill, Abs. Papers Amer. Chem. Soc., 2005, 230, U2788. 156 K. Ishimura, P. Pulay and S. Nagase, J. Comput. Chem., 2006, 27, 407. 157 P. Musch and B. Engels, J. Comput. Chem., 2006, 27, 1055. 158 A. R. Ford, T. Janowski and P. Pulay, J. Comput. Chem., 2007, 28, 1215. 159 R. Dovesi, V. R. Saunders, C. Roetti, R. Orlando, C. Zicovich- Wilson, F. Pascale, B. Civalleri, K. Doll, N. M. Harrison, I. J. Bush, P. D’Arco and M. Llunell, CRYSTAL06 user’s manual, 2006.

Chem. Modell., 2008, 5, 208–248 | 247 This journal is

c

The Royal Society of Chemistry 2008

160 R. Dovesi, B. Civalleri, R. Orlando, C. Roetti and V. R. Saunders, Rev. Comput. Chem., 2005, 21, 1. 161 G. Hetzer, M. Schu¨tz, H. Stoll and H.-J. Werner, J. Chem. Phys., 2000, 113, 994. 162 C. Pisani, M. Busso, G. Capecchi, S. Casassa, R. Dovesi, L. Maschio, C. Zicovich-Wilson and M. Schu¨tz, J. Chem. Phys., 2005, 122, 094113. 163 M. Schu¨tz, G. Hetzer and H.-J. Werner, J. chem. Phys., 1999, 111, 5691. 164 G. Mitra, G. T. Pauly, R. Kumar, G. K. Pei, S. H. Hughes, R. C. Moschel and M. Barbacid, Proc. Natl. Acad. Sci. USA, 1989, 86, 8650. 165 R. Goth and M. F. Rajewsky, Proc. Natl. Acad. Sci. USA, 1974, 71, 639. 166 A. A. Purmal, Y. W. Kow and S. S. Wallace, Nucleic Acids Res., 1994, 22, 3930–3935. 167 D. I. Feig, L. C. Sowers and L. A. Loeb, Proc. Natl. Acad. Sci. USA, 1994, 91, 6609–6613. 168 D. A. Kreutzer and J. M. Essigmann, Proc. Natl. Acad. Sci.USA, 1998, 95, 3578–3582. 169 T. Helgaker, J. Chem. Phys., 1997, 106, 9639. 170 D. Feller, J. Chem. Phys., 1992, 96, 6104. 171 T. H. Dunning, Jr, J. Chem. Phys., 1989, 90, 1007. 172 L. A. Curtiss, K. Raghavachari, P. C. Redfern, V. Rassolov and J. A. Pople, J. Chem. Phys., 1998, 109, 7764. 173 L. A. Curtiss, P. C. Redfern, K. Raghavachari, V. Rassolov and J. A. Pople, J. Chem. Phys., 1999, 110, 4703.

248 | Chem. Modell., 2008, 5, 208–248 This journal is

c

The Royal Society of Chemistry 2008

Experiment and theory in the determination of molecular hyperpolarizabilities in solution; pNA and MNA in dioxane David Pugh DOI: 10.1039/b609719g

1. Introduction The computation of molecular polarizabilities, hyperpolarizabilities, magnetizabilities and hypermagnetizabilities attracts a significant part of the research effort in quantum chemistry. High level ab initio and density functional calculations have progressed to the point where it is reasonable to expect realistic results to be obtained for isolated organic molecules containing, say, up to about 100 atoms. The internal consistency of these methods can be assessed by comparing the results obtained by different techniques, and in many cases it is possible at least to find a reasonable measure of agreement between the values obtained for a particular parameter as calculated by the more fundamental quantum chemistry approaches. The objective of these studies is to be able to explain and predict experimental properties and to utilize the predictive capability to help control useful processes and to assist in the design of new materials with particular required properties. One of the main incentives for calculations of molecular hyperpolarizabilites has been the desirability of having high hyperpolarizability materials available for optoelectronic applications. The relationship between the computations and experimentally determined parameters is therefore all-important. While gas phase work on the hyperpolarizability of small molecules has been relatively free of problems concerned with the definitions of measured quantities and their formal relationship to computed quantities, the same cannot be said about solution studies of rather larger organic species. It is the latter that possess the very large nonlinear response functions that are of greatest interest. The prototype system for such studies has been 4-nitroaniline (pNA) and this review is mainly concerned with the relation between the measurements, in vacuo and in solution, of the hyperpolarizabilities of pNA and the closely related molecule, MNA (2-methyl, 4-nitroaniline) to ab initio and DFT calculations of these quantities. The first problem is concerned with the definitions of the quantities calculated and measured. It has only gradually emerged over the last decades that there was considerable ambiguity about the definition of some of the reported parameters. In 1992 Willetts et al.1 (referred to as WRBS) gave a detailed account of the various conventions that have been applied by different authors in defining the molecular response functions, but more recently Reiss2 has suggested that there are still inconsistencies in the way that the experimentally measured, macroscopic, response functions are reported. A second difficulty arises from the circ*mstance that almost all the effects of experimental or technological interest involve the frequency-dependent response functions. Frequently the theoretical values are calculated using a very high level static field calculation which is then adjusted for the frequency dependence using a lower level (often time-dependent Hartree-Fock) method. The semi-empirical approach, which was often closely connected with the identification of significantly large effects in organic molecules, was usually calibrated to experimental electronic excitation energies, which are related to resonances in the response functions. These Department of Pure and Applied Chemistry, University of Strathclyde, Thomas Graham Building, 295 Cathedral Street, Glasgow, UK G1 1XL

Chem. Modell., 2008, 5, 249–278 | 249 This journal is

c

The Royal Society of Chemistry 2008

methods could therefore be said to be adapted to identifying particularly strong resonant effects, but weak on producing quantitatively accurate results. The ab initio methods are now expected to achieve quantitative accuracy for the static response but are still arguably unreliable when it comes to some of the frequency-dependent phenomena. An even more serious impediment to obtaining a quantitative correspondence of theory and experiment is a consequence of measurements on the molecules of greatest interest necessarily being made in solutions or condensed phases. Internal field corrections are usually applied in a formal way using standard continuum theories (Lorentz-Lorenz, Clausius-Mossotti, Onsager).3 In the design of optimized crystal structures these same theories have often been applied, apparently successfully, but with even less theoretical justification.4 Recently simulation studies of the solutions have been favoured, using a mixture of ab initio or DFT methods for the solute molecule and a semi-empirical or molecular mechanics method for a surrounding cluster of solvent.5,6 A more rigorous approach to the internal field corrections in crystals has been pursued by Munn and collaborators7–10 and in combination with ab initio calculations, by Munn, Papadopoulos, Reiss et al.11 Finally, it is to be expected of an all-embracing theory that it will reflect the ubiquitous nature of the polarizability–hyperpolarizability formulation. A very wide range of coherent and incoherent optical and spectroscopic effects are analysed in terms of polarizabilities or hyperpolarizabilities that differ only in the type of frequency dependence. For all these phenomena the response for a given set of input and output frequencies is determined formally through the polarizabilities by a perturbation theory expansion over the molecular eigenstates. In the development of solution studies of organic molecules and pNA in particular the perturbation theory expansion over excited states has figured prominently. Even if the hyperpolarizabilities calculated by this method are only of use in providing a general comparison of different structures and semi-quantitative results they are of interest for another reason. Very often experimental studies have been accompanied by an attempt to explain the results obtained in terms of the SOS theory. An examination of the multiplicative constants in the perturbation theory expansion at least gives an indication of how the parameters measured experimentally are being defined by the authors and may be of help in resolving some of the doubts that exist over these definitions. The problem of establishing the values of clearly determined molecular parameters for pNA and MNA in solution and interpreting their values through fundamental non-empirical calculations has received a great deal of attention in the period extending from the 1970s to the present. There have been a number of recent studies, which have often included substantial reviews, and re-interpretations of earlier works. The aim of the present review is to describe the background to the present studies and identify the origins of some of the current controversies. In particular it seems to be essential to attempt to distinguish between the macroscopic measurements and the values of the first molecular hyperpolarizability deduced from them, which are often presented as an ‘experimental’ quantity, but are strongly dependent on the nature of the intervening theory. The most commonly studied problem is the determination of the first hyperpolarizability, b(2o;o,o) for second harmonic generation for solutions of pNA or MNA in dioxane solution. Views on the nature of dioxane at a microscopic level have changed over the period of these investigations. In earlier NLO work it was assumed to be a ‘non-polar’ solvent in which the properties of the active solute would be close to that of the isolated molecule. More recently it is regarded as being decidedly ‘polar’ on a molecular scale with an effective dielectric constant of 6. Much of the present article is concerned with EFISH measurements on these solutions. The plan of the article is as follows:—In section 2 the systematic definitions of the susceptibilities and (hyper)polarizabilities are described; section 3 briefly derives the standard SOS formulae; in section 4 the general theory of the field induced second 250 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

harmonic generation experiment (EFISH), from which most information on the first and second order hyperpolarizabilities has been obtained, is derived; section 5 reviews ab initio and DFT calculations on the isolated pNA molecule; section 6 discusses the solitary gas phase experimental determination of Kaatz et al.12 and related deductions about the solution response based on hyper-Rayleigh scattering measurements; section 7 discusses the extension of the EFISH theory to two component solutions and reviews EFISH studies of pNA and MNA in dioxane; section 8 reviews recent work including computer simulation of the solution structure and section 9 briefly describes very recent related work that has not yet been incorporated into more detailed reviews.

2. General theory of the response to frequency-dependent electric fields 2.1 Macroscopic fields The derivation of formulae for the frequency-dependent nonlinear susceptibilities of nonlinear optics from the time-dependent response functions can be found in a number of sources, (Bloembergen,13 Ward and New,14 Butcher and Cotter,15 Flytzanis16). Here it is assumed that the susceptibilities can be expressed in terms of frequency-dependent quantities that connect individual (complex) Fourier components of the polarization with simple products of the Fourier components of the field. What then has to be shown is how the quantities measured in various experiments can be reduced to these simpler parameters. The applied field is expanded as, X E ¼ Eð0Þ þ Eðon Þ expðion tÞ ð2:1Þ n

and the polarisation in terms of the order of interaction, P = P(0) + P(1) + P(2) +

(2.2)

with the nth order term given by, PðnÞ ¼ PðnÞ ð0Þ þ

X

PðnÞ ðos Þ expðios tÞ

ð2:3Þ

s

In the above, quantities with no arguments are time-dependent, those with one argument, which is always a frequency, are time-independent Fourier coefficients. It is convenient to introduce one further type of symbol with two arguments, P(n)(t,os), where the quantity defined contains the full nth order time-dependent response at frequency os, which will contain both positive and negative complex frequency components. The composite quantities, P(n)(t,os), can always be expressed in terms of linear combinations of elementary complex polarizations, p(n)(t,os) = e0v(n)(os;o1,o2, . . . ,on)E(o1)E(o2) . . . E(on)exp(iost)

(2.4)

where a unique ordered product of the complex field components occurs on the right. The lower case symbol for the polarization has been introduced to indicate that this is an intermediate quantity related to the complex product field. It is nevertheless well defined since the complex product can be split into real and imaginary parts. All susceptibilities can be reduced to combinations of those defined in eqn (2.4) and the adoption of the susceptibilities defined through this equation has two advantages; firstly there is no possibility of introducing numerical factors associated with particular definitions of fields in terms of trigonometric functions (raised to the nth power), and secondly it can be shown15 that the susceptibilities defined in this way will not vary discontinuously when degeneracy occurs among the field frequencies. Chem. Modell., 2008, 5, 249–278 | 251 This journal is

c

The Royal Society of Chemistry 2008

However the real fields in the experiment may be defined, the above susceptibilities refer to the response to a product of complex fields written as Eeiot. Any additional factors that occur in the definitions of the actual real fields and their products have to be accounted for explicitly and the response expressed in terms of the susceptibilities defined as above. There may, in principle, be contributions involving several different frequency-dependent susceptibilities in eqn (2.3)—for example if the input field contained components of frequencies (0,o,2o), when the output third order term in 3o would contain contributions from the combinations (0,o,2o) and (o,o,o)—but most experimental arrangements avoid such situations. Symmetry requirements on the w’s that result from the permutation of the order of application of the fields, from causality requirements and from the requirement that real fields must give rise to real polarizations (in the absence of line-width effects) ensure that the w’s are real, that, w(n) ijk . . . (os;o1,o2, . . . on) is invariant under permutations of the pairs, (i,o1), (j,o2), (k,o3), . . . (n) and that, w(n) ijk . . . (os;o1,o2, . . . on) = wijk . . . (os;o1,o2, . . . on)

(2.5)

As an example the case of the third order contribution to frequency doubling, where the input field consists of a static field plus a field at frequency, o, is considered as in the EFISH experiment. Both the fields are applied along the same axis E = E0 + Eo cos ot = E(0) + E(o)(eiot + eiot)

(2.6)

o

where, E = 2E(o) and a superscript notation has been used to indicate that a field amplitude is not defined in the standard form of eqn (2.1). The third order polarization at the frequency 2o is P(3)(t,2o) = 3e0[w(3)(2o;o,o,0)E(o)2E(0)e2iot + w(3)(2o;o,o,0)E(o)2E(0)e2iot]

(2.7)

which, using the symmetry properties of the susceptibilities, reduces to, P(3)(t,2o) = 32e0w(3)((2o;o,o,0)(Eo)2E0 cos 2ot

(2.8)

2.2 The molecular response The induced molecular dipole replaces the polarization and the local field, E/, acting on the molecule is introduced in place of the macroscopic field. There are two conventions in use for defining the hyperpolarizability series; one is the exact analogue of the macroscopic method (B convention), the other uses a Taylor series expansion (T convention) where a factor (1/n!) is introduced into nth order terms. The notation introduced by WRBS1 as been used. For a noncentrosymmetric molecule subjected to an internal field, E 0 (t) = E 0 o cos ot the second order induced dipole at 2o is given in the two conventions by, dm(t,2o) = 12bB(2o;o,o)(E 0 o)2 cos 2ot = 14bT(2o;o,o)(E 0 o)2 cos 2ot

(2.9)

where the hyperpolarizabilities are defined in a manner exactly analogous to the macroscopic susceptibilities except for the inclusion of a factor (1/n!) in the T convention terms. From eqn (2.9) bT = 2bB

(2.10)

Many of the discussions of the hyperpolarizability series and the interpretation of EFISH and other measurements carried out in the nonlinear optics 252 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

physics community use the B convention. Phenomenological conventions defined in an ad hoc fashion for particular types of experiment have been extensively used particularly in reporting EFISH experiments. These will be discussed in later sections. Quantum Chemists and other workers in molecular theory have adhered to the T convention for defining the hyperpolarizabilities and almost all ab initio and density functional calculations have used it. The most suitable candidate for a standardised system therefore appears to be a combination of the power series expansion for the macroscopic susceptibilities and the Taylor series convention for the hyperpolarizabilities.

3. The sum over states method In the studies of the nonlinearities of organic molecules the sum over states method has played a leading part, to some extent in default of any other practicable approach. The earlier experimental EFISH studies have often been linked to analysis of the results by the semi-empirical SOS method. An examination of the exact SOS formulae used to interpret the data can provide confirmatory evidence of the convention that has been adopted. The standard SOS procedure, for the electronic contribution to the first hyperpolarizability for frequency doubling, is summarized below. Writing the perturbing field in the general form, X E¼ Eðof Þeiof t ð3:1Þ f 0

and the perturbing hamiltonian as, H = er E where r ¼

P

ra is the sum over the

a

position vectors of the electrons, leads to formulae for the first and second order corrections to the wave function, C(t) = C(0) + C(1) + C(2) + e X X 0 hnjr Ef j0i iðof o0 Þt e jni Cð1Þ ¼ h f n on0 þ of Cð2Þ ¼

e 2 X X 0 h

f ;g n;m

hnjr Ef jmihmjr Eg j0i eiðog þof o0 Þt jni ðom0 þ og Þðon0 þ og þ of Þ

(3.2) ð3:3Þ

ð3:4Þ

In deriving these formulae terms from the lower limit of the time integral from N to t can be omitted provided there are no secular terms corresponding to vanishing denominators.17,18 The prime on the summations over the molecular eigenfunctions implies that the ground state is omitted, which requires that the co-ordinate origin is taken at the electronic charge centroid of the molecule.18,19 The electric dipole is given by, l = ehC(0) + C(1) + C(2)|r|C(0) + C(1) + C(2)i

(3.5)

For a perturbing field, E ¼ E o cos ot ¼

E o iot ðe þ eiot Þ 2

ð3:6Þ

the induced dipole is given in terms of hyperpolarizabilities defined in the T convention as, o o Ej Ek 1 2iot 2iot dmi ð2oÞ ¼ fbijk ð2o; o; oÞe ð3:7Þ þ bijk ð2o; o; oÞe g 2 2 2 Chem. Modell., 2008, 5, 249–278 | 253 This journal is

c

The Royal Society of Chemistry 2008

where the tensor summation convention is implied, and comparing this expression with the formulae derived from the perturbation theory expression leads to, bTijk ð2o; o; oÞ ¼ bTijk ð2o; o; oÞ ( ) rj0n rinm rkm0 2e3 X 0 ri0n rjnm rkm0 rk0n rjnm rim0 þ þ ¼ 2 h n;m ðom0 þ oÞðon0 þ 2oÞ ðom0 oÞðon0 2oÞ ðom0 þ oÞðon0 oÞ ð3:8Þ If the matrix elements are evaluated in an arbitrary co-ordinate system then, rnm ) (hn|r|mi h0|r|0idnm).

(3.9)

In the two state model, where the entire effect is attributed to the contribution of one pp* transition polarized along the direction (z) of the ground state dipole, the hyperpolarizability becomes, bTzzz ð2o; o; oÞ ¼

6e3 o2n0 jrn0 j2 rnn 2 2 ðon0 o2 Þðo2n0 4o2 Þ h

ð3:10Þ

This can be rewritten in terms of the oscillator strength of the transition and the change in the total electric dipole moment between the ground and excited state:bTzzz ð2o; o; oÞ ¼

6e2 on0 fn0 Dm 2 hm ðon0 o2 Þðo2n0 4o2 Þ

ð3:11Þ

The two state formula of eqns (3.10) and (3.11) have frequently been used in attempts to deduce frequency-dependent results from static field calculations, or conversely to deduce the static hyperpolarizability from measurements at optical frequencies. For any convention (C) defining the hyperpolarizability, the two-state model gives, bC zzz ð2o; o; oÞ ¼

nC e3 o2n0 : jrn0 j2 rnn 2 2 2 ðon0 o Þðo2n0 4o2 Þ h

ð3:12Þ

Values of nC are included in Table 1.

4. General theory of the EFISH experiment The input field is defined as, E ¼ E0 þ Eo cos ot ¼ E0 þ

Eo iot ðe þ eiot Þ 2

ð4:1Þ

Table 1 EFISH hyperpolarizability conventions WRBS label

nC

T

6

B

3

B*

1

X

3/2

g 0 = G/Nf 20fof2o bT 1 T Z 4 g þ 5kT bB 3 B Z 2 g þ 15kT bB 3 B Z þ 5kT 2 g bX Z gX þ 5kT

The notation, T, B, B*, X was introduced by WRBS. G is as defined in eqn (4.16) and nC is the constant in the two-state formula (3.12).

254 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

In terms of the elementary susceptibilities, if the medium is isotropic in the absence of the fields so that 2nd order terms are zero, the part of the polarization that varies at the second harmonic frequency is (see eqn (2.8)), P(3) = 32e0v(3)(2o;o,o,0):E0(Eo)2 cos(2ot)

(4.2)

This quantity is the source of the second harmonic and is determined from its intensity and the macroscopic optical parameters. If the intensity of the optical input is also measured and the static field strength known then the susceptibility in the equation can be calculated. In practice the intensity of the SHG is measured relative to a known standard that for solution work has usually been quartz, occasionally lithium iodate. In the gas phase a calculated value for an inert gas has been used. The macroscopic third order susceptibility has to be related to the response functions for the active molecule in the solution. In the long-established procedure where the molecule is considered to be at the centre of a virtual cavity in a continuous medium the fields acting on the molecule, denoted by primed quantities are, at any frequency, O, g 0 E ðOÞ ¼ EO þ PO ; ð4:3Þ e0 where g is a geometric factor depending on assumptions about the cavity enclosing the molecule. For the static and o-field, the linear relationship between polarization and input field can be rewritten as, g 0 E ðOÞ ¼ EðOÞ þ e0 wð1Þ ðO; OÞ EðOÞ ¼ ½1 þ gwð1Þ ðO; OÞEðOÞ ¼ fO EðOÞ ð4:4Þ e0 where, fO = 1 + gw(1)(O), is the internal field factor. There is also an internal field at 2o given by, g 0 E ð2oÞ ¼ Pð2oÞ ð4:5Þ e0 [There has been some discussion recently about the inclusion of the 2o field; Wortmann and Bishop20 suggested it should be omitted, but Munn et al.21 have demonstrated that the term is present in an exact crystal lattice calculation, confirming the validity of the procedure used in the continuum model from the early days of nonlinear optics.] The molecular response is described in terms of polarizabilites and hyperpolarizabilities by equations analogous to the macroscopic equations. In this case the field is, E0 = E00 + E0o cos ot + E02o cos 2ot

(4.6)

and the definitions of the standard hyperpolarizabilities follow as in the macroscopic case except that an additional factor of 1/n! is included with each nth order term:dl2o=a(2o;2o):E02o+12 12b(2o;o,o):(E0o)2 + 16 32c(2o;o,o,0):E00(E0o)2 (4.7) The effect of the three internal fields, at 0, o and 2o, on one molecule is next treated, and after angular averaging the connection between the microscopic and macroscopic response functions established. The polarization of the dipole distribution by re-orientation of the molecules is induced by the static field. Introducing the Boltzmann factor and normalizing the distribution gives, for the number of molecules per unit volume with dipole moment inclined at an angle, y, to the fields, expðxÞ mE 0 cos y nðyÞ ¼ n sinh x ; where x ¼ z kT 2 x

ð4:8Þ

Chem. Modell., 2008, 5, 249–278 | 255 This journal is

c

The Royal Society of Chemistry 2008

and n is the number density of solute. Expanding and discarding terms nonlinear in the static field, mEz0 cos y nðyÞ ¼ n 1 þ ¼ nw ð4:9Þ kT When this distribution function is included the rotational averaging of the dipole moment induced in the molecule, 1 0o 2 dm(2o,t) = [azz(2o;o,o)E 0 2o z + 4bzzz(2o;o,o,0)(E z ) 00 0o 2 + gzzzz(2o;o,o,0)E z (E z ) ]cos2ot

(4.10)

2o

leads to a polarization, P cos(2ot) where, g P2o ¼ e0 n½hwazz ð2o; 2oÞi P2o þ e0 1 fhwbzzz ð2o; o; oÞifo2 ðEzo Þ2 þ hwgzzzz ð2o; o; o; 0Þif0 fo2 Ez0 ðEzo Þ2 g 4

ð4:11Þ

The rotational averages for the odd order terms in a and g come from the first 1 term in w and are a = 13aii and g ¼ 15 ð2giijj þ gijji Þ which is the scalar part of the tensor. For b the calculation is shown explicitly, since again there can be some ambiguity. The non-zero contribution to b comes from the second term in w. mEz0 hcos ybzzz ð2o; o; oÞi kT 0 mE ¼ z hazZ azI azJ azK ibIJK ð2o; o; oÞ kT mEz0 ½hcos4 yibZZZ þ hcos2 y sin2 y cos2 fifbZXX þ 2bXZX g ¼ kT þ hcos2 y sin2 y sin2 fi þ fbZYY þ 2bYZY g mE 0 1 1 bZZZZ þ fbZXX þ bZYY þ 2bXZX þ 2bYZY g ¼ z 15 kT 5 0 mE ¼ z bZ 5kT ð4:12Þ

hwbzzz ð2o; o; oÞi ¼

where, bZ = 13(bZII + 2bIZI) is the Z-component (the component in the direction of the molecular dipole) of the vector part of the b-tensor. Alternatively the rotational average, again obtained from the second term in w, can be written in terms of hbzi, the average component of the vector part of b in the direction of the applied field. Transforming the molecular quantity, bZ and averaging gives, bz = cos y bZ and, from the second term in w, b// = hbzi = 35bZ

(4.13)

so that, hwbi ¼

mEz0 b 3kT ==

ð4:14Þ

Eqn (4.11) becomes, ð1 nað2o; 2oÞÞP2o ¼

o 1n m bZ ð2o; o; oÞ þ gð2o; o; o; 0Þ f0 fo2 Ez0 ðEzo Þ2 4 5kT

256 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

The factor multiplying the polarization on the right is easily shown to be equal to (1/f2o), and, using eqn (2.8), 3 ð2o; o; o; 0ÞE 0 ðE o Þ2 Pz ð2oÞ ¼ e0 wð3Þ 2 h zzzz i n m T ¼ bz ð2o; o; oÞ þ gT ð2o; o; o; 0Þ f0 fo2 f2o E 0 ðE o Þ2 4 5kT ¼ng0T f0 fo2 f2o E 0 ðE o Þ2

ð4:15Þ

If the objective is to express both the macroscopic and molecular quantities involved in the analysis of the EFISH experiment in terms of the fundamental definitions of section 2, then eqn (4.15) can be regarded as the standard form of the EFISH equation (see for example Reiss2). In eqn (4.15) the superscript has been re-inserted to emphasise that the equation relates a susceptibility defined by the standard method of section (2) to hyperpolarizabilities in the T convention. The hyperpolarizabilities are obtained by contraction of the tensors, bTijk(2o;o,o) and gTijk(2o;o,o,0) defined in the standard form of section 2. In discussing experimental and theoretical results in this paper we have tried to reduce all reported quantities to the form given in these equations, so that susceptibilities are related to eqn (2.6) and hyperpolarizabilities to eqn (2.9). The experimental studies have, however, used other conventions. Instead of eqn (4.15) the equation that often occurs is written, Pz(2o) = GE0(Eo)2 = ng 0 f0f2of2oE0(Eo)2 G = ng 0 f0f2of2o

(4.16)

where G is a macroscopic response function defined specifically for the EFISH experiment and g 0 is an effective second hyperpolarizability, in some cases also constructed specifically for the EFISH experiment. WRBS have enumerated the different versions of g 0 that are may have been used by different authors. These are given in Table 1. The T convention is as in eqn (4.15) and the B convention is obtained when the factors of 1/n! are omitted in the hyperpolarizability series. The fourth line can be interpreted as defining an empirical convention, tailored for the EFISH experiment, where the rotational averaging factors have been retained but the front factor, derived from the general definitions of susceptibilities and hyperpolarizabilities, has been omitted. In the B* convention an additional factor of 2/3 has appeared in the definition of g 0 in comparison with the X convention. The most plausible explanation of the origin of the B* convention is that the X convention has been adapted to the case where the macroscopic quantity being employed is the standard susceptibility, w(3)(o;o,o,0) rather than the ad hoc G. In interpreting the results of the experimental work the first priority is to identify clearly the macroscopic quantity reported. The various forms of analysis used to extract the hyperpolarizability value can be applied to any such macroscopic measurement and provided the latter is clearly defined there should be no difficulty in choosing the correct formula from Table 1, to obtain the hyperpolarizability in any desired convention. In section (7), therefore, an effort is made to isolate the parameters describing the macroscopic observation from the subsequent interpretation in terms of hyperpolarizabilities.

5. Ab initio and DFT calculations of the pNA b tensor Table 2 lists the results of calculations of the static first hyperpolarizability. As the perturbation theory expansion shows, for o = 0, Kleinman symmetry22 holds exactly and the components of the tensor are invariant under permutations of the three co-ordinate indices so that bZ as defined in eqn (4.12) reduces to, bZ = bZZZ + bZXX + bZYY

(5.1)

Chem. Modell., 2008, 5, 249–278 | 257 This journal is

c

The Royal Society of Chemistry 2008

Table 2 Ab initio and DFT calculations of the electronic part of b(0;0,0) for pNA Reference

Method

Soscun (2006)25

RHF MP2 B3LYP B3LYP MP2 CC2

Reiss (2005)5

bTZZZ

bTZXX

bTZYY

bTZ

Pol D95V(p*,d)

1057 2109 1822 2186 1646 1818

186 64.8 141.1 162.5 50 ?

39.8 61.8 72 70.4 50 ?

831 1982 1609 1953 1546 ?

Basis

opt.6-31G(d*,p)

Salek (2005)26

CCSD B3LYP B3LYP B3LYP

aug-cc-PVDZ cc-pVDZ aug-cc-PVDZ Sadlej

1919 1353 1765 1782

121.5 157.4 141.2 127.3

61.3 9.25 67.1 72.9

1736 1186 1557 1582

Luo (1993)24

RHF MCSCF RHF MP2 RHF MP2

2z + pol. funs 2z + pol. funs 3s2p2d/2s1p 3s2p2d/2s1p 3s2p2d/2s 3s2p2d/2s

1181 1372

208 15.0

25.4 15.0

1269 2058

191 78

48.0 46

947 1373 1271 2019 1030 1934

Sim (1993)23

All values are in atomic units. The Z-axis is in the direction of the ground state dipole and the X-axis in the plane of the benzene ring.

where Z is in the direction of the ground state dipole. The terms on the right are the only distinct non-zero elements of the tensor at zero frequency. Where the three elements are given in Table 2 the bZ column can be completed by a simple addition. It is perhaps useful to note that in some of the original papers the tensor components are given in the T convention (used here) but the figure quoted for bZ is often half that given in Table 2; the explanation being that the authors adhere to the Taylor series convention while performing molecular quantum chemistry calculations but revert to the B convention for quantities related to SHG. It is clear that in all cases the major contribution, by a factor between 5 and 10, is the diagonal component along the dipole, bZZZ, in agreement with the simple donor/ acceptor interpretation. The entry in the ‘method’ column refers to the level at which the actual hyperpolarizability calculation has been made—the geometry of the molecule has often been determined at a different level. In these cases alternative structure determinations have had little effect, but the question will be addressed below in connection with solution studies. Four Hartree-Fock level results appear and these are consistently lower, by substantial percentages, than any of the results obtained from methods that attempt to include the effects of electron correlation. The values given by Sim et al.23 are about 10–20% higher than those of Luo et al.24 and Soscun et al.,25 but the approximate HF value, taking the average is defined to about 15% as (1020 150). In all cases the finite field method has been used. Sim et al.23 discuss the consistent inclusion of contributions from higher order terms; Soscun et al.25 employ a more recent formulation of the finite difference equations. The variation in the results is almost certainly due to the difference in basis sets. The correlated results, taking account only of the highest level values from each source, vary from about 1300 au to just over 2000 au. for bTZ and the most recent ab initio calculations of Salek et al.26 and Reiss et al.5 give respectively 1736 au and 1546 au. Fig. 1 and Table 3 refer to the frequency dependence of bSHG obtained from the Z ab initio methods. Only those that attempt to include electron correlation effects are included and the DFT methods, where the density functional has often been adjusted by reference to 258 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

Fig. 1 Frequency dependence of bTZ/au from ab initio calculation. r: Sim et al.;23 J: Salek et al.;26 K: Reiss et al.;5 &: Luo et al.24

time-dependent ab initio studies are omitted. The aim in this section is to try to define what the ab initio predictions of the gas phase hyperpolarizability have established. DFT methods can be extended to much more complex systems, but in the absence of clear experimental guidance, can only be considered to be as reliable as the ab initio methods from which the functionals have been derived. The CCSD results of Salek et al.26 (2005) and the MCSCF results of Luo et al.24 (1993) compute the frequency dependence as an intrinsic part of the correlated calculation. The work of Sim et al.23 (1993) and Reiss et al.5 (2005) takes the static value obtained at the MP2 level and scales it using the RPA method to get the frequency dependence. Luo et al.24 (1993) also report the result of an RHF/RPA calculation where the frequency dependence is the natural extension of the RHF method. The plotted points are at the four readily available laser frequencies that have been used in almost all experimental work. The most popular of these has been the YAG frequency corresponding to 1.17 eV or 1064 nm. At this frequency the spread of results ranges from about 1550 to 2600 au. If only the two fully frequencydependent correlated calculations are considered the range is from about 1700 to 2600 au. Salek et al., using the CCSD method find that as the frequency is increased from zero to 1.17 eV, bSHG increases from 1736 to 2667 au and Luo et al., using Z MCSCF, from 1373 to1898 au.

Table 3 Gas Phase ab initio calculations of bTZ(2o;o,o) Reference

l/nm

Method 24

Luo (1993) Sim (1993)23 Reiss5 Reiss5 Salek26 Luo (1993)24

RHF + RPA MP2 + RPA RPA Scaled MP2 CCSD MCSCF

1907

1064

968

833

947 1934

1042 2222

1505

1676

1546 1736 1373

1908 1509

1315 2779 1126 2091 2667 1898

3202 2171

2415

All values are in atomic units.

Chem. Modell., 2008, 5, 249–278 | 259 This journal is

c

The Royal Society of Chemistry 2008

The various authors make claims for the accuracy of their methods—in particular relating to the quality of the basis sets and the accuracy of the geometry that has been used—but it is difficult to believe that the theoretical value of bSHG (gas, Z electronic, 1.17 eV) has been settled to better than about 500 au. Provided a fixed geometry is assumed (whatever it may be) the quantity that is being calculated is clearly defined, but it excludes vibrational effects and the effects of intermolecular interactions. It has usually been assumed for frequency doubling, where two optical frequency fields have to interact, that the vibrational effects will be small and in the gas phase the internal field factors do not differ greatly from unity. Even if they are by no means the whole story, the quantities, bSHG (gas, electronic, 0 eV) and Z bSHG (gas, electronic, 1.17 eV), as calculated by reliable ab initio methods, ought to Z provide the foundation for a sound theoretical understanding of the large hyperpolarizability in pNA and similar molecules. A firmly established figure for the 1.17 eV value, accurate to about 100 au would be extremely valuable in helping to distinguish between the alternative interpretations of the experimental data.

6. Gas phase measurement Smaller molecules and inert gas atoms have been extensively studied using EFISH in the gas phase (see for example Miller and Ward,27 Ward and Miller,28,29 Shelton30). Shelton and Rice31 provide a comprehensive list of gaseous EFISH measurements on small molecules up to 1994. The only such result reported for molecules with donor/ acceptor substitution on a benzene ring appears to be that obtained by Kaatz et al.12 for pNA in 1998. In this experiment a gas mixture containing 0.075 mole fraction of pNA was used to obtain an EFISH measurement at 1064 nm at one temperature. The gT(2o;o,o,0) of eqn (4.15) was estimated from a THG experiment and taken as the intercept on a two point plot of g 0 versus 1/T. The value of bZ was calculated from the slope. The linearity of such plots has been confirmed in the work on smaller molecules. The gas phase method differs from that used for solutions in that the extrapolation to infinite dilution is not made since the molecular density in the gas is very much smaller. Also the internal field factors are close to unity. It is usually possible to make measurements over a sufficiently wide range of temperatures to obtain the quantity (mb/k) from the plot of G versus 1/T. In the case of pNA the value of the dipole was chosen as 6.87 D. The absolute calibration of these results is based on an absolute value of hgi obtained from an EFISH measurement on N2 which, in turn, has been calibrated against an ab initio calculation of the quantity for He. The latter is expected to have achieved a high degree of accuracy. The absolute value obtained for bZ is therefore independent of any solid state standard. The results are summarized below. Gas Phase EFISH Determination of bZ for pNA at 1064 nm mb G ¼ g þ z ; m ¼ 6:87D; bz ¼ 1787au 5kT It is reasonably certain that the above value does refer to bTz. The calibration is with respect to values measured in the gas phase or calculated where adherence to the standard conventions has been the norm. Kaatz and Shelton have also made Hyper-Rayleigh scattering measurements of a number of solvents and solutions32,33 including pNA in dioxane. The absolute values have been obtained by comparison with the hyper-Rayleigh scattering from carbon tetrachloride. An absolute value for CCl4 in the gas phase had been obtained34 by utilizing the N2 result as in the EFISH gas measurement described above. The same value was then taken to apply to liquid CCl4. Since these measurements were made an effective hyper-Rayleigh hyperpolarizability for liquid CCl4 has been obtained via measurements of acetonitrile as an intermediate in the gas phase, calibrated with respect to N2 and then directly with the liquid CCl4.35 If the more recent estimates 260 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

are adopted, the value of 2470 au obtained from the HRS in dioxane by Kaatz and Shelton must be multiplied by a factor of 1.88 to give 4644 au at 1064 nm for the equivalent of the EFISH parameter, bTz. This result is compared with EFISH dioxane solution measurements in section 7.

7. Solution EFISH studies of pNA and MNA in dioxane The high first hyperpolarizability of pNA was first identified in EFISH measurements in the 1970s. The experimental method itself was developed by Levine and Bethea,36,37 Oudar,38 Chemla39 and others and the inherent difficulties of the procedure and the devices used to overcome them are well documented in the some of the papers. Measurements have been made on a large range of molecules (see for example Cheng et al.40,41 where full account of the method is given and Zyss and collaborators,4 where the main objective has been to characterise compounds suitable for growth in the form of nonlinear optically useful crystals.) The solvents used were often highly polar (acetone, DMSO) and where the nonlinearity was weak (as for example in nitrobenzene and aniline) neat liquids were used. A clear solutedependent signal was, however, obtained for pNA in dilute solutions. The quantitative study on which the interpretation of a great deal of subsequent work has depended was made by Teng and Garito42,43 (1983) on solutions of pNA and MNA(2-methyl-4-nitroaniline) in dioxane. They employed a careful extrapolation procedure, previously developed by Singer and Garito44 to obtain results at infinite dilution (the method parallels the Debye method used in the measurements of dipoles in solution) and made measurements at the four non-zero frequencies listed in Table 2. 7.1 Analysis of the macroscopic measurements Experimental EFISH results have often been reported in terms of a quantity, G, as defined eqn (4.16) in terms of the real field amplitudes, E = E0 + Eo cos ot

(7.1)

P2o(t) = P2o cos(2ot) = GE0(Eo)2 cos(2ot) P2o = GE0(Eo)2

(7.2)

through the equations,

There is usually no ambiguity about the definitions of the field strengths, which are as in eqn (7.1), and it follows that in terms of the standard susceptibility definitions of eqn (2.8), G = 32e0w(3)(2o;o,o,0)

(7.3)

[The inclusion or omission of the permittivity of free space, e0 in these equations does not affect the discussion of the integer or simple fractional numerical factors, although it does change the units in which G or w(3) are reported (see Appendix).] Optical theory then relates the 2o polarization to the observed second harmonic intensity. Assuming that G has been correctly determined in terms of the fields as defined in (7.1) it has then to be expressed in terms of the molecular parameters. The standard version of the susceptibility is immediately obtained from G through eqn (4.15), and from that equation, using the standard field factor theory for a single component system, 1 T bTZ m G ¼ Nf0 fo2 f2o g þ ð7:4Þ 4 5kT or, for other conventions, G = Nf0f2of2og 0

(7.5)

Chem. Modell., 2008, 5, 249–278 | 261 This journal is

c

The Royal Society of Chemistry 2008

where the appropriate g 0 can be found from Table 1. Within the limitations of the field-factor theory uncertainty arises only if there is doubt as to the definition of the macroscopic parameter and if the data for the field factors, dipole moment and temperature are not available. If this data is provided one can choose whichever convention is preferred and insert the corresponding formula on the right of eqn (7.5). Reiss2 has questioned whether the definitions of the macroscopic electric fields used in the experiments have always been correctly identified. A more general definition of the field would be, E = E0 + mEo cos ot

(7.6)

where m = 2 occurs if the field is defined as the sum of complex exponential functions. Then, introducing the symbol, Gm, and using the T convention,

1 bT m g þ Z E 0 ðmE o Þ2 P2o ¼ Gm E 0 ðE o Þ2 ¼ Nf0 fo2 f2o 4 5kT ð7:7Þ

T b m ¼ m2 Nf0 fo2 f2o g þ Z E 0 ðE o Þ2 5kT and for the case of m = 2,

bT m G2 ¼ Nf0 fo2 f2o g þ Z 5kT

ð7:8Þ

where, G2 would be the measured value if the field were defined with m = 2 in eqn (7.6). Reiss has introduced a notation for combining the four microscopic variants of WRBS with field definitions defined by the ‘m’ value. It might be preferable to keep the macroscopic adjustments separate from the molecular definitions. The question of the definition of the fields is related to the nature of the EFISH measurement. The essential observation is of the ratio between the intensity of second harmonic light emitted from the solution to that emitted by quartz standard. However the o-field is defined the ratio will be related to G by an equation (written schematically) of the kind, !1=2 SHG ISOL GEo Eo2 ¼G ð7:9Þ SHG d11 Eo2 IQ where G is a factor that depends on the optical arrangement and depends on refractive indices (and through them on coherence lengths). The optical fields cancel out in eqn (7.9), but the definition of the field is still relevant since it occurs implicitly in the definition of d11 for quartz. The definition of d11 involves (again schematically) a relation of the type, (ISHG ) = G 0 d11E2o Q 0

(7.10)

where the optical factor, G , the SHG intensity and the field strength must be determined absolutely. If the definition of the field in this experiment is the same as the standard definition (E = Eo cos ot) used in deriving the molecular expression on the right of eqn (4.15) then it will always be correct to take m = 1. The above discussion may, however, be over-simplified in that the signal obtained from the SHG cell also includes a contribution from the glass windows. Glass is isotropic and the glass signal arises from a term, GGE0E2o. The definition of the quantity GG must also be consistent with the field definition. In the cases discussed below, however, there seems to be no doubt that the appropriate value of m is one.

262 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

The theory thus far derived has dealt with one active species. Eqn (7.5) can be extended to a two component solute/solvent system as, GL = N0F0g00 + N1F1g01

(7.11)

2 2o FI = f 0I (f o I ) fI

(7.12)

where,

The internal field factors are now in general to be adjusted to take account of the differing environments of the molecules of the two species. In particular the reaction field introduced in the Onsager field will be different, even in very dilute solution, in the cavity surrounding the polar solute as compared to that of a non-polar solvent. The Onsager and Lorenz factors for the EFISH equations are, respectively, Onsager: fIO ¼

2 eððnO I Þ þ 2Þ

2e þ

2 ðnO I Þ

Lorenz: fIO ¼

2 ðnO I Þ þ2 3

ð7:13Þ

The reaction field that is responsible for the modification of the Lorenz factors to the Onsager form is present only if the dipole of the molecule in the cavity has become partially oriented in the direction of the electric field and may reasonably be assumed to be absent at the optical frequencies. At these frequencies it is also usually assumed that the refractive index may be taken as that of the solvent for both species. For the static field, however, the Onsager factor should, in principle be chosen and the effective refractive index is related to the response through the polarizability of the molecule in the cavity. For the solvent this gives rise to the solvent refractive index, but for the solute the refractive index is a partial quantity reflecting the contribution to the solution refractive index of a solute molecule in a dilute solution. Measurements of refractive index as a function of concentration and specific volume are necessary to determine it. In many EFISH studies this procedure has not been followed and less experimentally demanding approximations have been employed. Full accounts of the theory are to be found in the work of Singer and Garito,44 Cheng et al.40 and Meredith et al.45 The simplest treatment, which might be regarded as a reasonable approximation when the solute makes a contribution to the nonlinearity that is much larger than that of the solvent, is to neglect the first term in eqn (7.11) and assume that the refractive indices and dielectric constant occurring in the field factors are simply the measured values for the solution as a whole. The composition of the solution is defined in terms of the weight fraction of the solute and to obtain the NI values the measured density is required. The unapproximated eqn (7.11) becomes,

ð1 wÞ w 0 0 ð7:14Þ F0 g0 þ F1 g1 GL ¼ NA r M0 M1 which, on introducing the above approximations, reduces to, w 0 GL ¼ NA r f ð0Þf ðoÞ2 f ð2oÞg1 M1

ð7:15Þ

where the field factors are obtained from the measured solution properties. If it is further assumed that the variation of the field factors and the density with composition has a negligible effect, then the value of g01 can be calculated from the slope of a plot of GL versus w. Putting in the standard formulae for the field factors (Onsager factor for the static field and Lorenz factors for the optical fields) gives, 27M1 ð2e þ nðoÞ2 Þ @GL 0 ð7:16Þ g1 ¼ NA r eðnðoÞ2 þ 2Þ3 ðnð2oÞ2 þ 2Þ @w 0 Variations of this equation appear in which slightly different approximations for the field factors have been used, but these variations seem to have very little effect since Chem. Modell., 2008, 5, 249–278 | 263 This journal is

c

The Royal Society of Chemistry 2008

the difference between the refractive indices of the fundamental and second harmonic is very small and the static dielectric constant is nearly equal to the square of the refractive index. The approximations used in deriving eqn (7.16) do, however, disregard some possibly important effects. The extrapolation to zero concentration, which is necessary to obtain reliable single molecule properties, needs to be treated more carefully as in the conventional Debye extrapolation procedure used in the determination of dipole moments in solution. The extension of the extrapolation to the nonlinear case has been treated in some detail by Singer and Garito.44 Their paper is concerned specifically with liquid mixtures where the introduction of separate refractive indices for the two components is an easily defined procedure. The interpretation of the parameters introduced for a solute is more questionable and in the application of the equation to the pNA/dioxane system43 these parameters are not explicitly given. Most other solution determinations have used simplified versions of their equation, neglecting terms that are thought to be small but retaining one or two larger terms that can give rise to appreciable corrections in eqn (7.14). The equation used by Teng and Garito43 is derived by differentiating eqn (7.14) with respect to w, when, substituting the solvent nonlinearity, G0 ¼ NA r0

F0 0 g M0 0

where appropriate, leads to,

0 @GL 1 @r 1 @F0 F1 g1 ¼ NA r0 þG0 1 r @w 0 F0 @w 0 @w 0 M1

ð7:17Þ

ð7:18Þ

If the term in the derivative of the field factor were negligible the expression on the left of this equation would be defined completely in terms of macroscopic measurable quantities. The specifics of the chosen cavity model enter the field factor derivative where Lorentz-Lorenz and Onsager factors may be mixed. The most commonly used procedure is to employ Onsager for the static field and Lorentz factors for the optical fields. For FI (i = 0,1), !2 ! 2 e nI þ 2 nðoÞ2 þ2 nð2oÞ2 þ 2 FI ¼ ð7:19Þ 3 3 n2I þ 2e where the optical factors, in which there will be no contribution from a reaction field, can be written in terms of the solution refractive index, while the low frequency factor has to be expressed in terms of a quantity which defines the reaction field for a central molecule of definite polarizability and dipole moment. The quantity, n0 can be determined experimentally from the pure solvent, but n1 is a construct from the polarizability of a solvated solute molecule isolated from other solute molecules as described above. FI changes with w through the variations of the two solution optical refractive indices and the dielectric constant. The formulae can be simplified, without appreciable loss of accuracy, by ignoring the frequency dependence of the solution refractive index, giving, 2 3 eðnI þ 2Þ n2 þ 2 FI ¼ ð7:20Þ 3 n2I þ 2e Using this formula for F0 and F1 in eqn (7.18) results in the equation employed by Teng and Garito43 to convert the macroscopic nonlinearity to a molecular quantity:

27M1 ðn21 þ 2e0 Þ @GL 1 @r 1 @F0 0 ¼ g1 þG 1 0 r0 @w 0 F0 @w 0 NA r0 e0 ðn20 þ 2Þ3 ðn21 þ 2Þ @w 0 ð7:21Þ 264 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

where the field factor derivative is given explicitly by, 2

1 @F0 3 @n 1 2 @e ¼ þ F0 @w 0 n20 þ 2 @w 0 e0 2e0 þ n20 @w 0

ð7:22Þ

Unfortunately Teng and Garito do not report the details of the application of the formula, no values being given for n1 or for the partial derivatives. [This is in contrast to the earlier paper of Singer and Garito on liquid mixture calculations using the same formula where full details are provided.] It is therefore difficult to assess to what extent the smaller terms, not involving (@GL/@w)0 make significant contributions. The solvent quantity, G0 is rather less than 1% of the derivative, (@GL/@w)0, which at least suggests that the other terms may not be important. If the specific nature of the reaction field is ignored the sometimes unavailable factor n1 is not required. In some of the work discussed below the simplified versions of the formula have been employed. The first term in the curly brackets in eqn (7.18) is usually available and included. A simplified version avoiding the use of n1 is obtained if Lorenz type internal field factors are used for all three frequencies so that, !2 ! 3 eþ2 nðoÞ2 þ 2 nð2oÞ2 þ 2 e þ 2 n2 þ 2 ffi F0 ¼ F1 ¼ ð7:23Þ 3 3 3 3 3 Bosshard et al.46 adopt the approximation on the extreme right of eqn (7.23) and neglect the variations of the density and refractive index, leading directly from eqn (7.21) to the formula,

81M1 @GL 1 @e 0 g1 ¼ ð7:24Þ þG 1 0 e0 þ 2 @w 0 @w 0 NA ðe0 þ 2Þðn20 þ 2Þ3 r0 They provide measured data for MNA at 1064 nm for the dielectric constant derivative. Eqn (7.24) further reduces to eqn (7.25) if the dielectric constant variation is also neglected: 81M1 @GL 0 g1 ¼ ð7:25Þ þG 0 @w 0 NA ðe0 þ 2Þðn20 þ 2Þ3 r0 and this version appears to have been used by Burland et al.47 in one of the treatments of their EFISH data on MNA in dioxane. Eqns (7.16), (7.24) and (7.25) can be applied easily to available measured macroscopic quantities and have been used in many of the reported analyses of EFISH data. There remains the question of whether the neglect of the other terms in eqn (7.21) is justified and whether a proper assessment of the parameter, n1 is required. Whatever the validity of these formulae and the underlying cavity field theories it is very desirable that, before applying them, the definition and consistency of the macroscopic quantities as measured by different groups should be assessed. Much of the discussion in the literature deals directly with the final values of the hyperpolarizability presented in the various experimental papers. In calculating these values different versions of the above procedures may have been employed and, in particular, different values of the molecular dipole moment inserted into g0 to extract the b value. It is relatively easy to identify how the microscopic parameter has been obtained and which molecular convention is being used provided the identity of the macroscopic quantity is clearly established. The symbol, G0L (and its derivative, (@G 0 L/@w)0) is introduced here to denote provisionally a reported macroscopic nonlinearity before assessing its precise definition. The unprimed symbols are defined in accordance with eqn (4.16). The most troublesome ambiguity in the Chem. Modell., 2008, 5, 249–278 | 265 This journal is

c

The Royal Society of Chemistry 2008

Table 4 Measured EFISH nonlinearities for pNA and MNA at 1064 nm

TG TG Bur(1) Bos P(1) P(2)

pNA MNA MNA MNA pNA pNA

Ref. q1 q1 q1 q1/ Li+ Li+

A

B

G0 0 (@G 0 /@w)0 0.053 0.1 10 0.3 0.053 0.1 9.4 0.3 0.039 10.8 0.046 9.5 2 — 15.7 — 15.7

G0 0 (@G 0 /@w)0 0.032 0.06 6.0 0.18 0.032 0.06 5.7 0.18 0.023 6.5 0.18 0.035 7.1 1.5 9.05 4.9

Adjustment factor 0.6 0.6 0.6 0.6 (0.5/0.4) 0.98/1.7 0.5624/1.80

G 0 0 and (@G/L/@w)0 are in units of 1012 esu2cm4. q1 is the older quartz standard, d11 = 1.2 109 esu; q1/ is referred to a quartz standard with d11 = 0.96 109 esu; Li+ is the older LiIO3 standard. These are the unmodified calibration standards in the original papers and correspond to the values in columns, A. The entries in columns B have been adjusted to the calibration standards that are currently thought to be correct. These are for quartz, d11 = 0.72 109 esu; and require an adjustment of the LiIO3 value by a factor of (0.98/1.7). The results of Paley et al.48 have also been further modified for reasons discussed in the text. TG: Teng and Garito;43 Bur: Burland et al.;47 Bos: Bosshard et al.;46 P: Paley et al.48

interpretation of the data arises from uncertainty as to whether the quantity employed is, G 0 L = GL = 32e0w(3)(2o;o,o) or G 0 L = e0w(3)(2o;o,o)

(7.26)

All the experimental studies described below have, at least in the initial analysis of the experimental results, related the macroscopic measured quantity to a molecular quantity defined by the formula referred to as the X convention in Table 1. The molecular quantity so extracted will then be in the X convention, provided the macroscopic quantity is defined by the first of the equalities in eqn (7.26); if the macroscopic quantity is defined by the second equality, then the molecular quantity will be in the B* convention. In the following section an attempt is made to identify and compare the macroscopic quantities, (@G0L/@w)0 reported by various authors. The values of G 0 0 are also usually reported and are useful in assessing consistency between different measurements. (Ancillary linear quantities, in particular (@e/@w)0 are not subject to conflicting definitions and when available are considered generally reliable.) In the subsequent section the molecular quantities evaluated by the equations given in this section are examined. The results to be considered are summarized in Table 4. In some cases their interpretation is not straightforward and is discussed in the following paragraphs. The values for the pure solvent nonlinearity and G00 in columns A are as far as possible taken directly from the original paper. In the Teng and Garito paper these values are stated explicitly. The paper includes an equation, which identifies G0 with G as defined in eqn (4.16) and the formula used for the SOS calculations also described is consistent with definition of bX z. Burland et al.47 refer to the nonlinearity as w(3)(2o;o,o) and their introduction of this quantity in the earlier part of their paper would lead to the conclusion that it is the standard susceptibility as defined in eqn (2.4), in which case the bz that they derive from it would be in the B* convention. However, the symbol, G, subsequently appears and it is stated that w(3)(2o;o,o) now has to be identified with G. A nearly linear plot of w(3)(2o;o,o) versus w is shown from which the value of (@G 0 L/@w)0 in column A has been obtained. An alternative method of analysis of the macroscopic data for G0L, which does not involve an explicit calculation of the derivative, is discussed below. 266 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

The value attributed in column A to Bosshard et al.46 is quoted in their paper and has required only a conversion of the units. Their definitions make it clear that the quantity concerned is intended to be GL, so that their hyperpolarizability is in the X convention. Paley et al.48 do not give results of the macroscopic quantity directly, but it is made clear that their bX Z has been derived through a formula of the type given in eqn (7.25) with pure Lorenz factors. They also give the dipole moment used in the calculation so that it is possible to reconstruct the (@GL/@w)0 value for comparison with the other measurements. In columns B the results of columns A have been adjusted by factors required to relate them to a uniform standard based on that currently preferred for the reference standards. In comparing ab initio and DFT calculations with experiment most authors have adopted a revised value for the quartz d11 coefficient which is reduced by a factor of 0.6 from that used in earlier work. The results of Teng and Garito and Burland et al. in column B have simply been altered by this factor. Bosshard et al. give an unusual value for quartz which is (0.4/0.5) times the older (q1) figure. This factor has been removed before converting to the new quartz standard. There is a more significant problem in relation to the work of Paley et al. The primary standard used in these measurements has been Lithium Iodate and again a revised determination of the standard has led to a large change—by a factor of (0.98/ 1.7). There is also a more serious question that arises from the use of an intermediate standard—a 1% solution of MNA in dioxane. After this solution has been calibrated against LiIO3 all subsequent measurements were referred to it, the nonlinearity of the solution being reported as 3.13 1013 esu., converting to 1.80 1013 esu with the newer LiIO3 standard. Bosshard et al. have also used this method (although with quartz as the primary standard) and their equivalent figure after conversion of their units, multiplication by 0.6(0.5/0/4) and conversion to the new quartz standard is 0.5625 1013 esu. Similar figures can be derived from the work of Burland et al. and Teng and Garito by applying the equation, ! 0 @G0 0 GL ðw ¼ 0:01Þ ¼ G0 þ 0:01 @w 0

The near linearity of, for example, the results of Burland et al. justify this procedure. There is, therefore, a large and unaccountable difference between the absolute values of Paley et al. and other groups for the 0.01% solution. If these values were consistent with each other it would demonstrate the equivalence of the quartz and lithium iodate standards but the large discrepancy leaves the issue in doubt. In the second set of adjusted results in column B for Paley et al., the values have been recalibrated using 0.5625 1013 esu as the reference for the 1% MNA solution. In the two cases the results of Paley et al. are, respectively higher and lower than the other sets of results which are in much better mutual agreement. Focussing on the first four lines of the Table 4, all of which are calibrated with the quartz standard, and can be adjusted to the same value for this standard, it is apparent that there is reasonably good agreement between the three determinations of the macroscopic nonlinearity of MNA. This consistency, taken in conjunction with the equations as written in the papers, leads to the conclusion that, with a reasonable degree of certainty, the provisionally defined quantity, G 0 L can be identified with the GL of eqn (4.16). Having made this assumption, it follows that any differences in reported bz values that are not proportional to the small percentage differences in the GL values must be the result of variations in the method employed to convert macroscopic to molecular parameters. This question is examined in the next section. The alternative interpretation, that some of the results should be taken as being in the B* convention, is also discussed. Chem. Modell., 2008, 5, 249–278 | 267 This journal is

c

The Royal Society of Chemistry 2008

Table 5 mbTZ: interpretation A

1 2 3 4 5 6 7 8 9 10

Ref.

Mol.

Eqn (7.24)

TG TG Bur(1) Bur(2) Boss Stah Wort Pal(1) Pal(2) KS

pNA MNA M NA MNA MNA pNA pNA pNA pNA pNA

G,X G,X G,X G,X G,X G,X G,X G,X G,X

eqn (7.25)

3.48 3.63 4.22

3.68 3.85 4.38

4.55

4.79 (3.17) 5.52 2.99

Published

Adjusted

2.91 2.88 4.40 3.87 4.82 3.17 3.28 5.52 2.99 3.25

2.91 2.88 3.28 (3.87) 3.61 2.50 (2.46) 4.36 2.36 (3.25)

The product mbTZ in units of au.Debye/104. Where the macroscopic data is available from Table 4 the product has been calculated from eqns (7.24) and (7.25) using the additional data from Table 7. The direct third order term, gT, has been neglected. The published value obtained from m and bTz as given in the papers is given in the seventh column. In rows 1 and 2 a dipole of 6.2D has been assumed (see text). In the final column the published values that have been obtained from simplified equations such as (7.24) and (7.25) have been reduced by the ratio of the ‘published’/’eqn (7.25)’ results of Teng and Garito. Values in brackets are unaltered, except in the case of Wortmann et al. where the published result has been multiplied by the missing 2o internal field factor. TG: Teng and Garito;43 Bur: Burland et al.;47 Boss: Bosshard et al.:46 Stah: Stahlein et al.:49 Wort: Wortman et al.;50 Pal: Paley et al.;48 KS Kaatz and Shelton,33 Hyper Rayleigh measurement. The bTz values are in atomic units and the value in the last column has been adjusted, where appropriate, by the same factors used for the macroscopic quantities in Table 4.

7.2 Calculation of the molecular bz from the experimental nonlinearities The formulae for effecting the conversion have been discussed in section (7.1). Data is not completely available to apply the general formula of Teng and Garito, but the simplified versions can readily be applied to the macroscopic measurements described in the previous section. A number of different values of the dipole moment have been adopted in extracting bTZ from g 0 . In all cases other than in refs. 42 and 43 the values of m and a form of b are given and it is possible to reconstruct the quantity mbZ. In comparing the interpretations of different studies this product, expressed in units of (au.Debye), has been used. The third order hyperpolarizability term, g of Table 6 mbTZ: interpretation B

1 2 3 4 5 6 7 8 9 10

Ref.

Mol.

TG TG Bur(1) Bur(2) Boss Stah Wort Pal(1) Pal(2) KS

pNA MNA M NA MNA MNA pNA pNA pNA pNA pNA

Published G,B* G,B* G,X G,X G,X G,B* G,X G,X G,X B*

4.37 (5.52) 4.32 (5.78) 4.40 3.87 4.82 4.75 3.28 (2.50) 5.52 2.99 4.87

The product mbTZ in units of au.Debye/104. The last column gives the mbTZ product calculated from the published values with the assumptions of the previous column. See Table 5 for key to references.

268 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

Table 7 MPNA = 138, MMNA = 152 v0 =0.967 cm3 g1 o 1064 nm e0 = 2.209 F = f 0(f o)2f 2o f0 ¼ @e

no = 1.41253,

e0 ðn2o þ2Þ , n2o þ2e0

@w 0 ¼

fo ¼

n2o þ2 3 ,

n2o = 1.422

f 2o ¼

n22o þ2 3

42, for MNA46 also assumed for pNA.

eqn (4.15) is known to be negligible. Table 5 includes results computed using eqns (7.24) and (7.25) which can be compared with values obtained from the published data. The refractive index, dielectric constant and other data required to perform the calculations are given in Table 7. Table 6 gives an alternative interpretation of the published data. In Table 5, starting with the adjusted results of column B of Table 4, the two formulae have been applied and resulting values converted to atomic units and multiplied by a factor of 4, so that the values quoted (as in the paper of Reiss2) are for the quantity, bTZ(2o;o,o). Also included is the value reported in the original paper, again adjusted by the factors used to convert column A to column B in Table 4 and to bTZ(2o;o,o) in atomic units. The table also includes pNA values reported by Stahelin et al.49 and Wortmann et al.50 (for which it has not been possible to analyse the published macroscopic data as above), the result obtained by the alternative method of Burland et al.47 and the hyper-Rayleigh value of Kaatz et al.12 The values obtained from eqn (7.24), which includes the correction for the concentration dependence of the dielectric constant (from ref. 46), are lowered by between 5 and 10% compared to the uncorrected values. The more frequently used approximation appears to be that of eqn (7.25). The fundamental study of Teng and Garito43 is the only one that treats both pNA and MNA. Referring to the first two lines of Table 5 it can be seen that both the calculated and published values for the two molecules are very similar. The values calculated from eqn (7.25) are substantially larger than the published values calculated on the assumption that the dipole moment has been taken as 6.2D. [Although not stated in the publications this assumption has been made universally in discussions of these papers. It is clear from the better documented study of Singer and Garito that the dipole moment was determined from measurements of the dielectric constant in solution extrapolated to infinite dilution and it is known from other studies that this procedure leads to a value of 6.2D.] For pNA the ratio of the value given in the paper to the value obtained from the widely used uncorrected approximation of eqn (7.25) is rp = 0.79, for MNA it is rM = 0.74. It is possible that this discrepancy is the source of the assumption made by some authors that the reported bZ value referred to the B* convention, in which case the ratio would have been 0.67, the difference being attributed to the uncertain values of the correction terms used in applying eqn (7.21). The alternative, and probably more plausible explanation, is that the modified field factors and additional correction terms in eqn (7.21) which was used in ref. 43, account for the differences. If this is so then Teng and Garito’s results can be assigned to the (G, X) convention; and if it is accepted that the full equation produces a more accurate version of the microscopic response function than the approximate equations, then there is a case for adjusting other results that have been obtained through the approximate equations by the ratios, rp or rM. This method has been used, where appropriate, to produce the quantities in the final column of Table 5. As previously stated there is some ambiguity in the definition of the macroscopic quantity used in the work of Burland et al., but if this is taken to be w(3) the discrepancy with the results of ref. 43, assumed to be in the (G, X) convention, would Chem. Modell., 2008, 5, 249–278 | 269 This journal is

c

The Royal Society of Chemistry 2008

amount to a factor of almost 2. In Table 5 the results of Burland et al. have been assigned to the (G, X) convention. The hyperpolarizability, calculated from eqn (7.25) and the data in Table 4, reproduce the published value obtained by the first method employed in the paper and this value has been adjusted by the factor rM in the final column. In ref. 47 a second method of obtaining the hyperpolarizability based on a numerical fit over the whole range of concentrations, rather than on an extrapolation to zero concentration, has also been reported. There is no justification for applying the above correction factor in this case and the value in the final column is unchanged. It is possible that this second method, which gives a lower hyperpolarizability, implicitly includes some of the additional corrections of eqn (7.21). Bosshard et al.46 work in the (G, X) convention. From Table 4 their value can be recalculated and adjusted by the factor, rM. The interpretation of the hyperpolarizability reported by Stahelin et al.49 is more difficult. Their publication is closely associated with that of Burland et al.47 and it might be assumed that they would employ the same convention. The issue again turns on their definition of a quantity referred to as w(3). Making the same assumptions as in the case of Burland et al. would lead to the (G, X) convention and values calculated via eqn (7.25) to be adjusted as before. This interpretation has been used in Table 5. Some of the authors of this publication have, however, in other work concluded that the Teng and Garito hyperpolarizabilities are in the B* convention and they compare their value with that of Teng and Garito. The close agreement found is rather illusory since Stahelin et al. use a dipole moment of 7.0. There is nevertheless a strong possibility that their quoted value is intended to refer to the B* convention. This alternative view is adopted in Table 6. The molecular quantities derived by Wortmann et al.50 are obtained through a very detailed analysis that includes, although in a different formalism, all the smaller terms treated in the full equation of Teng and Garito. It is reasonably clear that his values of the hyperpolarizability refer to the X convention. The adjustment by rp is not required, but division by the missing field factor, f2o [see section 3, and ref. 21] is necessary. The last three rows of the table exhibit the EFISH results of Paley et al.,48 the second version, corrected as described above probably being the more acceptable, and the Hyper-Rayleigh value calculated from the dipole moment and hyperpolarizability given in ref. 12. In Table 6 an alternative analysis is presented. The essential assumptions are that the hyperpolarizabilities reported by Teng and Garito and by Stahelin et al. and possibly even by Wortmann et al. and Kaatz et al. relate to the B* convention. The figure in brackets for the Teng and Garito results has been calculated from eqn (7.25), assuming that G/ = w(3). To maintain any reasonable degree of consistency in the results it still has to be assumed that refs. 46 and 47 have used the X convention. Since most of these results have been calculated with the approximate formulae they should be compared with the bracketed values for the Teng and Garito results. The agreement is then poor. If the general bias towards an interpretation in terms of the B* convention reflected in Table 6 has any validity then the average values of the product would be substantially higher than predicted from the results of Table 5. From Tables 5 and 6 it is possible to make an assessment of an average result. From rows 1, 6, 7, 9 and 10 of Table 5, for the case of pNA, and rows 2, 3 and 5 for MNA, pNA: mbTZ = 2.63 0.2 au Debye

MNA: mbTZ = 3.26 0.2 au Debye

where the second value of Burland et al.46 has been omitted on the grounds that the extrapolation method with the adjustment to the final column is more accurate. From Table 6 the uncertainties are greater but it can reasonably concluded that from both pNA and MNA the values of the product are in the range (4 to 5) au.Debye. There are clearly a number of uncertainties that make it impossible to 270 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

Table 8 Estimates of bTZ for pNA and MNA in dioxane

pNA pNA pNA pNA pNA pNA MNA MNA MNA MNA

A A KS(X) (gas) B B KS(B*) (gas) A A B B

mbTZ/Debye.au/104

m/Debye

bTZ/au

2.63 2.63

6.2 7.0 6.87 6.2 7.0 6.87 6.2 7.4 6.2 7.4

4242 3757 1781 7256 6429 2671 5258 4405 7258 6081

4.5 4.5 3.26 3.26 4.5 4.5

The figure of 4.5 for the mbTZ product from interpretation B is an arbitrary value within the group of higher values found if the results of Pal (2) and Wort are omitted. See Table 5 for key to references.

reach completely reliable conclusions, but a consideration of the internal consistency of the analyses make it seem probable that the version given in Table 5 is the more likely to be correct. This is in broad agreement with the views put forward by Reiss for pNA. The inclusion of the MNA data draws attention to the major variations that arise when different procedures are used to convert the macroscopic measured nonlinearities (even when unambiguously defined) to molecular quantities. To extract the hyperpolarizability from the product data a value for the dipole moment has to be adopted. Again there is considerable variation between the values used by different groups. It has been argued that the dipole moment of the molecule in vacuo should be used, all other effects presumed to be subsumed into the cavity field formalism. If the cavity method were strictly accurate the analysis from the dielectric measurements should lead to the dipole moment of the isolated molecule. Gaseous dipole moments for pNA and MNA have not been reliably measured in the gas phase. Calculations tend to give higher values (B47 Debye) for both molecules, whereas the solution extrapolation for pNA in dioxane gives 6.2 Debye. In the case of pNA Teng and Garito (almost certainly!) used the value 6.2 Debye while all other authors have adopted higher values near to 7 Debye. For MNA it can again be assumed that the lower solution value was used by Teng and Garito while Burland et al. and Bosshard et al. have used a value of 7.4 Debye. In Table 8. The results obtained from the above averages of Table 5 are given for one low and one high value of the dipole and compared with the gas phase value from Kaatz et al.12 for pNA.

8. Theoretical approaches to the calculation of the EFISH nonlinearity of pNA in solution There is reasonable agreement between the bTZ values obtained from the gas-phase measurement of Kaatz et al.12 and the higher level theoretical calculations on the isolated molecule. If effects occurring in solution were completely accounted for by the introduction of the standard internal field factors in eqn (4.15) then the bTZ values derived from the solution measurements should also be in agreement with the gas phase values. It is clear from a perusal of the above tables that bTZ extracted from solution measurements is always substantially greater than the value established for the isolated molecule. This is true irrespective of whether the experimental data is interpreted as pertaining to X or to the B* conventions. A more detailed analysis of the solution behaviour is therefore required and, since applications exploiting the high nonlinearities possessed by pNA and similar molecules will inevitably use condensed phases, such an analysis should be of practical importance. Chem. Modell., 2008, 5, 249–278 | 271 This journal is

c

The Royal Society of Chemistry 2008

In assessing recent work it might be useful to classify models used for calculating the response functions of liquids or solutions in general terms as follows:A: Continuum Models (I) Lorentz-Lorenz, Onsager internal field factors (QM(g), Continuum) (II) Self-Consistent Reaction Field (SCRF) Models (QM/Continuum) B: Discrete Models (III) Discrete SCRF Models (QM/MM/Continuum) In case (I), the theory that has been used to describe the EFISH analysis in earlier sections, the isolated molecule calculation provides the bZ value and the internal field factors adjust the fields to allow for the polarization on the cavity surface. The effect of reaction fields due to the additional polarization on the cavity walls induced by the permanent and induced dipoles on the central molecule is implicitly included in the low frequency Onsager field factor through the dielectric constant values. The choice of the high frequency dielectric constant (eN) in this formulation is rather ill defined and no account is taken of changes in the static or dynamic polarizabilities of the molecule as a result of the surrounding fields. In (II) the reaction field of the dipole is included in the molecular Hamiltonian, so that the QM calculation, at whatever level, is modified to give a new molecular wavefunction for one molecule at the centre of a cavity. This calculation can be carried out in the absence of an applied macroscopic field and would give the unperturbed properties (dipole moment, energy states etc.) of a solvated molecule. The macroscopic field has then usually been applied in a finite field calculation of the hyperpolarizability. One source of uncertainty in this procedure arises from the fact that when the reaction field is introduced into the hamiltonian it appears in a specific form, 1 2ðe 1Þ l a3 2e þ 1

ð8:1Þ

where e is the dielectric constant and l the dipole moment, which is very sensitive to the choice of the cavity radius, a. Method (III) attempts to include a discrete molecular description of the solvent structure around a central solute molecule. The solute molecule is described, again, through a QM calculation while the spatial distribution, charge distribution, polarizabilities etc. of the adjacent part of the solvent is represented by a parametrized molecular mechanics (MM) model. The parametrization may be achieved through high level calculations on the isolated solvent molecules. To relate this type of calculation to the macroscopic response it is still necessary to include long range effects through the continuum model. The basis of the procedure followed is justified by the theory that underlies the derivations of the Lorentz and Onsager approximations. In its simplest form a sphere is defined around a solute molecule and the calculation split into two parts. The sphere is a ‘virtual’ cavity. It is not implied that the material in the sphere has been removed. In two cases, a simple cubic array of dipoles and a hom*ogeneous isotropic continuum, it can be shown that for cavities that are large on a molecular scale but much smaller than the macroscopic dimensions, the field arising from the molecules in the cavity tends to zero. The material outside the cavity modifies the macroscopic field through the usual field factors. There are slight variations in the methods used by different authors, but typically (following Jensen et al.51 and Wortmann and Bishop20) the field at the solute molecule is written, EINT = EMAC + EPOL ECAV POL + EDIS

(8.2)

where the first two terms on the right are the macroscopic field and the continuum polarization field that gives rise to the usual field factors. The part of the macroscopic polarization field that due to the material in the cavity is then removed and 272 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

replaced by the discrete contributions, EDIS of the molecules in the cavity. It is possible to write, PERM EDIS = EIND DIS + EDIS

(8.3)

where the first term is the field due to the molecules in the cavity which has been induced by the macroscopic field, while the second term is the average field produced by these molecules in the absence of a macroscopic field. The separation is not always convenient. The macroscopic polarisation can always be written as, P = NhmINDi

(8.4)

where hmINDi is an average induced dipole contribution from a solute molecule. The polarization can now be expanded in several different ways. For example, if the expansion is in terms of powers of the first two terms on the right of eqn (8.2) which together comprise the effective fields as defined by the conventional field factors then effective polarizabilities and hyperpolarizabilities will be defined which incorporate the effects of the discrete terms that have been omitted from the fields in the power expansion. If the expansion is in terms of all contributions except EPERM DISC then the polarizabilities and hyperpolarizabilities so defined will represent effective properties of a solvated molecule. Since pNA and most of the chromophores of interest have large dipole moments an important feature of the continuum models is the introduction of the reaction field. The pNA molecule at the centre of the cavity in the continuum induces a polarization on the surface of the cavity, which produces the reaction field acting on the central molecule. This reaction field changes the dipole moment of the pNA molecule via the linear polarizability. A self consistent procedure is required in which the effects of the reaction field and also the effects of the applied macroscopic fields modified by the internal field factors are included in a self-consistent determination of the molecular response within a specified quantum mechanical model. Willetts and Rice52 attempted to develop a continuum model (type II) to calculate the EFISH response of liquid acetonitrile. Their results were compared with accurate calculations and measurements of the hyperpolarizability of the isolated molecule and measured solution for two different cavity radii. One ‘natural’ choice of cavity radius is based on the van der Waals radius of the compound and with this value of a the discrepancy between the gas and solution measurements is not resolved. A rather smaller value of a can be selected to give much better agreement. The arbitrariness of the choice of cavity radius places a serious limitation on the usefulness of the continuum theories and, in recent years, there has been a considerable effort to develop hyperpolarizability calculations based on discrete molecular simulations of liquid and solution structure. Several groups have developed models of type III based on a combination of quantum chemical calculations (QM) at various levels for a central model with the effect of the surrounding medium represented by an ensemble of molecules treated by molecular mechanics (MM). Mikkelsen et al.53 have employed a continuum reaction field response model using reference wave functions at SCF and MCSCF levels. The basis set selection is developed from work on RPA calculations on pNA by Agren et al.54 A planar C2v geometry has been employed, adapted from the crystallographically determined structure where deviations from planarity are small.55,56 In the MCSCF work a full p-orbital space is included with complete correlation constituting a CASSCF p-orbital calculation. The electronic excitation energies, transition moments and ground and CT dipole moments have been calculated in this approximation. These results are applied to a subsequent analysis of the two-state model. Effective solute hyperpolarizabilities are obtained over a range of continuum dielectric constants including those of the usual solvents. Chem. Modell., 2008, 5, 249–278 | 273 This journal is

c

The Royal Society of Chemistry 2008

Jensen et al. use a discrete QM/MM model in which the quantum mechanical part of the calculation employs a DFT method. Using procedures such as those outlined above they calculate vacuum, solute and effective values for pNA at CCSD/B3LYP level. Reiss et al.5,6 have recently carried out an extensive theoretical study of the linear and nonlinear properties of pNA in the gas phase, of three solvents, cyclohexane (CH), 1,4-dioxane (DI) and tetrahydrofuran (THF) and of solutions of pNA in these solvents. The aim of the work is to provide a treatment in which the liquid phase molecules are treated discretely via a statistical simulation of the liquid and solution structures. The general strategy for attaining this goal can be summarized as follows:Firstly, the geometry of the pNA molecule in the gas and solution phases has been closely scrutinized using, for the solution, several self-consistent reaction field formalisms, the solute molecule being described at reasonably high levels of ab initio and DFT theory with averaging over conformations of the molecule. The question of the structure of pNA in gas, liquid and crystal phases has been previously addressed by several authors and the main features, confirmed and extended by this study, are that in the gas phase the C–NH2 group is pyramidal while under the influence of local fields in solutions or crystals the arrangement tends to become more planar. The crystal structure has been determined experimentally and is planar but with the C–NH2 plane twisted slightly with respect to the plane of the molecule. The computations in this work establish that there is a similar effect in a local field, represented by a self consistent reaction field, provided a rather high dielectric constant (B7) is used. Only THF of the three solvents has such a large permittivity, the observed (static) value for DI being 2.2. However the authors draw attention to ‘the dioxane effect’ where, probably because of high quadrupole interactions, this solvent behaves like a more polar solvent on a microscopic scale, as is demonstrated by high solvatochromic shifts. Torsional rotations of the amino group are also treated. The electronic structures of the three solvent molecules have been examined at the same level and the dipole and quadrupole moments evaluated so that the dipole, dipole gradient and quadrupole fields can be included in the discrete simulation. Calculations of the dipole moment of the isolated pNA molecule have also been made for use in the EFISH formula. Calculations of the vibrational contributions to the static polarizability and hyperpolarizability have also been attempted. As far as the EFISH experiment is concerned, which depends on the square of an optical frequency field, it is assumed that there will be no direct contribution to b(2o,o,o) from the vibrations although the static contribution is comparable with the static electronic contribution to b(0;0,0). An indirect vibrational effect through the linear polarizability of the solvent molecules is more important. Calculations of the vibrational effects in pNA cannot be carried out reliably even for the static case since the second term in the perturbation theory is much larger then the first and there is no evidence of convergence. The preliminary calculations are essentially of the properties of single molecules in the self-consistent continuum reaction field, taking account of the effect of the field on the structure of the single molecule. The data is then used in the discrete simulation, where the effective parameters (a,b,g) of the solute pNA molecule averaged over solution conformations are obtained. In this stage of the computation the averaged geometries from the continuum studies are used. The response function calculated in the simulation is the molecular response function of the solvated molecule. When the finite field procedure is applied the polarization of the solvent molecules by the macroscopic field is not included and this (substantial) effect has still to be allowed for when relating the results to the 274 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

measured macroscopic susceptibility by including the usual Onsager-Lorentz-Lorenz factors. This feature might be considered a systematic weakness of the procedure in comparison with that of Jensen et al., where the macroscopic susceptibility is calculated directly from a finite field that interacts with both solvent and solute. The level of description in the latter calculation is much lower than that in the Reiss study, where it would not be computationally feasible at present to use the Jensen procedure. However, in the Reiss study, despite the detailed simulation of the solution for part of the calculation, the relation of the results to experiment still depends on the validity of the conventional internal field factors. The computed quantity is therefore,

mb g ¼ gel þ ; ð8:5Þ 5kT related to the standard macroscopic susceptibility by, 3 (3) 2wzzzz(2o;o,o,0)

= 14Nf0f2of2og,

(8.6)

Since Reiss et al. consistently use the T convention. In discussing the work of the above groups it is desirable to separate the comparison of their theoretical results from the comparison of theoretical results with experiment. As far as the second comparison is concerned a simplified version of the extensive discussion and comparison of models presented in the paper would be that, when all the enhancements associated with non-uniform fields and quadrupole effects have been included, there is reasonable agreement between the effective g 0 or first hyperpolarizability extracted from the analysis and the lower of the two estimates described in section 7. In other words, the agreement depends on identifying the results of Teng and Garito and Stahelin et al. as being correctly deduced in the X convention. In comparing solution calculations, however, there are still differences in the choice of the quantity that is used to characterise the response of the solution. A distinction is made between the property of a solvated pNA molecule and the effective hyperpolarizability of pNA in solution. In the former case the modifications of the pNA molecule have been calculated in the solution without any applied electric field present. In the second case the applied electric field has been present while the structure of the pNA and its immediate surroundings have been optimized. In both cases the calculated molecular quantity has to be used with internal field factors to recover the macroscopic response—but the choice of field factors may be different in the two cases.

9. Recent work on pNA/MNA The development of the discrete solution simulation methods and their application to other systems (the pNA/MNA/dioxane solutions have been used here as the most exhaustively studied exemplar, but similar problems are inherent in all studies of organic molecules of a comparable or greater size in solution) is well described in recent publications which can be located, in particular, via the recent series by Reiss and collaborators. Here attention is drawn to a few more recent publications that are relevant to the subject. Whitten et al.57 have critically re-examined the molecular dipole moment of MNA in the crystal. MNA, as opposed to pNA, crystallizes in a noncentrosymmetric group where the molecules are all closely aligned, and it would be expected that there would be a very large internal dipolar field acting on each molecule arising from the dipoles of its neighbours. The additional induced dipole in the ground state of the molecule under these circ*mstances would, if known, give an indication of an upper limit to the changes that might be expected Chem. Modell., 2008, 5, 249–278 | 275 This journal is

c

The Royal Society of Chemistry 2008

in solution. An earlier experimental charge density analysis58 had concluded that the ground state dipole was enhanced by a factor of almost 3, to 23 Debye due to a combination of strong intermolecular interactions and crystal field effects. Whitten et al.57 have examined X-ray and neutron diffraction data carefully measured at 100 K and supplemented their analysis with ab initio Hartree-Fock calculations to re-assess the problem and have concluded that the degree of enhancement is only of the order of 30–40%. Their analysis includes anisotropic modelling of the motion of hydrogen atoms, integral use of periodic ab initio calculations and improved data quality, all of which represent advances over the earlier studies. Yang et al.59 have made an extensive study of the effects of basis sets in a TDDFT-SOS approach to the calculation of the hyperpolarizability of pNA and a squairaine molecule. They find that the method is not very sensitive to the choice of basis set. The convergence behaviour and efficiencies of various functionals are discussed. Calculations of the excited state energies of pNA in vacuo and in solution— through a polarized continuum model—have been reported using a TDDFT model by Scalmani et al.60 [See also Faustino et al.,61 who have attempted to relate calculations of the first hyperpolarizability of excited states of pNA to experimental hyper-Rayleigh scattering data].

Appendix I. Conversion of units 1.

Hyperpolarizabilities

Four systems occur: 2 n (1) ESU: dm(n)/esu.cm = a(n) esu(E/esu.cm ) 1 n (2) SI(1): dm(n)/Cm = a(n) (E/Vm ) SI(1) 1 n (3) SI(2): dm(n)/Cm = e0a(n) esu(E/Vm ) (n) (n) 2 n (4) AU: dm /e.a0 = aau (E/ea0 ) where, a(n) is the nth order polarizability [(n 1)th order hyperpolarizability]. The omission of the (1/n!) factors does not affect the conversion of units. The AU equation has been written in the form suitable for conversion to the esu system. If the velocity of light is written, c = B 108 ms1, where B = 2.99792458, then using, (Coulomb/esu) = (B 109)1, e0 = (4pB2 109)1, leads to, ðnÞ

aSIð1Þ =CV n mnþ1 ¼ ðnÞ

aSIð2Þ =mnþ2 V 1n ¼

10ð4nþ11Þ ðnÞ aesu =esu1n cm2nþ1 Bnþ1 4p 10ð4nþ2Þ ðnÞ aesu =esu1n cm2nþ1 Bn1

n1 a(n) (a0/cm)(2n+1)a(n) AU = (e/esu) esu,

where e = 4.8029750 1010 esu, a0 = 5.29167 109 cm. The numerical equivalents of 1 esu unit are given in the table for n = 1,2,3:

a b g

SI(1) 1.1126501 (16) 3.7224011 (21) 1.2379901 (25)

SI(2) 1.2566371 (5) 4.1916900 (10) 1.3981973 (14)

276 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

(AU) 6.74872532(24) 1.15756813(32) 1.98550683(39)

2.

Susceptibilities

There are only two systems in use: 2 n (1) ESU: P(n)/esu.cm2 = w(n) ESU(E/esu.cm ) (n) 2 (n) 1 n (2) SI: P /Cm = e0wSI (E/Vm ) ðnÞ

wSI =CV n mn2 ¼

4p ðnÞ w =esu1n cm2n2 104ðn1Þ Bn1 ESU

The numerical equivalents of 1 esu are given in the table for n = 1,2,3: w(1) 4p

3.

w(2) 4.1916900 (4)

w(3) 1.39811973 (8)

Dipole moments

1 Debye Unit = 1018esu cm = 3.3356410 1030 Cm.

References 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

A. Willetts, J. E. Rice, D. M. Burland and D. P. Shelton, J. Chem. Phys., 1992, 97, 7590. H. Reiss, J. Chem. Phys., 2006, 125, 014506. C. J. F. Bottcher, Theory of Electric Polarization, Elsevier, New York, 1973. J. Zyss and D. S. Chemla, in: ‘Nonlinear Optical Properties of Molecules and Crystals’, eds. D. S. Chemla and J. Zyss, Academic press, New York, 1987, vol. 1, p. 1. H. Reiss, A. Grzybowski and M. G. Papadopoulos, J. Phys. Chem. A, 2005, 109, 10106. H. Reiss, M. G. Papadopoulos and A. Grzybowski, J. Phys. Chem. B, 2006, 110, 18537. R. W. Munn, Synth. Met., 1994, 64, 117. D. A. Dunmur and R. W. Munn, Chem. Phys., 1983, 76, 249. P. J. Bounds and R. W. Munn, Chem. Phys., 1981, 59, 47. M. Malagoli and R. W. Munn, J. Chem. Phys., 1997, 107, 7926. H. Reiss, M. G. Papadopoulos and R. W. Munn, J. Chem. Phys., 1998, 109, 6828. P. Kaatz, E. A. Donley and D. P. Shelton, J. Chem. Phys., 1998, 108, 849. N. Blombergen, Nonlinear Optics, Benjamin, New York, 1965. J. F. Ward and G. H. C. New, Phys. Rev., 1969, 185, 57. P. N. Butcher and D. Cotter, The Elements of Nonlinear Optics, CUP, Cambridge, 1990. C. Flytzanis, in Quantum Electronics: A Treatise, eds. H. Rabin and C. L. Tang, Academic press, New York, 1975, pp. 9–207. J. F. Ward, Rev. Mod. Phys., 1965, 37, 1. B. J. Orr and J. F. Ward, Mol. Phys., 1971, 20, 513. J. O. Morley and D. Pugh, in: ‘Nonlinear Optical Properties of Molecules and Crystals’, eds. D. S. Chemla and J. Zyss, Academic press, New York, 1987, vol. 1, p. 193. R. Wortmann and D. M. Bishop, J. Chem. Phys., 1998, 108, 1001. R. W. Munn, Y. Luo, P. Macak and H. Agren, J. Chem. Phys., 2001, 114, 3105. D. A. Kleinman, Phys. Rev., 1962, 126, 1977. F. Sim, S. Chin, M. Dupuis and J. Rice, J. Phys. Chem., 1993, 97, 1158. Y. Luo, H. Agren, O. Vahtras and P. Jorgensen, Chem. Phys. Lett., 1993, 207, 190. H. Soscun, O. Castellano, Y. Bermudez, C. Toro, N. Cubillan, A. Hinchliffe and N. Phu, Int. J. Quant. Chem., 2006, 106, 1130. P. Salek, T. Helgaker, O. Vahtras, H. Agren, D. Jonsson and J. Gauss, Mol. Phys., 2005, 103, 439. C. K. Miller and J. F. Ward, Phys. Rev. A, 1977, 16, 1179. J. F. Ward and C. K. Miller, Phys. Rev. A, 1979, 19, 826. J. F. Ward and D. S. Elliot, J. Chem. Phys., 1978, 69, 5438. D. P. Shelton, Phys. Rev. A, 1990, 42, 2578. D. P. Shelton and J. E. Rice, Chem. Rev., 1994, 94, 3. P. Kaatz and D. P. Shelton, Rev. Sci. Instrum., 1996, 67, 1438. P. Kaatz and D. P. Shelton, J. Chem. Phys., 1996, 105, 3918.

Chem. Modell., 2008, 5, 249–278 | 277 This journal is

c

The Royal Society of Chemistry 2008

34 P. Kaatz and D. P. Shelton, Mol. Phys., 1996, 88, 683. 35 R. D. Pyatt and D. P. Shelton, J. Chem. Phys., 2001, 114, 9538. 36 B. F. Levine and C. G. Bethea, J. Chem. Phys., 1976, 65, 2429; B. F. Levine and C. G. Bethea, J. Chem. Phys., 1978, 69, 5240. 37 B. F. Levine, Chem. Phys. Lett., 1977, 37, 516. 38 J. L. Oudar, J. Chem. Phys., 1977, 67, 446. 39 J. L Oudar and D. S. Chemla, Opt. Commun., 1975, 13, 10. 40 L.-T. Cheng, W. Tam, S. H. Stevenson, G. R. Meredith, G. Rikken and S. R. Marder, J. Phys. Chem., 1991, 95, 10631. 41 L.-T. Cheng, W. Tam, S. R. Marder, A. E. Stiegman, G. Rikken and C. W. Spangler, J. Phys. Chem., 1991, 95, 10643. 42 C. C. Teng and A. F. Garito, Phys. Rev. Letters, 1983, 50, 350. 43 C. C. Teng and A. F. Garito, Phys. Rev. B, 1983, 28, 6766. 44 K. D. Singer and A. F. Garito, J. Chem. Phys., 1981, 75, 3572. 45 G. R. Meredith and B. Buchalter, J. Chem. Phys., 1983, 78, 1938. 46 Ch. Bosshard, G. Knopfle, P. Pretre and P. Gunter, J. Appl. Phys., 1992, 71, 1594. 47 D. M. Burland, C. A. Walsh, F. Kajzar and C. Sentein, J. Opt. Soc. Am. B, 1991, 8, 2269. 48 M. S. Paley, J. M. Harris, H. Looser, J. C. Baumert, G. C. Bjorklund, D. Jundt and R. J. Tweig, J. Org. Chem., 1989, 54, 3774. 49 M. Stahelin, C. R. Moylan, D. M. Burland, A. Willetts, J. E. Rice, D. P. Shelton and E. A. Donley, J. Chem. Phys., 1993, 98, 5595. 50 R. Wortmann, P. Kramer, C. Glania, S. Lebus and N. Detzer, Chem. Phys., 1993, 173, 99. 51 L. Jensen and P. van Duijnen, J. Chem. Phys., 2005, 123, 074307. 52 A. Willets and J. E. Rice, J. Chem. Phys., 1993, 99, 426. 53 K. V. Mikkelsen, Y. Luo, H. Agren and P. Jorgensen, J. Chem. Phys., 1994, 100, 8240. 54 H. Agren, O. Vahtras, H. Koch, P. Jorgensen and T. Helgaker, J. Chem. Phys., 993, 98, 6417. 55 F. Bertinelli, P. Palmieri, A. Brillante and C. Talliani, Chem. Phys., 1997, 25, 335. 56 S. P. Karna, P. N. Prasad and M. Dupuis, J. Chem. Phys., 1991, 94, 1171. 57 A. E. Whitten, P. Turner, W. T. Klooster, R. O. Piltz and M. A. Spackman, J. Phys. Chem. A, 2006, 110, 8763. 58 S. T. Howard, M. B. Hursthouse, C. W. Lehmann, P. R. Mallinson and C. S. Frampton, J. Chem. Phys., 1992, 97, 5616. 59 G. C. Yang, S. Q. Shi, W. Guan, L. Fang and Z. M. Su, J. Mol. Struct. (Theochem), 2006, 773, 9. 60 G. Scalmani, M. J. Frisch, B. Mennucci, J. Tomasi, R. Cammi and V. Barone, J. Chem. Phys., 2006, 124, 094107. 61 W. M. Faustino and D. V. Petrov, Chem. Phys. Lett., 2002, 365, 170.

278 | Chem. Modell., 2008, 5, 249–278 This journal is

c

The Royal Society of Chemistry 2008

The floating spherical Gaussian orbital (FSGO) methodw A. H. Pakiari DOI: 10.1039/b801727c

Introduction This review is dedicated to my PhD supervisor Professor J. W. Linnett, to Professor N. C. Handy who taught me Configuration Interaction (CI) during my postdoctoral period, and to Professor M. J. S. Dewar, who taught me theoretical physical organic chemistry. The review includes the most important articles on the FSGO method dating from 1967, when the original idea of the FSGO method was introduced by A. A. Frost1 until the last FSGO paper to appear in the literature in 2004. Therefore, this is a review of 37 years of FSGO. One difficulty is that authors have applied FSGO with other methods such as Monte Carlo and the phrase ‘‘FSGO’’ has not been used in the title or abstract of their articles. Such articles cannot be found by searching databases. The date of finishing this review is July 2007, which is also the date of the final search for finding FSGO articles in databases. What we are trying to achieve in this article is to give a review of the subject for those who are interested, in such a way that they can start their research. Obviously, one is not able to go into excessive detail, but references cited in this review can help such people to peruse the matter. The references are given in such a way as to make them succinct due to the limitation of the length of the review article, but hopefully in an effective manner. Therefore, you may find the name of some authors in the article but no mention of them in the references. For example, Frost used Huzinaga’s pseudopotential, but reference to Huzinaga is not given in our reference list. Presumably, those people who are interested in this particular reference can read Frost’s paper, and find it. This review consists of four parts. Part I includes the original theory of FSGO and its characteristics. Part II describes various developments of the FSGO original theory; for example, describing molecular orbitals by multi-Gaussian expansions, open shell methodology, multi-determinant calculations, how to describe lone pairs, how to describe p-systems, force constants and pseudopotential etc. Part III contains a collection of useful papers, which may be used by readers for different purposes; this part is called ‘Application of FSGO’. Part IV contains other methods which FSGO has been applied to such as FSGO fragments, natural orbitals, Monte Carlo, DFT, perturbation theory etc. Obviously, it is not possible to cover all papers. It may be that some important papers have been neglected, partly due to the space limitation of review article and partly due to ignorance on the part of author, and I apologize for any major omission. However, the authors named in the review have many publications in this subject; perhaps their names form good keywords for more FSGO articles, for those readers who are interested. Frost last reviewed this subject in 1977.2 Chemists are always interested to see where the electrons are in molecule, which electrons from what valence shell of which atom participate in different part of molecules and why, and what is the of role of each electron in determining molecular properties. These are difficult questions to be answered, but such considerations enable us to study chemical reactivity, which is what chemistry all about. Perhaps, Chemistry Department, College of Sciences, Shiraz University, Shiraz 71454, Iran. E-mail: [emailprotected]; [emailprotected] { I have to acknowledge Dr A. Mohajeri faculty member of Chemistry Dept. Shiraz University for editing and scientifically correction of this manuscript. [emailprotected]

Chem. Modell., 2008, 5, 279–311 | 279 This journal is

c

The Royal Society of Chemistry 2008

the motivation for introducing the Floating Spherical Gaussian Orbital (FSGO) method was the above considerations. Computer programming and working with the FSGO method is simple, and you will find in the Appendix various useful expressions for evaluation of the electronic energy.

1- Part I: 1-A

Original theory

The FSGO method, which was introduced by Frost1 in 1967, is briefly as follows. The spherical Gaussian, which is the simplest Gaussian function, and looks like the simplest hydrogenic 1s orbital, is used as a constructing function for the basis set, ~i) = (2ai/p)3/4 exp[ai(~ ~i)2] Gi(~ rR rR

(1-A-1)

~i are the variational The orbital exponent ai and the location of the orbital center R parameters. These Gaussians are positioned in the molecule in the same way as in Lewis’s concept of valence theory. A single Slater determinant wave function for the non-orthogonal basis set of doubly occupied orbitals, GI is constructed, and the total electronic energy is given by the well-known formula X XX hijh1 jiiþ ð2hiijh2 jjji hijjh2 jijiÞ ð1-A-2Þ Eel ¼ 2 i

j

i

Here, h1 is the operator for kinetic and electron nuclear attraction energies, and h2 is the two electron operator (1/rij). Since the Gaussian orbitals in eqn (1-A-1) form a non-orthogonal basis set, the following transformation can be used in order to convert it to an orthogonal basis set; X ji ¼ ðS1=2 Þin Gn ð1-A-3Þ n

where S is the overlap matrix. The total electronic energy for a single-determinantal wave function (Slater determinant) for describing one spherical Gaussian per molecular orbital is given by: X X Eel ¼ 2 hij Tjk þ Rijkl ½2Tij Tkl Til Tjk ð1-A-4Þ j;k

k;l;p;q

where hij = hi|h1|ji, Rijkl = hij|h2|kli. T = S1 is the inverse of the overlap matrix. The total energy is obtained by adding the inter-nuclear repulsion energy. E ¼ Eel þ

N X Zm Zn mon

ð1-A-5Þ

rmn

where rmn and Z are inter-atomic distance and nuclear charge, respectively. The dipole moment can be written as follows, X X X hmi ¼ Rn Zn 2 Ri Tii 4 Rij Tij Sij ð1-A-6Þ n

i

j;k

where Rn, Ri and Rij are locations of the nuclei, the orbital centers, and the overlap centers, respectively. The force on each nucleus due to the electron distribution can also be calculated,

fn ¼ 2Zn

n X j;k

Tjk qn;jk

ðRn Rjk Þ s3

280 | Chem. Modell., 2008, 5, 279–311 This journal is

c

The Royal Society of Chemistry 2008

ð1-A-7Þ

where s = |Rn Rjk|, and qn,jk = 8(2/p)1/2Sjk(s/pjk)3F1[2(s/pjk)2]

(1-A-8)

F1(t) = (1/2t)[Fo(t)et]

(1-A-9)

where Fo is the error-type function, which is required for energy integrals and finally pjk=f2ik Sik

(1-A-10)

Technically, the nuclear coordinates and orbital parameters, which consist of the Cartesian coordinates of the orbital center and its exponent, are all variational parameters and have to be optimized until the viral theorem is satisfied as a necessary condition (at least the total kinetic energy is equal and opposite to the total energy). For a sufficient condition, it is required that the obtained electronic and geometrical structure must make sense chemically and experimentally. The above formulas have been programmed in FORTRAN by Frost3 and Liu et al.,4 and there are integral forms for kinetic energy, electron-nuclear attraction and electron-repulsion with a modified routine for the error function eqn (1-A-9) in that paper. We have also provided for above integral forms using the ordinary error function (ERF available in the most FORTRAN compilers), in the Appendix. Frost1 presented the results for LiH by single Gaussian FSGO (SG-FSGO) in his first paper; they are 6.57269 au (in error by 18.6%), 6.56 Debyes (error 11.6%), 3.2260 au (error 7.0%) for the total energy, dipole moment and bond length, respectively. 1-B Characteristics of the SG-FSGO Frost applied the SG-FSGO method to one- and two-electron-pair systems in part II,5 first row of atom hybrids and hydrocarbons in part III,6 hydrocarbons in Part IV7 and diatomic molecules of first-row and second-row atoms,8 where he obtained about 1.7% to 6% error in bond length for hydrocarbons. In addition, the total energies evaluated by SG-FSGO have about 15–17% error. The percentage errors for bond length and bond angle for a molecule with lone pairs such as H2O are 14.5% and 8.0%, respectively. Despite these shortcomings SG-FSGO is rather good in molecule without lone pairs such as hydrocarbons; it cannot however predict correct properties for molecules with lone pairs. For example, a SG-FSGO calculation on F2 with six lone pairs shows dissociation.9 There are several major shortcomings in the original FSGO method (i) Lack of cusp conditions, which can be cured by using at least six concentric Gaussians at nuclei. (ii) Lack of suitable description of lone pair orbitals, which will be discussed later in section 2-H. (iii) Lack of general and suitable orbital description for s-bond. (iv) p-Bond will be given a rather general description in section 2-I. (v) A large number of parameters have to be optimized, which problem has not yet been solved. (vi) Description molecular orbital FSGO needs experience and a lot of chemical intuition, which is managed automatically in the SCF procedure by symmetry adapted concepts. Description of molecular orbitals is important in FSGO, it is used to employ description of orbitals instead of basis set. The results of chemical properties have low percentage of error when the description of MO orbitals is relevant. Therefore, all attempts are to choose a better description of MO orbitals. To get a better orbital description, the multi Gaussians orbital description section 2-A or using open shell and multi-determinantal wave function section 2-C and 2-D may help. 1-B-ii

Localized nature of FSGO wave-function

A localization procedure similar to that of Edmiston and Ruedenberg10 has been proposed by Suthers and Linnett11 for FSGO wave functions. The two electron Chem. Modell., 2008, 5, 279–311 | 281 This journal is

c

The Royal Society of Chemistry 2008

integrals part in eqn (1-A-2) can be written as follows; ER = D + C 0 X 0 X D¼ ½iijii Diagonal terms

(1-B-1) ð1-B-2Þ

i

C0 ¼ 2

XX

X0 ¼ 2

½iijjj; i 6¼ j off diagonal Coulomb terms

ð1-B-3Þ

½ijjji; i 6¼ j off diagonal exchange terms

ð1-B-4Þ

j

i

XX j

i

The equivalent expressions in the non-orthogonal basis set can be written as: X XXXX ½mnjpq ðT1=2 Þim ðT1=2 Þin ðT1=2 Þip ðT1=2 Þiq ð1-B-5Þ D¼ m

C0 ¼ 2

n

p

q

XXXX m

n

p

i

½mnjpq

XX

q

ðT1=2 Þim ðT1=2 Þin ðT1=2 Þjp ðT1=2 Þjq ; i 6¼ j

j

i

ð1-B-6Þ X0 ¼

XXXX m

n

p

½mnjpq

XX

q

i

ðT1=2 Þim ðT1=2 Þjn ðT1=2 Þjp ðT1=2 Þiq ; i 6¼ j ð1-B-7Þ

j

Now, the set of localized orbitals can be obtained by the procedure of Lennard-Jones and Pople, or by the Reudenberg method. Sum of self repulsion terms D will be maximized by this procedure. This implies the simultaneous minimization of C0 and X0 .

2- Part II, Development of original theory 2-A

Multi-Gaussian FSGO

In 1969 Frost12 also developed the double Gaussian FSGO model (DG-FSGO) and improved his previous results. The constructing function is now a linear combination of two spherical Gaussians, ~i)2] + C2 exp[a2(~ ~i)2] Fi = C1 exp[a1(~ rR rR

(2-A-1)

where the Ci are linear coefficients. As shown in Table 1, the improvements for DG-FSGO in comparison with SG-FSGO, are about 10.514.0% in energy, 1.0–4.0% in bond length and 4.05.8% in inter-bond angle. The percentage of error will increase with increasing the number of lone pairs in molecule. A general form of linear combination of spherical Gaussians has been used for describing each molecular orbital.13 The multi-Gaussian function Ith molecular orbital is defined as X ~j Þ w I ¼ NI cj Gj ð~ rR ð2-A-2Þ j

where cj is the coefficient of Gaussian, and NI is the normalization factor which can be evaluated as; !1=2 X ci cj Sij ð2-A-3Þ NI ¼ i;j

The linear coefficients of the multi-Gaussian ci are obtained by direct optimization 282 | Chem. Modell., 2008, 5, 279–311 This journal is

c

The Royal Society of Chemistry 2008

Table 1 The comparison of results of single and double Gaussian FSGO with Hartree-Fock and experimental results SG-FSGO

LiH CH4 NH3 H2O HF

Calc. error% Calc. Error% Calc error% Calc. error% Calc. error%

DG-FSGO

Energy

BL

Angle

Energy

6.572 17.7 33.992 14.7 47.568 15.0 64.288 15.3 84.571 16.0

1.712 7.3 1.115 2.0 1.011 0.06 0.881 7.9 0.784 14.5

7.6905 1.620 3.6 1.6 38.4851 1.038 3.5 5.0 53.81409 0.950 3 6.0 72.7921 0.861 4.1 10.0 94.571 0.794 5.5 13.4

— 87.6 17.8 88.4 15.4 —

BL

Experimental Angle Energya

BL

Angle

1.595 — 1.093 — 1.012 — 0.957 — 0.917 —

— 91.9 13.8 94.5 9.6 —

7.9851 — 39.8660 — 55.9748 — 75.9224 — 100.058 —

— 106.6 — 104.5 — —

a Hartree Fock energy, total energy is negative, BL is bond length in angstrom, and angle is in degree.

similar to other parameters. We use lower case letter i, j, k, l for representation of single Gaussian orbital description, and upper case letter I, J, K, L for representation of multi-Gaussian orbital description. To obtain total electronic energy of multiGaussian orbital description, it must be considered in eqn (2-A-4) that W = D1, and the lower case letters should be changed to upper case one. X X Eel ¼ 2 hIJ WIJ þ RIJKL ½2WIJ WKL WIL WJK ð2-A-4Þ I;J

I;J;K;L

where hIJ and RIJKL were explained for eqn (1-A-4). The MG-FSGO can be used in three ways. First, each Gaussian can only be used in one specific orbital description similar to eqn (1-A-4), which Frost used for single and double Gaussian calculations on LiH. This type is called Localized FSGO. The second type of application is that each Gaussian can be used in different combinations (orbital description) with identical exponent, position and coefficient. This method was applied to ethylene, vinyl lithium, diazene and formaldehyde.13 This type is called Semi-delocalized FSGO. Third, each Gaussian with identical exponent and position but different coefficient can make contributions in different orbital descriptions; therefore, it has more flexibility in our basis. This type of application can be called delocalized MG-FSGO (see section 2-G). This is similar to the SCF procedure. This method was applied to cyclobutadiene.13 2-B Floating ellipsoidal Gaussian orbitals (FEGO) It was thought in the early 1970s that it might be possible to improve the results of FSGO by using Floating Elliptic Gaussian Orbitals, FEGO. In a short article in 1974, Frost14 presented disappointing results for FEGO, which are for LiH; a total energy of 6.57277 au (compared with 6.57270 au for SG-FSGO) and the bond length is 3.212 au (compared with 3.226 au by SG-FSGO). The result for NH3 was also very disappointing, namely 1.96 au for the bond length, and an angle of 91.11. Therefore, there were no further articles about FEGO in the literature. 2-C

Open-shell wave-function

There are two types of open shell wave-function in the FSGO method. I- The first type of open-shell is called restricted open-shell, denoted RO-FSGO, and it was introduced by Blustin and Linnett.15 There are Chem. Modell., 2008, 5, 279–311 | 283 This journal is

c

The Royal Society of Chemistry 2008

n-electrons in doubly occupied orbitals, and there is an extra orbital fm that is singly occupied, n.fm| C = det|f1f1 . . . fnf

(2-C-1)

One and two electron electronic energies are given, respectively by " # n n n X X X 2 E1 ¼ 2 hij Tij þ Am hmm 2 him Sjm Tij þ hij Uij i;j¼1

"

n X

E2 ¼

i;j¼1

Rijkl ð2Tij Tkl Tik Tjl Þ þ A2m

n X

ð2Rijmm Rimjm ÞTij

i;j¼1

i;j;k;l¼1

2

ð2-C-2Þ

i;j¼1

n X

Rijml Smk ð2Tij Tkl Tik Tjl Þ þ

i;j;k;l¼1

n X

#

ð2-C-3Þ

Rijkl ð2Uij Tkl Uik Tjl Þ

i;j;k;l¼1

P P in which Uij ¼ nk;l¼1 Tik Tjl Skm Slm , A2m ¼ 1=ð1 ni;j¼1 Tij Sim Sjm Þ. The unpaired electron density is, " # n n X X 2 rz ðrÞ ¼ Am pmm ðr Rmm Þ 2 Tij Sim pjm ðr Rjm Þ þ Uij pij ðr Rij Þ ð2-C-4Þ i;j¼1

i;j¼1

The dipole moment is expressed as, for closed shell part, X

mc ¼

rn Z n 2

And for open shell part is, " mo ¼

Rin Tii 4

n X

i¼1

nuclei

A2m

n X

Rm 2

n X

Rim Sim Sjm Tij þ 4

i;j¼1

ð2-C-5Þ

Rij Sij Tij

i4j

n X

# ð2-C-6Þ

Rij Sij Uij

i;j¼1

They applied their method to radicals such as HeH, LiH+, H4+, He2+ as a p-type lobe function, constructed from two equivalent spherical Gaussians, fm = Nm(fa fb), Nm = 1/(2 2hfa|fbi)1/2. II- The second type of open-shell wave-function is called unrestricted FSGO, or UFSGO, which was introduced by Pakiari and Linnett.16 When there are p electrons with a-spin and q electrons with b-spin, then, the wave-function is, C = det|f1f2 . . . fpfp+1fp+2 . . . fp+q|

(2-C-7)

The electronic energy is, Eel ¼

pþq X i;j¼1 i4j

hij 3

2

pþq p X 16 X þ 6 Rijkl Tij Tkl Rijkl Til Tjk 4 2 i;j;k;l¼1 i4j;k4l

i;j;k;l¼1 i4j;k4l

q X i;j;k;l¼pþ1 i4j;k4l

7 Rijkl Til Tjk 7 5

ð2-C-8Þ

The factor half is due to avoiding the evaluation of the same integral twice, since each integral must be evaluated once for single occupied orbital. This can be taken into account in computer programming in order to save computational time. It should be noted that the electronic energy in both types can easily be extended to

284 | Chem. Modell., 2008, 5, 279–311 This journal is

c

The Royal Society of Chemistry 2008

cover the MG-FSGO formalism. Obviously, these types of wave functions (2-C-1) and (2-C-7) are not eigenfunction of S2. 2-D

FSGO calculation with many determinantal wave function

It is possible to achieve an eigenfunction of S2 or some type of configuration interaction with nonorthogonal basis sets17 by using a combination of Slater determinants. The element Hmn between two determinants of the electronic energy matrix is; X X 1 Hmn ¼ Tij hmn þ ð2-D-1Þ ðwim wjm jwkn wln ÞðTjl Tik Tjk Til Þ D mn ij ijkl ioj; k 6¼ l where Dmn is the determinant formed from the elements of the overlap matrix between the two determinants m and n, and the elements of T-matrix belong to the inverse of Dmn. 2-E First and second derivative of FSGO energy integrals There are difficulties for optimization in FSGO methods, when using subroutines such as the Harwell Library VA04A, since it is a direct search optimization procedure. Therefore, the optimization is very time consuming. If we want to use optimization routine using gradient or second derivative, first and second derivatives are needed. Numerical first and second derivatives (finite difference method) have large error. Analytical derivatives help to speed up the optimization. Additionally, those derivatives are vital for force constant and frequency calculations. The formulation for analytical first18 and second derivatives19 has been provided by Pakiari and Ofadeh. These derivatives are with respect to exponent, and Cartesian coordinates. Since first and second derivative by finite difference contains large error specially the second one, this makes large error in calculation of force constant and optimization. Therefore first and second analytical derivative of FSGO energy integrals, which can also used in SCF procedure, can quite help for more accurate force constant calculation and also fast optimization. A typical formula for the analytical first derivative of overlap integrals Sij with respect to coordinates of orbitals is, dSij ai aj drij dSij ¼ 2 rij Sij if k ¼ i ¼ j then ¼0 dqk ai þ aj dqij dqk Where q represents the Cartesian coordinates of orbitals x, y, z, and 8 drij drij xi xj þ yi yj þ zi zj < k ¼ i ¼ j then dqk ¼ 0 ¼m if k ¼ i then m ¼ 1 : dqk rij k ¼ j then m ¼ 1

ð2-E-1Þ

ð2-E-2Þ

For analytical first derivative of overlap integral Sij with respect to exponent ak, dSij 3ðai aj Þ a2n Sij ¼ ð1Þm Sij r2ij 4ak ðai þ ak Þ dak ðai þ aj Þ2

ð2-E-3Þ

8 ij < k ¼ i ¼ j then dS dak ¼ 0 if k ¼ i then m ¼ 1 and n ¼ j : k ¼ j then m ¼ 2 and n ¼ i

ð2-E-4Þ

Typical second analytical derivatives of overlap integral Sij are as follows. Chem. Modell., 2008, 5, 279–311 | 285 This journal is

c

The Royal Society of Chemistry 2008

The second derivative with respect to exponents ak1 and ak2, if k1 = k2 = i, " # @ 2 Sij 3 ða2i a2j Þ þ 2ai aj ai aj @Sij ¼ Sij þ 4 @ak2 @ak1 ai ðai þ aj Þ @ak2 a2i ðai þ aj Þ2 ð2-E-5Þ 2 2 2aj aj 2 2 @Sij þ Sij rij rij ðai þ aj Þ3 ðai þ aj Þ2 @ak2 if k1 = i and k2 = j, @ 2 Sij 3 2ai aj ¼ Sij Sij r2ij @ak2 @ak1 2ðai þ aj Þ2 ðai þ aj Þ3 3 ai aj @Sij ai aj @Sij r2 4 ai ðai þ aj Þ @ak2 ai ðai þ aj Þ2 ij @ak2

ð2-E-6Þ

The second derivative of overlap integral with respect to ak1 and qk2, a2j @ 2 Sij 3 ai aj @Sij @rij 2 @Sij ¼ 2S r þ r ij ij ij 4 ai ðai þ aj Þ @qk2 ðai þ aj Þ2 @qk2 @ak1 @qk2 @qk2 The second derivative of overlap integral with respect to qk1 and qk2, @ 2 Sij 2ai aj @Sij @rij @rij @rij @ 2 rij rij ¼ þ Sij þ Sij rij @qk2 @qk1 ðai þ aj Þ @qk2 @qk1 @qk2 @qk1 @qk2 @qk1

ð2-E-7Þ

ð2-E-8Þ

where @ 2 rij @rij @rij 1 ¼ ð1Þm 3 @qk2 @qk1 @qk2 @qk1 rij

ð2-E-9Þ

8 < k2 ¼ i then m ¼ 2 k1 ¼ i and or : k2 ¼ j then m ¼ 1

ð2-E-10Þ

If

2-F Reconciliation of FSGO and SCF methods There have been some arguments that FSGO cannot provide total energy with the same accuracy as SCF results. If the calculation is carried out for instance for optimized LiH with a single Gaussian (one Gaussian for k-shell and one Gaussian for bonding positioned at inter-atomic distance) by FSGO method, and then taking the optimized results as input for a standard SCF procedure, we will obtained almost the same results as FSGO. But, when you repeat the calculation with the multiGaussian FSGO for LiH, the results of SCF is more stable than FSGO. The reason is that orbital description of any MO in FSGO is truncated with respect to SCF molecular orbital. Now, if we put the same linear combination as SCF for orbital description of FSGO method, we will obtain the identical results from both methods SCF and FSGO. In other words, FSGO molecular orbitals are a truncated linear combination of basis set of SCF method.20 Therefore, it is possible to reach to Hartree-Fock limit. For example for LiH energy 7.9867349 au with bond length 3.035 au obtained by FSGO, and 7.987315 au with bond length 3.034 au by Cade21 obtained by SCF method. 286 | Chem. Modell., 2008, 5, 279–311 This journal is

c

The Royal Society of Chemistry 2008

2-G

Delocalized FSGO

The argument in sections 2-A and 2-F means that we let any orbitals in basis set contribute to all molecular orbital in molecule, which depend on their symmetry similar to SCF, the results of FSGO and SCF are identical. This way of thinking can be called delocalized orbital in FSGO method. Delocalized and localized FSGO have been demonstrated for cyclobutadiene for p system.12 2-H

Lone pair description

There is a shortcoming in FSGO for predicting inter-bond angle and bond length for molecules with lone pair such as H2O and NH3 molecules6 or F2 molecule.8 The angles and bond lengths are quite below the experimental expectation, as mentioned in section 1-B. Tan and Linnett22,23 tried to spread the lone pair orbital of NH3 and CH2 around the atom having lone pair (delocalized idea such as SCF), and they have obtained rather more correct results as in Table 2. Since this idea is difficult to generalize for two or three lone pairs on one atom, another later development is to include an axial lone pair.24 The last improvements for lone pair description were the linear combination of 1s + single p-orbital and 2s + double p-orbital.25 2-I

p-Description

The description of p-bonds is another important component of chemical valence, which chemists must be concerned about. Frost7 describes s- and p-bonds with one Gaussian for each side of plane of ethylene (top and bottom), and the results are given in Table 3. The model is called banana type, which was introduced by Pauling and Slater. This type description for p-bond makes it indistinguishable from s-bond, and Hoffman and Woodward rule can not be applied to it. Pakiari and Keshavarz26 proposed a s–p model of p-bond description, which has been introduced by Hu¨ckel. In this model, Gaussian orbitals for describing a s-bond are positioned on the interatomic vector, and a p-bond is made of two p-type Gaussians (positive and negative lobes), whose centers are optimized very near to nuclei. The above procedures are called the localization p-system. Let us see how delocalized p-orbital description works. Delocalized FSGO (section 2-A and 2-G) have been used for p-system of cyclobutadiene in both square Table 2 Different results for lone pair descriptions in FSGO calculations Lone pair CH2

NH3

H2O

F2

Axial Axial Delocalized Axial 1p Axial 2p 1s + 1p 2s + 2p

SG-FSGO BL (%) Angle (%) BL (%) Angle (%) BL (%) Angle (%) BL (%)

2.17b (3.0) 91.5b (11.0) 1.91a (0.0) 87.9a (18.0)

2.14b (2.0) 103.5b (0.01) 1.90a (0.0) 106.3a (0.0) — —

Dissociates unstable —

2.054 (2.2) 102.05 (0.34) 1.834 (4.1) 105.27 (1.34) 1.667 (7.9) 104.15 (0.33) —

2.135 (1.67) 102.3 (0.14) 1.872 (2.2) 107.9 (1.11) 1.724 (4.65) 104.22 (0.27) —

2.101 (0.05) 100.23 (2.11) 1.909 (0.21) 96.47 (9.59) 1.808 (0.0) 95.82 (8.31) 2.681 (0.4)

2.124 (1.14) 102.26 (0.14) 1.893 (1.04) 106.74 (0.04) 1.723 (4.70) 104.31 (0.18) —

Tan and Linnett,22 b Tan and Linnett.23 The rest of results are references.24,25b-(Angles in deg. and bond length, BL, in au). a

Chem. Modell., 2008, 5, 279–311 | 287 This journal is

c

The Royal Society of Chemistry 2008

Table 3 The results of bond length and bond angle for ethylene by different methods (au) Method

C–C BL

Error%

C–H BL

Error%

HCC BA

Error%

Banana model SG-FSGO7 Banana model DG-FSGOa s–p model FSGO26 s–p model FSGO13 MP2/6-31Ga13 QCISD/6-31Ga13 Experiment

2.554 2.500 2.555 2.547 2.524 2.527 2.527

1.31 0.83 1.35 1.03 0.12 0.24 —

2.081 1.945 2.072 2.028 2.051 2.056 2.053

1.86 4.8 1.42 0.73 0.39 0.64 —

120.7 120.6 120.5 121.8 121.7 121.8 121.3

0.49 0.58 0.66 0.43 0.33 0.45 —

a DG-FSGO stands for double Gaussian FSGO calculation. BL is for bond length, and BA is for bond angle.

and rectangular geometry.13 The total energy for localized description for square and rectangular geometries are 151.81688 au and 151.84776 au, respectively, while for delocalized form it is 151.8524 au. 2-J Force constant The first calculation of force constants in the FSGO method was done by Easterfield and Linnett.27 They studied BH4, CH4 and NH+4, and their results are shown in Table 4. Harmonic force fields for several polyatomic hydrides of 1st-row atoms were also calculated by Semkow28 with the FSGO method. Some of their results are in Table 5. Pakiari and Noorizadeh29 have suggested an ab initio determination of a rather accurate force constant for given FSGO wavefunctions for LiH, BH, H2 and H2O. In this procedure, an nth degree polynomial must be fitted to different calculated potential points from ab initio wave function. Brief results are collected in Table 6. Solimannejad and Pakiari developed ab initio analytical force constant for FSGO, and applied their theory to H2, LiH and BH; the results are 5.936 (3.21%), 1.208 (17.73%) and 3.168 (4.48%) mdyn A˚1, respectively.30 Ab initio analytic cubic force constants have also been formulated by Solimannejad and Pakiari31 in FSGO and applied to H2O and CH2. 2-K

Orbital energy

Orbital energy is well-known in SCF procedure, which is ei in FCi = eiSCi. Pakiari and Solimannejad have obtained an expression for e in FSGO method.32 One must consider multi-Gaussian (section 2-A) and open shell FSGO (section 2-C). Table 4 The experimental and calculated force constant by FSGO27 BH4 Force constant F11/mdyn F22/mdyn F33/mdyn F44/mdyn F34/mdyn

A˚ A˚1 A˚1 A˚1 A˚1 1

NH3+

CH4

Calc.

Expt.

Calc.

Expt.

Calc.

Expt.

3.80 0.36 2.58 0.44 0.27

3.05 0.29 2.71 0.29 0.15

8.15 0.71 8.67 0.66 0.38<