Researcher profile

Jan M. L. Martin

Jan M. L. Martin contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2022arXiv

Automatic generation of complementary auxiliary basis sets (CABS) for explicitly correlated methods

Explicitly correlated calculations, aside from the orbital basis set, typically require three auxiliary basis sets: JK (Coulomb-exchange fitting), RI-MP2 (resolution of the identity MP2), and CABS (complementary auxiliary basis set). If unavailable for the orbital basis set and chemical elements of interest, the first two can be auto-generated on the fly using existing algorithms, but not the third. In this paper, we present a quite simple algorithm named autoCABS; a Python implementation under a free software license is offered at Github. For the cc-pVnZ-F12 (n=D,T,Q,5) and the W4-08 thermochemical benchmark, we demonstrate that autoCABS-generated CABS basis sets are comparable in quality to purpose-optimized OptRI basis sets from the literature, and that the quality difference becomes entirely negligible as n increases.

preprint2022arXiv

Benefits of Range-separated Hybrid and Double-Hybrid Functionals for a Large and Diverse Dataset of Reaction Energies and Barrier Heights

To better understand the thermochemical kinetics and mechanism of a specific chemical reaction, an accurate estimation of barrier heights (forward and reverse) and reaction energy are vital. Due to the large size of reactants and transition state structures involved in real-life mechanistic studies (e.g., enzymatically catalyzed reactions), DFT remains the workhorse for such calculations. In this paper, we have assessed the performance of 88 density functionals for modeling the reaction energies and barrier heights on a large and chemically diverse dataset (BH9) composed of 449 organic chemistry reactions. We have shown that range-separated hybrid functionals perform better than the global hybris for BH9 barrier heights and reaction energies. Except for the PBE-based range-separated nonempirical double hybrids, the exchange term's range separation helps improve the performance for barrier heights and reaction energies. The sixteen-parameter Berkeley double hybrid, ωB97M(2), performs remarkably well for both properties. However, our minimally empirical range-separated double hybrid functionals offer marginally better accuracy than ωB97M(2) for BH9 barrier heights and reaction energies.

preprint2022arXiv

Electron Correlation: Nature's Weird and Wonderful Chemical Glue

It can be argued that electron correlation, as a concept, deserves the same prominence in general chemistry as molecular orbital theory. We show how it acts as Nature's "chemical glue" at both the molecular and supramolecular levels. Electron correlation can be presented in a general chemistry course in an at least somewhat intuitive manner. We also propose a simple classification of correlation effects based on their length scales and the size of the orbital gap (relative to the two-electron integrals). In the discussion, we also show how DFT can shed light on wavefunction theory, and conversely. We discuss two types of "honorary valence orbitals", one related to small core-valence gaps, the other to the ability of empty 3d orbitals in 2nd row elements to act as backbonding acceptors. Finally, we show why the pursuit of absolute total energies for their own sake becomes a sterile exercise, and why atomization energies are a more realistic "fix point".

preprint2022arXiv

MP2-F12 basis set convergence near the complete basis set limit: are $h$ functions sufficient?

We have investigated the title question for the W4-08 thermochemical benchmark using partial-wave truncations of a large reference (REF) basis set, as well as for standard F12-optimized basis sets. With the REF basis set, the root mean square (RMS) contribution of i functions to the total atomization energies (TAEs) is about 0.01 kcal/mol, the largest individual contributions being 0.04 kcal/mol for \ce{P2} and \ce{P4}. However, even for these cases, basis set extrapolation from \{g,h\} basis sets adequately addresses the problem. Using basis sets insufficiently saturated in the $spdfgh$ angular momenta may lead to exaggerated $i$ function contributions. For extrapolation from $spdfg$ and $spdfgh$ basis sets, basis set convergence appears to be quite close to the theoretical asymptotic $\propto L^{-7}$ behavior. We hence conclude that $h$ functions are sufficient even for highly demanding F12 applications. With one-parameter extrapolation, $spdf$ and $spdfg$ basis sets are adequate, with aug-cc-pV\{T,Q\}Z-F12 yielding RMSD=0.03 kcal/mol. A limited exploration of CCSD(F12*) and CCSD-F12b suggests our conclusions are applicable to higher-level F12 methods as well.

preprint2022arXiv

The MOBH35 metal-organic barrier heights reconsidered: performance of local-orbital coupled cluster approaches in different static correlation regimes

We have revisited the MOBH35 (Metal-Organic Barrier Heights, 35 reactions) benchmark [Iron, M. A.; Janes, T. J. Phys. Chem. A 2019, 123 (17), 3761-3781; ibid. 2019, 123, 6379-6380] for realistic organometallic catalytic reactions, using both canonical CCSD(T) and localized orbital approximations to it. For low levels of static correlation, all of DLPNO-CCSD(T), PNO-LCCSD(T), and LNO-CCSD(T) perform well; for moderately strong levels of static correlation, DLPNO-CCSD(T) and (T1) may break down catastrophically, and PNO-LCCSD(T) is vulnerable as well. In contrast, LNO-CCSD(T) converges smoothly to the canonical CCSD(T) answer with increasingly tight convergence settings. The only two reactions for which our revised MOBH35 reference values differ substantially from the original ones are reaction 9 and to a lesser extent 8, both involving iron. For the purpose of evaluating DFT methods for MOBH35, it would be best to excise reaction 9 entirely as its severe level of static correlation is just too demanding a test. The magnitude of the difference between DLPNO-CCSD(T) and DLPNO-CCSD(T1) is a reasonably good predictor for errors in DLPNO-CCSD(T1) compared to canonical CCSD(T); [...]

preprint2020arXiv

Canonical and DLPNO-based composite wavefunction methods parametrized against large and chemically diverse training sets. 2. Correlation consistent basis sets, core-valence correlation, and F12 alternatives

A hierarchy of wavefunction composite methods (cWFT), based on G4- type cWFT methods available for elements H through Rn, was recently reported by Semidalas and Martin [J. Chem. Theor. Comput. 2020, 16, 4238]. We extend this hierarchy by considering the inner-shell correlation energy in the second-order Moller-Plesset correction and replacing the Weigend-Ahlrichs def2-mZVPP(D) basis sets used in the aforementioned paper with complete basis set extrapolation from augmented correlation consistent core-valence triple-zeta, aug-cc-pwCVTZ(-PP), and quadruple-zeta, aug-cc-pwCVQZ(-PP), basis sets, thus creating cc-G4- type methods. For the large and chemically diverse GMTKN55 benchmark suite, they represent a substantial further improvement and bring WTMAD2 (weighted mean absolute deviation) down below 1 kcal/mol. Intriguingly, the lion's share of the improvement comes from better capture of valence correlation; the inclusion of core-valence correlation is almost an order of magnitude less important. These robust correlation consistent cWFT methods approach the CCSD(T) complete basis limit with just one or a few fitted parameters. Particularly the DLPNO variants such as cc-G4-T-DLPNO are applicable to fairly large molecules at modest computational cost, as is (for a reduced range of elements) a different variant using MP2-F12/cc-pVTZ-F12 for the MP2 component.

preprint2018arXiv

A Simple 'Range Extender' for Basis Set Extrapolation Methods for MP2 and Coupled Cluster Correlation Energies

We discuss the interrelations between various basis set extrapolation formulas and show that for the nZaPa and aug-cc-pVnZ basis set formulas, for n=4--6 their behavior closely resembles the Petersson (L+a)^{-3} formula with a shift a specific to the basis set family and level of theory. This is functionally equivalent to the Pansini-Varandas extrapolation for large L. This naturally leads to a simple way to extend these extrapolations to n=7 and higher. The formula is validated by comparison with newly optimized extrapolation factors for the AV{6,7}Z basis set pairs and literature values for {6,7}ZaPa. For L\geq5, the CCSD extrapolations of both the Schwenke and Varandas type are functionally equivalent to E(L)=E_\infty+A.(L-0.30)^{-3}, i.e., E(\infty)=E(L)+[E(L)-E(L-1)]/([(L-0.30)/(L-1.30)]^3-1)

preprint2015arXiv

Comment on: Doubly hybrid density functional xDH-PBE0 from a parameter-free global hybrid model PBE0 (J. Chem. Phys. 136, 174103 (2012))

We have compared the performance of Grimme style DH/DSD and Zhang-Xu-Goddard type xDH/xDSD forms for double hybrids. In the DH and DSD forms, KS orbitals with elevated HF exchange and damped DFT correlation are used, while in the xDH and xDSD forms, the KS orbitals are obtained from a conventional hybrid functional with undamped DFT correlation. Generally, the difference in performance between DSD and xDSD functionals is small, slightly favoring xDSD. Augmentation of the xDH form with either same-spin MP2 correlation or a dispersion correction markedly improves performance. Best xDSD results appear to be obtained for orbitals obtained with `exact exchange' fractions in the 50-70% range. The orbitals for xDSD appear to be fairly transferable between different correlation functionals.

preprint2014arXiv

The cc-pV5Z-F12 basis set: reaching the basis set limit in explicitly correlated calculations

We have developed and benchmarked a new extended basis set for explicitly correlated calculations, namely cc-pV5Z-F12. It is offered in two variants, cc-pV5Z-F12 and cc- pV5Z-F12(rev2), the latter of which has additional basis functions on hydrogen not present in the cc-pVnZ-F12 (n=D,T,Q) sequence.A large uncontracted 'reference' basis set is used for benchmarking. cc-pVnZ-F12 (n=D, T, Q, 5) is shown to be a convergent hierarchy. Especially the cc- pV5Z-F12(rev2) basis set can yield the valence CCSD component of total atomization energies (TAEs), without any extrapolation, to an accuracy normally associated with aug-cc-pV{5,6}Z extrapolations. SCF components are functionally at the basis set limit, while the MP2 limit can be approached to as little as 0.01 kcal/mol without extrapolation. The determination of (T) appears to be the most difficult of the three components and cannot presently be accomplished without extrapolation or scaling. (T) extrapolation from cc-pV{T,Q}Z-F12 basis sets, combined with CCSD-F12b/cc-pV5Z-F12 calculations appears to be an accurate combination for explicitly correlated thermochemistry. For accurate work on noncovalent interactions, basis set superposition error with the cc-pV5Z-F12 basis set is shown to be so small that counterpoise corrections can be neglected for all but the most exacting purposes.

preprint2010arXiv

Performance of W4 theory for spectroscopic constants and electrical properties of small molecules

Accurate spectroscopic constants and electrical properties of small molecules are determined by means of W4 and post-W4 theories. For a set of 28 first- and second-row diatomic molecules for which very accurate experimental spectroscopic constants are available, W4 theory affords near-spectroscopic or better predictions. Specifically, the root-mean-square deviations (RMSD) from experiment are 0.04 pm for the equilibrium bond distances (r_e), 1.03 cm^{-1} for the harmonic frequencies (ω_e), 0.20 cm^{-1} for the first anharmonicity constants (ω_e x_e), 0.10 cm^{-1} for the second anharmonicity constants (ω_e y_e), and 0.001 cm^{-1} for the vibration-rotation coupling constants (α_e). Higher-order connected triples, \hat{T}_3-(T), improve agreement with experiment for the hydride systems, but their inclusion (in the absence of \hat{T}_4) tends to worsen agreement with experiment for the nonhydride systems. Connected quadruple excitations, \hat{T}_4, have significant and systematic effects on r_e, ω_e, and ω_e x_e, in particular they universally increase r_e (by up to 0.5 pm), universally reduce ω_e (by up to 32 cm^{-1}), and universally increase ω_e x_e (by up to 1 cm^{-1}). Connected quintuple excitations, \hat{T}_5, are spectroscopically significant for ω_e of the nonhydride systems, affecting ω_e by up to 4 cm^{-1}. The triatomic molecules H_2O, CO_2, and O_3, as well as the pathologically multireference BN and BeO diatomics, are also considered. The asymmetric stretch of ozone represents a severe challenge to W4 theory, in particular the connected quadruple contribution converges very slowly with the basis set size. Finally, the importance of post-CCSD(T) correlation effects for electrical properties, namely dipole moments (μ), polarizabilities (α), and first hyperpolarizabilities (β) is evaluated.

preprint2009arXiv

Benchmark thermochemistry of the C_nH_{2n+2} alkane isomers (n=2--8) and performance of DFT and composite ab initio methods for dispersion-driven isomeric equilibria

The thermochemistry of linear and branched alkanes with up to eight carbons has been reexamined by means of W4, W3.2lite and W1h theories. `Quasi-W4' atomization energies have been obtained via isodesmic and hypohomodesmotic reactions. Our best atomization energies at 0 K (in kcal/mol) are: 1220.04 n-butane, 1497.01 n-pentane, 1774.15 n-hexane, 2051.17 n-heptane, 2328.30 n-octane, 1221.73 isobutane, 1498.27 isopentane, 1501.01 neopentane, 1775.22 isohexane, 1774.61 3-methylpentane, 1775.67 diisopropyl, 1777.27 neohexane, 2052.43 isoheptane, 2054.41 neoheptane, 2330.67 isooctane, and 2330.81 hexamethylethane. Our best estimates for $ΔH^\circ_{f,298K}$ are: -30.00 n-butane, -34.84 n-pentane, -39.84 n-hexane, -44.74 n-heptane, -49.71 n-octane, -32.01 isobutane, -36.49 isopentane, -39.69 neopentane, -41.42 isohexane, -40.72 3-methylpentane, -42.08 diisopropyl, -43.77 neohexane, -46.43 isoheptane, -48.84 neoheptane, -53.29 isooctane, and -53.68 hexamethylethane. These are in excellent agreement (typically better than 1 kJ/mol) with the experimental heats of formation at 298 K obtained from the CCCBDB and/or NIST Chemistry WebBook databases. However, at 0 K a large discrepancy between theory and experiment (1.1 kcal/mol) is observed for only neopentane. This deviation is mainly due to the erroneous heat content function for neopentane used in calculating the 0 K CCCBDB value. The thermochemistry of these systems, especially of the larger alkanes, is an extremely difficult test for density functional methods. A posteriori corrections for dispersion are essential. Particularly for the atomization energies, the B2GP-PLYP and B2K-PLYP double-hybrids, and the PW6B95 hybrid-meta GGA clearly outperform other DFT functionals.

preprint2009arXiv

Performance of ab initio and density functional methods for conformational equilibria of CnH2n+2 alkane isomers (n=2-8)

Conformational energies of n-butane, n-pentane, and n-hexane have been calculated at the CCSD(T) level and at or near the basis set limit. Post-CCSD(T) contribution were considered and found to be unimportant. The data thus obtained were used to assess the performance of a variety of density functional methods. Double-hybrid functionals like B2GP-PLYP and B2K-PLYP, especially with a small Grimme-type empirical dispersion correction, are capable of rendering conformational energies of CCSD(T) quality. These were then used as a `secondary standard' for a larger sample of alkanes, including isopentane and the branched hexanes as well as key isomers of heptane and octane. Popular DFT functionals like B3LYP, B3PW91, BLYP, PBE, and PBE0 tend to overestimate conformer energies without dispersion correction, while the M06 family severely underestimates GG interaction energies. Grimme-type dispersion corrections for these overcorrect and lead to qualitatively wrong conformer orderings. All of these functionals also exhibit deficiencies in the conformer geometries, particularly the backbone torsion angles. The PW6B95 and, to a lesser extent, BMK functionals are relatively free of these deficiencies. Performance of these methods is further investigated to derive conformer ensemble corrections to the enthalpy function, $H_{298}-H_0$, and the Gibbs energy function, ${\rm gef}(T)\equiv - [G(T)-H_0]/T$, of these alkanes. While $H_{298}-H_0$ is only moderately sensitive to the level of theory, ${\rm gef}(T)$ exhibits more pronounced sensitivity. Once again, double hybrids acquit themselves very well.

preprint2008arXiv

Atomization energies of the carbon clusters Cn (n=2--10) revisited by means of W4 theory as well as density functional, Gn, and CBS methods

The thermochemistry of the carbon clusters C$_n$ (n=2--10) has been revisited by means of W4 theory and W3.2lite theory. Particularly the larger clusters exhibit very pronounced post-CCSD(T) correlation effects. Despite this, our best calculated total atomization energies agree surprisingly well with 1991 estimates obtained from scaled CCD(ST)/6-31G* data. Accurately reproducing the small singlet-triplet splitting in C$_2$ requires inclusion of connected quintuple and sextuple excitations. Post-CCSD(T) correlation effects in C$_4$ stabilize the linear form. Linear/cyclic equilibria in C$_6$, C$_8$, and C$_{10}$ are not strongly affected by connected quadruples, but they are affected by higher-order triples, which favor polyacetylenic rings but disfavor cumulenic ones. Near the CCSD(T) basis set limit, C$_{10}$ does undergo bond angle alternation in the bottom-of-the-well structure, although it is expected to be absent in the vibrationally averaged structure. The thermochemistry of these systems, and particularly the longer linear chains, is a particularly difficult test for density functional methods. Particularly for the smaller chains and the rings, double-hybrid functionals clearly outperform convential DFT functionals for these systems. Among compound thermochemistry schemes, G4 clearly outperforms the other members of the G$n$ family. Our best estimates for total atomization energies at 0 K should be reliable to 1 kJ/mol up to C$_5$ inclusive, and to better than 1 kcal/mol up to C$_9$ inclusive.

preprint2007arXiv

W4 thermochemistry of P_2 and P_4. Is the CODATA heat of formation of phosphorus atom correct?

The high-accuracy W4 computational thermochemistry protocol, and several post-W4 methods, have been applied to the P$_2$ and P$_4$ molecules. Contrary to previous studies, we find the experimental thermochemistry to be fundamentally sound. The reaction enthalpy for P$_4\to 2$P$_2$ has a very significant contribution from post-CCSD(T) correlation effects. We derive a gas-phase heat of formation for the phosphorus atom of $ΔH^\circ_{f,0}$[P(g)]=75.54$\pm$0.1 kcal/mol and $ΔH^\circ_{f,298}$[P(g)]=75.74$\pm$0.1 kcal/mol, in the upper half of the CODATA uncertainty interval.

preprint2000arXiv

A fully {\it ab initio} potential curve of near-spectroscopic quality for the OH^- anion: importance of connected quadruple excitations and scalar relativistic effects

A benchmark study has been carried out on the ground-state potential curve of the hydroxyl anion, OH^{-}, including detailed calibration of both the 1-particle and n-particle basis sets. The CCSD(T) basis set limit overestimates $ω_e$ by about 10 cm^{-1}, which is only remedied by inclusion of connected quadruple excitations in the coupled cluster expansion --- or, equivalently, the inclusion of the $2π$ orbitals in the active space of a multireference calculation. Upon inclusion of scalar relativistic effects (-3 cm^{-1} on $ω_e$), a potential curve of spectroscopic quality (sub-cm^{-1} accuracy) is obtained. Our best computed EA(OH), 1.828 eV, agrees to three decimal places with the best available experimental value. Our best computed dissociation energies, D_0(OH^-)=4.7796 eV and D_0(OH)=4.4124 eV, suggest that the experimental D_0(OH)=4.392 eV may possibly be about 0.02 eV too low.

preprint2000arXiv

On the integration accuracy in molecular density functional theory calculations using Gaussian basis sets

The sensitivity of computed DFT (Density Functional Theory) molecular properties (including energetics, geometries, vibrational frequencies, and infrared intensities) to the radial and angular numerical integration grid meshes, as well as to the partitioning scheme, is discussed for a number of molecules using the Gaussian 98 program system. Problems with typical production grid sizes are particularly acute for third-row transition metal systems, but may still result in qualitatively incorrect results for a molecule as simple as CCH. Practical recommendations are made with respect to grid choices for the energy(+gradient) steps, as well as for the solution of the CPKS (Coupled Perturbed Kohn-Sham) equations.

preprint1998arXiv

A fully ab initio quartic force field of spectroscopic quality for SO_3

The quartic force field of SO$_3$ was computed fully ab initio using coupled cluster (CCSD(T)) methods and basis sets of up to $spdfgh$ quality. The effect of inner-shell correlation was taken into account. The addition of tight $d$ functions is found to be essential for accurate geometries and harmonic frequencies. The equilibrium geometry and vibrational fundamentals are reproduced to within 0.0003 Åand (on average) 1.15 cm^{-1}, respectively. We recommend the following revised values for the harmonic frequencies: $ω_1 = 1082.7, ω_2 = 502.6, ω_3 = 1415.4, ω_4 = 534.0 cm^{-1}$. In addition, we have shown that the addition of inner polarization functions to second-row elements is highly desirable even with more approximate methods like B3LYP, and greatly improves the quality of computed geometries and harmonic frequencies of second-row compounds at negligible extra computational cost. For larger such molecules, the B3LYP/VTZ+1 level of theory should be a very good compromise between accuracy and computational cost.