Towards Chemical Accuracy in Atomic Ionization Energies: The Case of H, C, N, O, F, P, and S Atoms

Ştefan Stan; Cora Crăciun; Vasile Chiș

doi:10.20944/preprints202604.2170.v1

Submitted:

30 April 2026

Posted:

30 April 2026

You are already at the latest version

Abstract

Accurate ionization energies are essential for understanding electronic structures of atoms and molecules, benchmarking quantum-chemical methods, and modeling ioni-zation processes in chemical and biological systems. In this work, we report calculated ionization energies of the H, C, N, O, P, and S atoms using a range of quan-tum-chemical approaches, aiming at reproducing the experimental values within the chemical accuracy. The methods include the electron propagator approximations OVGF and P3+, the coupled-cluster methods CCSD(T), CCSDT, and IP-EOM-CCSD, and the composite methods G3 and CBS-QB3. The CCSD(T), CCSDT, G3, and CBS-QB3 methods, together with the DFT method with B2PLYP density functional and several post-Hartree-Fock methods, were used in conjunction with the energy-difference (ΔSCF) approach. The coupled-cluster calculations were combined with the aug-cc-pVXZ-DK, aug-cc-pVXZ, and ANO-RCC basis sets, all-electron correlation, DKH2 scalar relativistic corrections, and extrapolation to the complete basis set (CBS) limit. The OVGF and P3+ methods do not reach chemical accuracy on average, while CCSD(T) and CCSDT combined with the aug-cc-pVXZ-DK basis set and CBS extrapolation achieve chemical accuracy for all atoms. CCSD(T)/aug-cc-pVXZ-DK with CBS extrapolation provides the best compromise between accuracy and computational cost, and can be used as a reference for these atomic ionization energies.

Keywords:

atomic ionization

;

OVGF

;

P3+

;

CCSD(T)

;

IP-EOM-CCSD

;

CBS extrapolation

;

DKH2

;

chemical accuracy

Subject:

Physical Sciences - Applied Physics

1. Introduction

The calculation of ionization energies of atoms may be useful for understanding their oxidation mechanism, for benchmarking quantum-chemical methods, and for modeling ionization processes in chemical and biological systems.

Two types of ionization energy (i.e., vertical and adiabatic) can be defined for atomic or molecular systems. The vertical ionization energy assumes that, during the removal of an electron, the orbitals of the system remain unchanged. By contrast, in the case of the adiabatic ionization energy, the orbitals relax during the ionization process as a result of the redistribution of the electronic density. Experimentally, ionization energies can be determined, for example, by photoelectron spectroscopy [1], from the difference between the energy of the incident photons and the kinetic energy of the emitted photoelectrons.

We computed the ionization energies of several atoms commonly found in proteins, namely H, C, N, O, and S, as well as two additional atoms, F and P. The atoms H, C, N, O, and S are present in essential amino acids, and the calculation of their ionization potentials may contribute to a better understanding of ionization processes in proteins. The atoms F and P may occur in synthetic amino acids and exhibit properties that can be exploited in the pharmaceutical industry [2,3]. Because fluorine is the most electronegative element, it forms strong bonds with carbon and can therefore be used in protein engineering for the synthesis of highly stable proteins [4]. Phosphorus-containing amino acids also have a variety of applications. For example, they may act as protecting groups, serve as antiviral agents incorporated into prodrugs, or be used for ¹⁸F labeling [5,6].

The methods employed in this work to compute ionization energies include the energy-difference approach and the electron propagator theory (EPT) methods, namely the Outer Valence Green’s Function (OVGF) and the renormalized partial third-order quasiparticle (P3+) methods. The energy-difference approach was combined with the B2PLYP DFT method and various post-Hartree–Fock methods. In addition, the coupled-cluster methods CCSD(T), CCSDT, and IP-EOM-CCSD were employed, in conjunction with aug-cc-pVXZ-DK, aug-cc-pVXZ, and ANO-RCC basis sets families, with all-electron correlation treatment, the second-order Douglas-Kroll-Hess (DKH2) scalar-relativistic Hamiltonian, and extrapolation to the complete basis set (CBS) limit using the two-point Helgaker formula. The aim of these calculations was to achieve chemical accuracy, defined here as a maximum deviation of 1 kcal/mol between the experimental and calculated ionization energies (1 kcal/mol = 4.184 kJ/mol = 0.0434 eV) [7].

2. Computational Methods

In the energy-difference approach, the ionization potential is obtained as the energy difference between the cation, generated by removing one electron, and the corresponding neutral system [8]:

I = E_{c a t i o n} - E_{n e u t r a l}

(1)

To obtain the adiabatic ionization energy, both the neutral system and the cation are optimized, and the difference between their total energies is then evaluated. By contrast, the vertical ionization energy is obtained by calculating the energy of the cation at the geometry of the neutral system. The approach was previously used by Jursic [9] to compute the ionization potentials of several second-row elements (C, N, O, F) employing DFT and ab initio methods.

The Electron Propagator Theory (EPT) [10] is a useful framework for calculation of ionization energies, electron affinities, one-electron properties, or spectral intensities. EPT is based on the concept of propagator. For an electron in a quantum system, the propagator is given by the expectation value of the time-evolution operator between the initial and final states,

G ({\vec{r}}_{i}, {\vec{r}}_{f}, t, t^{'}) = ⟨{\vec{r}}_{f} |\hat{U} (t, t^{'})| {\vec{r}}_{i}⟩

(2)

where

\hat{U} (t, t^{'})

is the time evolution operator. The one-particle Green’s function [11] can be used to calculate electron propagators. EPT is based on the solution of the Dyson equation [12,13,14],

G = G^{0} + G^{0} Σ G

(3)

where

G

is the one-particle Green’s function,

G^{0}

is the free Green’s function, and

Σ

is the self-energy which contains the correlation and final-state relaxation effects [15,16]. The Dyson equation may be transformed in the following one-electron equation [16,17,18]:

[F + Σ (E)] ϕ^{D y s o n} = E ϕ^{D y s o n}

(4)

where

F

is the one-electron Fock operator,

E

is the energy, and

ϕ^{D y s o n}

is the Dyson orbital [19]. The Dyson orbital corresponding to the ionization from the initial state

Ψ_{N} (x_{1}, \dots, x_{N})

to the final state

Ψ_{s, N - 1} (x_{2}, \dots, x_{N})

is defined as [17]:

ϕ_{s}^{D y s o n} (x_{1}) = \sqrt{N} \int Ψ_{N} (x_{1}, x_{2}, \dots, x_{N}) Ψ_{s, N - 1}^{*} (x_{2}, x_{3}, \dots, x_{N}) d x_{2} d x_{3} \dots d x_{N}

(5)

where

x_{i}

stands for the space-spin coordinate of electron i.

Based on the Dyson orbital, the pole strength, which takes values in the [0,1] interval, can be computed as follows [18,19]:

p_{s} = \int {|ϕ_{s}^{D y s o n} (x)|}^{2} d x

(6)

For strong correlation the pole strength is close to zero, whereas for weak correlation it approaches unity [16]. When the Koopmans picture provides a qualitatively valid description of the

Ψ_{N}

and

Ψ_{s, N - 1}

states, the pole strength is greater than 0.85 [16].

In this paper we employ two diagonal approximations of the EPT, namely OVGF and P3+, in which the off-diagonal elements of the self-energy matrix are neglected [20]. The OVGF method is designed for outer valence electrons and is based on a perturbative expansion of the self-energy that is exact through third order, while higher order contributions are treated approximately [14]. There are three variants of the OVGF method, denoted A, B, and C, which differ in the form of the self-energy expression. The corresponding expressions for the self-energy are given in ref. [10]. Method A is applied when the ionization energy is greater than 15 eV, whereas method B is used when the ionization energy is smaller than 15 eV [20]. Approximations A and B correspond to the cases in which the second-order terms in the self-energy exceed the third-order terms, whereas approximation C is used when the second order terms are small [20]. An algorithm for selecting among methods A, B, and C is listed in ref. [20]. The computational costs of the OVGF method scales as OV⁴, where O is the number of occupied orbitals and V is the number of virtual orbitals [15,16,21]. For closed-shell systems, the absolute error in computed vertical ionization potential below 20 eV is about 0.25 eV [20,22].

The P3 method [23] neglects some terms from the self-energy expression used in OVGF. A renormalized version of P3, denoted P3+, has proven to be very effective for computing the ionization energies of challenging anions [17]. The computational cost of both P3 and P3+ scales as O³V², which represents an improvement over OVGF method because the number of occupied orbitals is usually much smaller than the number of virtual orbitals [16]. For P3+, mean absolute deviations of less than 0.2 eV between calculated and experimental ionization energies have been reported for anions with ionization potentials below 20 eV [17].

For the seven atoms considered in this work, the energy-difference approach was used in conjunction with the DFT functional B2PLYP [24], as well as the post-Hartree–Fock methods MP2 [25], CCSD(T) [26,27], QCISD(T) [27], G3 [28], and CBS-QB3 [29]. These methods were combined with the aug-cc-pVQZ basis set. The OVGF and P3+ calculations were performed with the 6-311+G(2df,p), cc-pVQZ, and aug-cc-pVQZ basis sets. All these calculations were carried out with Gaussian 16 Revision C.01 [30] on the High-Performance Computing Center of Babeș-Bolyai University [31].

In addition to the calculations described above, the coupled-cluster methods CCSD(T) [26,27], CCSDT [32], and IP-EOM-CCSD [33,34] were used for the calculation of the ionization energies of the C, N, O, F, P, and S atoms, as implemented in ORCA 6.1.0 [35]. The CCSD(T) method combines the iterative coupled-cluster treatment of single and double excitations with a perturbative correction of the triple excitations. The CCSDT method provides an iterative treatment of the triple excitations and was employed in order to evaluate the quality of the perturbative correction in the CCSD(T). The CCSDT calculations were performed using the AUTOCI module implemented in ORCA. The IP-EOM-CCSD method is an equation-of-motion (EOM) coupled-cluster approach for ionization energies. It is based on a neutral CCSD reference state, while the ionized states are described through electron removal operators in the EOM space. Therefore, unlike the energy-difference approach, IP-EOM-CCSD gives the ionization energies directly. For the hydrogen atom, which is a one-electron system, only Hartree-Fock (HF) calculations were performed. Very tight criterion was used for the convergence of SCF calculations.

Three families of basis sets were used: the aug-cc-pVXZ basis sets (X = Q, 5) [36,37], the corresponding relativistically recontracted aug-cc-pVXZ-DK basis sets (X = Q, 5) [38], and the relativistically contracted atomic natural orbitals basis sets (ANO-RCC) at the TZP, QZP, and Full contraction levels [39]. The aug-cc-pVXZ-DK and ANO-RCC calculations were performed in conjunction with the second order Douglas-Kroll-Hess scalar relativistic Hamiltonian (DKH2) [40,41] and the finite nucleus model, while the aug-cc-pVXZ calculations were done without relativistic corrections. All electrons were included in the correlation treatment through the NoFrozenCore option. For the CCSD(T) and CCSDT calculations, the ionization energies were obtained as the energy differences between the neutral and cationic atoms. For the IP-EOM-CCSD calculations, the ionization energies were obtained directly from the ionized roots of the neutral reference state. For the six atoms with more than one electron, the α and β electron removal roots were treated separately, and the first ionization energy was considered to be the lower value.

In order to reduce the basis set error, the ionization energies obtained with CCSD(T), CCSDT, and IP-EOM-CCSD were extrapolated to the complete basis set (CBS) limit using the two-point formula of Helgaker et al. [42,43]:

E_{X} = E_{C B S} + \frac{A}{X^{3}}

(7)

which results in

E_{C B S} = \frac{X^{3} E_{X} - Y^{3} E_{Y}}{X^{3} - Y^{3}}

(8)

where X and Y are the cardinal numbers of the basis sets used in the extrapolation. For the aug-cc-pVXZ-DK and aug-cc-pVXZ basis sets, the (Q,5) pair was used, while for the ANO-RCC basis sets the (T,Q) pair was employed based on the TZP and QZP contractions. For the CCSD(T) and CCSDT methods, a dual-level CBS extrapolation scheme was used, in which the total energy at each basis set level was separated into a self-consistent field (SCF) component and a correlation component:

E_{t o t} (X) = E_{S C F} (X) + E_{c o r r} (X)

(9)

This separation was justified by the fact that the two contributions exhibit different convergence patterns with increasing the basis set size [43,44]. The SCF energy converges exponentially with the cardinal number and is already close to the CBS limit at the 5Z level, while the correlation energy converges asymptotically with X^-3. Therefore, the ionization energy at the CBS limit was obtained as:

{I E}_{S C F} (Y) = E_{S C F}^{c a t} (Y) - E_{S C F}^{n e u} (Y)

(10)

{I E}_{c o r r} (X) = E_{c o r r}^{c a t} (X) - E_{c o r r}^{n e u} (X)

(11)

{I E}_{c o r r, C B S} = \frac{Y^{3} {I E}_{c o r r} (Y) - X^{3} {I E}_{c o r r} (X)}{Y^{3} - X^{3}}

(12)

{I E}_{C B S} = {I E}_{S C F} (Y) + {I E}_{c o r r, C B S}

(13)

where Y = X+1 and the SCF contribution is taken from the larger basis set, namely aug-cc-pV5Z, aug-cc-pV5Z-DK, and ANO-RCC-QZP. This scheme was used by Richard et al. [44] for the calculation of ionization potentials and electron affinities at the CCSD(T)/CBS limit. For the IP-EOM-CCSD method, which computes the ionization energy directly, the extrapolation was applied directly to the α and β electron removal ionization energies [45]:

{I P}_{s, C B S} = \frac{Y^{3} {I E}_{s} (Y) - X^{3} {I E}_{s} (X)}{Y^{3} - X^{3}}, s = α, β

(14)

The first ionization energy at the CBS limit was then obtained as the lower of the two extrapolated values.

3. Results and Discussion

3.1. Ionization Energies for the C, N, O, F, P, and S Atoms Obtained with the OVGF and P3+ Methods

Table 1 summarizes the ionization energies of the six atoms C, N, O, F, P, and S calculated using the OVGF and P3+ methods. The hydrogen atom was not considered within these approaches because, having only one electron, electron correlation is absent. Three basis sets were used: cc-pVQZ, aug-cc-pVQZ, and 6-311+G(2df,p). The table reports both the experimental ionization energies and the values computed with the OVGF and P3+ methods. For OVGF, the A, B, and C variants were evaluated according to the algorithm implemented in Gaussian package, and the recommended values were used in the discussion. Beside these quantities, the table includes the spin multiplicities of the neutral atoms, the indexes of the HOMO and LUMO orbitals, and the orbital window employed in the calculations.

Figure 1 sketch the atomic orbital levels of the C, N, O, F, P, and S atoms used in the OVGF calculations, while Figure 2 exemplifies the orbital indexing for the OVGF method in case of the sulfur atom.

Table 2 contains the MAE and MSE values averaged over the six atoms for the three basis sets with the OVGF and P3+ methods. Chemical accuracy is not achieved for any combination of basis set and method on average. For OVGF, the smallest MAE is obtained with the aug-cc-pVQZ basis set, followed by cc-pVQZ and then by 6-311+G(2df,p), indicating that the inclusion of diffuse functions improves the description of the ionized state. Higher errors are obtained for the P3+ method, with the same order of the basis sets as for OVGF.

Figure 3 shows the deviations of the calculated ionization energy with respect to the experimental values for the six atoms for the OVGF and P3+ methods. For carbon atom, the OVGF overestimates and P3+ underestimates the experimental value. For nitrogen, the OVGF deviations are positive, while the P3+ underestimates the IE. None of the calculated values achieves chemical accuracy for C and N atoms. For oxygen, OVGF/aug-cc-pVQZ yields a deviation of only 0.0083 eV, within chemical accuracy, while the other two basis sets give larger negative deviations. Moreover, the P3+ method strongly underestimates the IE for oxygen, with deviations from the experiment in the range of -0.5191 eV to -0.3431 eV. For fluorine, OVGF/aug-cc-pVQZ achieves chemical accuracy with a deviation of -0.0346, while the P3+ underestimates the IE. For phosphorus, the OVGF deviations are below 0.0434 eV for all three basis sets, reaching chemical accuracy, while P3+ is close to the limit only with the aug-cc-pVQZ basis set. For sulfur, the OVGF underestimates the IE, while the P3+ method yields large deviations. In all cases, the 6-311+G(2df,p) basis set yields the largest deviations, while aug-cc-pVQZ gives the smallest, indicating that the inclusion of diffuse functions improves the description of the ionized state. The P3+ method systematically underestimates the experimental ionization energies for all atoms and basis sets, as also reflected in the negative MSE values reported in Table 2.

3.2. Ionization Energies of the H, C, N, O, F, P, and S Atoms Obtained with the Energy-Difference Approach and IP-EOM-CCSD

Since the OVGF and P3+ methods do not achieve chemical accuracy for the considered atoms (Table 1), higher level coupled-cluster calculations were performed using CCSD(T), CCSDT, and IP-EOM-CCSD methods as implemented in ORCA 6.1.0, combined with the basis set extrapolation to the complete basis set (CBS) limit and scalar relativistic corrections through the DKH2 Hamiltonian. In addition, a series of calculations were carried out with Gaussian 16 using B2PLYP, MP2, CCSD(T), QCISD(T), G3, and CBS-QB3 methods with the default frozen-core approximation, in which only the valence electrons are included in the correlation treatment.

Table 3, Table 4, Table 5, Table 6, Table 7, Table 8 and Table 9 summarize the ionization energies of the H, C, N, O, F, P, and S atoms. The upper part of each table reports the results obtained with ORCA 6.1.0 using the CCSD(T), CCSDT, and IP-EOM-CCSD methods with three families of basis sets (aug-cc-pVXZ-DK, aug-cc-pVXZ, and ANO-RCC) together with the corresponding CBS extrapolations. For CCSD(T) and CCSDT the ionization energies were obtained as total energy differences between the neutral and cationic species, while for the IP-EOM-CCSD method they were obtained directly from the ionized roots of the neutral reference state. The lower part of the tables shows the results obtained with Gaussian 16 using the frozen-core default approximation and the energy-difference approach for the B2PLYP, MP2, CCSD(T), and QCISD(T) methods combined with aug-cc-pVQZ basis set, as well as the composite methods G3 and CBS-QB3. All Gaussian calculations employed the default frozen-core approximation. All deviations (ΔIE) are reported relative to the experimental values from NIST Atomic Spectra Database (ver. 5.12) [46]. In addition, the tables provide the spin multiplicities of both the neutral and cationic species. For each atom, the corresponding deviations are also presented graphically in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10.

For the hydrogen atom (Table 3), which is a single-electron system, there is no electron correlation, and the ionization energy corresponds to the Hartree-Fock (HF) value. HF calculations were performed with three families of basis sets: the aug-cc-pVXZ (X = Q, 5), the relativistic aug-cc-pVXZ-DK (X = Q, 5) including the DKH2 Hamiltonian, and the ANO-RCC basis sets at the TZP, QZP, and Full contraction levels. All computed IE values are within the range of 13.6043 – 13.6057 eV, with deviations ranging from 0.0058 to 0.0073 eV relative to the experimental value of 13.5984 eV, which are clearly within chemical accuracy. The effect of scalar relativistic corrections (DKH2) is minimal for H, corresponding to only 0.0002 eV, as seen from the comparison between the values obtained with aug-cc-pVQZ-DK and aug-cc-pVQZ. The ANO-RCC basis sets already converge at the TZP contraction level (13.6051 eV), QZP and Full giving identical values (13.6054 eV). Moreover, the G3 and CBS-QB3 yield 13.6330 eV (ΔIE = 0.0345 eV) and 13.6007 eV (ΔIE = 0.0023 eV), respectively, CBS-QB3 providing the closest agreement with the experiment among all methods considered for this atom.

These results are also presented in Figure 4, which shows that IE values for the hydrogen atom are well within the chemical accuracy interval, with only minor differences between the basis sets.

In the case of carbon atom (Table 4), the CCSD(T)/aug-cc-pVXZ-DK ionization energy values converge from 11.2312 eV (QZ, ΔIE = -0.0291 eV) to 11.2476 eV (5Z, ΔIE = -0.0127 eV), while the CBS extrapolation yields 11.2635 eV (ΔIE = 0.0032 eV), remaining within the limits of chemical accuracy. CCSDT produces a CBS value of 11.2671 eV (ΔIE = 0.0068 eV), with a difference of only 0.0036 eV relative to CCSD(T), showing that the perturbative triples correction in CCSD(T) is enough for an accurate description of the ionization energy. For IP-EOM-CCSD, the first ionization is attributed to the α electron removal. This method already overestimates the IE at the QZ level with 0.0325 eV, and the CBS extrapolation further increases this deviation to 0.0778 eV, outside chemical accuracy, due to the missing triple excitations.

The aug-cc-pVXZ results are slightly higher, with the CCSD(T) CBS value at 11.2672 eV (ΔIE = 0.0069 eV) and CCSDT at 11.2708 eV (ΔIE = 0.0105 eV), both within chemical accuracy. The non-relativistic IP-EOM-CCSD CBS value of 11.3419 eV (ΔIE = 0.0816 eV) follows the same overestimation pattern. The DKH2 contribution is approximately -0.0037 eV across all methods and basis sets.

With ANO-RCC basis sets, the CCSD(T) values increase from 11.1962 eV (TZP) to 11.2163 eV (QZP) and 11.2274 eV (Full), the last value being within chemical accuracy (Full, ΔIE = -0.0329 eV). The CBS(T,Q) extrapolation yields 11.2337 eV (ΔIE = -0.0266 eV), also within chemical accuracy but further from experiment than the aug-cc CBS result. CCSDT CBS(T,Q) extrapolation value is 11.2384 eV (ΔIE = -0.0219 eV), while IP-EOM-CCSD CBS(T,Q) reaches 11.3029 eV (ΔIE = 0.0426 eV), at the chemical accuracy limit.

Among the Gaussian 16 results with the aug-cc-pVQZ basis set, B2PLYP overestimates the IE with 0.0893 eV, while MP2 gives a deviation of 0.0198 eV. QCISD(T) yields a deviation of -0.0531 eV. The results of the composite methods G3 and CBS-QB3 are 11.2114 eV (ΔIE = -0.0489 eV) and 11.1925 eV (ΔIE = -0.0678 eV), respectively.

A graphical summary of the deviations from the experimental value corresponding to the C atom is depicted in Figure 5. The four panels clearly show the good agreement with the experiment obtained for CCSD(T) and CCSDT, particularly with the CBS extrapolation, the systematic overestimation by the IP-EOM-CCSD method with the aug-cc basis sets, and the large negative deviations observed with the frozen-core methods.

For the nitrogen atom (Table 5), the CCSD(T)/aug-cc-pVXZ-DK results show a particularly fast convergence. At the 5Z level, the ionization energy of 14.5346 eV (ΔIE = 0.0005 eV), is in excellent agreement with the experimental value of 14.5341 eV. The CBS value of 14.5506 eV (ΔIE = 0.0165 eV) slightly overestimates the IE_exp value, but it remains within chemical accuracy. CCSDT yields identical results, with a CBS value of 14.5502 eV (ΔIE = 0.0161 eV) and a difference of only 0.0004 eV relative to CCSD(T). For IP-EOM-CCSD, the first ionization corresponds to the lowest α electron removal root. As observed for carbon, the method already overestimates the IE at the QZ level (ΔIE = 0.0100 eV), and the CBS extrapolation increases this deviation to 0.0588 eV, outside chemical accuracy. The aug-cc-pVXZ calculations follow the same pattern, with CCSD(T) CBS at 14.5567 eV (ΔIE = 0.0226 eV) and CCSDT CBS at 14.5563 eV (ΔIE = 0.0222 eV), both within chemical accuracy. The IP-EOM-CCSD CBS value of 14.5990 eV (ΔIE = 0.0649 eV) remains outside the threshold. The DKH2 contribution is -0.0061 eV, larger than for carbon (-0.0037 eV), consistent with the higher nuclear charge.

With the ANO-RCC basis sets, the CCSD(T) values increase from 14.4838 eV (TZP, ΔIE = -0.0504 eV) to 14.4976 eV (QZP, ΔIE = -0.0365 eV) and 14.5173 eV (Full, ΔIE = -0.0168 eV), the last two being within chemical accuracy. The CBS(T,Q) extrapolation yields 14.5114 eV (ΔIE = -0.0227 eV), also within chemical accuracy. CCSDT and IP-EOM-CCSD follow the same pattern, with ANO-RCC-Full values of 14.5176 eV (ΔIE = -0.0166 eV) and 14.5487 eV (ΔIE = 0.0146 eV), respectively.

Furthermore, B2PLYP exhibits a deviation of -0.0171 eV, lower than 0.0434 eV, while MP2 overestimates the value of IE with 0.0684 eV. The QCISD(T) yields 14.5001 eV (ΔIE = -0.0340 eV), whereas the G3 and CBS-QB3 exhibit deviations of -0.0273 eV and -0.0415 eV, respectively, all of them being within chemical accuracy.

Figure 6 presents the comparison of the ΔIE values for the nitrogen atom. The compact range of the CCSD(T) and CCSDT results around zero confirms the fast basis set convergence observed for this atom, while the IP-EOM-CCSD panel shows the progressive overestimation with increasing the basis set size.

In the case of the oxygen atom (Table 6), the CCSD(T) method with the aug-cc-pVXZ-DK basis sets yields IE values that converge systematically from 13.5204 eV at the QZ level (ΔIE = -0.0977 eV) to 13.5620 eV at the 5Z level (ΔIE = -0.0560 eV). The CBS value obtained with the SCF(5Z) + CBS(Q,5)-extrapolated correlation energy scheme is 13.6066 eV (ΔIE = -0.0115 eV), achieving chemical accuracy. The CCSDT results follow the same convergence pattern, with the CBS value of 13.6094 eV (ΔIE = -0.0087 eV), confirming that the perturbative triples in CCSD(T) is enough for an accurate description of the ionization energy, since the difference between the two methods at the CBS limit is only 0.0028 eV. For the IP-EOM-CCSD, the first ionization corresponds to the lowest β electron removal. The CBS value of 13.6483 eV (ΔIE = 0.0302 eV) remains within chemical accuracy, although the positive deviation indicates a slight overestimation of the experimental value. The best agreement for this method is obtained at the 5Z level (ΔIE = -0.0226 eV), where the basis set incompleteness error partially compensates for the overestimation associated with the absence of triple excitations.

The non-relativistic calculations with the aug-cc-pVXZ basis sets show a similar convergence but with higher ionization energy values. For CCSD(T), the CBS value is 13.6155 eV (ΔIE = -0.0026 eV), closer to the experiment than the DKH2 result. For CCSDT, the CBS value of 13.6183 eV (ΔIE = 0.0002 eV) is in excellent agreement with the experimental value. The IP-EOM-CCSD CBS value of 13.6571 eV (ΔIE = 0.0390 eV) follows the same trend and remains under the 0.0434 eV value. The contribution of DKH2 scalar relativistic correction to the IE of oxygen is constant, with a value of -0.0090 eV across all methods and basis sets.

The ANO-RCC basis sets show a significantly slower convergence. For CCSD(T), the ionization energy increases from 13.3830 eV (TZP, ΔIE = -0.2350 eV) to 13.4440 eV (QZP, ΔIE = -0.1741 eV) and 13.5149 eV (Full, ΔIE = -0.1031 eV), although none of them achieve chemical accuracy. The CBS(T,Q) extrapolation yields 13.4996 eV (ΔIE = -0.1185 eV), which is worse than the uncontracted Full result, indicating the absence of diffuse functions in the ANO-RCC family. The same trend is observed for CCSDT and IP-EOM-CCSD, with ANO-RCC-Full values of 13.5183 eV (ΔIE = -0.0998 eV) and 13.5454 eV (ΔIE = -0.0726 eV), respectively.

For the results obtained with Gaussian 16 using the aug-cc-pVQZ basis set, B2PLYP overestimates the IE with 0.1965 eV, while MP2 underestimates it with 0.1570 eV. The QCISD(T) method yields a deviation from the experimental value of -0.1032 eV. None of these methods achieves chemical accuracy. Moreover, the G3 and CBS-QB3 methods yield values of 13.5478 eV (ΔIE = -0.0702 eV) and 13.5869 eV (ΔIE = -0.0311 eV), respectively. Among these composite methods, only CBS-QB3 achieves chemical accuracy.

The corresponding deviations are illustrated in Figure 7, where the convergence from negative ΔIE values at the QZ and 5Z levels toward the experimental value at the CBS limit is visible for both CCSD(T) and CCSDT. The large underestimation with the ANO-RCC basis sets is also visible.

For the fluorine atom (Table 7), the CCSD(T)/aug-cc-pVXZ-DK converges from 17.3488 eV (QZ, ΔIE = -0.0740 eV) to 17.3863 eV (5Z, ΔIE = -0.0365 eV), and the CBS value of 17.4283 eV (ΔIE = 0.0055 eV) achieves chemical accuracy. CCSDT produces a CBS value of 17.4275 eV (ΔIE = 0.0047 eV), following the same trend with CCSD(T). For IP-EOM-CCSD, the first ionization is attributed to the lowest β electron removal root. Unlike the other atoms where IP-EOM-CCSD overestimates IE, here both QZ (ΔIE = -0.1017 eV) and 5Z (ΔIE = -0.0498 eV) underestimate it, and the CBS extrapolation yields a result of 17.4276 eV (ΔIE = 0.0047 eV), in excellent agreement with the experiment and within chemical accuracy.

The aug-cc-pVXZ results have higher ionization energy values. The CCSD(T) CBS is 17.4410 eV (ΔIE = 0.0182 eV) and the CCSDT CBS is 17.4403 eV (ΔIE = 0.0175 eV), both within chemical accuracy. The IP-EOM-CCSD CBS value of 17.4402 eV (ΔIE = 0.0174 eV) also remains within the threshold. The DKH2 contribution is -0.0127 eV, the largest among the period 2 atoms, consistent with fluorine having the highest nuclear charge in this group.

The ANO-RCC basis sets show a slow convergence also for fluorine. CCSD(T) values range from 17.1979 eV (TZP, ΔIE = -0.2249 eV) to 17.2638 eV (QZP, ΔIE = -0.1591 eV) and 17.3443 eV (Full, ΔIE = -0.0785 eV), none achieving chemical accuracy. The CBS(T,Q) extrapolation yields 17.3251 eV (ΔIE = -0.0977 eV), again worse than the Full result. Moreover, the CCSDT values follow the same trend, underestimating the ionization energy with 0.2238 eV (TZP), 0.1586 (QZP), 0.0787 (Full), and 0.0977 (CBS(T,Q)). For IP-EOM-CCSD, the ANO-RCC-Full and CBS (T,Q) deviations of -0.0987 eV and -0.1309 eV are both far from chemical accuracy.

Furthermore, B2PLYP strongly overestimates IE with 1.4022 eV, the largest error observed for any method across all atoms studied. MP2 gives a deviation of -0.0270 eV, within chemical accuracy. QCISD(T) yields a value of 17.3486 eV, while G3 and CBS-QB3 methods exhibit deviations of -0.0344 eV and 0.0513 eV, respectively, only the G3 result being within chemical accuracy.

As shown in Figure 8, for the fluorine atom, the CCSD(T), CCSDT, and IP-EOM-CCSD values improve with the basis set, and the CBS extrapolations achieve chemical accuracy. The anomalous B2PLYP overestimation is clearly visible in the frozen-core panel. For clarity, the vertical axis was limited to 0.3 eV, even though the B2PLYP deviation is significantly higher, at 1.4022 eV.

In the case of the phosphorus atom (Table 8), the CCSD(T)/aug-cc-pVXZ-DK ionization energy of 10.4796 eV at the QZ level (ΔIE = -0.0071 eV) is already very close to the experimental value of 10.4867 eV. At the 5Z level, the value increases to 10.5003 eV (ΔIE = 0.0136 eV), and the CBS extrapolation further raises the overestimation to 10.5242 eV (ΔIE = 0.0375 eV), approaching the chemical accuracy limit. The CCSDT exhibits the same behavior, with a CBS value of 10.5245 eV (ΔIE = 0.0378 eV) and a difference of only 0.0003 eV relative to CCSD(T). For IP-EOM-CCSD, the first ionization corresponds to the lowest root associated with the α electron removal. The method overestimates the IE already at the QZ level (ΔIE = 0.0192 eV), and the CBS value of 10.5817 eV (ΔIE = 0.0951 eV) represents the largest IP-EOM-CCSD deviation among all atoms studied with the aug-cc-pVXZ-DK basis sets.

The aug-cc-pVXZ calculations yield systematically higher values. The CCSD(T) CBS of 10.5365 eV (ΔIE = 0.0498 eV) exceeds the chemical accuracy threshold, in contrast to the DKH2 result, which remains within the limit. This makes phosphorus the only atom for which the inclusion of DKH2 relativistic corrections improves the CCSD(T) CBS result from outside to within the chemical accuracy. The CCSDT CBS value of 10.5368 eV (ΔIE = 0.0501 eV) exhibits the same behavior. The IP-EOM-CCSD CBS value of 10.5939 eV (ΔIE = 0.1072 eV) confirms the strong overestimation pattern. The DKH2 contribution is -0.0121 eV, comparable to that of fluorine (-0.0127 eV), indicating that the scalar relativistic correction remains significant for the period 3 atom.

With the ANO-RCC basis sets, the CCSD(T) energy increase from 10.4497 eV (TZP, ΔIE = -0.0370 eV) to 10.4594 eV (QZP, ΔIE = -0.0273 eV) and 10.4890 eV (Full, ΔIE = 0.0023 eV), the latter being in excellent agreement with experiment. The CBS(T,Q) extrapolation yields 10.4672 eV (ΔIE = -0.0195 eV), within chemical accuracy but further from experiment than the Full value. CCSDT with ANO-RCC-Full basis set yields 10.4909 eV (ΔIE = 0.0043 eV), also within chemical accuracy. For IP-EOM-CCSD with the ANO-RCC basis sets, the ionization energy increases from 10.4708 eV (TZP, ΔIE = -0.0159 eV) to 10.4865 eV (QZP, ΔIE = -0.0002 eV), the latter being in excellent agreement with the experiment. However, the Full value of 10.5337 eV (ΔIE = 0.0471 eV) overestimates the experimental value and slightly exceeds the chemical accuracy. The CBS(T,Q) yields a value of 10.4980 eV (ΔIE = 0.0113 eV), within chemical accuracy.

For the Gaussian 16 results, B2PLYP underestimates the ionization energy with 0.1445 eV and MP2 yields a deviation of only 0.0029 eV, the closest to experiment among all Gaussian methods with the aug-cc-pVQZ basis set. QCISD(T) result overestimates the value with 0.0125 eV, whereas the G3 and CBS-QB3 yield deviations of -0.0230 eV and -0.0394 eV, respectively, both within chemical accuracy.

The performance of the tested methods for the phosphorus atom is illustrated in Figure 9. The figure highlights that the QZ values are already close to the experiment and the CBS extrapolation leads to a slight overestimation for CCSD(T) and CCSDT, while the IP-EOM-CCSD CBS values exceed the chemical accuracy limit.

For the sulfur atom (Table 9), the IE results obtained with the CCSD(T) and the aug-cc-pVXZ-DK basis sets show the largest basis set dependence among all atoms studied. The QZ value of 10.2414 eV (ΔIE = -0.1186 eV) and the 5Z value of 10.2842 (ΔIE = -0.0758 eV) significantly underestimate the experimental value of 10.3600 eV, while the CBS value of 10.3297 eV (ΔIE = -0.0304 eV) achieves chemical accuracy. CCSDT yields a CBS value of 10.3332 eV (ΔIE = -0.0268 eV), with a difference of 0.0035 eV relative to CCSD(T). For IP-EOM-CCSD, the first ionization corresponds to the lowest root associated with the β electron removal. The QZ value (ΔIE = -0.0415 eV) underestimates the IE, while the 5Z value (ΔIE = 0.0169 eV) shifts to the opposite side. The CBS extrapolation increases the overestimation to 0.0782 eV, outside chemical accuracy, a behavior similar to that observed for oxygen but with a larger final deviation.

The aug-cc-pVXZ results yield higher ionization energy values, with the CCSD(T) CBS result being 10.3441 eV (ΔIE = -0.0159 eV), closer to the experiment than the DKH2 result. The CCSDT CBS is 10.3476 eV (ΔIE = -0.0124 eV), also within chemical accuracy and providing the best overall agreement with the experiment among the ORCA methods. The IP-EOM-CCSD CBS of 10.4530 eV (ΔIE = 0.0930 eV) remains far from the threshold. The DKH2 contribution is -0.0142 eV, the largest among all atoms studied, consistent with sulfur having the highest nuclear charge.

The ANO-RCC basis sets show the weakest convergence for sulfur compared to all other atoms. The CCSD(T) values range from 10.1113 eV (TZP, ΔIE = -0.2487 eV) to 10.1488 eV (QZP, ΔIE = -0.2112 eV) and 10.2542 eV (Full, ΔIE = -0.1058 eV), all being far from chemical accuracy. The CBS(T,Q) extrapolation yields 10.1778 eV (ΔIE = -0.1823 eV), significantly worse than the Full result. CCSDT follows the same pattern, with ANO-RCC-Full at 10.2581 eV (ΔIE = -0.1019 eV) and CBS(T,Q) at 10.1778 eV (ΔIE = -0.1823 eV). For IP-EOM-CCSD, the ANO-RCC-Full value of 10.3493 eV (ΔIE = -0.0107 eV) is within chemical accuracy, although this result likely reflects the cancelation of errors between the missing triple excitations and basis set incompleteness.

Finally, the B2PLYP computation yields a deviation of only -0.0047 eV, the closest to the experimental value for this atom, while the MP2 underestimates the ionization energy with 0.2204 eV, the largest MP2 deviation among all atoms studied. QCISD(T) yields a deviation of -0.1051 eV, while G3 and CBS-QB3 underestimate the experimental value with 0.0909 eV and 0.1156 eV, respectively. Thus, sulfur and carbon are the only atoms for which none of the Gaussian composite methods achieve chemical accuracy.

Figure 10 shows the ΔIE values for the sulfur atom. The ANO-RCC basis sets show significant negative deviations and IP-EOM-CCSD goes from negative to positive ΔIE from the QZ to 5Z levels.

It should be noted that the CCSD(T) results obtained with Gaussian 16 using the aug-cc-pVQZ basis set are not directly comparable to ORCA CCSD(T)/aug-cc-pVQZ values, as in the Gaussian calculations were performed with the default frozen-core approximation, while all ORCA computations employed NoFrozenCore. The resulting differences range from 0.002 eV for sulfur to 0.093 eV for carbon, depending on the relative contribution of core-valence correlation to the ionization energy of each atom. Moreover, the same frozen-core approximation was also used in all Gaussian calculations considered here.

The CCSDT calculations are considerably more demanding than CCSD(T), when the same computational settings are used. For the fluorine atom with the aug-cc-pV5Z-DK basis set, employing the same setup, the CCSDT calculation took approximately 10 hours, while the corresponding CCSD(T) calculation required only 8 minutes and 27 seconds. This difference shows the significantly higher cost associated with the iterative treatment of triple excitations in CCSDT.

Table 10 summarizes the energies, types and degeneracies of the HOMO and HOMO – 1 orbitals for the C, N, O, F, P, and S atoms, together with the HOMO – HOMO-1 energy gap, calculated at QCISD(T)/aug-cc-pVQZ level of theory. The spin character of HOMO confirms the behavior in the IP–EOM–CCSD calculations, indicating that for C, N, and P, the lowest ionization energy is associated with the α electron removal, while for O, F, and S, it comes from the β electron removal. The smallest HOMO – HOMO-1 gap is obtained for sulfur (0.98 eV), while the largest gap corresponds to the phosphorus (4.47 eV).

The overall performance of the computational methods is summarized in Table 11, which reports the mean absolute error (MAE) and mean signed error (MSE) for each method and basis set combination, averaged over the C, N, O, F, P, and S atoms.

Among all combinations of method and basis set, the lowest MAE values are obtained with the CCSDT CBS extrapolation using the aug-cc-pVXZ-DK basis sets (MAE = 0.0168 eV, MSE = 0.0050 eV), closely followed by CCSD(T) CBS extrapolated correlation (Q,5) with the same basis sets (MAE = 0.0174 eV, MSE = 0.0035 eV). The nearly identical MAE values confirm that the perturbative triples treatment in CCSD(T) is enough for all atoms studied. The small positive MSE indicates a slight overestimation at the CBS limit.

The aug-cc-pVXZ CBS results show slightly larger MAE values of 0.0194 eV for CCSD(T) and 0.0188 eV for CCSDT, with MSE values of 0.0132 eV and 0.0147 eV, respectively. The increase in both MAE and MSE relative to the DKH2 results reflects the absence of scalar relativistic corrections, which systematically raise the computed ionization energies. Nevertheless, the CBS results obtained with these basis sets remain within chemical accuracy on average.

For IP-EOM-CCSD, the CBS extrapolation yields MAE = 0.0575 eV (MSE= 0.0575 eV) with DKH2 and MAE = 0.0672 (MSE = 0.0672 eV) without DKH2. In both cases, MAE equals MSE, indicating that the method systematically overestimates the ionization energy for all atoms at the CBS limit. At the finite basis set level, the performance of IP-EOM-CCSD is comparable to CCSD(T), with MAE values of 0.0463 eV (QZ-DK) and 0.0390 eV (5Z-DK), suggesting that the basis set incompleteness error partially compensates for the missing triple excitations.

The ANO-RCC basis sets have larger MAE values, ranging from 0.1433 eV (TZP) to 0.0566 eV (Full) for CCSD(T). The MSE values are consistently negative, indicating an underestimation due to the absence of diffuse functions. The CBS(T,Q) extrapolation (MAE = 0.0784 eV) does not improve over the Full result, consistent with the observation that the extrapolation cannot compensate for the missing diffuse functions.

Across the Gaussian 16 methods that were used with the aug-cc-pVQZ basis set, B2PLYP exhibits the largest MAE (0.3091 eV) with a positive MSE (0.2536), mostly influenced by the overestimation for fluorine. The composite methods G3 (MAE = 0.0491 eV, MSE = -0.0491 eV) and CBS-QB3 (MAE = 0.0578 eV, MSE = -0.0407 eV) provide the best performance among these methods, although none of them achieves chemical accuracy on average. For G3, the equality between MAE and MSE indicates a systematic underestimation for all six atoms. QCISD(T) (MAE = 0.0637 eV), MP2 (MAE = 0.0826 eV), and CCSD(T) (MAE = 0.0756 eV) with the aug-cc-pVQZ basis set perform somewhat worse. For comparison, the corresponding ORCA CCSD(T)/aug-cc-pVQZ result gives a lower MAE of 0.0490 eV (MSE = -0.0474 eV). The difference between the Gaussian and ORCA CCSD(T) values can be attributed mainly to the frozen-core approximation employed in Gaussian calculations, while the ORCA results were obtained with NoFrozenCore.

Overall, the statistical performance is summarized graphically in Figure 11, which reports the MAE values averaged over the C, N, O, F, P, and S atoms. The CCSD(T) and CCSDT CBS extrapolations with aug-cc-pVXZ-DK have the lowest MAE values, while the ANO-RCC and IP-EOM-CCSD results exhibit consistently higher error. Among the methods with the frozen-core approximation, G3 and CBS-QB3 provide the best performance, however none of them achieve chemical accuracy on average.

4. Conclusions

In this work, the first ionization energies of the H, C, N, O, F, P, and S atoms were investigated using several quantum chemical methods, with the aim of identifying the computational protocols able to reproduce the experimental values within chemical accuracy. The methods included the electron propagator approaches OVGF and P3+, as implemented in Gaussian 16, the coupled-cluster methods CCSD(T), CCSDT, and IP-EOM-CCSD, as implemented in ORCA 6.1.0, and the composite methods G3 and CBS-QB3. Several post-Hartree-Fock and DFT methods were also tested using the energy-difference approach.

The OVGF and P3+ methods did not reach chemical accuracy on average for the six atoms (C, N, O, F, P, S). For all basis sets, OVGF performed better than P3+, while P3+ constantly underestimated the experimental ionization energies, as shown by the negative MSE. Among the tested basis sets, aug-cc-pVQZ gave the best results for both methods, followed by cc-pVQZ and 6-311+G(2df,p). Chemical accuracy was achieved only in some cases, specifically with OVGF for phosphorus for all three basis sets, and for oxygen and fluorine when the aug-cc-pVQZ basis set was used. The best agreement obtained with the propagator methods was found for P atom, which has a large HOMO – HOMO-1 gap.

The CCSD(T) and CCSDT methods, used together with the aug-cc-pVXZ-DK basis sets and the CBS extrapolation scheme provided the most accurate result. The corresponding MAE values were 0.0174 eV for CCSD(T) and 0.0168 eV for CCSDT. The difference between the two methods at the CBS limit was small for each atom, showing that the perturbative triples correction in CCSD(T) is enough for an accurate description of these ionization energies. Therefore, the much more computationally expensive iterative treatment of triple excitations in CCSDT did not bring significant improvement for the ionization energies of the atoms that were studied. Moreover, considering the much higher computational cost of CCSDT, CCSD(T) combined with CBS extrapolation represents the best compromise between the accuracy and the hardware resources.

The scalar relativistic contribution introduced through the DKH2 Hamiltonian was small, but consistent. Its size increased with the atomic number, from about 0.0002 eV for hydrogen to approximately 0.0142 eV for sulfur. For hydrogen, which is a one-electron system, the ionization energy is described at the Hartree-Fock level. All HF values obtained with the tested basis set families remained within chemical accuracy, and the CBS-QB3 gave the closest agreement with the experimental value.

The IP-EOM-CCSD method showed a different behavior. Since this method calculates the ionization energies directly from the ionized roots of the neutral reference state, it provides an independent comparison with the energy difference results. At the finite basis sets, IP-EOM-CCSD often gave values close to those obtained with CCSD(T), due to the basis set incompleteness error, which compensated for the missing triple excitations. At the CBS limit, however, this effect was reduced, and the method began to overestimate the experimental values.

The ANO-RCC basis sets showed a less uniform convergence than the aug-cc-pVXZ-DK basis sets. The CBS(T,Q) extrapolation based on the TZP and QZP contractions did not always improve the results, and in some cases, was worse than the Full result. This suggests that ANO-RCC basis sets may not be ideal for the calculations of these atomic ionization energies.

The Gaussian 16 results obtained with the default frozen-core approximation also showed that core-valence correlation is important for achieving chemical accuracy. Among these methods, G3 and CBS-QB3 gave the best overall agreement with the experiment, but none of them reaches chemical accuracy on average over the six atoms. The B2PLYP functional shows a particularly large overestimation for fluorine, which influenced its overall MAE. The comparison between Gaussian CCSD(T)/aug-cc-pVQZ and the corresponding all-electron ORCA result showed that the differences can be significant due to the frozen-core approximation. This confirms that the treatment of core-valence correlation must be considered carefully when ionization energies are compared at this level of accuracy.

Overall, the results show that the most robust approach for the atomic ionization energies studied here is CCSD(T) with the aug-cc-pVXZ-DK basis sets, all-electron correlation, DKH2 scalar relativistic corrections, and the CBS(Q,5) extrapolation scheme. CCSDT confirms the accuracy of the perturbative triples treatment in CCSD(T), while IP-EOM-CCSD provides a useful direct comparison. Thus, the consistency of the results obtained in this study supports the use of this protocol as a reference for the ionization energies of the H, C, N, O, F, P, and S atoms found in proteins and amino acids, as well as in the related benchmark studies.

Author Contributions

Conceptualization, V.C.; methodology, S.S., V.C., S.C.C.; investigation, S.S., S.C.C.; data curation, S.S.; writing—original draft preparation, S.S., S.C.C.; writing—review and editing, V.C., S.S., S.C.C.; supervision, V.C.; All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

EPT	Electron propagator theory
OVGF	Outer Valence Green’s Function
P3+	Renormalized partial third-order quasiparticle
DFT	Density functional theory
MAE	Mean absolute error
MSE	Mean signed error
HOMO	Highest occupied molecular orbital
LUMO	Lowest unoccupied molecular orbital
IE	Ionization energy
CCSD(T)	Coupled-Cluster with Single and Double excitations and Perturbative Triples
CCSDT	Coupled-Cluster with single, double, and triple excitations
IP-EOM-CC	Ionization Potential Equation-of-Motion Coupled-Cluster Singles and Doubles

References

Turner, D.W.; Baker, C.; Baker, A.D.; Brundle, C.R. Molecular photoelectron spectroscopy. A Handbook of He 584 Å spectra; Wiley Interscience: London, 1970. [Google Scholar]
Yu, H.; Yang, H.; Shi, E.; Tang, W. Development and clinical application of phosphorus-containing drugs. Med. Drug. Discov. 2020, 8, 100063. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Sanchez-Roselló, M.; Aceña, J.L.; del Pozo, C.; Sorochinsky, A.E.; Fustero, S.; Soloshonok, V.A.; Liu, H. Fluorine in pharmaceutical industry: Fluorine-containing drugs introduced to the market in the last decade (2001−2011). Chem. Rev. 2014, 114(4), 2432–2506. [Google Scholar] [CrossRef] [PubMed]
Odar, C.; Winkler, M.; Wiltschi, B. Fluoro amino acids: a rarity in nature, yet a prospect for protein engineering. Biotechnol. J. 2015, 10(3), 427–446. [Google Scholar] [CrossRef] [PubMed]
Arribat, M.; Cavelier, F.; Rémond, E. Phosphorus-containing amino acids with a P–C bond in the side chain or a P–O, P–S or P–N bond: from synthesis to applications. RSC Adv. 2020, 10(11), 6678–6724. [Google Scholar] [CrossRef]
Markovic, M.; Ben-Shabat, S.; Dahan, A. Prodrugs for improved drug delivery: lessons learned from recently developed and marketed products. Pharm. 2020, 12(11), 1031. [Google Scholar] [CrossRef]
Bogojeski, M.; Vogt-Maranto, L.; Tuckerman, M.E.; Müller, K.-R.; Burke, K. Quantum chemical accuracy from density functional approximations via machine learning. Nat. Commun. 2020, 11, 5223. [Google Scholar] [CrossRef]
McKechnie, S.; Booth, G.H.; Cohen, A.J.; Cole, J.M. On the accuracy of density functional theory and wave function methods for calculating vertical ionization energies. J. Chem. Phys. 2015, 142(19), 194114. [Google Scholar] [CrossRef] [PubMed]
Jursic, B. S. Computation of some ionization potentials for second-row elements by ab initio and density functional theory methods. Int. J. Quantum Chem. 1997, 64(2), 255–261. [Google Scholar] [CrossRef]
Ortiz, J.V. Electron propagator theory: an approach to prediction and interpretation in quantum chemistry. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2013, 3(2), 123–142. [Google Scholar] [CrossRef]
Hedin, L. New method for calculating the one-particle Green’s function with application to the electron-gas problem. Phys. Rev. 1965, 139, A796–A823. [Google Scholar] [CrossRef]
Cederbaum, L.S.; Von Niessen, W. Direct calculation of ionization potentials of atoms and molecules: application to Ne. Chem. Phys. Lett. 1974, 24(2), 263–266. [Google Scholar] [CrossRef]
Cederbaum, L.S. One-body Green’s function for atoms and molecules: theory and application. J. Phys. B Atom. Molec. Phys. 1975, 8(2), 290–303. [Google Scholar] [CrossRef]
Von Niessen, W.; Schirmer, J.; Cederbaum, L.S. Computational methods for the one-particle Green’s function. Comput. Phys. Rep. 1984, 1(2), 57–125. [Google Scholar] [CrossRef]
Danovich, D. Green’s function methods for calculating ionization potentials, electron affinities, and excitation energies. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2011, 1(3), 377–387. [Google Scholar] [CrossRef]
Dolgounitcheva, O.; Díaz-Tinoco, M.; Zakrzewski, V.G.; Richard, R.M.; Marom, N.; Sherrill, C.D.; Ortiz, J.V. Accurate ionization potentials and electron affinities of acceptor molecules IV: Electron-Propagator methods. J. Chem. Theory Comput. 2016, 12(2), 627−637. [Google Scholar] [CrossRef] [PubMed]
Ortiz, J.V. An efficient, renormalized self-energy for calculating the electron binding energies of closed-shell molecules and anions. Int. J. Quantum Chem. 2005, 105(6), 803–808. [Google Scholar] [CrossRef]
Melin, J.; Singh, R.K.; Mishra, M.K.; Ortiz, J.V. Tautomeric forms of azolide anions: vertical electron detachment energies and Dyson orbitals. J. Phys. Chem. A 2007, 111(50), 13069–13074. [Google Scholar] [CrossRef]
Ortiz, J.V. Dyson-orbital concepts for description of electrons in molecules. J. Chem. Phys. 2020, 153, 070902. [Google Scholar] [CrossRef]
Zakrzewski, V.G.; Ortiz, J.V.; Nichols, J.A.; Heryadi, D.; Yeager, D.L.; Golab, J.T. Comparison of perturbative and multiconfigurational electron propagator methods. Int. J. Quantum Chem. 1996, 60(1), 29–36. [Google Scholar] [CrossRef]
Corzo, H.H.; Galano, A.; Dolgounitcheva, O.; Zakrzewski, V.G.; Ortiz, J.V. NR2 and P3+: Accurate, efficient Electron-Propagator methods for calculating valence, vertical ionization energies of closed-shell molecules. J. Phys. Chem. A 2015, 119(33), 8813−8821. [Google Scholar] [CrossRef]
Dolgounitcheva, O.; Zakrzewski, V.G.; Ortiz, J.V. Electron propagator calculations on uracil and adenine ionization energies. Int. J. Quantum Chem. 2000, 80(4−5), 831−835. [Google Scholar] [CrossRef]
Ortiz, J.V. Partial third-order quasiparticle theory: Comparisons for closed-shell ionization energies and an application to the Borazine photoelectron spectrum. J. Chem. Phys. 1996, 104, 7599–7605. [Google Scholar] [CrossRef]
Grimme, S. Semiempirical hybrid density functional with perturbative second-order correlation. J. Chem. Phys. 2006, 124, 034108. [Google Scholar] [CrossRef]
Møller, C.; Plesset, M.S. Note on an approximation treatment for many-electron systems. Phys. Rev. 1934, 46, 618–622. [Google Scholar] [CrossRef]
Purvis, G.D., III; Bartlett, R.J. A full coupled-cluster singles and doubles model: The inclusion of disconnected triples. J. Chem. Phys. 1982, 76, 1910–1918. [Google Scholar] [CrossRef]
Raghavachari, K.; Trucks, G.W.; Pople, J.A.; Head-Gordon, M. A fifth-order perturbation comparison of electron correlation theories. Chem. Phys. Lett. 1989, 157(6), 479–483. [Google Scholar] [CrossRef]
Curtiss, L.A.; Raghavachari, K.; Redfern, P.C.; Rassolov, V.; Pople, J.A. Gaussian-3 (G3) theory for molecules containing first and second-row atoms. J. Chem. Phys. 1998, 109, 7764–7776. [Google Scholar] [CrossRef]
Montgomery, J.A., Jr.; Frisch, M.J.; Ochterski, J.W.; Petersson, G.A. A complete basis set model chemistry. VI. Use of density functional geometries and frequencies. J. Chem. Phys. 1999, 110, 2822–2827. [Google Scholar] [CrossRef]
Gaussian 16, Revision C.01; Frisch, M.J.; Trucks, G.W.; Schlegel, H.B.; Scuseria, G.E.; Robb, M.A.; Cheeseman, J.R.; Scalmani, G.; Barone, V.; Petersson, G.A.; Nakatsuji, H.; Li, X.; Caricato, M.; Marenich, A.V.; Bloino, J.; Janesko, B.G.; Gomperts, R.; Mennucci, B.; Hratchian, H.P.; Ortiz, J.V.; Izmaylov, A.F.; Sonnenberg, J.L.; Williams-Young, D.; Ding, F.; Lipparini, F.; Egidi, F.; Goings, J.; Peng, B.; Petrone, A.; Henderson, T.; Ranasinghe, D.; Zakrzewski, V.G.; Gao, J.; Rega, N.; Zheng, G.; Liang, W.; Hada, M.; Ehara, M.; Toyota, K.; Fukuda, R.; Hasegawa, J.; Ishida, M.; Nakajima, T.; Honda, Y.; Kitao, O.; Nakai, H.; Vreven, T.; Throssell, K.; Montgomery, J.A., Jr.; Peralta, J.E.; Ogliaro, F.; Bearpark, M.J.; Heyd, J.J.; Brothers, E.N.; Kudin, K.N.; Staroverov, V.N.; Keith, T.A.; Kobayashi, R.; Normand, J.; Raghavachari, K.; Rendell, A.P.; Burant, J.C.; Iyengar, S.S.; Tomasi, J.; Cossi, M.; Millam, J.M.; Klene, M.; Adamo, C.; Cammi, R.; Ochterski, J.W.; Martin, R.L.; Morokuma, K.; Farkas, O.
Bufnea, D.; Niculescu, V.; Silaghi, G.; Sterca, A. Babeş-Bolyai University’s high performance computing center. Stud. Univ. Babeş-Bolyai Inform. 2016, LXI(2), 54–69. [Google Scholar]
Noga, J.; Bartlett, R.J. The full CCSDT model for molecular electronic structure. J. Chem. Phys. 1987, 86(12), 7041–7050. [Google Scholar] [CrossRef]
Stanton, J.F.; Bartlett, R.J. The equation of motion coupled-cluster method. A systematic biorthogonal approach to molecular excitation energies, transition probabilities, and excited state properties. J. Chem. Phys. 1993, 98(9), 7029–7039. [Google Scholar] [CrossRef]
Hirata, S.; Nooijen, M.; Bartlett, R.J. High-order determinantal equation-of-motion coupled-cluster calculations for ionized and electron-attached states. Chem. Phys. Lett. 2000, 328(4–6), 459–468. [Google Scholar] [CrossRef]
Neese, F. Software update: the ORCA program system—Version 6.0. WIREs Comput. Mol. Sci. 2025, 15(1), e70019. [Google Scholar] [CrossRef]
Dunning, T.H., Jr. Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen. J. Chem. Phys. 1989, 90(2), 1007–1023. [Google Scholar] [CrossRef]
Kendall, R.A.; Dunning, T.H., Jr.; Harrison, R.J. Electron affinities of the first-row atoms revisited. Systematic basis sets and wave functions. J. Chem. Phys. 1992, 96(9), 6796–6806. [Google Scholar] [CrossRef]
de Jong, W.A.; Harrison, R.J.; Dixon, D.A. Parallel Douglas–Kroll energy and gradients in NWChem: Estimating scalar relativistic effects using Douglas–Kroll contracted basis sets. J. Chem. Phys. 2001, 114(1), 48–53. [Google Scholar] [CrossRef]
Roos, B.O.; Lindh, R.; Malmqvist, P.-Å.; Veryazov, V.; Widmark, P.-O. Main group atoms and dimers studied with a new relativistic ANO basis set. J. Phys. Chem. A 2004, 108(15), 2851–2858. [Google Scholar] [CrossRef]
Douglas, M.; Kroll, N.M. Quantum electrodynamical corrections to the fine structure of helium. Ann. Phys. 1974, 82(1), 89–155. [Google Scholar] [CrossRef]
Hess, B.A. Relativistic electronic-structure calculations employing a two-component no-pair formalism with external-field projection operators. Phys. Rev. A 1986, 33(6), 3742–3748. [Google Scholar] [CrossRef] [PubMed]
Helgaker, T.; Klopper, W.; Koch, H.; Noga, J. Basis-set convergence of correlated calculations on water. J. Chem. Phys. 1997, 106(23), 9639–9646. [Google Scholar] [CrossRef]
Halkier, A.; Helgaker, T.; Jørgensen, P.; Klopper, W.; Koch, H.; Olsen, J.; Wilson, A.K. Basis-set convergence in correlated calculations on Ne, N2, and H2O. Chem. Phys. Lett. 1998, 286(3–4), 243–252. [Google Scholar] [CrossRef]
Richard, R.M.; Marshall, M.S.; Dolgounitcheva, O.; Ortiz, J.V.; Brédas, J.-L.; Marom, N.; Sherrill, C.D. Accurate ionization potentials and electron affinities of acceptor molecules I. Reference data at the CCSD(T) complete basis set limit. J. Chem. Theory Comput. 2016, 12(2), 595–604. [Google Scholar] [CrossRef]
Gałyńska, M.; Boguslawski, K. Benchmarking ionization potentials from pCCD tailored coupled cluster models. J. Chem. Theory Comput. 2024, 20(10), 4182–4195. [Google Scholar] [CrossRef]
Kramida, A.; Ralchenko, Yu.; Reader, J. NIST ASD Team NIST Atomic Spectra Database (ver. 5.12). National Institute of Standards and Technology: Gaithersburg, MD, 2024. Available online: https://physics.nist.gov/asd. [CrossRef]

Figure 1. Schematic representation of the atomic orbitals of C, N, O, F, P, and S for ionization energy calculations using the OVGF method.

Figure 2. Valence orbitals of the S atom (α type in the left column and β type in the right column) calculated using the OVGF method with the aug-cc-pVQZ basis set.

Figure 3. Deviations (in eV) from the experimental data of the calculated ionization energy of the C, N, O, F, P, and S atoms using the OVGF (solid bars) and P3+ (hatched bars) methods. The horizontal lines define the interval of chemical accuracy.

Figure 4. Deviations ΔIE (eV) of the calculated ionization energies of the hydrogen atom from the experimental value. The horizontal lines define the interval of chemical accuracy.

Figure 5. Deviations ΔIE (eV) of the calculated ionization energies of the carbon atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 6. Deviations ΔIE (eV) of the calculated ionization energies of the nitrogen atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 7. Deviations ΔIE (eV) of the calculated ionization energies of the oxygen atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 8. Deviations ΔIE (eV) of the calculated ionization energies of the fluorine atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 9. Deviations ΔIE (eV) of the calculated ionization energies of the phosphorus atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 10. Deviations ΔIE (eV) of the calculated ionization energies of the sulfur atom from the experimental value for CCSD(T), CCSDT, IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal lines define the interval of chemical accuracy.

Figure 11. Mean absolute error (MAE, in eV) of the calculated ionization energies averaged over the C, N, O, F, P, and S atoms, obtained with CCSD(T), CCSDT, and IP-EOM-CCSD, and the methods employing frozen-core approximation. The horizontal line defines the interval of chemical accuracy.

Table 1. Calculated values with the OVGF and P3+ methods for the ionization energies of the C, N, O, F, P, and S atoms, with different basis sets.

Atom	IE_exp (eV) [46]	Basis set	Multiplicity	IE_OVGF (eV)	ΔIE_OVGF (eV)	IE_P3+ (eV)	ΔIE_P3+ (eV)	HOMO / LUMO	Orbital window
C	11.2603	cc-pVQZ	3	11.3757	0.1154	11.166	-0.0943	2a / 4a	1-6
		aug-cc-pVQZ	3	11.3874	0.1271	11.176	-0.0843
		6-311+G(2df,p)	3	11.3571	0.0968	11.111	-0.1493
N	14.5341	cc-pVQZ	4	14.6263	0.0922	14.385	-0.1491	2a / 2b	1-6
		aug-cc-pVQZ	4	14.6543	0.1202	14.403	-0.1311
		6-311+G(2df,p)	4	14.6278	0.0937	14.331	-0.2031
O	13.6181	cc-pVQZ	3	13.5643	-0.0538	13.229	-0.3891	2b / 3b	1-5
		aug-cc-pVQZ	3	13.6264	0.0083	13.275	-0.3431
		6-311+G(2df,p)	3	13.4939	-0.1242	13.099	-0.5191
F	17.4228	cc-pVQZ	2	17.3020	-0.1208	17.057	-0.3658	2b / 4b	1-5
		aug-cc-pVQZ	2	17.3882	-0.0346	17.122	-0.3008
		6-311+G(2df,p)	2	17.2914	-0.1314	16.953	-0.4698
P	10.4867	cc-pVQZ	4	10.4968	0.0101	10.423	-0.0637	2a / 2b	1-5
		aug-cc-pVQZ	4	10.5139	0.0272	10.429	-0.0577
		6-311+G(2df,p)	4	10.4506	-0.0361	10.350	-0.1367
S	10.3600	cc-pVQZ	3	10.2209	-0.1391	10.087	-0.2730	2b / 3b	1-7
		aug-cc-pVQZ	3	10.2476	-0.1124	10.106	-0.2540
		6-311+G(2df,p)	3	10.0617	-0.2983	9.910	-0.4500

Table 2. The MAE and MSE errors for C, N, O, F, P, and S atoms, computed for different basis sets for the OVGF and P3+ methods.

Basis set	MAE OVGF (eV)	MSE OVGF (eV)	MAE P3+ (eV)	MSE P3+ (eV)
cc-pVQZ	0.0886	-0.0160	0.2225	-0.2225
aug-cc-pVQZ	0.0716	0.0226	0.1952	-0.1952
6-311+G(2df,p)	0.1301	-0.0666	0.3213	-0.3213

Table 3. Ionization energies of the H atom computed using HF with multiple basis sets and with the composite methods G3 and CBS-QB3.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
H	HF	aug-cc-pVQZ-DK	2/1	13.6045	13.5984	0.0060
		aug-cc-pV5Z-DK	2/1	13.6057	13.5984	0.0073
		aug-cc-pVQZ	2/1	13.6043	13.5984	0.0058
		aug-cc-pV5Z	2/1	13.6056	13.5984	0.0071
		ANO-RCC-TZP	2/1	13.6051	13.5984	0.0067
		ANO-RCC-QZP	2/1	13.6054	13.5984	0.0070
		ANO-RCC-Full	2/1	13.6054	13.5984	0.0070
	G3		2/1	13.6330	13.5984	0.0345
	CBS-QB3		2/1	13.6007	13.5984	0.0023

Table 4. Ionization energies of the C atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
C	CCSD(T)	aug-cc-pVQZ-DK	3/2	11.2312	11.2603	-0.0291
		aug-cc-pV5Z-DK		11.2476		-0.0127
		SCF(5Z) + corr_CBS(Q,5)		11.2635		0.0032
		aug-cc-pVQZ		11.2349		-0.0254
		aug-cc-pV5Z		11.2513		-0.0090
		SCF(5Z) + corr_CBS(Q,5)		11.2672		0.0069
		ANO-RCC-TZP		11.1962		-0.0641
		ANO-RCC-QZP		11.2163		-0.0439
		ANO-RCC-Full		11.2274		-0.0329
		SCF(QZ) + corr_CBS(T,Q)		11.2337		-0.0266
	CCSDT	aug-cc-pVQZ-DK		11.2356		-0.0246
		aug-cc-pV5Z-DK		11.2516		-0.0086
		SCF(5Z) + corr_CBS(Q,5)		11.2671		0.0068
		aug-cc-pVQZ		11.2394		-0.0209
		aug-cc-pV5Z		11.2553		-0.0050
		SCF(5Z) + corr_CBS(Q,5)		11.2708		0.0105
		ANO-RCC-TZP		11.2014		-0.0589
		ANO-RCC-QZP		11.2213		-0.0390
		ANO-RCC-Full		11.2319		-0.0284
		SCF(QZ) + corr_CBS(T,Q)		11.2384		-0.0219
	IP-EOM-CCSD	aug-cc-pVQZ-DK		11.2928		0.0325
		aug-cc-pV5Z-DK		11.3149		0.0546
		CBS(Q,5)		11.3381		0.0778
		aug-cc-pVQZ		11.2966		0.0363
		aug-cc-pV5Z		11.3187		0.0584
		CBS(Q,5)		11.3419		0.0816
		ANO-RCC-TZP		11.2442		-0.0161
		ANO-RCC-QZP		11.2782		0.0179
		ANO-RCC-Full		11.2972		0.0370
		CBS(T,Q)		11.3029		0.0426
	B2PLYP	aug-cc-pVQZ		11.3495		0.0893
	MP2	aug-cc-pVQZ		11.2801		0.0198
	CCSD(T)	aug-cc-pVQZ		11.1420		-0.1183
	QCISD(T)	aug-cc-pVQZ		11.2072		-0.0531
	G3			11.2114		-0.0489
	CBS-QB3			11.1925		-0.0678

Table 5. Ionization energies of the N atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
N	CCSD(T)	aug-cc-pVQZ-DK	4/3	14.5189	14.5341	-0.0152
		aug-cc-pV5Z-DK		14.5346		0.0005
		SCF(5Z) + corr_CBS(Q,5)		14.5506		0.0165
		aug-cc-pVQZ		14.5250		-0.0091
		aug-cc-pV5Z		14.5407		0.0066
		SCF(5Z) + corr_CBS(Q,5)		14.5567		0.0226
		ANO-RCC-TZP		14.4838		-0.0504
		ANO-RCC-QZP		14.4976		-0.0365
		ANO-RCC-Full		14.5173		-0.0168
		SCF(QZ) + corr_CBS(T,Q)		14.5114		-0.0227
	CCSDT	aug-cc-pVQZ-DK		14.5192		-0.0149
		aug-cc-pV5Z-DK		14.5346		0.0004
		SCF(5Z) + corr_CBS(Q,5)		14.5502		0.0161
		aug-cc-pVQZ		14.5253		-0.0088
		aug-cc-pV5Z		14.5407		0.0066
		SCF(5Z) + corr_CBS(Q,5)		14.5563		0.0222
		ANO-RCC-TZP		14.4845		-0.0496
		ANO-RCC-QZP		14.4982		-0.0359
		ANO-RCC-Full		14.5176		-0.0166
		SCF(QZ) + corr_CBS(T,Q)		14.5119		-0.0223
	IP-EOM-CCSD	aug-cc-pVQZ-DK		14.5441		0.0100
		aug-cc-pV5Z-DK		14.5679		0.0338
		CBS(Q,5)		14.5930		0.0588
		aug-cc-pVQZ		14.5503		0.0162
		aug-cc-pV5Z		14.5741		0.0400
		CBS(Q,5)		14.5990		0.0649
		ANO-RCC-TZP		14.4863		-0.0478
		ANO-RCC-QZP		14.5233		-0.0109
		ANO-RCC-Full		14.5487		0.0146
		CBS(T,Q)		14.5502		0.0161
	B2PLYP	aug-cc-pVQZ		14.5171		-0.0171
	MP2	aug-cc-pVQZ		14.6026		0.0684
	CCSD(T)	aug-cc-pVQZ		14.5001		-0.0340
	QCISD(T)	aug-cc-pVQZ		14.5001		-0.0340
	G3			14.5068		-0.0273
	CBS-QB3			14.4927		-0.0415

Table 6. Ionization energies of the O atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
O	CCSD(T)	aug-cc-pVQZ-DK	3/4	13.5204	13.6181	-0.0977
		aug-cc-pV5Z-DK		13.5620		-0.0560
		SCF(5Z) + corr_CBS(Q,5)		13.6066		-0.0115
		aug-cc-pVQZ		13.5294		-0.0887
		aug-cc-pV5Z		13.5710		-0.0471
		SCF(5Z) + corr_CBS(Q,5)		13.6155		-0.0026
		ANO-RCC-TZP		13.3830		-0.2350
		ANO-RCC-QZP		13.4440		-0.1741
		ANO-RCC-Full		13.5149		-0.1031
		SCF(QZ) + corr_CBS(T,Q)		13.4996		-0.1185
	CCSDT	aug-cc-pVQZ-DK		13.5238		-0.0942
		aug-cc-pV5Z-DK		13.5651		-0.0529
		SCF(5Z) + corr_CBS(Q,5)		13.6094		-0.0087
		aug-cc-pVQZ		13.5328		-0.0853
		aug-cc-pV5Z		13.5741		-0.0440
		SCF(5Z) + corr_CBS(Q,5)		13.6183		0.0002
		ANO-RCC-TZP		13.3866		-0.2314
		ANO-RCC-QZP		13.4477		-0.1703
		ANO-RCC-Full		13.5183		-0.0998
		SCF(QZ) + corr_CBS(T,Q)		13.5034		-0.1147
	IP-EOM-CCSD	aug-cc-pVQZ-DK		13.5450		-0.0730
		aug-cc-pV5Z-DK		13.5954		-0.0226
		CBS(Q,5)		13.6483		0.0302
		aug-cc-pVQZ		13.5540		-0.0641
		aug-cc-pV5Z		13.6043		-0.0138
		CBS(Q,5)		13.6571		0.0390
		ANO-RCC-TZP		13.3743		-0.2438
		ANO-RCC-QZP		13.4606		-0.1574
		ANO-RCC-Full		13.5454		-0.0726
		CBS(T,Q)		13.5236		-0.0944
	B2PLYP	aug-cc-pVQZ		13.8146		0.1965
	MP2	aug-cc-pVQZ		13.4611		-0.1570
	CCSD(T)	aug-cc-pVQZ		13.5137		-0.1044
	QCISD(T)	aug-cc-pVQZ		13.5149		-0.1032
	G3			13.5478		-0.0702
	CBS-QB3			13.5869		-0.0311

Table 7. Ionization energies of the F atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
F	CCSD(T)	aug-cc-pVQZ-DK	2/3	17.3488	17.4228	-0.0740
		aug-cc-pV5Z-DK		17.3863		-0.0365
		SCF(5Z) + corr_CBS(Q,5)		17.4283		0.0055
		aug-cc-pVQZ		17.3616		-0.0612
		aug-cc-pV5Z		17.3990		-0.0238
		SCF(5Z) + corr_CBS(Q,5)		17.4410		0.0182
		ANO-RCC-TZP		17.1979		-0.2249
		ANO-RCC-QZP		17.2638		-0.1591
		ANO-RCC-Full		17.3443		-0.0785
		SCF(QZ) + corr_CBS(T,Q)		17.3251		-0.0977
	CCSDT	aug-cc-pVQZ-DK		17.3488		-0.0740
		aug-cc-pV5Z-DK		17.3859		-0.0369
		SCF(5Z) + corr_CBS(Q,5)		17.4275		0.0047
		aug-cc-pVQZ		17.3616		-0.0612
		aug-cc-pV5Z		17.3987		-0.0241
		SCF(5Z) + corr_CBS(Q,5)		17.4403		0.0175
		ANO-RCC-TZP		17.1990		-0.2238
		ANO-RCC-QZP		17.2642		-0.1586
		ANO-RCC-Full		17.3442		-0.0787
		SCF(QZ) + corr_CBS(T,Q)		17.3252		-0.0977
	IP-EOM-CCSD	aug-cc-pVQZ-DK		17.3211		-0.1017
		aug-cc-pV5Z-DK		17.3731		-0.0498
		CBS(Q,5)		17.4276		0.0047
		aug-cc-pVQZ		17.3339		-0.0889
		aug-cc-pV5Z		17.3858		-0.0370
		CBS(Q,5)		17.4402		0.0174
		ANO-RCC-TZP		17.1354		-0.2874
		ANO-RCC-QZP		17.2259		-0.1969
		ANO-RCC-Full		17.3241		-0.0987
		CBS(T,Q)		17.2920		-0.1309
	B2PLYP	aug-cc-pVQZ		18.8251		1.4022
	MP2	aug-cc-pVQZ		17.3958		-0.0270
	CCSD(T)	aug-cc-pVQZ		17.3464		-0.0764
	QCISD(T)	aug-cc-pVQZ		17.3486		-0.0743
	G3			17.3884		-0.0344
	CBS-QB3			17.4741		0.0513

Table 8. Ionization energies of the P atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
P	CCSD(T)	aug-cc-pVQZ-DK	4/3	10.4796	10.4867	-0.0071
		aug-cc-pV5Z-DK		10.5003		0.0136
		SCF(5Z) + corr_CBS(Q,5)		10.5242		0.0375
		aug-cc-pVQZ		10.4916		0.0049
		aug-cc-pV5Z		10.5124		0.0257
		SCF(5Z) + corr_CBS(Q,5)		10.5365		0.0498
		ANO-RCC-TZP		10.4497		-0.0370
		ANO-RCC-QZP		10.4594		-0.0273
		ANO-RCC-Full		10.4890		0.0023
		SCF(QZ) + corr_CBS(T,Q)		10.4672		-0.0195
	CCSDT	aug-cc-pVQZ-DK		10.4820		-0.0047
		aug-cc-pV5Z-DK		10.5016		0.0150
		SCF(5Z) + corr_CBS(Q,5)		10.5245		0.0378
		aug-cc-pVQZ		10.4939		0.0072
		aug-cc-pV5Z		10.5137		0.0270
		SCF(5Z) + corr_CBS(Q,5)		10.5368		0.0501
		ANO-RCC-TZP		10.4535		-0.0332
		ANO-RCC-QZP		10.4624		-0.0243
		ANO-RCC-Full		10.4909		0.0043
		SCF(QZ) + corr_CBS(T,Q)		10.4695		-0.0172
	IP-EOM-CCSD	aug-cc-pVQZ-DK		10.5059		0.0192
		aug-cc-pV5Z-DK		10.5429		0.0562
		CBS(Q,5)		10.5817		0.0951
		aug-cc-pVQZ		10.5177		0.0310
		aug-cc-pV5Z		10.5549		0.0682
		CBS(Q,5)		10.5939		0.1072
		ANO-RCC-TZP		10.4708		-0.0159
		ANO-RCC-QZP		10.4865		-0.0002
		ANO-RCC-Full		10.5337		0.0471
		CBS(T,Q)		10.4980		0.0113
	B2PLYP	aug-cc-pVQZ		10.3422		-0.1445
	MP2	aug-cc-pVQZ		10.4896		0.0029
	CCSD(T)	aug-cc-pVQZ		10.5006		0.0139
	QCISD(T)	aug-cc-pVQZ		10.4992		0.0125
	G3			10.4637		-0.0230
	CBS-QB3			10.4472		-0.0394

Table 9. Ionization energies of the S atom computed using CCSD(T), CCSDT, and IP-EOM-CCSD with multiple basis sets and their CBS extrapolation, together with B2PLYP, MP2, CCSD(T), and QCISD(T) obtained with aug-cc-pVQZ, and with the composite G3 and CBS-QB3 methods.

Atom	Method	Basis set	Multiplicity (neutral/cation)	IE_calc (eV)	IE_exp (eV) [46]	ΔIE (eV)
S	CCSD(T)	aug-cc-pVQZ-DK	3/4	10.2414	10.3600	-0.1186
		aug-cc-pV5Z-DK		10.2842		-0.0758
		SCF(5Z) + corr_CBS(Q,5)		10.3297		-0.0304
		aug-cc-pVQZ		10.2554		-0.1046
		aug-cc-pV5Z		10.2984		-0.0616
		SCF(5Z) + corr_CBS(Q,5)		10.3441		-0.0159
		ANO-RCC-TZP		10.1113		-0.2487
		ANO-RCC-QZP		10.1488		-0.2112
		ANO-RCC-Full		10.2542		-0.1058
		SCF(QZ) + corr_CBS(T,Q)		10.1746		-0.1854
	CCSDT	aug-cc-pVQZ-DK		10.2452		-0.1148
		aug-cc-pV5Z-DK		10.2879		-0.0721
		SCF(5Z) + corr_CBS(Q,5)		10.3332		-0.0268
		aug-cc-pVQZ		10.2592		-0.1008
		aug-cc-pV5Z		10.3021		-0.0579
		SCF(5Z) + corr_CBS(Q,5)		10.3476		-0.0124
		ANO-RCC-TZP		10.1140		-0.2460
		ANO-RCC-QZP		10.1517		-0.2083
		ANO-RCC-Full		10.2581		-0.1019
		SCF(QZ) + corr_CBS(T,Q)		10.1778		-0.1823
	IP-EOM-CCSD	aug-cc-pVQZ-DK		10.3185		-0.0415
		aug-cc-pV5Z-DK		10.3769		0.0169
		CBS(Q,5)		10.4382		0.0782
		aug-cc-pVQZ		10.3330		-0.0270
		aug-cc-pV5Z		10.3915		0.0315
		CBS(Q,5)		10.4530		0.0930
		ANO-RCC-TZP		10.1865		-0.1735
		ANO-RCC-QZP		10.2308		-0.1292
		ANO-RCC-Full		10.3493		-0.0107
		CBS(T,Q)		10.2631		-0.0969
	B2PLYP	aug-cc-pVQZ		10.3553		-0.0047
	MP2	aug-cc-pVQZ		10.1396		-0.2204
	CCSD(T)	aug-cc-pVQZ		10.2536		-0.1064
	QCISD(T)	aug-cc-pVQZ		10.2550		-0.1051
	G3			10.2691		-0.0909
	CBS-QB3			10.2444		-0.1156

Table 10. Energies, types, and degeneracies of the HOMO and HOMO-1 orbitals for the atoms C, N, O, F, P, and S and the HOMO – HOMO-1 energy gap calculated at QCISD(T)/aug-cc-pVQZ level of theory.

	C	N	O	F	P	S
HOMO type and degeneracy	α 2	α 3	β 1	β 2	α 3	β 1
HOMO-1 type and degeneracy	β 1	β 1	α 1	α 2	β 1	α 1
E_HOMO (a.u.)	-0.43903	-0.57090	-0.52173	-0.67998	-0.39209	-0.37946
E_HOMO-1 (a.u.)	-0.58357	-0.72606	-0.61167	-0.73174	-0.55635	-0.41562
ΔE_HOMO-HOMO-1 (eV)	3.93	4.22	2.45	1.41	4.47	0.98

Table 11. Mean absolute error (MAE) and mean signed error (MSE) in eV, for each combination of method and basis set, averaged over C, N, O, F, P, and S atoms. The chemical accuracy threshold is 0.0434 eV (1kcal/mol).

Method	Basis set	MAE (eV)	MSE (eV)
CCSD(T)	aug-cc-pVQZ-DK	0.0570	-0.0570
	aug-cc-pV5Z-DK	0.0325	-0.0278
	SCF(5Z) + corr_CBS(Q,5)	0.0174	0.0035
	aug-cc-pVQZ	0.0490	-0.0474
	aug-cc-pV5Z	0.0290	-0.0182
	SCF(5Z) + corr_CBS(Q,5)	0.0194	0.0132
	ANO-RCC-TZP	0.1433	-0.1433
	ANO-RCC-QZP	0.1087	-0.1087
	ANO-RCC-Full	0.0566	-0.0558
	SCF(QZ) + corr_CBS(T,Q)	0.0784	-0.0784
CCSDT	aug-cc-pVQZ-DK	0.0546	-0.0546
	aug-cc-pV5Z-DK	0.0310	-0.0259
	SCF(5Z) + corr_CBS(Q,5)	0.0168	0.0050
	aug-cc-pVQZ	0.0474	-0.0450
	aug-cc-pV5Z	0.0274	-0.0162
	SCF(5Z) + corr_CBS(Q,5)	0.0188	0.0147
	ANO-RCC-TZP	0.1405	-0.1405
	ANO-RCC-QZP	0.1061	-0.1061
	ANO-RCC-Full	0.0549	-0.0535
	SCF(QZ) + corr_CBS(T,Q)	0.0760	-0.0760
IP-EOM-CCSD	aug-cc-pVQZ-DK	0.0463	-0.0258
	aug-cc-pV5Z-DK	0.0390	0.0149
	CBS(Q,5)	0.0575	0.0575
	aug-cc-pVQZ	0.0440	-0.0161
	aug-cc-pV5Z	0.0415	0.0245
	CBS(Q,5)	0.0672	0.0672
	ANO-RCC-TZP	0.1307	-0.1307
	ANO-RCC-QZP	0.0854	-0.0795
	ANO-RCC-Full	0.0468	-0.0139
	CBS(T,Q)	0.0654	-0.0420
B2PLYP	aug-cc-pVQZ	0.3091	0.2536
MP2	aug-cc-pVQZ	0.0826	-0.0522
CCSD(T)	aug-cc-pVQZ	0.0756	-0.0709
QCISD(T)	aug-cc-pVQZ	0.0637	-0.0595
G3		0.0491	-0.0491
CBS-QB3		0.0578	-0.0407

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Towards Chemical Accuracy in Atomic Ionization Energies: The Case of H, C, N, O, F, P, and S Atoms

Abstract

Keywords:

Subject:

1. Introduction

2. Computational Methods

3. Results and Discussion

3.1. Ionization Energies for the C, N, O, F, P, and S Atoms Obtained with the OVGF and P3+ Methods

3.2. Ionization Energies of the H, C, N, O, F, P, and S Atoms Obtained with the Energy-Difference Approach and IP-EOM-CCSD

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe