2. Results and Discussion
For the CASF-2013 dataset, re-docking of the ligands in the protein-ligand complexes has been performed. The following data have been extracted from the 30 poses saved:
- i)
the best docking score (the lower the better) – hereinafter referred to as BestDS;
- ii)
the lowest root-mean-square deviation (RMSD) between the predicted pose and the ligand in the co-crystalized complex – hereinafter referred to as BestRMSD;
- iii)
RMSD between the pose with the best docking score and the ligand in the co-crystalized complex – hereinafter referred to as RMSD_BestDS;
- iv)
the docking score of the pose with the lowest RMSD to the ligand in the co-crystalized complex – hereinafter referred to as DS_BestRMSD.
For the sake of completeness, we have also investigated the possible relations to the experimental data available in CASF-2013. Based on the availability, Kd or Ki, expressed as (-logKd) or (-logKi), respectively, are used.
The collected information has been formatted for subsequent application of ICrA. All spreadsheets have been constructed in an identical way: the ligands from protein-ligand complexes have been considered as objects of research in terms of ICrA, while the different scoring functions (represented by BestDS, BestRMSD, RMSD_BestDS, or DS_BestRMSD), along with the binding affinity data, have been considered as criteria.
ICrA has been applied in two steps. Initially, it has been implemented on the data outputs extracted from molecular docking results as described above, with the conditionally defined in ICrAData values
α = 0.75 and
β = 0.25, based on which the ranges of
consonance and
dissonance are defined [
21].
Table 1 summarizes the results obtained for the degrees of agreement
µ between the five scoring functions of MOE and the binding affinity data of the studied complexes, after the ICrA application for all aforementioned docking outputs. Colors in
Table 1 reproduce the ones used in ICrAData, as explained above, for the conditionally defined in ICrAData values
α = 0.75 and
β = 0.25.
As seen from
Table 1, “varicolored” ICrA results have been obtained only when BestRMSD is considered. For the rest docking outputs ICrA does not outline any significant relations (other than in
dissonance) between the five investigated scoring functions in MOE, as well as between the scoring functions and the binding affinity data – all hit the
dissonance zone only (colored in magenta).
In the second step, the impact of the variations in the
α and
β values on the investigated relations has been explored. As mentioned above, the thresholds α and
β might be determined algorithmically, or chosen by the user. Since these parameters define the thresholds for
positive consonance/
dissonance/
negative consonance between the studied pairs of criteria, we investigated how their values could impact the relations between the scoring functions. We varied the difference between
α and
β (keeping their sum to 1.00) starting from 0.5 (
α = 0.75 and
β = 0.25) and decreasing it gradually to 0.2 (
α = 0.6 and
β = 0.4) to follow how these values affect the relations between the investigated scoring functions (for clarity, these analyses are reported using ICrA colors in supplementary tables S1-S5). The results are summarized in
Table 2 and
Table 3, based on the following two indicators: the number of pairs in
positive consonance for each docking output (
Table 2), and the number of docking outputs for each pair of scoring functions in
positive consonance (
Table 3).
The first indicator, number of pairs in positive consonance, could be considered as a measure for “sensitivity” of the corresponding docking output to the α and β values. The second one, number of docking outputs for each pair in positive consonance, allows to compare the scoring functions based on the number of the docking outputs – the more the docking outputs in positive consonance, the more similar is the performance of the studied pairs of scoring functions.
As expected, the decrease in the difference between α and β results in a higher number of pairs in positive consonance thus raising the question what are the most appropriate α and β values. The values of 0.67/0.33 and 0.65/0.35 show almost identical results, and the 0.67/0.33 appears a good compromise between “no” and “many” pairs in positive consonance allowing simultaneously certain tolerance in comparability between the studied criteria.
In addition, the values
α = 0.67 and
β = 0.33 allow the authors to present an alternative scale for
consonance/
dissonance. If the scale of
consonance/
dissonance outlined in Atanassov et al. [
21], might be considered as a scale of “quarters”, here а kind of scale of “thirds” is intentionally considered, corresponding to
α = 0.67 (approximately two-thirds) and
β = 0.33 (approximately one-third). For better understanding,
Figure 1 represents the respective interpretative intuitionistic triangle.
In this case, the pair of criteria are said to be in:
positive consonance, whenever ≥ 2/3 and < 1/3;
negative consonance, whenever < 1/3 and ≥ 2/3;
dissonance, whenever 0 ≤ < 2/3, 0 ≤ < 2/3 and 2/3 ≤ + ≤ 1; and
uncertainty, 0 ≤ < 2/3, 0 ≤ < 2/3 and 0 ≤ + < 2/3.
As seen from
Figure 1, the presented scale of “thirds” preserves the symmetry in the interpretative intuitionistic triangle, in accordance with the original “quarters”-based symmetry [
21]. Thus, the combination
α = 0.67 and
β = 0.33 appears an appropriate selection in this investigation.
According to the pairwise performance comparison of the MOE scoring functions (
Table 3), the most persistent results are:
- 1)
the absence of any kind of agreement in all explored values of
α and
β with the experimental data (except one, between GBVI/WSA dG - (-log
Kd) or (-log
Ki) at
α = 0.60 and
β = 0.40, rows highlighted in grey in
Table 3). Such result is in accordance with our previous studies [
19]. The lack of any agreement might be explained also with the fact that even implementing a variety of scoring terms and becoming more sophisticated, the scoring functions are still a computational approximation aiming to assist mostly in prediction of ligand binding poses as confirmed by the results with BestRMSD docking output (
Table 2).
- 2)
positive consonance between two scoring functions – Alpha HB and London dG: in particular, for 0.67/0.33 thresholds values they are comparable in all four docking outputs (a row in bold in
Table 3). The result suggests that these scoring functions might be used interchangeably. At the same time some pairs show small comparability (Affinity dG - London dG and GBVI/WSA dG - London dG) suggesting that they can complement each other in consensus docking studies.
Figure 2 and
Figure 3 demonstrate the results from ICrA implementation for the explored values of thresholds
α and
β, respectively for BestDS and RMSD_BestDS. Both figures show the ICrA screenshots at conditionally defined thresholds values
α = 0.75 and
β = 0.25 (subplot a),
α = 0.70 and
β = 0.30 (subplot b),
α = 0.67 and
β = 0.33 (subplot c),
α = 0.65 and
β = 0.35 (subplot d), and
α = 0.60 and
β = 0.40 (subplot e).
As seen from
Figure 2, applying new values of thresholds leads to appearance of pairs in
positive consonance. For
α = 0.70 and
β = 0.30 (
Figure 2b) and
α = 0.67 and
β = 0.33 (
Figure 2c),
positive consonance appears between the scoring functions Alpha HB - London dG, toward no identified significant relations with the conditionally defined thresholds values (
Figure 2a). Further decreasing the difference between
α and
β leads to one more pair in
positive consonance – Alpha HB - ASE for
α = 0.65 and
β = 0.35 (
Figure 2d), and additionally 3 more pairs in
positive consonance – namely Affinity dG - Alpha HB, Affinity dG - ASE, and ASE - London dG at
α = 0.60 and
β = 0.40 (
Figure 2e).
As seen from
Figure 3 and
Table 1 as well, RMSD_BestDS is the docking output with the highest number of newly appeared significant relations when the new values of the thresholds are applied. Altogether 5 pairs of scoring functions show
positive consonance, namely: Affinity dG - Alpha HB, Affinity dG - ASE, Affinity dG - GBVI/WSA dG, Alpha HB - GBVI/WSA dG, and Alpha HB - London dG at
α = 0.67 and
β = 0.33 (
Figure 3c) in comparison to no significant relations identified at the conditionally defined thresholds values (
Figure 3a) and
α = 0.70 and
β = 0.30 (
Figure 3b). Further decreasing of the difference between α and β leads to two more pairs in
positive consonance – between ASE - GBVI/WSA dG and ASE - London dG at
α = 0.65 and
β = 0.35 (
Figure 3d), and even to 3 additional ones – between Affinity dG - London dG, Alpha HB - ASE, and GBVI/WSA dG - London dG at
α = 0.60 and
β = 0.40 (
Figure 3e).
As mentioned above, the most “varicolored” picture from ICrA implementation is when the BestRMSD is considered (results not shown as a figure). As seen from
Table 1,
positive consonance has been observed for almost all pairs of scoring functions, while the other two pairs of scoring functions – ASE - GBVI/WSA dG and GBVI/WSA dG - London dG, are very close to the conditionally defined thresholds values
α = 0.75 and
β = 0.25. Then, still at the thresholds values
α = 0.70 and
β = 0.30, all pairs of scoring functions become in
positive consonance. Based on this analysis, one may conclude that according to the BestRMSD, all scoring functions give quite similar results.
For the DS_BestRMSD output, only one pair of scoring functions – Alpha HB - London dG falls into the interval of
positive consonance (
Table 1, results not shown as a figure) when applying values of thresholds
α = 0.70 and
β = 0.30. Further decrease in the difference between
α and
β leads to three more pairs in
positive consonance – between Affinity dG - GBVI/WSA dG, Alpha HB - ASE, and ASE - London dG, only at
α = 0.60 and
β = 0.40.
For the completeness of the comparison of the five scoring functions, a correlation analysis (CA) has also been performed.
Table 4 summarizes the results obtained by ICrA and CA for all docking outputs. CA shows higher correlation only for BestRMSD, while for the other docking outputs the observed correlations are relatively low. In case of BestRMSD, the highest correlation coefficients somehow coincide with the highest values of ICrA degrees of agreement. In particular, the relation between Alpha HB and London dG is evaluated with the highest values of degree of agreement by ICrA and with the second-best correlation coefficient by CA.
Figure 4 illustrates the correlation in terms of CA between Alpha HB and London dG for BestRMSD.
The absence of correlation, estimated by the Pearson correlation coefficients, between the docking scores and the experimental data on binding affinity of the ligands in the studied complexes is not surprising. This result could not be related only to the heterogeneity of the experimental data (different measures of affinity, different methods and experimental protocols, various protein-ligand complexes) in the used dataset, but rather to the fact that molecular docking has not initially been designed for the purpose of correlation of docking scores with experimental binding affinities [
22]. Absence of such correlations has also been confirmed in our previous studies employing much more consistent experimental data on binding affinities of a homologous series of 88 benzamidine type ligands toward thrombin, trypsin, and factor Xa [
19].
As seen from
Table 4 and
Figure 4, ICrA may offer enhanced capabilities over CA by enabling the identification of additional relationships among the evaluated scoring functions. This might be explained by the fact that ICrA, as well as CA, reports the degree of coincidence – in terms of ICrA, this is
positive consonance. Both CA and ICrA report the negative correlation (
negative consonance in terms of ICrA), in which the values for one criterion increase while at the same time the values of the other criterion decrease. Unlike CA, ICrA also allows for classifying the criteria relations in dissonance, as well as accounting of the degree of uncertainty, which makes the advantage of ICrA over CA.