3.4. Initial Construction of the Instrument
As part of the construction process of the 5P–ESG composite index, the study initially considered a total of 28 conceptually grouped variables aimed at broadly capturing the financial and sustainability dimensions associated with the five Ps of sustainable development: People, Planet, Prosperity, Peace, and Partnerships.
The preliminary selection of variables was based on three fundamental criteria:
Theoretical support in the literature on sustainable finance, ESG, and corporate performance;
International standards such as GRI, SASB, and MSCI ESG Ratings;
Availability and comparability of information for companies listed on the Colombian Stock Exchange.
This process resulted in a broader initial instrument, which was subsequently subjected to statistical cleaning and validation procedures in order to construct a final version that is more parsimonious, robust, and operationally viable (see
Table 1).
Table 1 presents the preliminary instrument comprising 28 indicators, designed to broadly cover the five dimensions of People, Planet, Prosperity, Peace, and Partnerships.
However, to ensure that the composite index was operationally viable, comparable across issuers, and statistically stable, a data reduction process based on four sequential filters (F1–F4) was applied, combining expert judgment (content validity) and statistical criteria (empirical validity). The objective was to obtain a parsimonious set of variables with high financial interpretability and internal consistency, enabling the construction of pillar-level scores and the overall I5P index.
F1. Content and applicability filter (expert judgment).
A technical review of the preliminary list was conducted to assess: (i) alignment with the 5P construct, (ii) conceptual clarity and directionality, and (iii) financial relevance for corporate analysis. Priority was given to indicators with direct interpretability for investors (e.g., verifiable policies, comparable ratios, efficiency metrics, and distribution measures). Variables whose measurement was highly heterogeneous across sectors or not consistently reportable across issuers were identified and excluded at this stage.
F2. Availability and comparability filter within the MSCI COLCAP universe.
Data availability for each indicator was audited for the period under analysis. Indicators with systematic data gaps or inconsistent definitions across firms were excluded. This filter is critical in composite index construction, as it avoids biases arising from excessive imputation and ensures that rankings reflect actual differences in corporate performance rather than information gaps.
F3. Preliminary statistical quality filter (variability and redundancy).
Indicators exhibiting (i) null or near-zero variance (i.e., lacking discriminatory power across issuers), and (ii) redundancy (very high correlations implying double counting of the same underlying phenomenon) were removed. These controls enhance index stability and reduce the risk that final scores are disproportionately driven by a single subset of variables.
F4. Statistical validation of the instrument (reliability and structure).
For the refined set of indicators, internal consistency tests (Cronbach’s alpha at the pillar and global levels) and factorability tests (KMO and Bartlett’s test of sphericity) were applied as prerequisites for exploratory factor analysis (EFA).
Subsequently, EFA confirmed that the retained indicators exhibited an interpretable factor structure and adequate factor loadings, supporting the empirical coherence of the measurement instrument.
In summary, the filters—defined ex ante and designed to be replicable—followed the logic outlined below:
F1 = Content validity (expert judgment: 5P relevance and financial interpretability).
F2 = Availability and comparability (consistent public data for COLCAP issuers; low missingness).
F3 = Preliminary statistical quality (sufficient variability and non-redundancy; avoidance of non-discriminatory items).
F4 = Empirical validation (contribution to reliability and structure: internal consistency and interpretable factor patterns).
Overall, the instrument refinement process was structured as a sequential and replicable filtering procedure (F1–F4), aimed at ensuring that the resulting index is comparable across issuers, operationally measurable, and statistically robust. This approach avoids the inclusion of indicators with limited availability or heterogeneous measurement and mitigates double counting arising from redundancy. Accordingly, priority was first given to content validity and financial interpretability (F1), followed by empirical feasibility within the MSCI COLCAP universe (F2), then discriminatory capacity and preliminary statistical quality (F3), and finally empirical coherence through reliability and structural tests (F4). The incremental logic of the procedure is summarized in
Table 2.
Based on the traceability presented in
Table 2, the filtering process made it possible to move from the preliminary set of 28 indicators to a parsimonious core of variables with high inter-firm comparability, direct financial interpretability, and conceptual coherence with the 5P framework. Accordingly,
Table 3 presents the decisive filter and the complete structure and specifications for each indicator, following a logic analogous to PRISMA reporting applied to index construction. Likewise, the core instrument (n = 16) used in the subsequent stages of the study is consolidated: direction homogenization, winsorization, normalization (Z-scores), calculation of pillar scores, and construction of the composite index (I5P–Score), as well as internal consistency and structural tests (Cronbach’s alpha, KMO/Bartlett, and EFA) (See
Table 3).
Preliminary statistical treatment of the data
The construction of the synthetic scores associated with the five dimensions of the 5P model (People, Planet, Prosperity, Peace, and Partnerships) followed a standardized statistical procedure designed to ensure comparability across heterogeneous indicators, reduce distortions caused by extreme values, and provide the statistical consistency required for a robust composite index.
The index design adopted a four-step standardization procedure, in line with OECD recommendations [
1] and the specialized literature on ESG ratings [
2,
3]:
(i) data cleaning and winsorization,
(ii) standardized normalization,
(iii) aggregation of items by dimension, and
(iv) estimation of the final score for each pillar.
This procedure ensures the statistical stability of the estimates and prevents extreme values or scale differences from affecting comparability across firms included in the MSCI COLCAP index.
First, a bilateral winsorization procedure (scientific control of outliers) was applied, setting the 5th and 95th percentiles as lower and upper bounds, respectively. This method allows to:
(i) mitigate the influence of extreme values without eliminating observations, thus preserving the full sample structure;
(ii) reduce the impact of outliers associated with atypical variations or measurement errors;
(iii) retain all cases in the analysis, unlike trimming procedures that remove observations; and
(iv) improve the stability of means and dispersions in sensitive indicators such as ratios, rates, operational days, and risk-management-related metrics.
Given that the data matrix includes quantitative and proportional performance indicators across the five dimensions (People, Planet, Prosperity, Peace, and Partnerships), winsorization was applied exclusively to continuous variables:
PEO1–PEO2 (People)
PLA1–PLA2 (Planet)
PRO1–PRO6 (Prosperity)
PEA1–PEA3 (Peace)
ALI1–ALI3 (Partnerships)
Binary/categorical columns (values 0–1) do not require winsorization.
Now, for each of the numerical indicators:
Each variable was sorted in ascending order.
The 5th (P5) and 95th (P95) percentiles were identified.
Values below P5 were replaced with the exact value of the 5th percentile.
Values above P95 were replaced with the corresponding 95th percentile.
Accordingly, for each indicator
, its winsorized version
was defined as:
Where and correspond to the 5th and 95th percentiles of each quantitative variable.The transformation preserves the relative structure of the dataset while reducing the influence of extreme outliers.
Winsorization resulted in:
A reduction in dispersion for highly volatile indicators such as PRO1, PRO4, and PLA2.
Greater stability in means, eliminating variations induced by extreme values.
Improved symmetry in a considerable portion of the Prosperity metrics, where several companies exhibited unusually high values in periods of operating days (PRO3).
Removal of adverse bias in indicators with negative proportions or values exceeding 100%, which distorted benchmarking.
Secondly, the variables were normalized using Z-scores, with the objective of unifying the measurement scales. The transformation was performed using:
Where:
is the winsorized value of indicator for company ,
is the mean of the indicator,
is the corresponding standard deviation.
After normalization, all indicators were expressed on a common scale (mean = 0; standard deviation = 1), ensuring comparability and statistical consistency.
For variables where lower values are preferable (“the lower, the better”), an additional transformation was applied:
Dichotomous variables (0/1) were kept in their original state. The process described above can be seen in
Table 5 (See
Table 5).
3.7. Estimation of Weights Using Principal Component Analysis (PCA)
With the aim of transforming the pillar scores into a sustainable scoring model, endogenous weights were estimated using Principal Component Analysis (PCA). This procedure allows identifying the relative contribution of each dimension of the 5P model to the aggregated sustainable-financial performance, avoiding the use of arbitrary weights.
Prior to the analysis, the pillar scores were re-standardized:
Subsequently, PCA was applied to the matrix of standardized scores , extracting the first principal component, which captures the largest proportion of the joint variance of the five pillars. The factor loadings associated with this component were interpreted as indicators of the relative importance of each dimension.
These loadings were transformed into normalized weights using:
Where corresponds to the factor loading of pillar .
Formulation of the I5P–Score Mathematical ModelThe 5P–ESG Composite Index is formally defined as a sustainable scoring model, analogous in structure to classical financial evaluation models. For each company
, the index is expressed as:
Where is the score of company in pillar , and is the weight estimated through PCA.
In the specific case of this study, the obtained weights were:
Thus, the final model is defined as:
The above expression constitutes the mathematical core of the research work and represents an integrated model for evaluating the financial and sustainable performance of companies, built based on the conceptual structure of the 5Ps and validated using multivariate statistical techniques.
Classification of Sustainable-Financial Performance Based on the I5P–Score
To interpret the I5P–Score in an operational manner and avoid the use of arbitrary absolute thresholds, a relative classification was adopted based on quantiles of the distribution of the indicator within the analyzed universe (MSCI COLCAP companies). This strategy is consistent with the nature of composite indices constructed from standardization and endogenous weights, whose scale depends on the set of observations and the period considered.
Two cut-off points were defined based on the empirical percentiles of the I5P–Score.
Based on these cut-offs, each company
was classified into three levels:
The thresholds and were estimated directly from the observed values of the index in the sample, ensuring that the classification is replicable (given the same data and the same analysis period) and suitable for moderate-sized samples, avoiding groups that are too small.