4. Discussion
In the present study, the reliability of the SF-36 and RAND-36 scales was compared in a sample of patients with CVD, using various statistical methods. The results show that both scales, SF-36 and RAND-36, have a similar and adequate reliability for the sample of patients with cardiac pathology. The internal consistency of the complete scales and their dimensions is good as shown by the Cronbach’s alpha values in all cases. Furthermore, there is a correlation between the items of each dimension in all cases except within the Physical Functioning dimension, in the correlation between items 3a and 3i and between items 3a and 3j where there is no significant correlation in any of the scales; however, the item-total correlation is significant in all dimensions for both scales. The correlation between the scores of all dimensions is also significant. The reliability assessed by the half-and-half method also yields results that confirm the good reliability of both scales. No differences were detected between the scales in any of these values.
The demographic profile of the sample in this study, comprising 85.2% men and a mean age of 60.27 years, is consistent with previous literature on CVD. Several studies have documented that CVD, especially coronary heart disease, tend to be more prevalent in men than in women [
19,
20,
21,
22,
23]. Women are twice as likely to have good cardiovascular health as men [
24].
However, this demographic profile also highlights a common limitation in cardiovascular research: the under-representation of women, which could influence the generalisability of the results [
22,
23,
25]. It is important that future studies seek to balance gender representation to better understand how these scales function in women with CVD, who often present with different clinical manifestations and responses to treatments. [
23,
25,
26].
Internal consistency, assessed by Cronbach’s alpha, showed values above 0.80 for the full scale including all 36 items, for the full scale including all 8 dimensions and for most dimensions in both scales. This finding is consistent with previous research reporting that both the SF-36 and the RAND-36 are robust instruments for measuring HRQoL in patients with chronic diseases, including CVD [
6,
7,
8,
11,
12,
13,
14,
27,
28,
29,
30,
31]. L The general health dimension showed a Cronbach’s alpha .756 in the SF-36 and .746 in the RAND-36 and the social functioning dimension .666 in the SF-36 and .773 in the RAND-36; which, although lower scores, are acceptable values.
RAND-36 can be considered an alternative version of the MOS SF-36, thus both instruments must meet the same psychometric standards as established by Ware and Sherbourne [
6] who considered that the most important for the MOS SF-36 are that the reliabilities must be acceptable and that the correlation between items of an item should be higher with its own subscale than with the other subscales of the instrument [
14]. In this research, regarding item and item-total correlations, the results show that both the SF-36 and the RAND-36 have a coherent internal structure, with item-total correlations exceeding the recommended threshold of 0.30 [
32]. This implies that each item contributes adequately to the overall measure of the dimension to which it belongs, which supports the internal validity of both scales. These correlations reinforce the idea that items within each dimension are measuring similar constructs, which is essential for the clinical interpretation of the scores obtained [
6,
11,
12,
28,
30,
31].
Reliability analysis using the half-and-half method also showed positive results for both scales, with coefficients indicating high measurement stability. This is particularly relevant in clinical settings, where it is crucial that assessment tools are able to provide consistent results over time and across different population subgroups [
33]. Consistency in measurement supports the use of these instruments in daily clinical practice to assess HRQoL in patients with CVD.
Despite the similarities in reliability between the two scales, it is important to highlight some theoretical and methodological differences that could influence the selection of one over the other in future research. The SF-36 was originally developed as a generic instrument, applicable to a wide range of health conditions and populations, while the RAND-36 is derived from the SF-36 but with differences in the coding of certain items and the handling of missing data [
13]. However, the results of this study suggest that, in terms of reliability, both scales are interchangeable in assessing HRQoL in patients with CVD.
This study presents several limitations that should be considered when interpreting the results. First, the sample consisted exclusively of patients with CVD, which limits the generalization of the findings to other populations with different health conditions. Previous studies have indicated that the psychometric properties of the SF-36 and RAND-36 scales may vary depending on the population studied, particularly when dealing with different chronic diseases or healthy individuals [
34,
35].
Second, although a comprehensive comparative analysis of reliability was conducted, the study did not include an assessment of the convergent or discriminant validity of the scales. Validity is a critical component to ensure that the scales effectively measure what they are intended to measure [
36], and its absence in this analysis limits the full interpretation of the utility of each tool. Sensitivity to change, an essential property for determining the ability of the scales to detect variations in health status over time, was also not explored, which could be especially relevant in monitoring patients with chronic diseases.
Finally, the cross-sectional design of the study prevents establishing causal relationships between the observed psychometric properties and the health condition of the patients. The cross-sectional nature limits the ability to assess the stability of psychometric properties over time. Longitudinal studies would be necessary to confirm the stability of these properties at different stages of CVD and in other chronic conditions [
37].
This study presents several strengths that reinforce the validity of its findings. First, one of the main strengths is the use of a rigorous methodological approach to evaluate the reliability of the SF-36 and RAND-36 scales in a specific sample of patients with CVD. The use of multiple statistical methods, such as Cronbach’s alpha, item-total correlation, and split-half reliability analysis, provides a comprehensive and robust view of the internal consistency of these scales. The diversity of analyses used allows for a more thorough assessment of reliability, which increases confidence in the results obtained [
38].
Sample size is a crucial aspect that can influence both the strengths and limitations of a study. In this case, the use of a sample of 413 patients with CVD represents an important strength of the study. An adequate sample size allows for more precise estimates of psychometric properties, such as reliability, and ensures greater statistical power to detect significant differences and relationships in the analyses [
39]. Additionally, with a sample of this size, the external validity of the results is improved, making the conclusions more generalizable to the broader population with CVD [
40].
Another strength is the focus on a specific clinical population. This approach allows for a detailed and relevant assessment of the reliability of the scales in a context of great clinical importance, making the results directly applicable to practice in the cardiovascular field. The specificity of the sample provides a solid foundation for the clinical interpretation of the psychometric properties of the scales in patients with CVD, which is crucial for the personalization of care and follow-up of these patients [
41].
Additionally, the study contributes to the field by comparing two widely used scales, the SF-36 and the RAND-36, facilitating informed decision-making about which tool is more suitable in different research and clinical practice contexts. The direct comparison between these two scales under the same conditions and within the same population is a valuable contribution that allows researchers and clinicians to select the most appropriate scale according to their specific needs.
From the findings of this study, several lines of research can be identified that could further enrich the knowledge about the assessment of HRQoL in patients with CVD. These include studying the sensitivity to change to detect variations in patients’ quality of life as their disease progresses or as they respond to therapeutic interventions; validation in other cultural and demographic contexts, such as different ethnic groups, socioeconomic levels, or patients from various geographic regions; and research on the predictive validity of long-term clinical outcomes, such as morbidity, mortality, or rehospitalization in patients with CVD. This could make these scales more powerful tools for clinical management and treatment planning.