3.2. Discussion
The experimental results show that the lower and upper caves of the San Michele di Mezzo sanctuary cannot be considered acoustically equivalent spaces. Although they belong to the same religious complex and share a rock-cut origin, the measured acoustic indicators reveal different frequency-dependent behaviours. This outcome is relevant from a vibroacoustic monitoring perspective because it confirms that portable and non-invasive field measurements can distinguish measurable acoustic responses within a small and irregular heritage site, even when only a conservative source–receiver configuration is adopted.
A first relevant difference concerns the low–mid frequency range. At 250 Hz, the lower cave shows markedly higher clarity and definition values than the upper cave, with C50 = 2.11 dB, C80 = 5.19 dB and D50 = 61.90%, whereas the corresponding values in the upper cave are C50 = −4.05 dB, C80 = −1.71 dB and D50 = 28.30%. This difference is particularly meaningful because low–mid frequency bands contribute to the perceived body and support of the human voice, while clarity and definition parameters describe the balance between early and late acoustic energy. In this sense, the measured response of the lower cave is more consistent with voice-related ritual practices than that of the upper cave in this specific frequency band.
The comparison becomes more nuanced at higher frequencies. At 500 Hz, the two caves show very close values for C50 and D50, whereas at 1000 Hz and above the differences are less systematic. At 4000 Hz and 8000 Hz, the upper cave even shows higher C50 and D50 values than the lower cave. This trend indicates that the acoustic advantage of the lower cave is not absolute across the whole spectrum. Rather, the distinction between the two environments is frequency-dependent. Such a result is important because it prevents an oversimplified interpretation of the lower cave as globally “better” from an acoustic point of view. The measured data support a more precise conclusion: the lower cave exhibits more favourable clarity and definition values in selected low–mid frequency bands, while the two spaces become more comparable, and in some cases inverted, in the medium–high and high-frequency ranges.
To place the San Michele di Mezzo results in a broader acoustic context, the measured indicators were compared with published datasets from selected cave and cave-like environments, reported in
Appendix A [
11,
12,
13]. This comparison is not intended to define universal reference values for caves, because the considered sites differ in volume, morphology, surface conditions, measurement protocols and cultural function. Rather, it provides an interpretative frame for evaluating whether the measured behaviour of San Michele falls within the variability already observed in analogous environments.
The reverberation-related indicators show that San Michele occupies an intermediate position between highly reverberant large cave spaces and smaller or more acoustically controlled cave environments. At 1000 Hz, EDT is 1.309 s in the lower cave and 1.271 s in the upper cave. These values are close to those reported for La Pasiega Turret cave and Tito Bustillo cave, both equal to 1.40 s, and higher than La Garma cave, equal to 0.56 s. They are also lower than the values reported for the Pertosa caves, where EDT at 1000 Hz ranges from 1.61 s in the Castle Hall to 3.83 s in the Large Hall. This comparison indicates that the San Michele caves do not behave as extremely reverberant large cavities, but rather as compact rock-cut spaces with moderate mid-frequency decay.
The same pattern emerges from T30. At 1000 Hz, T30 is 1.600 s in the lower cave and 1.592 s in the upper cave, close to the Castle Hall of Pertosa, equal to 1.46 s, and lower than the Large Hall and Throne Hall of Pertosa, equal to 3.83 s and 2.81 s, respectively. At 4000 Hz, San Michele shows T30 values of 1.079 s in the lower cave and 0.993 s in the upper cave, again below the values reported for the larger Pertosa spaces. This comparison supports the interpretation of San Michele as a small cave sanctuary with measurable reverberant behaviour, but without the long decay times typical of larger cave halls.
The clarity indicators provide an even more informative comparison. At 250 Hz, the lower cave of San Michele shows C50 = 2.11 dB, which is higher than the values reported for several cave sites in the comparative dataset, including El Castillo, La Pasiega Turret and Tito Bustillo, and close to the range of sites with more favourable early-to-late energy balance, such as Las Chimeneas and La Garma. In contrast, the upper cave shows C50 = −4.05 dB at the same frequency. This comparison reinforces the internal differentiation between the two San Michele spaces: the lower cave is not only different from the upper cave, but also shows clarity values that, in selected bands, are comparable with or higher than those of several cave environments reported in the literature.
A similar observation can be made for C80. At 250 Hz, the lower cave reaches C80 = 5.19 dB, whereas the upper cave has C80 = −1.71 dB. The lower cave value is higher than those reported for the Hall of Bulls in Lascaux IV and for the Pertosa halls at the same frequency. At 1000 Hz, the lower cave shows C80 = 1.89 dB and the upper cave C80 = 2.04 dB, while the Pertosa halls range from −3.88 dB to 1.17 dB and Lascaux IV shows −3.5 dB. These values suggest that, in the considered frequency bands, the San Michele caves are not characterized by the very low clarity values observed in some larger or more reverberant cave environments.
The D50 comparison further highlights the specific behaviour of the lower cave. At 250 Hz, D50 is 61.90% in the lower cave and 28.30% in the upper cave. The lower cave value is substantially higher than those reported for the Pertosa halls at the same frequency, which range from 12.78% to 23.27%, whereas the upper cave is closer to that comparative range. At 1000 Hz and 4000 Hz, both San Michele caves show D50 values comparable with or higher than several Pertosa spaces, with the exception of the low 2000 Hz value reported for the upper cave. This confirms that the lower cave presents a relatively high early-energy fraction in selected frequency bands, consistently with the measured C50 and C80 values.
Overall, the comparative analysis strengthens the interpretation of San Michele di Mezzo as an acoustically differentiated cave sanctuary. The lower cave does not merely perform better than the upper cave in an internal comparison; in selected frequency bands, it also exhibits clarity and definition values that are favourable when placed beside published datasets from other cave or cave-like environments. At the same time, the comparison confirms the need for caution: the acoustic response of caves is highly site-specific, and no single parameter can be used to define the acoustic quality of a heritage space independently of morphology, volume, measurement setup and historical use.
This frequency-dependent behaviour is compatible with the irregular morphology of cave-like environments. In such spaces, the acoustic response is controlled by a combination of volume, enclosure, surface roughness, local cavities, openings and non-uniform boundary conditions. Unlike regular halls, rock-cut spaces do not necessarily exhibit smooth or predictable acoustic trends over the frequency spectrum. Experimental studies on cave acoustics have shown that local acoustic responses may vary substantially within the same site and may be associated with specific spatial positions, ritual areas or perceptual effects. The San Michele di Mezzo results are consistent with this broader evidence: the two caves belong to the same sanctuary, but their measured acoustic responses differ in selected bands because their geometries and boundary conditions are not equivalent.
From the point of view of cultural heritage interpretation, the most relevant finding is the acoustic differentiation of the lower cave. The lower cave preserves one of the oldest devotional nuclei of the sanctuary and contains the fresco of the Virgin and Child, probably dating to the end of the twelfth century. The experimental data do not demonstrate intentional acoustic design, nor do they prove that the space was selected because of its acoustic properties. Such conclusions would exceed the available evidence. However, the measured indicators show that the long-lasting ritual centrality of the lower cave is compatible with acoustic conditions that, in selected frequency bands, support vocal clarity and early sound definition. This compatibility is sufficient to connect the physical measurements with the historical use of the space without falling into deterministic or speculative interpretations.
This point is particularly important for the study of intangible heritage. In worship spaces, sound is not merely a physical by-product of ritual action. Spoken prayer, chant, liturgical recitation and collective listening are embodied practices that depend on the acoustic behaviour of the place in which they occur. Previous research on ancient theatres, churches and other performance or worship spaces has shown that acoustic properties, including sound level, reverberation, clarity and speech transmission, can influence intelligibility, audibility, perceived acoustic comfort and the suitability of spaces for collective listening practices. In the present case, vibroacoustic monitoring provides a quantitative layer of evidence that complements the historical and architectural reading of the sanctuary. The tangible features of the lower cave—its morphology, enclosure and devotional apparatus—are therefore linked to an intangible sound-related dimension that can be investigated through physical measurement.
The methodological contribution of this study lies in the conservative use of field vibroacoustic monitoring. The work does not rely on unsupported transfer-function estimates between adjacent spaces, nor on a numerical model of the sanctuary. Instead, it uses directly measured quantities: time histories, third-octave-band spectra and room-acoustic indicators derived from impulse-response measurements. This approach is less ambitious than a complete acoustic mapping or a calibrated numerical simulation, but it is more robust for a first-level investigation of a fragile heritage site. It also responds to a practical need in cultural heritage research: the availability of non-invasive, portable and cost-effective procedures that can be applied in sites where extensive instrumentation is not feasible.
The relevance of these results is not limited to heritage interpretation. Natural caves and artificial cavity-like spaces are relevant in several applied contexts where acoustic response affects communication, safety, comfort and spatial use. In tourist and show caves, room-acoustic indicators such as EDT, T20, C80, D50 and STI have been used to evaluate visitor experience, guided-tour communication, performance suitability and emergency information systems. Recent studies on karst tourist caves have shown that in situ impulse-response measurements, combined with geometric documentation, can support acoustic optimization for guided tours and safety communications [
14]. Similarly, acoustic measurements in Italian caves used for concerts and theatrical performances have demonstrated the relevance of reverberation, clarity and definition parameters for assessing the functional suitability of natural underground spaces [
12].
A further field of transferability is represented by underground transport infrastructure and other long or irregular artificial enclosures. Subway stations, tunnels, underground passages and similar confined spaces are characterized by non-diffuse sound fields, complex geometries and strong dependence of speech intelligibility on reverberation and source–receiver configuration. For example, studies on underground railway stations and subway platforms have shown that sound-field characteristics directly affect public-address systems, emergency communication, perceived comfort and acoustic design strategies [
15,
16]. Another potential related application can be identified in underground mining environments, where high background noise and confined geometries can reduce verbal communication effectiveness and make acoustic intelligibility a relevant factor for ordinary operational communication, safety and occupational acoustic comfort [
17,
18]. From this perspective, the protocol adopted in the present study, based on portable measurements, time-domain inspection, third-octave-band analysis and impulse-response-derived indicators, can be regarded as a first-level monitoring approach potentially transferable to other confined or semi-confined spaces where rapid, non-invasive acoustic assessment is required for different purposes.
This transferability does not imply that the same interpretative criteria can be applied unchanged across different domains. Heritage caves, tourist caves, subway platforms and artificial tunnels differ in scale, material properties, occupancy, background noise and functional requirements. However, they share a common vibroacoustic problem: the need to characterize how irregular or elongated cavities shape mechanical-wave propagation and how this affects human perception, information transmission and use of space. In this sense, the San Michele di Mezzo case study contributes not only to heritage interpretation, but also to the broader development of applied vibroacoustic monitoring procedures for complex cavity-like environments.
At the same time, the limitations of the study must be clearly acknowledged. The use of a single microphone position and a single source–receiver configuration does not allow the reconstruction of the full spatial variability of the acoustic field. The use of a directional loudspeaker, although compatible with the comparative scope of the work, does not reproduce the ideal conditions of standardized measurements based on omnidirectional sources. Moreover, the complex geometry of the caves prevents the direct generalization of the measured values to the entire volume of each space. For these reasons, the results should be interpreted as representative of the adopted monitoring configuration and as comparative indicators of acoustic differentiation, not as exhaustive acoustic certification of the sanctuary.
These limitations do not invalidate the main result of the study. The objective was not to produce a complete acoustic model of San Michele di Mezzo, but to verify whether a portable vibroacoustic monitoring protocol could identify measurable differences between the lower and upper caves. This objective was achieved. The comparison between the two spaces shows that the lower cave has a distinct acoustic response in selected frequency bands relevant to vocal practices, especially at 250 Hz, while the higher-frequency behaviour is more comparable between the two caves. The outcome is therefore not a generic claim of acoustic superiority, but a measured and frequency-dependent distinction between two historically and architecturally different spaces.
Future research should extend the monitoring protocol by increasing the number of source and receiver positions, repeating measurements under different environmental conditions and integrating the experimental data with three-dimensional geometric documentation of the cave spaces. A denser spatial sampling would allow the variability of acoustic indicators within each cave to be assessed, while numerical modelling could help distinguish the effects of volume, surface morphology and openings on the measured response. Additional perceptual tests or voice-based measurements could also be useful to evaluate more directly the relationship between acoustic indicators and ritual vocal practices. Within this broader research path, the present study provides a first experimental basis for interpreting San Michele di Mezzo as a heritage site in which architecture, acoustic response and intangible devotional use are physically and historically interconnected.