2. Materials and Methods
The study took place on the campus of the University of Massachusetts in Amherst, Mass., USA (USDA Hardiness Zone 5b). In July 2021, 18 experienced arborists who held the International Society of Arboriculture’s (ISA) Tree Risk Assessment Qualification (TRAQ) (among other credentials) assessed the likelihood of stem failure due to decay of 30 trees using 5 (basic and advanced) assessment techniques.
We selected trees for the field assessment based on practical considerations. The first was the availability of sonic and electrical resistance (ER) tomograms taken of the trunk taken within 2 m of the ground. Tomograms had been previously obtained using a PiCUS Sonic Tomograph 3, a TreeTronic 3 for ERT, and the Caliper 3 Geometry Measurement System (Argus Electronic GMBH, Rostock, Germany) following the methods of [
23]. A second consideration was variation in compartmentalization response: weak (
Pinus) and strong (
Quercus). Finally, only (i) larger individuals (> 50 cm stem diameter measured 1.4 m above ground—“DBH”) and (ii) individuals that were close enough to one another that they could be grouped by location were selected. In the latter case, we selected individuals in six discrete clusters around the campus. We selected clusters of individuals for two reasons: (i) it included a variety of landscape settings (open space or near infrastructure such as roads, buildings, and parking lots); and (ii) it limited travel time to maximize the number of individuals that could be assessed in the two days when assessors visited campus. Prior to conducting the study, we pre-tested the methods and determined an efficient route to assess as many trees as possible in two days.
We recruited assessors from our professional networks, inviting only experienced assessors who (i) held the TRAQ credential, (ii) regularly performed risk assessments as part of their professional practice, and (iii) were familiar with advanced decay detection techniques such as resistance drilling and tomography. We offered continuing education units to assessors, but neither financial compensation nor reimbursement of travel expenses.
Before assessors arrived on campus in July 2020 to participate in the study, we used a Resistograph® F500-S (IML North America, Moultonborough, N.H., USA) to determine the thickness of sound wood (
t) at between three and six locations spaced at approximately even intervals around the stem circumference and at the same height as the tomogram. For each location, we computed the
t/
R ratio, where
R is trunk radius [
31]. We attached flagging to the stem to indicate the locations of the tomography and Resistograph measurements (
Figure 1).
We provided each assessor a binder that included a sheet for each tree. The sheet contained the following information: genus and species, DBH, height, the Resistograph output (
Figure 2), and the sonic and ERT tomograms (
Figure 3). Output from the Resistograph included a scaled diagram of the cross-section of the stem and lines indicating where drillings were made, the height and stem diameter where the drillings were made, the mean
t value, and a table of all
t/
R values. The tomograms included the percentages of cross-sectional area that were sound or decayed. The decayed proportion of the cross-section was computed automatically from the combined areas of blue and purple in the sonic tomogram. Since we used the default settings (SoT1 calculation option and minimum velocity established at 50%), the resulting tomogram depicts the greatest possible area of decay in comparison to those generated using SoT2 and an expanded color space to view the minimum percent velocities. But the computed proportion of decayed wood indicated at the top of the tomogram assessors viewed during the study (e.g.,
Figure 3) did not include areas of intermediate velocities. We explained this to assessors prior to the field study. After the field study, we computed the loss in section modulus due to decay (
ZLOSS) from each sonic tomogram following the method of [
32].
We instructed assessors to assign a rating of the likelihood of stem failure due to decay (“LoFR”) within 2 m of the ground and reminded them not to assess likelihood of failure of other parts of the tree. We used LoFRs from [
13] and provided assessors with the definitions (listed in the Introduction). We instructed assessors to assign their LoFR based on a timeframe of three years.
Assessors performed five consecutive assessments of the LoFR. In order, assessments were as follows:
visual assessment of the tree and its surroundings;
sounding the trunk with a plastic mallet;
viewing the Resistograph output (
Figure 2);
consulting with a randomly assigned assessor.
Assessment techniques (a) and (b) are part of a Level 2 (“basic”) risk assessment [
13]. Assessment techniques (c) and (d) are more sophisticated techniques to assess the amount and location (i.e., the “extent”) of decay and part of a Level 3 (“advanced”) risk assessment [
13]. For odd-numbered trees, assessors viewed the resistance drilling output (c) before the viewing the tomogram (d); for even-numbered trees, assessors viewed the tomogram first. Consulting with a peer is not explicitly recommended in common professional guidelines [
13,
33]. Within each cluster of trees, assessors were randomly paired and inspected individual trees at their own pace.
After each of the five assessments [(a) – (e)] on a tree, assessors completed a survey to indicate their LoFR and describe the factor(s) (e.g., species, decay severity, tree size, exposure, lean, crown, etc.) that most influenced their LoFR, and if the additional information gained in the assessment technique changed their LoFR.
Assessors also self-reported the following information on the survey: years of experience performing tree risk assessments; number of trees assessed annually; relevant credentials in addition to TRAQ; and how frequently they used assessment techniques (b), (c), and (d) as part of their professional practice.
During the field study, not every assessor completed all five assessments of every tree. As a result, approximately 15% of the expected dataset was missing values. We used multivariate imputation by chained equations [
34,
35] to impute the most likely value for each missing value to obtain a full dataset prior to OLR analyses.
The university campus is well maintained, and no assessor assigned a LoFR of four (“imminent”) to any tree. Consequently, we coded LoFR ordinally as one (“improbable”), two (“possible”), or three (“probable”) and built ordinal logistic regression (OLR) models to investigate the effect of assessment technique on LoFR. All analyses were performed using the statistical language R, v4.1.2 [
36]. In the OLR models, we included covariates describing trees [genus; DBH; percent of cross-sectional area with decay (from tomograms); average sound wood thickness (
t) from the Resistograph output;
t/R, where
R is stem radius; Z
LOSS] and participants (years of experience; frequency of using a mallet, resistance drilling, and tomography when conducting risk assessments). We also included tree and assessor identification as random effects in each OLR model. We built models with the “clmm” function from the “Ordinal” package by iteratively adding covariates as single effects or interactions with the main effect of assessment technique [
37]. Since the order of assessments differed between even- (viewed tomogram before Resistograph output) and odd-numbered (viewed Resistograph output before tomogram) trees, the variable “assessment technique” contained ten levels that represented an interaction between the five assessment techniques and even- or odd-numbered trees. We then selected the best model using lowest AICc scores.
In addition to the OLR analyses, we created a contingency table with four rows (one for each of the assessment techniques that followed the initial visual assessment) and two columns (to indicate whether the additional information gained for the assessment technique changed (“Yes”) or did not change (“No”) assessors’ LoFRs). We used a test to determine whether the proportion of affirmative and negative responses varied among assessment techniques.
Lastly, we investigated the influence of the random variables in the OLR model (assessor and tree) on LoFR. To investigate the influence of assessors, we evaluated if the consistency in assessor LoFRs changed among the five assessment techniques or four frequency-of-use categories of the tomogram or Resistograph. We quantified LoFR consistency with the “betadisper” function in the “vegan” package, which performed a multivariate test of homogeneity of variances on a Bray-Curtis (rank-based) dissimilarity matrix of the proportional distribution of LoFRs [
38]. A multivariate approach was needed to evaluate inconsistencies of LoFRs with a single test.
To investigate the influence of trees between the initial visual assessment and each subsequent assessment technique, we computed the ratio of the weighted mean change in LoFR to the proportion of unchanged LoFRs for each tree. The ratio illustrated the frequency, magnitude, and direction of changes in LoFR from the initial visual assessment. We computed the ratio () as follows:
Compute the difference in LoFR from the initial visual LoFR:
where
,
, and
, are indices for the 4 assessment techniques following the initial visual assessment (indicated by the subscript
), 30 trees, and 18 assessors, respectively.
Compute the proportion of unchanged LoFRs (i.e.,
) for each tree and assessment technique:
Compute the weighted mean change in LoFR:
where
is a weighting factor of 1 (for LoFRs that changed one level from the initial visual assessment, e.g., from probable to possible or improbable to possible) or 2 (for LoFRs that changed two levels, e.g., from probable to improbable).
For each tree and assessment technique,
We thus computed 30 values of for each of the 4 assessment techniques that followed the initial visual assessment. From the resulting distribution of 120 values of , we considered only values in the upper and lower quartiles as having increased and decreased LoFR, respectively. We considered values of within the interquartile range (IQR) as having the same LoFR as the initial visual assessment. In the rest of the paper, we refer to “increased”, “decreased”, or “unchanged” LoFRs rather than values of in upper quartile, lower quartile, and IQR, respectively.
We described the basic assessment techniques as “consistent” if the LoFR assigned in the mallet assessment was unchanged from the initial visual assessment, and “inconsistent” if the LoFR assigned in the mallet assessment was greater or less than the initial visual assessment. We described the advanced assessment techniques as consistent if the change in LoFR from the initial visual assessment was the same for both advanced assessment techniques. We described the advanced assessment techniques as inconsistent if the change in LoFR from the initial visual assessment was not the same for both advanced assessment techniques. With respect to changes in LoFR from the initial visual assessment, we described the effect of the consultation assessment as “confirming” (or not) the basic and advanced assessments. If the LoFR assigned in the mallet and consultation assessments was unchanged from the initial visual assessment, the consultation assessment confirmed the basic assessment techniques. Similarly, if the LoFR was greater than or less than the initial visual assessment for both advanced assessment techniques and the consultation assessment, the consultation confirmed the advanced assessment techniques.