3. Sequential Likelihood Ratio Tests for Unstable Points Identification – SLRTUPI
One of the questions that arises is how to define the number of points to be monitored and their locations. Clearly, geodesists need to work in an interdisciplinary way with other professional/areas so that an alternative model may be properly formulated [
18]. However, even with such information, we may still have doubts whether there are points that a priori were assumed to be stable, whereas in fact they may be shifting. From a practical point of view, therefore, we often do not know well how many and which points are or are not stable. Here, all geodetic network points are subjected to be inspected, regardless of whether they are located on structure or not. Thus, a more conservative alternative hypothesis would be to assume that there is at least one point that is moving among all points involved in the geodetic network. Consequently, we would have
alternative models for
to be tested against the null hypothesis, i.e.:
Note that
represents the ith column vector of the displacement-design matrix
. For the case of the network illustrated in
Figure 1 and
Figure 2, we would then have 6 alternative hypotheses, since we have a total of
points, so that
This means that we would have to test
against
, i.e.:
We are now dealing with testing involving multiple hypotheses. Now, we are interested in knowing which of the alternative hypotheses may lead to the rejection of the null hypothesis with a certain probability. To answer this, the test statistic in Eq. (18) is now computed for each of such tests
, so that we would have a vector of test statistics, denoted by
. In that case, the test statistic coming into effect is the maximum test value, which is computed as
The decision rule for this case is given by:
The decision rule in (23) states that if none of the tests get rejected, then we accept the null hypothesis . If the is larger than some percentile of its probability distribution (i.e., some critical value ), then there is evidence that there is a displaced point in the structure. In this case, we can only assume that detection occurred. The identification, however, is not straightforward. The identification (or localization) of the displaced point consists of seeking the point that produced the maximum test statistic , and whose value is greater than some critical value . Thus, point identification only happens when point detection necessarily exists, i.e., “point identification” only occurs when the null hypothesis is rejected. This means that the correct detection does not necessarily imply correct identification. For instance, we may correctly detect the occurrence of deformation, but wrongly identify a point as displaced, while in fact it is another point that has changed. The description of the success and failure rates will be covered in the next sections.
It is important to highlight that the maximum test statistic, now generally denoted by
, is treated directly as a test statistic. Thus, the distribution of
cannot be derived from well-known test distributions (e.g., Chi-squared distribution). Therefore, critical values cannot be taken from a statistical table but must be computed numerically. Here, the critical value
is computed by Monte Carlo such that a user-defined Type I decision error rate “
” for the proposed procedure is warranted [
39]. In the next section, we provide details on how to compute the critical value
for the proposed sequential procedure.
The procedure described so far allows only one single point to be identified. However, we may have multiple points displaced simultaneously. A more appropriate approach would be to apply a sequential test to decide the number and location of shifted points for the case where the null model is rejected. If the maximum test statistic value exceeds the critical value at the significance level adopted, i.e., if
, the test statistics proceeds for
(two possible unstable points). Since we do not know which group of two points might be shifting, we have to compute the test statistics by Eq. (18) from the possible combinations for two unstable points
, and then identifying the corresponding candidate group for
through the maximum value of the test statistic
. Note that the extreme test statistic
returns only one group of points or point. In general, the number of possible groups of points, denoted by
, is given by:
For instance, for a geodetic network with
points and by assuming
two-point group to be tested, we would have a total of
possible combinations. That would be the case for the geodetic network presented here, as can be seen either by
Figure 1 or
Figure 2, then we would have
alternative models, i.e.,
. Note that for
it is a particular case of having
, as previously presented. Thus, the general cases of the alternative hypotheses can be expressed as:
If the identified point with
is also contained in the case of
, i.e. if the one point flagged initially is among one of the pairs flagged subsequently, then the null hypothesis becomes specified according to the model identified at
and the alternative hypothesis to the model defined at
, as can be seen the example below:
For example, if
and the point A in the geodetic network illustrated in
Figure 1 (or
Figure 2) was identified by the
, the model under the null hypothesis would become
. If in the next step for
, points A and C were identified by the
, then the alternative hypothesis would be defined as
. Note that
is a subset of
. To decide which of these models to select, we compute the test statistic based on the maximum likelihood ratio between
and
, denoted by
[
37]:
The result of Eq. (27) comes from the ratio between the maximum of the probability density function of
under
and
, and the fact that the null hypothesis is (and must be) a subset of the alternative hypothesis. Thus, the selection of the most likely hypothesis is based on the following decision:
where
and
are the least-squares estimated observation errors for the model under the null and alternative hypothesis, respectively. For the example above, we would have
for the null model and
for the alternative one.
If the null hypothesis is not rejected, the testing ends and only the point corresponding to is flagged as unstable. On the other hand, if the null hypothesis is rejected again, then a new step is started by taking the model under the alternative hypothesis identified in the previous step as the null hypothesis. It proceeds by computing the test statistics for according to Eq. (18) for all possible combinations given by Eq. (24), identifying the corresponding candidate group for through the maximum value of the test statistic , and then checking if one of the points flagged in the previous step (i.e., for ) is among those flagged in the current step for . If this is the case, then the test is applied through Eq. (26) to decide between the null model defined by the identified model for and the alternative model for . This sequential procedure is repeated until any of the following conditions is met:
- (i)
The current null hypothesis fails to be rejected.
- (ii)
More than one group of geodetic network points is identified by the extreme statistic for a given , i.e., for the case where the hypotheses cannot be separable (statistical overlap), and therefore the identification cannot be accomplished.
- (iii)
If the group of point(s) flagged by is not fully contained in the group of points flagged by , then the procedure ends with the null hypothesis given by the group of point(s) flagged by .
- (iv)
Iteration reaches the threshold (i.e., until the maximum number of points to be evaluated is fully inspected). The definition of the maximum number of points will be detailed in the next section.
In general, therefore, the sequential testing procedure proposed here is based on likelihood ratio, which we now call Sequential Likelihood Ratio Tests for Unstable Points Identification (SLRTUPI). It consists in determining whether the additional subset of displaced point, considered in every new alternative hypothesis, is statistically significant or not, in terms of its impact on the quadratic form of the estimated observation errors, similar to its form for outlier detection [
27]. Consequently, we can identify the number and the location of the displaced geodetic network points.
Figure 3 provides step-by-step how to run the SLRTUPI procedure.
3.1. Monte Carlo Approach for Controlling the False Detection Rate
There is the probability of committing at least one false discovery, or Type I error when performing multiple hypotheses tests. Thus, Type I Error rate for the case where SLRTUPI is in play corresponds to the probability of incorrectly detecting at least one point as displaced while in fact there is none (i.e., accept at least one alternative hypothesis, when, in fact, the null hypothesis is true). This means that the SLRTUPI Type I decision error does not depend on all subsequent test steps, only on the extreme test statistic computed in its first step, i.e., only for
in Eq. (22). The risk of rejecting a true
is now
one-fold: the undesired random event “reject a true
” can occur in any of the
tests. Let the probability of rejecting a true
in test
be
(the so-called “experimentwise error rate”) and let
. Furthermore, assume the random events “reject a true
in test
” to be approximately statistically independent. Then the total probability of rejecting a true
in the multiple hypothesis test (the so-called “familywise error rate”, denoted here by
) is [
11]:
The classical and well-known procedure to control the
is to apply the Bonferroni equation by choosing [
40]:
Unfortunately, the test statistics
and consequently the random events “reject a true
in test
” are statistically dependent. The extreme statistic
captures such dependencies, as it is extracted from the
. If such dependencies are neglected, then the computed critical values are erroneous, and the test decisions do not have the user-defined familywise error rate
[
34]. Here, the maximum test value
in Equation (22) is treated directly as a test statistic [
39]. Note that when using Equation (22) as a test statistic, the decision rule is based on a one-sided test of the form
, as can be seen in expression (23). However, the distributions of
cannot be derived from well-known test distributions (e.g.,
-distribution). Therefore, critical values cannot be taken from a statistical table but should be computed numerically. A rigorous computation of critical values requires a Monte Carlo technique.
The procedure to compute the critical values for is given step-by-step as follows:
Generate a sequence of
random vectors of the measurement errors for both epoch 1
and epoch 2
k = 1,...,
m of the desired distribution. e.g.:
where
is known as the number of Monte Carlo experiments. In addition, Matlab's “
mvnrnd” command may be used in this step, for example.
For each pair
and
,
k = 1,...,
m compute the differences in errors between the two epochs, i.e.:
Apply the least-squares to estimate the observation errors, as follows:
where
is known as the redundancy matrix, which is given by:
with
the identity matrix.
Assemble the displacement-design matrix ,i = 1,..., np for each Monte Carlo experiment, according to equation (16), but the signs of the coefficients of the displacement-design matrix given by k = 1,..., m, and compute the test statistic by (18) and (22). The frequency distribution of approximates the probability distribution of .
Sort in ascending order the
, getting a sorted vector
, such that:
The sorted values in
provide a discrete representation of the cumulative density function of the maximum test statistic
.
Determine the critical value
:
where
denotes rounding down to next integer that indicates the position of the selected elements in the ascending order of
. This position corresponds to a critical value for a stipulated overall false detection probability
. This can be done for a sequence of values
in parallel [
34].
The Matlab function "kslrtupi.m" was elaborated for the computation of critical values. Inputs in the function are , where “” is the user-defined number of Monte Carlo experiments.
3.2. Determining the maximum possible number of points
Here, we objectively and universally demonstrate how to obtain the maximum number of points “
” to be inspected by the SLRTUPI sequential procedure. The maximum number of points is also defined sequentially. The procedure to define the maximum number of points “
” consists in finding a regular model (i.e., a matrix
with full column rank) and, of course, that model has enough redundancy for the identification of unstable points. The step-by-step procedure to obtain the maximum possible number of points
to be inspected by SLRTUPI is given by the flowchart in
Figure 5.
If the full displacement-design matrix is specified by taking the total number of points in the network, i.e., by taking according to equation (16), then we fall into a problem in which there is a rank deficiency. Usually, the rank defect is greater than or equal to 1, because , where is the number of columns of the matrix . In this situation, the determinant of the normal matrix is zero. This means that such matrix cannot be considered invertible – the matrix is then said to be singular. Consequently, it is not possible to compute the test statistics by Equation (18). This demonstrates that .
By reducing the number of points by one unit, i.e., , we expect to find regular models (i.e., without rank deficiency). For that, it should be checked again whether there is rank defect or not. But now we have at our disposal not only one single group of points, but a combination of group of points. If at least one of the matrices presents rank defect, then must be reduced again by one unit, i.e., , and the rank computation is then repeated for group of points. If the rank defect persists, then the rank computation is repeated by decreasing to , otherwise we proceed to evaluate whether the test statistics computed in Equation (18) are different from each other. If there are at least two statistics with equal values, then a statistical overlap will be flagged. In that case, therefore, we should reduce to and restart the procedure. If the models are found to be regular and there is no statistical overlap, then the process ends with the value found.
Important to highlight that the rank of the matrix
depends on the behaviour of the signs of the coefficients of its displacement-design submatrix
. The signs will also depend on the geometry of the geodetic network, as well as the unknown magnitude and number of points that have been shifted. Remember, however,
captures these quantities. Although we have the limitation of computing the test statistics a priori, the procedure proposed above allows to determine the maximum number of testable points for identification. Generally, the choice of this maximum number is restricted to a subjective choice [
11,
13]. Here, Matlab function "pmax.m" was developed for the automatic computation of the maximum number of points. Inputs in the function are
and
, i.e.,
. Below are presented two possible scenarios that may occur in obtaining the
for the geodetic network presented in the scope of this article (
Figure 1).
Example 1:
Note that the number of columns in the matrix (38) is
but its
. In that case, we would fall into a rank deficiency problem, because
. If we considered that
, then we would have
group by Eq. (24), namely
. However, in this case, the determinant of the normal matrix would be equal to zero, denoted by
, which would characterize it as a singular matrix, and consequently would not allow the computation of the test statistics by the Eq. (18). By reducing
to one unit (
), we would have now a total of
groups of 5 points, which would be the following:
;
;
;
;
; and
. All these groups would now have full column rank with
and
. Thus, we would end and set
.
However, another important analysis is needed. Although the determinants of the matrices constructed for each group of points are non-zero, their values are all equal to . Mostly, the test statistics computed for each of those groups by Eq. (18) would also have the same values. So, we would not be able to identify the group by the , since all the statistics are the same, i.e., there would be a statistical overlap. Consequently, we would only be able to get the information about displacement detection (i.e., ), but not identification. In order to have the possibility of identification, we would then decrease again to one unit (i.e., ), which would lead to a total of groups of 4 points. Now, the test statistics for each of would result of values different from each other, making identification possible. As a result, the maximum number of points to be considered would be for identification, which for that example we would get .
In this scenario, we would fall back into a problem where the matrix holds rank defect, but now with . If , we would have groups of 5 points to be considered, namely: ; ; and . Unfortunately, the rank of any of these groups would be equal to , and therefore we would still remain with rank deficiency problem. Moreover, the determinant of any normal matrix of these groups would be zero, i.e., . By reducing to one unit (i.e., ), we would have a total of groups of 4 points, but still with rank defect for the following groups: and As there would still be a lack of full column rank for some groups, we would again reduce . Now we would have a larger number of groups than the previous case, but of size equals to , i.e., groups of 3 points. In the latter case, however, we would still have only one single group of points that would cause a rank defect, namely . Again, we would proceed to decrease the maximum number of points followed by computing the rank of the matrices. Finally, we would find the maximum number of points, which in that case would be . All these groups of 2 points would have their respective matrices with full column rank and theirs test statistics would have different values from each other. Thus, would be the dimension found, so that all groups of 2 points would be testable for identification.
In addition to the examples given above, we may have cases where , which allows us to identify only one single point. In that case, the SLRTUPI procedure would not proceed in its sequential form, but would restrict only in its first step, i.e., only in the identification of a point by means of . And, if , then in that case we would have a situation in which it is not possible to detect displacements, since the network does not have enough redundancy for this purpose. This latter case may occur if the geodetic network is very poorly designed so that it will not be possible to detect displacements.