Preprint
Article

Correspondence Analysis for Assessing Departures from Perfect Symmetry Using the Cressie-Read Family of Divergence Statistics

This version is not peer-reviewed.

Submitted:

05 June 2024

Posted:

06 June 2024

You are already at the latest version

A peer-reviewed article of this preprint also exists.

Abstract
Recently, there appeared in this journal (Beh and Lombardo \textbf{2022}, {\it Symmetry}, 14, 1103) a paper that showed how to perform a correspondence analysis on a two-way contingency table where Bowker's statistic lies at the numerical heart of this analysis. Thus, we showed how this statistic can be used to visually identify departures from perfect symmetry. Interestingly, Bowker's statistic is a special case of the symmetry-version of the Cressie-Read family of divergence statistics. Therefore, this paper presents a new framework for visually assessing departures from perfect symmetry using a second-order Taylor series approximation of the Cressie-Read family of divergence statistics.
Keywords: 
;  ;  ;  

1. Introduction

The correspondence analysis of a symmetric S × S contingency table, N has been a topic of research undertaken by, for example, Greenacre [22] and Beh and Lombardo [10]. Both approaches involve the partition of N such that
N = Y + K = 1 2 N + N T 1 2 N N T
where Y is the matrix that reflects the symmetric part of the table and K reflects the skew-symmetric part. This partition was considered in various context by many including, but certainly not limited to, Bove [14], Section 3 Constantine and Gower [16] and Gower [25].
The methods of Greenacre [22] and Beh and Lombardo [11] approach the visualisation of the departure from perfect symmetry using correspondence analysis by partitioning the transformed contingency table into a skew matrix and a skew-symmetric matrix, as (1) does. While both correspondence analysis approaches have (1) as a common thread, they are quite different. Greenacre [22] uses Pearson’s chi-squared statistic, X 2 , that centres the elements of the contingency table with respect to the mean of the row and column marginal totals, yielding two low-dimensional displays; one depicting departures from perfect symmetry and the other depicting departures from skew-symmetry. On the other-hand, Beh and Lombardo [11] use Bowker’s chi-squared statistic, X B 2 [15], producing a single low-dimensional display depicting departures from perfect symmetry.
While the difference between X 2 and X B 2 is that the former assesses departures from complete independence while the latter assesses departures from perfect symmetry, both can be expressed as a special case of the Cressie-Read family of divergence statistics (Cressie and Read [17]) when viewed as a goodness-of-fit measure. For more on how correspondence analysis can be performed on a two-way contingency table using the Cressie-Read family of divergence statistics refer to Beh and Lombardo [11]. Their method includes, as special cases, the classical approach to correspondence analysis [6,8,21,28], log-ratio analysis (LRA) [23,24] and the Hellinger Distance Decomposition (HDD) method [18,19]. Rather than using Pearson’s statistic as the numerical foundation, LRA and HDD use the modified log-likelihood ratio statistic [29] and the Freeman-Tukey statistic [20], respectively.
This paper presents a family of symmetry divergent statistics based on the two recent works of Beh and Lombardo [10], Beh and Lombardo [11] by demonstrating how a correspondence analysis can be performed for assessing departures from perfect symmetry of N . To do so, this paper is divided into five further sections. Section 2 gives an overview of the classic test of the departure from perfect symmetry for a two-way contingency table. It also describes how the Cressie-Read family of divergence statistics can be used for performing such a test. This family is dependent on the power parameter δ where changes in δ lead to special cases of the family. In this paper we focus on the second order approximation of this family which yields exactly Pearson’s chi-squared statistic ( δ = 1 ), the Freeman-Tukey statistic ( δ = 1 / 2 ) and the modified likelihood ratio statistic ( δ = 0 ). Section 3 describes the core interest of this paper; the development of a correspondence analysis framework that can be applied to a two-way contingency table to visualise sources of departure from perfect symmetry when using the Cressie-Read family of divergence statistics. As part of this discussion, we also show that when δ = 1 , the method described by Beh and Lombardo [10] is a special case of this new framework. Two examples are given that demonstrate the various features of this new framework. Section 4 studies a 4 × 4 artificial contingency table which exhibits perfect symmetry when a constant C = 0 is added to a cell frequency. As C increases, the artificial table exhibits features consistent with increasing departures from perfect symmetry and so this example examines the features of this correspondence analysis framework as C and δ change. Our second example (Section 5) examines the data of Wiepkema [33] that is concerned with 12 pre- and post-courtship behaviours of a small European fish called a bitterling (Rhodeus amarus Bloch). Some final comments on the framework outlined here are made in Section 6.

2. Test of Perfect Symmetry and the Cressie-Read Family of Divergence Statistics

2.1. Notation

Suppose we have an S × S contingency table, N , where the i , j th cell entry has a frequency of n i j for i = 1 , 2 , , S and j = 1 , 2 , , S . Let the grand total of N be n and let the matrix of relative frequencies be P so that its i , j th cell entry is p i j = n i j / n where i = 1 S j = 1 S p i j = 1 . Define the ith row marginal proportion by p i = j = 1 S p i j . Similarly, define the jth column marginal proportion as p j = i = 1 S p i j .

2.2. Testing Depatures from a Hypothesised p i j

Testing whether there is evidence of a statistically significant association between the row and column variables of N can be made by considering any member of the Cressie-Read family of divergence statistics
CR δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j p i j p ^ i j δ 1 ,
for any δ , , where p ^ i j is some value of p i j under a well defined null hypothesis. When assessing departures from complete independence p ^ i j = p i p j for i , j = 1 , 2 , , S so that (2) is a chi-squared random variable with S 1 2 degrees of freedom. However, Cressie and Read [17] also presented a second order approximation of (2) around p i j / p ^ i j δ = 1 that is very useful for the purposes of applying correspondence analysis to N. This approximation is
CR δ CR * δ = n i = 1 S j = 1 S p ^ i j 1 δ p i j p ^ i j δ 1 2 .

2.3. Testing Departures from Complete Independence

Of course, any reasonable choice of p ^ i j may be defined but we will confine ourselves to briefly discussing its definition under complete independence and perfect symmetry. For the case where one is interested in assessing departures from complete independence of the variables of N, (2) is expressed as
CR A δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j p i j p i p j δ 1 .
The subscript “A” has been added to the left-hand side to show that this family of statistics assesses departures from complete association. The general nature of (2), and (4), ensures that specific values of δ lead to well defined and well understood measures of association, all of which are chi-squared random variables. These include Pearson’s chi-squared statistic, the log-likelihood ratio statistic, and the Freeman-Tukey statistic which are X 2 = CR δ = 1 , G 2 = CR δ = 0 and T 2 = CR δ = 1 / 2 , respectively. The modified chi-squared statistic, the modified log-likelihood ratio statistic and the Cressie-Read statistic are also special cases such that N 2 = CR δ = 2 and M 2 = CR δ = 1 , and C 2 = CR δ = 2 / 3 , respectively.
Beh and Lombardo [11] showed that a correspondence analysis of N when assessing departures from independence can be undertaken by making use of the second order Taylor series approximation of (2) around
p i j p ^ i j δ = p i j p i p j δ = 1
resulting in
CR A δ CR A * δ = n i = 1 S j = 1 S p i p j 1 δ p i j p i p j δ 1 2 .
This approximation may be obtained by substituting p ^ i j = p i p j into (3). See also pp. 94 – 95 Cressie and Read [17] for a derivation of this approximation of (4). This family of statistics gives exactly the following commonly used chi-squared statistics: Pearson’s statistic X 2 = CR A 1 = CR A * 1 , the Freeman-Tukey statistic T 2 = CR A 1 / 2 = CR A * 1 / 2 and the modified log-likelihood ratio statistic M 2 = CR A 1 = CR A * 0 . In the context of correspondence analysis, X 2 serves as numerical foundations of the traditional approach, T 2 is the foundations of the method described in Beh, Lombardo and Alberti [12] and Cuadras and Cuadras [18], while M 2 serves as the foundations of LRA, a variant of correspondence analysis described by Greenacre [23]. A fourth chi-squared statistic that is commonly used in the context of contingency table analysis, and was discussed from a correspondence analysis perspective by Beh and Lombardo [11] is when δ = 2 / 3 yielding CR A * 2 / 3 , a second order approximation of the Cressie-Read statistic so that CR A 2 / 3 CR A * 2 / 3 .

2.4. Testing for Departures from Perfect Symmetry

Sometimes it is the case that the two variables of N are, say, identical but measured over two different time periods. It may be that these same variables are collected between two different cohorts. In cases such as these, it is more typical to analyse the departures from perfect symmetry between the rows and columns of N. Therefore, when testing for departures from perfect symmetry, the null hypothesis is
H 0 : p i j = p j i
for i = 1 , 2 , , I and for j = 1 , 2 , , J .
Assessing whether there exists any evidence of symmetry between the variables of N in the population, p. 427 Agresti [1] and p. 321 Anderson [4] showed that the most appropriate choice of p ^ i j is
p ^ i j = p i j + p j i 2 .
Therefore, the Cressie-Read family of divergence statistics can be defined for testing departures from perfect symmetry so that (2) can be expressed as
CR S δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j 2 p i j p i j + p j i δ 1 ,
and is a chi-squared random variable with S S 1 / 2 degrees of freedom. The subscript “S” has been added to the left-hand side of (7) to show that this family of statistics assesses departures from perfect symmetry. This statistic has been the topic of interest by Tomizawa, Seo and Yamamoto [30], Ando, Hoshi, Ishii and 98izawa [5] and Altun and Saraçbaşi [2]. Our focus will be to examine the role of a second order approximation of CR S δ for performing a correspondence analysis to visually detect departures from perfect symmetry. Therefore, it presents a more general framework to the correspondence analysis discussed by Greenacre [22] and Beh and Lombardo [11].

2.5. A Second Order Approximation

A second-order Taylor series approximation of (7) around
p i j p ^ i j δ = 2 p i j p i j + p j i δ = 1
can be obtained by substituting (6) into (3). Doing so yields the family of asymptotically chi-squared random variables with S S 1 / 2 degrees of freedom under the null hypothesis of perfect symmetry, (6),
CR S * δ = n 2 i = 1 S j = 1 S p i j + p j i 1 δ 2 p i j p i j + p j i δ 1 2
or, alternatively but equivalently,
CR S * δ = n i > j S p i j + p j i 1 δ 2 p i j p i j + p j i δ 1 2 .
There are three special cases of this family of divergence statistics that we shall consider in our analysis of symmetry in a two-way contingency table. The first is when δ = 1 :
X S 2 = CR S * 1 = n 2 i = 1 S j = 1 S p i j p j i 2 p i j + p j i = n i > j S p i j p j i 2 p i j + p j i
which is just Bowker’s chi-squared statistic [15]. Beh and Lombardo [10] used this statistic as the basis for performing correspondence analysis to assess departures from perfect symmetry in N.
Secondly, suppose that we consider the case where (8) is evaluated when δ = 1 / 2 . Then, we can show that
T S 2 = CR S * 1 2 = 4 n i = 1 S j = 1 S p i j p i j + p j i 2 2
is the Freeman-Tukey statistic when assessing departures from perfect symmetry.
The third special case of (8) is when δ = 0 . For this value of δ , (8) does not exist. However, we can obtain the limiting value of (8) as δ 0 . Doing so means that we can use the Box-Cox transformation so that
lim δ 0 1 δ 2 p i j p i j + p j i δ 1 = ln 2 p i j p i j + p j i .
Therefore,
CR S * 0 = lim δ 0 CR S * δ = n i = 1 S j = 1 S p i j + p j i 2 lim δ 0 1 δ 2 p i j p i j + p j i δ 1 2
simplies to
M S 2 = CR S * 0 = n i = 1 S j = 1 S p i j + p j i 2 ln 2 p i j p i j + p j i
and is the modified version of the log-likelihood ratio statistic when testing for perfect symmetry in N . Note that eq. (8.2-11) Bishop, Fienberg and Holland [13], p. 489 Haberman [26] and eq. (1.3) Ireland, Ku and Kullback [27] gave the (unmodified) log-likelihood ratio statistic:
M ˜ S 2 = n i = 1 S j = 1 S p i j ln 2 p i j p i j + p j i
for assessing departures from perfect symmetry in a S × S contingency table.
When there is perfect symmetry between the variables of N so that (5) holds, (9), (10) and (11) will be zero. When there exists a statistically significant departure from perfect symmetry, we can visually assess the statistical significance of this departure then using correspondence analysis. We shall now show how (8) can be used to perform a correspondence analysis on N when assessing these departures.

3. Correspondence Analysis & Perfect Symmetry

3.1. The Divergence Residual

To perform a correspondence analysis on N under the null hypothesis of perfect symmetry we first define the S × S matrix of divergence residuals, S δ , where its i , j th element is
s i j δ = 1 δ p i j + p j i 2 2 p i j p i j + p j i δ 1 .
Therefore, the sum-of-squares of these residuals gives (8) so that
CR S * δ = n i = 1 S j = 1 S s i j 2 δ
= n trace S δ T S δ
= n trace S δ S δ T .
Note that when i = j (so that we are concerned with the diagonal elements of S δ ) these residuals are zero for all δ . Three examples of the form that (12) takes is when δ = 1 , 1 / 2 and (approaching) 0. Respectively, these values of δ give the residuals
s i j 1 = 1 2 p i j p j i p i j + p j i s i j 1 2 = 2 2 p i j p i j + p j i = 2 p i j p i j + p j i 2 s i j 0 = p i j + p j i 2 ln 2 p i j p i j + p j i .
The first of these, s i j 1 , is the i , j th Bowker residual described by eq. (7) Beh and Lombardo [10] so that n times its sum-of-squares produces Bowker’s statistic, (9).
The second and third residuals are akin to the Freeman-Tukey residual and modified log-likelihood ratio residual, respectively, described by Beh and Lombardo [11], but are used when assessing departures from perfect symmetry. Note that n times the sum of squares of these two residuals gives (10) and (11), respectively.
For s i j 0 , it is assumed that all cells of the contingency table have non-zero frequencies so that 0 < p i j < 1 for i = 1 , 2 , , S and j = 1 , 2 , , S . This is to avoid any problems with calculating the natural logarithm of zero. In the event that a zero cell frequency is observed, a simple remedy is to replace it with a small value, say 0.01. Alternatively, one may use more objective methods to accommodate for a zero cell frequency. Other residuals can also be obtained using alternative values of δ .

3.2. Is the Matrix of Divergence Residuals Skew-Symmetric?

One of the benefits of using Bowker’s statistic as the numerical basis on which to perform correspondence analysis is that the resulting matrix of divergence residuals is skew-symmetric. That is, when δ = 1 , S 1 has the property that S 1 T = S 1 . Therefore, s i i 1 = 0 and s i j 1 = s j i 1 for i j , i , j . It also means that the singular values, and the left and right singular vectors, of S 1 can be calculated by applying an eigen-decomposition to S 1 T S 1 or, equivalently, S 1 2 . Ward and Gray [32] and p. 113 Gower [25] discuss that for a S × S skew-symmetric matrix, like S 1 , that if S is odd then there will always be a zero eigen-value and S 1 positive eigen-values. If S is even there will always be S eigen-values that exist in pairs [16].
When δ 1 , S δ is not a skew-symmetric matrix, since there will be at least one cell where s i j δ s j i δ , i j , unless there is perfect symmetry between the variables of N .

3.3. Singular Value Decomposition and the Divergence Residual

When assessing departures from perfect symmetry in N , the correspondence analysis approach of Beh and Lombardo [10] involves applying a singular value decomposition (SVD) to the matrix S 1 . Since Bowker’s statistic is a special case of (8), this suggests that a more general family of correspondence analysis techniques can be developed for visualising departures from perfect symmetry. Such a general family can be developed using the family of statistics generated from (8). Therefore, a new general family of correspondence analysis techniques can be obtained by applying a SVD to S δ such that, for the i , j th cell,
1 δ p i j + p j i 2 2 p i j p i j + p j i δ 1 = m = 1 M a i m δ λ m δ b j m δ
where
i = 1 I a i m δ a i m δ = 1 m = m 0 m m , j = 1 J b j m δ b j m δ = 1 m = m 0 m m ,
and M is the maximum number of dimensions required to depict all of the association that exists between the variables of the contingency table. When δ = 1 then M = S if S is even and M = S 1 if S is odd. For other values of δ , M = S . The quantities a i m δ and b j m δ are the ith and jth element, respectively, of the mth left and right singular vectors of the matrix of divergence residuals for a fixed δ . The mth largest singular value is λ m δ so that 1 > λ 1 δ > λ 2 δ > , λ M * δ > 0 .
The matrix form of (15) and (16) is
S δ = A δ Δ δ B T
with
A δ T A δ = I M and B δ T B δ = I M
being the matrix form of (16). Here, I M is a M × M identity matrix, A δ is the S × M matrix where the i , m th element is a i m δ , B δ is the S × M matrix where the j , m th element is b j m δ , and Δ δ is the M × M diagonal matrix of singular values with λ m as it’s m , m th element.
While the Cressie-Read family of divergence statistics can be expressed in terms of S δ – see (13) and (14) – it can also be expressed in terms of its singular values. To show this, substituting (17) into (13) leads to
CR * δ = n trace A δ Δ δ B δ T T A δ Δ δ B δ T = n trace B δ Δ δ 2 B δ T = n trace Δ δ 2
when B δ is of full rank so that B δ B δ T = B δ T B δ = I M . Therefore, the total inertia of N can be expressed as the sum-of-squares of the squared singular values so that
CR * δ n = m = 1 M λ m 2 δ .
Expressing the total inertia in this manner is analogous to the total inertia of p. 22 Beh and Lombardo [11] when the Cressie-Read family of divergence statistics is used as the numerical basis of the correspondence analysis of a two-way contingency table.

3.4. The Principal Inertia Values

Beh and Lombardo [10] showed that when assessing departures from perfect symmetry when N is a 2 × 2 contingency table, S 1 , has two equal singular-values whose squared values are
λ 1 2 = λ 2 2 = s 21 2 1 = 1 2 p 21 p 12 p 21 + p 12
and are the principal inertia values of the first two dimensions of the correspondence plot when analysing a two-way contingency table. Similarly, when symmetry is of concern for 3 × 3 contingency table, the three principal inertia values are
λ 1 2 = λ 2 2 = CR * 1 2 n and λ 3 2 = 0 .
For both sized N , the sum of their squared singular values gives Bowker’s statistic.
When analysing the symmetry of a two-way table using the Cressie-Read family of divergence statistics we can consider values of δ 1 . For example, when S δ is of rank 2 then amending Appendix A of [10] for δ 1 shows that there will be two unequal singular values whose squares are
λ 1 2 δ = max s 12 2 δ , s 21 2 δ
λ 1 2 δ = min s 12 2 δ , s 21 2 δ .
Note that when δ = 1 these squared singular values simplify to (19).
When δ 1 , then (19) and (20) are satisfied only when there exists perfect symmetry between the variables of N (in which case all squared singular values will be zero). This is because when δ 1 , S δ is not a skew-symmetric matrix.
We now turn our attention to the construction of the M-dimensional correspondence plot by defining and describing the principal coordinates for each row and column of N .

3.5. Principal Coordinates

When visually portraying the categories of N, define the metric matrix by
D ˜ = D I + D J 2 .
Then the matrix of row and column principal coordinates is
F δ = D ˜ 1 / 2 A δ Δ δ
G δ = D ˜ 1 / 2 B δ Δ δ ,
respectively. These provide a more general set of principal coordinates than those of eqs. (14) & (15) Beh and Lombardo [10] who were concerned only with the case when δ = 1 . Although, the principal coordinates of eqs. (14) & (15) Beh and Lombardo [10] can be obtained by simply substituting δ = 1 into (23) and (24).
Defining the row and column principal coordinates by (23) and (24), respectively, means that the row and column spaces have the same metric that is based on the aggregation of p i j across the two variables, irrespective of the value of δ . Such an aggregation is done since (7) relies only on the cell proportions p i j and p j i .
Post-multiplying both sides of (23) by B δ T and simplifying gives us an alternative expression for the row principal coordinates
F δ = D ˜ 1 / 2 S δ B δ .
Similarly, it can be shown that the column principal coordinates can be expressed in terms of S δ such that
G δ = D ˜ 1 / 2 S δ T A δ .
As we have already shown, S δ is not a skew-symmetric matrix unless δ = 1 . In the event that δ = 1 then, as shown by Beh and Lombardo [10],
G 1 = D ˜ 1 / 2 A 1 Δ 1 J M T = F 1 J M T
where J M is an M × M block-diagonal and orthogonal skew-symmetric matrix so that
J M T J M = J M J M T = M .
See Section 5.1 Beh and Lombardo [10] for examples of J M when M = 2 , 3 and 4.

3.6. On the Total Inertia and the Origin

The total inertia of the two-way contingency table can be expressed in terms of the matrices of row and column principal coordinated given by (23) and (24). To show this, suppose we consider the total inertia in terms of the row principal coordinates. Then,
trace F δ T D ˜ F δ = trace D ˜ 1 / 2 A δ Δ δ T D ˜ D ˜ 1 / 2 A δ Δ δ = trace Δ δ A δ T A δ Δ δ = trace Δ δ 2 = CR * δ n .
Similarly, we can also show that the total inertia can be expressed in terms of the column principal coordinates so that
CR * δ n = trace G δ T D ˜ G δ .
Therefore, if there is perfect symmetry between the rows and columns of our two-way contingency table then the total inertia will be zero. When this happens, the position of the row and column principal coordinates will be located at the origin. Therefore, the origin is interpreted as the point in the low-dimensional space where there is perfect symmetry between the row and column variables. The further a point is away from this origin then the more deviation it has from the null hypothesis of perfect symmetry. When assessing departures from complete independence assessing the contribution of a row and column point to the association structure can be undertaken using the closed-form equations that yield confidence regions for each point; see Beh [7] and Beh and Lombardo [9] when δ = 1 and Alzahrani, Beh and Stojanovski [3] for other values of δ . Such regions have not yet been developed for studying departures from perfect symmetry and so we shall leave this for future study.

4. Example 1: Artificial Data

4.1. The Data

To examine how the Cressie-Read family of divergence statistics can be used for the purposes of applying correspondence analysis to visually assess departures from perfect symmetry we consider the artificial data set given in Table 1. Beh and Lombardo [10] used this contingency table to highlight the features obtained when using Bowker’s statistic and so we shall focus on showing the features of the correspondence analysis using (8) for δ = 1 , 1 / 2 and 0. Thus the numerical foundations of this variant of correspondence analysis uses the modified log-likelihood ratio statistic, M S 2 , the Freeman-Tukey statistic T S 2 and Pearson’s chi-squared statistic X S 2 . In Table 1 the 2 , 1 th cell frequency is 20 + C where C 20 is a constant; note that Beh and Lombardo [10] considered the case where C 0 for their analysis of Table 1. When C = 0 then the variables of Table 1 exhibit perfect symmetry and as C the departure from perfect symmetry between the variables becomes more apparent. The sample size of Table 1 is n = 680 + C .

4.2. The Family of Divergence Statistics

Since there exists perfect symmetry in all but two cells of Table 1 we only need to confine ourselves to examining the difference between the 2 , 1 th and 1 , 2 th cells. Of course, when there is perfect symmetry then s 12 δ = s 21 δ and this happens only when C = 0 . We shall examine the changes in these two values as C and δ change. Therefore, to assess the departure from perfect symmetry in Table 1 we shall do so by comparing
n · s 12 δ = 1 δ 40 + C 2 40 40 + C δ 1
and
n · s 21 δ = 1 δ 40 + C 2 40 + 2 C 40 + C δ 1
for δ 0 , otherwise
n · s 12 0 = 40 + C 2 ln 40 40 + C
and
n · s 21 0 = 40 + C 2 ln 40 + 2 C 40 + C
using the Box-Cox transformation.
Therefore, when assessing departures from perfect symmetry for the data in Table 1 the Cressie-Read family of divergence statistics can be expressed in terms of C 20 and δ , so that
CR * δ = n s 12 2 δ + s 21 2 δ = 1 δ 2 40 + C 2 40 40 + C δ 1 2 + 40 + 2 C 40 + C δ 1 2 .
We can immediately see that this family of statistics can also be derived by substituting (21) and (22) into (18) (since M = 2 ) yielding the equivalent expression
CR * δ = n λ 1 2 δ + λ 2 2 δ .
Substituting δ = 1 into (27) for Table 1 of
X S 2 = CR * 1 = C 2 40 + C
which is Bowker’s statistic derived by eq. (21) Beh and Lombardo [10]. Similarly, the Freeman-Tukey statistic, (10), and modified log-likelihood ratio statistic, (11), can be written in terms of C so that
T S 2 = CR * 1 2 = 8 40 + C 1 2 40 + C 40 + 40 + 2 C M S 2 = CR * 0 = 40 + C ln 40 40 + 2 C 40 + C 2 .
A visual representation of X S 2 , T S 2 , and M S 2 versus C 20 , 100 at unitary increments is given in Figure 1; the horizontal line is the quantile of the chi-squared distribution with 6 degrees of freedom for α = 0.05 so that χ 0.95 2 6 = 12.5916 . Figure 1 shows that all three statistics are quite similar, especially for values of C 15 , 50 . Note that when C = 0 these three statistics are all zero showing there is perfect symmetry in Table 1. When C 20 , 0 all three statistics decrease to zero and then increase for C > 0 . Therefore, there is a minimum value of C that will lead to the rejection of the null hypothesis of perfect symmetry. We now investigate what this value of C is for X S 2 , T S 2 and M S 2 .

4.3. On the Departure from Perfect Symmetry

Beh and Lombardo [10] show that for Bowker’s statistic, CR * 1 , there is a statistically significant departure from perfect symmetry at the α level of significance, when
C > χ α 2 6 + χ α 2 6 2 + 160 χ α 2 6 2 ,
where χ α 2 6 is 1 α quantile of the chi-squared distribution with S S 1 / 2 = 4 4 1 / 2 = 6 degrees of freedom. For example, the minimum value of C when α = 0.05 is 29.59, or 30 when rounded up to an integer value. Hence, the 2 , 1 th cell frequency must be at least 50 to detect any departure from perfect symmetry when performing the test at the 0.05 level of significance using Bowker’s statistic.
There is also a second solution to C that leads to a rejection of the null hypothesis of perfect symmetry. This is when
20 C < χ α 2 6 χ α 2 6 2 + 160 χ α 2 6 2
yeilding an upper bound of this interval of C = 17.01 . Thus, there is a rejection of the null hypothesis of perfect symmetry when the 2 , 1 th cell frequency is less than 2.99, or 2 when rounding down to an integer value.
Figure 1 shows fairly similar values of C (and n 11 ) are required when assessing the test of perfect symmetry using T S 2 and M S 2 . Although obtaining a simple expression to determine these cell counts, like (28) and (29) do for Bowker’s statistic, is not straightforward. However, numerical methods show that the values of C that produce a statistically significant T S 2 are when C < 15.99 and C > 28.50 , for α = 0.05 and for 6 degrees of freedom. So, when using T S 2 , the values of the 2 , 1 th cell frequency that ensure that the null hypothesis of perfect symmetry is rejected at the 5% level of significance is n 11 49 and 0 n 11 4 .
Similarly, numerical methods show that when using M S 2 the value of C that rejects the null hypothesis of perfect symmetry is C < 15.52 and C > 27.94 . Therefore, the values of the 2 , 1 th cell frequency that ensure that the null hypothesis of perfect symmetry is rejected at the 5% level of significance is n 11 48 and 0 n 11 4 .
To keep any further analysis of Table 1 simple we shall now confine our attention to values of C 0 like Beh and Lombardo [10] did for their analysis of the table.

4.4. Features of Correspondence Analysis & Symmetry

4.4.1. The Matrix of Divergence Residuals

We can derive the matrix of divergence residuals, S δ for Table 1. Based on (25) and (26) this matrix is
S δ = 1 δ 40 + C 2 680 + C 0 40 40 + C δ 1 0 0 40 + 2 C 40 + C δ 1 0 0 0 0 0 0 0 0 0 0 0
when δ 0 . For example, when δ = 1 then
S 1 = C 2 680 + C 40 + C 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0
which is a skew-symmetric matrix since s i j 1 s j i 1 , i , j = 1 , , 4 . When δ = 0 and δ = 1 / 2 , then the matrix of divergence residuals is
S 0 = 40 + C 2 680 + C 0 ln 40 40 + C 0 0 ln 40 + 2 C 40 + C 0 0 0 0 0 0 0 0 0 0 0
and
S 1 / 2 = 2 680 + C 0 40 40 + C 0 0 40 + 2 C 40 + C 0 0 0 0 0 0 0 0 0 0 0 ,
respectively. These two matrices are not skew-symmetric matrix since s 12 δ s 21 δ for δ = 0 and 1/2 unless C = 0 . In this case there is perfect symmetry in Table 1 so that s 12 δ = s 21 δ = 0 .

4.4.2. The Singular Values

The structure of the 4 × 4 matrix S δ given by (30) is identical to the 2 × 2 matrix obtained by removing the zero rows and columns of the matrix. Appendix A Beh and Lombardo [10] derived the two singular values of S 1 for Table 1 and showed them to be
λ 1 1 = λ 2 1 = C 2 680 + C 40 + C
for C > 0 and are both zero when C = 0 . When δ 1 the two singular values are not equivalent since, for these δ values, S δ is not a skew-symmetric matrix. Although, the two singular values will be approximately equivalent when δ 1 . Adjusting the derivation of Appendix A Beh and Lombardo [10] for δ 0 and C 0 , the two singular values of Table 1 are
λ 1 δ = s 12 δ n = 1 δ 40 + C 2 680 + C 40 40 + C δ 1
λ 2 δ = s 21 δ n = 1 δ 40 + C 2 680 + C 40 + 2 C 40 + C δ 1
so that λ 1 δ λ 2 δ when C 0 . Therefore, while the 4 × 4 diagonal matrix of eigen-values consists of zero values for the 3 , 3 th and 4 , 4 th elements, the 2 × 2 matrix of non-zero eigen-values is
Δ δ = 40 + C 1 / 2 δ δ 2 680 + C 40 + C δ 40 δ 0 0 40 + 2 C δ 40 + C δ
so that both singular values remain positive when C > 0 . For example, when δ = 1 these two singular values simplify to (31). Similarly, when δ = 1 / 2 ,
λ 1 1 2 = s 21 1 / 2 n = 2 680 + C 40 + C 40 λ 2 1 2 = s 21 1 / 2 n = 2 680 + C 40 + 2 C 40 + C .
When δ = 0 , then applying the Box-Cox transformation to (32) and (33) yields, for C 0 , the two singular values
λ 1 0 = s 12 0 n = 40 + C 2 680 + C ln 40 40 + C λ 2 0 = s 21 0 n = 40 + C 2 680 + C ln 40 + 2 C 40 + C .
Figure 2 displays λ 1 δ versus C 0 , 100 for δ = 0 , 1 / 2 and 1, while Figure 3 shows λ 2 δ versus C 0 , 100 ; the vertical axis of both figures are identically scaled to enable an easy comparison of the two singular values. These two figures show that, for all values of C, λ 1 1 = λ 2 1 as expected since S 1 is a skew-symmetric matrix. It also shows that λ 1 δ = λ 2 δ = 0 when C = 0 . A comparison of Figure 2 and Figure 3 shows that λ 1 δ > λ 2 δ for C > 0 .
Figure 2 shows that, for λ 1 δ , as δ moves from 0 to 1 the singular value decreases in magnitude for all C. However, the values of λ 2 δ increases as δ goes from 0 to 1, although any difference between values of λ 2 δ for a given C > 0 is not as large as the differences observed between the λ 1 δ values.
Suppose we define λ Diff δ = λ 1 δ λ 2 δ so that, from (34),
λ Diff δ = 40 + C 1 / 2 δ δ 2 680 + C 2 40 + C δ 40 + 2 C δ 40 δ 0
for all values of δ ; note that this difference is zero when δ = 1 . This difference is also zero when C = 0 irrespective of the choice of δ . A plot of this difference versus C 0 , 100 is given in Figure 4. It confirms that λ 1 1 = λ 2 1 while the difference between the two singular values is at its largest when δ = 0 . Thus, LRA will produce a more heavily dominant first dimension than its second dimension when compared with the correspondence analysis approach of Beh and Lombardo [10]. In fact, Figure 2 and Figure 3 show that when δ = 0 the first singular value will be larger than the first singular value when performing HDD and correspondence analysis to assess departures from perfect symmetry. Therefore, the first dimension of an LRA will always account for a larger proportion of any departure from perfect symmetry than HDD and correspondence analysis.

4.4.3. Principal Coordinates

To derive the row and principal coordinates, (23) and (24), we first need to determine D ˜ . p. 11 Beh and Lombardo [10] showed that for Table 1
D ˜ = 200 + C 2 680 + C 0 0 0 0 400 + C 2 680 + C 0 0 0 0 150 680 + C 0 0 0 0 230 680 + C
so that
D ˜ 1 / 2 = 680 + C 2 200 + C 0 0 0 0 2 400 + C 0 0 0 0 1 150 0 0 0 0 1 230 .
We also have the matrix of left and right singular vectors which are
A δ = 1 0 0 1 0 0 0 0 and B δ = 0 1 1 0 0 0 0 0
when δ 0 , 1 and, when δ 1 ,
A δ = 0 1 1 0 0 0 0 0 and B δ = 1 0 0 1 0 0 0 0 .
When δ 0 , 1 then, using (34), (35), and A δ from (36), the elements of the matrix of row principal coordinates, (23), can be expressed in terms of δ and C so that
F δ = 40 + C 1 / 2 δ δ 40 + C δ 40 δ 200 + C 0 0 40 + 2 C δ 40 + C δ 400 + C 0 0 0 0 .
Therefore, changing δ and C does not influence the position of the principal coordinates of the third and fourth rows of Table 1. This make sense since there is perfect symmetry for these two rows and so that their position in the correspondence plot is at the origin. Note that for δ 0 , 1 , the 1 , 1 th and 2 , 2 t h elements of F δ , denoted by f 11 C , δ and f 22 C , δ , respectively, are both negative for C > 0 . The link between them is
f 11 C , δ f 22 C , δ = K C , δ 400 + C 200 + C > 400 + C 200 + C
where
K C , δ = 1 40 40 + C δ 40 + 2 C 40 + C δ 1 > 1
for δ 0 , 1 and C > 0 . Thus, the magnitude of f 11 C , δ will always be at least 400 + C / 200 + C times larger than the magnitude of f 22 C , δ for all δ 0 , 1 . For example, when C = 50 in Table 1, the lower bound of this ratio is 450 / 250 = 3 / 5 = 1.3416 and this will occur as δ 1 . Therefore, when C > 0 , f 11 C , δ will always lie at least 1.3416 times further from the origin than f 22 C , δ . Thus row 1 of Table 1 will contribute more to any departure from perfect symmetry than row 2, irrespective of the choice of δ . When δ = 1 then K C , 1 = 1 so that the row principal coordinates of eq. (23) Beh and Lombardo [10] are derived. Also, when δ = 1 , the link between f 12 C , 1 and f 21 C , 1 can be established using (37) instead of (36) and is
f 12 C , 1 f 21 C , 1 = 400 + C 200 + C .
This is identical to the ratio derived by Section 6.3 Beh and Lombardo [10] when using Bowker’s statistic to assess departures from perfect symmetry.
We can also obtain similar expressions for the column principal coordinates. Substituting (34), and (35), and B δ from (36), into (24) leaves us with
G δ = 40 + C 1 / 2 δ δ 0 40 + 2 C δ 40 + C δ 200 + C 40 + C δ 40 δ 400 + C 0 0 0 0 0 .
Note that the choice of δ and C does not influence the position of the third and fourth columns in the two-dimensional correspondence plot, where they lie at the origin. This makes sense since these columns of Table 1 are perfectly symmetrical with the third and fourth rows of the contingency table. Something else to note is that the 1 , 2 th element of G δ , denoted by g 12 δ is negative for δ 0 , 1 . Also, the 2 , 1 th element of G δ , denoted by g 21 δ is always positive for these values of δ . Therefore, the ratio of these two coordinates is always negative and is
g 12 C , δ g 21 C , δ = 1 K C , δ 400 + C 200 + C .
Therefore,
K C , δ = f 11 C , δ f 22 C , δ · g 21 C , δ g 12 C , δ
and shows that the relationship between the first and second row and column principal coordinates remains constant for some given value of C and δ .

4.5. The Correspondence Plots

Figure 5 gives the correspondence plot of Table 1 for δ = 1 , 1 / 2 and 0; these are constructed with X S 2 , T S 2 and M S 2 , respectively, as their numerical foundation with C = 50 .
Suppose we consider first the correspondence plot (Figure 5a) which can also be obtained using the technique outlined in Beh and Lombardo [10]. It shows that R3, R4, C3 and C4 are located at the origin. This should not be surprising for two related reasons: (1) there is perfect symmetry between R3 and C3, and between R4 and C4, and (2) these rows and columns are not influenced by the magnitude of C. Thus, these four categories of Table 1 play no part in determining the magnitude of Bowker’s statistic. Instead, X B 2 is influenced solely by the row categories R1 and R2, and the column categories C1 and C2, since C impacts on the symmetry (or lack thereof) of the 1 , 2 th and 2 , 1 th cell frequencies of Table 1. However, there is a noticeable difference in the position of R1 and C1 showing that there is a large departure from perfect symmetry between these categories; a feature present because C = 50 . Similarly, R2 and C2 are situated at quite a distance from each other showing the influence of C on their position in the correspondence plot. However, since this distance appears shorter than between R1 and C1 this shows the influence of C impacts more on the symmetry between R1 and C1 than it does on the symmetry between R2 and C2.
The configuration of points in Figure 5b,c are quite similar, although appear quite different when compared with the configuration of points in Figure 5a. However, since λ 1 1 = λ 2 1 then the configuration of points in Figure 5a remains unchanged if it is rotated clockwise 90 degrees and reflected along the first dimension. Doing so produces a configuration of points that is comparable to Figure 5b,c and, since M = 2 for our three values of δ , the three correspondence plots in Figure 5 depict all of the departures that exist from perfect symmetry. The only noticeable difference between the three plots is the percentage of the total inertia accounted for by the two dimensions. While all three plots display 100% of the departures from perfect symmetry (and are therefore excellent visual depictions) the first dimension is very much the most dominant when δ = 0 , accounting for 77.1% of M S 2 , while 64.5% of T S 2 is accounted for along this dimension when δ = 1 / 2 . This confirms the findings in our discussion of Figure 4.

5. Example 2: Pre- and Post-Courtship Behaviour of Bitterlings

5.1. The Data

We now move away from the analysis in Section 4 of the artificial contingency table and turn our attention to a more practical application. Consider Table 2 where S = 12 that originally comes from the extensive study of Table II Wiepkema [33]. The data concerns the pre- and post-courtship behaviour of male bitterlings (Rhodeus amarus Bloch), a small European fish where the behaviour is classified according to 12 traits. Here we use the (pre/POST)-courtship labelling convention that is an adaptation of the one used by Wiepkema [33] and van der Heijden [31]: jerking (jk/JK), turning beats (tu/TU), head butting (hb/HB), chasing (cs/CS), fleeing (fl/FL), quivering (qu/QU), leading (le/LE), head-down posture (hd/HD), skimming (sk/SK), snapping (sn/SN), chafing (cf/CF) and fin-flickering (ff/FF).
Table 2 was the subject of a classical correspondence analysis performed by van der Heijden [31] where departures from complete independence were assessed. Given the symmetric nature of the variables, we shall now perform a correspondence analysis using (8) to assess any departures from perfect symmetry that may exist in the data.

5.2. Test of the Departure from Perfect Symmetry

Of the 144 cells in Table 2 there are 22 zero cell frequencies (or 15.3% of the cells). The affect of this is that there are 16 values of p i j + p j i that are zero which means that (8) involves 16 instances where a division by zero occurs. To overcome this problem 0.01 has been added to each cell of the contingency table. Doing this leads to Bowker’s statistic, (9), of 277.801 while (10) and (11) are 333.9 and 671.0, respectively. With 12 12 1 / 2 = 66 degrees of freedom, these three statistics have a p-value that is less that 0.0001. Therefore, there is enough evidence in Table 2 to conclude that there is a statistically significant departure from perfect symmetry. That is, there is at least one of the 12 pairings of the pre- and post-courtship behaviour that is statistically different.

5.3. On the Divergence Residuals

One may evaluate where these departures from perfect symmetry lie by observing the elements of S δ . Table 3 gives these residuals for δ = 1 , 1 / 2 and 0. Note that since all diagonal elements of S δ are zero they have been omitted from Table 3. Those residuals designated “ < 0.001 ” are residuals lying within the interval ± 0.0000001 , 0.0001 .
The largest (negative and positive) divergence residuals for our three values of δ appear in bolded text in Table 3. We can see that the largest positive residuals are for the pre- and post-courtship pairs (hd, LE), (sk, HD) and (qu, SK). These combinations reflect that there are more observations in these cells that what would be expected if there were perfect symmetry between the variables. For example, for (LE, hd) the observed cell count is 167 while the expected number of observations under perfect symmetry is 167 + 7 / 2 = 87 . On the other hand, the largest negative residuals are for the pairs (QU, sk), (HD, le) and (SK, hd) and reflect those cells where the observed cell count is smaller than what is expected under perfect symmetry. This can be see with the (HD, le) pairing where the observed cell count is 7 and the expected number of observations under perfect symmetry is 87 (as we showed above). Therefore, these three pairs of pre- and post-courtship behaviour are the reverse of those pairings with a large positive divergence residual.
Table 3 also shows that s i j 1 = s j i 1 for all i , j = 1 , 2 , , 12 . For our other two values of δ there is either perfect or near perfect symmetry since the i , j th and j , i th divergence residuals are of the same or similar magnitude (differing only in their sign). Although, there are clear differences in magnitude of some of these residuals. For example s 87 0 = 0.132 (corresponding to the (HD/le) pair) while s 78 0 = 0.046 (corresponding to the (LE/hd) pair). Comparing these divergence residuals shows that the negative interaction between HD and le is about three times greater than the positive interaction between hd and LE.
The similarities, and differences, in these divergence residuals can be visualised using the correspondence plot. We now turn our attention to the correspondence plot of Table 2 when δ = 1 , 1 / 2 and 0.

5.4. Visualising the Departures from Perfect Symmetry

To visualise where the departures from perfect symmetry exist we construct the correspondence plot using the principal coordinates of (23) and (24) for δ = 1 , 1 / 2 and 0. These plots are given in Figure 6 where departures from perfect symmetry are assessed using the statistics, (9) (11) and (10) for δ = 1 , 1 / 2 and 0, respectively. These three correspondence plots provide an excellent visual depiction of departures from perfect symmetry in Table 2 since they all account for about 84% of the total inertia calculated using X S 2 , T S 2 and M S 2 .
The first thing to note about the configuration of points in the three correspondence plots of Figure 6 is that there is a large cluster of points that lie close to the origin. In fact, most of the categories of Table 2 lie at, or near, the origin with only a few categories that lie at a distance from the origin. Therefore, the three plots of Figure 6 show that most of the categories of Table 2 are fairly consistent with what is expected under perfect symmetry. Note that we are not saying here that all of the categories located in close proximity to the origin are perfectly symmetric. This can be achieved by determining the 100 1 α % confidence region for each category (for some level of significance, α ) which is beyond the scope of this paper. Although, when assessing departures from complete independence, such regions were recently developed by Alzahrani, Beh and Stojanovski [3] and are based on those described in Beh [7] and Beh and Lombardo [9].
We now turn our attention to those categories that are located relatively far from the origin. The three plots of Figure 6 show these to be le/LE (pre- and post-courtship leading), sk/SK (pre- and post-courtship skimming), and hd/HD (pre- and post-courtship head-down posture). Therefore it is these three behaviours that deviate the most from what would be expected if there were perfect symmetry in Table 2, and are the dominant source for why the p-value of (9), (10) and (11) is very small. Interestingly, these are three of the four behaviours that Wiepkema [33, p. 131] and van der Heijden [31, p. 56] note as being the sexual factors that underlay bitterling courtship behaviour. The fourth trait they identified was quivering (qu/QU). Note that there is a relatively large negative divergence residual between sk and QU in Table 3 which suggests that a pre-courtship skimming behaviour is unlikely to lead to a quivering post-courtship behaviour. While QU lies relatively close to the origin for all four values of δ , it does lie at a distance from sk. However, there are many other post-courtship behaviours that lie close to the origin of their correspondence plot and hence at a distance from sk and have a relatively small divergence residual. So is there really an under-count of pre-courtship skimming behaviour and post-courtship quivering? While adding a third dimension does not add a great deal to our visual display of the departures from perfect symmetry, they do show, for our three δ values, that the proximity of sk from the origin is matched by the proximity that le and/or hd (depending on the choice of δ ). Therefore, the third dimension does add additional context to the differences highlighted in Table 3 between sk and QU.
Suppose we now discuss other courtship behaviours and pairs that are located relatively far from each other. The first thing to point out here is that for our three values of δ , LE, HD and SK are all located in different parts of their correspondence plot. This suggests that the post-courtship behaviours of leading, head-down posture and skimming all contribute differently to the lack of perfect symmetry in Table 2. So too are their pre-courtship behaviours le, hd and sk. Interestingly, each of these three pre-courtship behaviours is not followed by their post-courtship behaviour. That is, for example, a pre-courtship display of leading is not followed by a post-courtship display of leading. In fact, Figure 6 shows that the differences between these three courtship behaviours is quite consistent.
While there are differences in pre- and post-courtship behaviours there are also some clearly defined pairings that can be identified by observing where departures from perfect symmetry exist. These are for the pairings of (hd, LE), (sk, HD) when δ = 1 and 1/2; recall that the divergence residuals for these pairs in Table 3 is relatively large and positive. This suggests that when assessing the departures from independence using the statistics (9) and (10), a pre-courtship display of head-down posture is followed by a leading post-courtship display, while a pre-courtship display of skimming is followed by a post-courtship display of head-down posture. Only when δ = 0 does there appear to be quite a difference between sk and HD; in fact, Figure 6a shows that a pre-courtship display of skimming is equally likely to lead to a post-courtship behaviour of head-down posture and skimming, although the link between the (sk, SK) and (sk, HD) pairs is not strong when δ = 0 .

6. Discussion

When numerically assessing departures from perfect symmetry one need not be confined to Bowker’s statistic [15], defined here by (9). There are a range of alternative statistics that can be considered and have been available for many decades; here we have focused our attention on the Freeman-Tukey statistic, T S 2 , and the modified log-likelihood ratio statistic M S 2 . These statistics are special cases of the Cressie-Read family of divergence statistics, defined by (2), as well as the second order Taylor series approximation of this family; see (3).
This paper has demonstrated how (3) can be used as the numerical foundations for performing a correspondence analysis to visualise departures from perfect symmetry. A special case of this family is when δ = 1 leading to the correspondence analysis technique recently described by Beh and Lombardo [10]. While we have discussed that any value of δ can be considered when performing this analysis, there are advantages in considering δ = 1 , δ = 1 / 2 and δ = 0 . With such flexibility in the choice of δ , one may well ask what is the most appropriate choice of δ to use? We discussed this issue when showing the links between (4) and correspondence analysis when assessing departures from complete independence; see Beh and Lombardo [11]. We described in that paper that the choice of δ may depend on many factors, including “the structure of the data, the output that is generated from the analysis or the ease and interpretability that a value of δ provides” (p. 38). However, there are other factors that may impact on the choice of δ . As the applications have shown, one may wish to choose the value of δ that yields the greatest percentage of the total inertia in a two-dimensional, say, correspondence plot; this depends greatly on the data structure that is being assessed for departures from perfect symmetry. One may consider δ = 1 to be an ideal choice for numerous reasons including (1) it leads to the more traditional correspondence analysis (2) the total inertia is measured using the well known and well understood Bowker’s statistic, and (3) the first two dimensions will account for the same percentage of the total inertia. This third reason also means that the analyst is provided with flexibilities to rotate and/or reflect the configuration of points around either dimension without affecting the general interpretability of the configuration. As the application to Table 2 also shows, of the three values we considered, δ = 1 also leads to the greatest percentage of the total inertia being visualised.
The next step in the evolution of this method of correspondence analysis is to derive the confidence regions alluded to Section 3.6 for visualising those categories that are statistically significant contributors to the global measure of the departure from perfect symmetry. Such regions expand upon those describe by Alzahrani, Beh and Stojanovski [3] and complement the correspondence analysis framework developed by Beh and Lombardo [11]. We shall leave this, and other further developments of the method of correspondence analysis outlined in this paper, for future work.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data in Table 1 is artificial and comes from Beh and Lombardo [10]. The data in Table 2 is from Wiepkema [33] and also appears in van der Heijden [31].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Agresti, A. Categorical Data Analysis, 3rd ed.; Wiley: New York, NY, USA, 2013. [Google Scholar]
  2. Altun, G.; Saraçbaşi, T. Determination of model fitting with power-divergence-type measure of departure from symmetry for sparse and non-sparse contingency tables. Communications in Statistics – Simulation and Computation 2022, 51, 4087–4111. [Google Scholar] [CrossRef]
  3. Alzahrani, A.; Beh, E.J.; Stojanovski, E. Confidence regions for simple correspondence analysis using the Cressie-Read family of divergence statistics. Electronic Journal of Applied Statistical Analysis 2023, 16, 423–448. [Google Scholar]
  4. Anderson, E.B. The Statistical Analysis of Categorical Data; Springer: Berlin/Heidelberg, Germany, 1991. [Google Scholar]
  5. Ando, S.; Hoshi, H.; Ishii, A.; Tomizawa, S. A generalized two-dimensional index to measure the degree of deviation from double symmetry in square contingency tables. Symmetry 2021, 13, 2067, (10 pages). [Google Scholar] [CrossRef]
  6. Beh, E.J. Simple correspondence analysis: A bibliographic review. International Statistical Review 2004, 72, 257–284. [Google Scholar] [CrossRef]
  7. Beh, E.J. Elliptical confidence regions for simple correspondence analysis. Journal of Statistical Planning and Inference 2010, 140, 2582–2588. [Google Scholar] [CrossRef]
  8. Beh, E.J.; Lombardo, R. Correspondence Analysis: Theory, Practice and New Strategies; Wiley: Chichester, UK, 2014. [Google Scholar]
  9. Beh, E.J.; Lombardo, R. Confidence regions and approximate p-values for classical and non symmetric correspondence analysis. Communications in Statistics - Theory and Methods 2015, 44, 95–114. [Google Scholar] [CrossRef]
  10. Beh, E.J.; Lombardo, R. Visualising departures from symmetry and Bowker’s X2 statistic. Symmetry 2022, 14, 1103, (25 pages). [Google Scholar] [CrossRef]
  11. Beh, E.J.; Lombardo, R. Correspondence analysis and the Cressie-Read family of divergence statistics. International Statistical Review 2024, 92, 17–42. [Google Scholar] [CrossRef]
  12. Beh, E.J.; Lombardo, R.; Alberti, G. Correspondence analysis and the Freeman-Tukey statistic: A study of archaeological data. Computational Statistics and Data Analysis 2018, 128, 73–86. [Google Scholar] [CrossRef]
  13. Bishop. Y.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
  14. Bove, G. Asymmetric multidimensional scaling and correspondence analysis for square tables. Statistica Applicata 1992, 4, 587–574. [Google Scholar]
  15. Bowker, A.H. A test for symmetry in contingency tables. Journal of the American Statistical Association 1948, 43, 572–598. [Google Scholar] [CrossRef] [PubMed]
  16. Constantine, A.G.; Gower, J.C. Graphical representation of asymmetry. Applied Statistics 1978, 27, 297–304. [Google Scholar] [CrossRef]
  17. Cressie, N.A.C.; Read, T.R.C. Multinomial goodness-of-fit tests. Journal of the Royal Statistical Society (Series B, Methodological) 1984, 46, 440–464. [Google Scholar] [CrossRef]
  18. Cuadras, C.M.; Cuadras, D. A parametric approach to correspondence analysis. Linear Algebra and its Applications 2006, 417, 64–74. [Google Scholar] [CrossRef]
  19. Cuadras, C.M.; Cuadras, D.; Greenacre, M.J. A comparison of different methods of representing categorical data. Communications in Statistics – Simulation and Computation 2006, 35, 447–459. [Google Scholar] [CrossRef]
  20. Freeman, M.F.; Tukey, J.W. Transformations related to the angular and square root. The Annals of Mathematical Statistics 1950, 21, 607–611. [Google Scholar] [CrossRef]
  21. Greenacre, M.J. Theory and Applications of Correspondence Analysis; Academic Press: London, UK, 1984. [Google Scholar]
  22. Greenacre, M. Correspondence analysis of square asymmetric matrices. Journal of the Royal Statistical Society (Series C, Applied Statistics) 2000, 49, 297–310. [Google Scholar] [CrossRef]
  23. Greenacre, M. Power transformations in correspondence analysis. Computational Statistics and Data Analysis 2009, 53, 3107–3116. [Google Scholar] [CrossRef]
  24. Greenacre, M. Log-ratio analysis is a limiting case of correspondence analysis. Mathematical Geosciences 2010, 42, 129–134. [Google Scholar] [CrossRef]
  25. Gower, J.C. The analysis of asymmetry and orthogonality. In Recent Developments in Statistics; Barra, J.R., Brodeau, F., Romer, G., van Cutsem, B., Eds.; North-Holland: Amsterdam, The Netherlands, 1977; pp. 109–123. [Google Scholar]
  26. Haberman, S.J. Analysis of Qualitative Data, Volume 2: New Developments; Academic Press: New York, 1979. [Google Scholar]
  27. Ireland, C.T.; Ku, H.H.; Kullback, S. Symmetry and marginal homogeneity of an r×r contingency table. Journal of the American Statistical Association 1969, 64, 1323–1341. [Google Scholar] [CrossRef]
  28. Lebart, L.; Morineau, A.; Warwick, K.M. Multivariate Descriptive Statistical Analysis: Correspondence Analysis and Related Techniques for Large Matrices; Wiley: New York, 1984. [Google Scholar]
  29. Neyman, J. Contributions to the theory of the χ2 test. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability; Neyman, J. Ed.; Statistical Laboratory of the University of California, Berkeley, 1949; pp. 239 -– 273.
  30. Tomizawa, S.; Seo, T.; Yamamoto, H. Power-divergence-type measure of departure from symmetry for square contingency tables that have nominal categories. Journal of Applied Statistics 1998, 25, 387–398. [Google Scholar] [CrossRef]
  31. van der Heijden, P.G.M.; de Vries, H.; van Hooff, J.A.R.A.M. Correspondence analysis of transition matrices, with special attention to missing entries and asymmetry. Animal Behavior 1990, 40, 49–64. [Google Scholar] [CrossRef]
  32. Ward, R.C.; Gray, L.J. Eigensystem computation for skew-symmetric matrices and a class of symmetric matrices. ACM Transactions on Mathematical Software 1978, 4, 278–285. [Google Scholar] [CrossRef]
  33. Wiepkema, P.R. An ethological analysis of the reproductive behaviour of the bitterling (Rhodeus amarus Bloch). Archives Néerlandaises de Zoologie 1961, 14, 103–199. [Google Scholar] [CrossRef]
Figure 1. X S 2 , T S 2 and M S 2 versus C 20 , 100 at unitary increments for Table 1.
Figure 1. X S 2 , T S 2 and M S 2 versus C 20 , 100 at unitary increments for Table 1.
Preprints 108423 g001
Figure 2. λ 1 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 and 0
Figure 2. λ 1 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 and 0
Preprints 108423 g002
Figure 3. λ 2 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 and 0
Figure 3. λ 2 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 and 0
Preprints 108423 g003
Figure 4. λ Diff δ versus C 0 for Table 1; δ = 1 , 1 / 2 and 0
Figure 4. λ Diff δ versus C 0 for Table 1; δ = 1 , 1 / 2 and 0
Preprints 108423 g004
Figure 5. Correspondence plot for Table 1 with C = 50 where (a) δ = 1 , (b) δ = 1 / 2 and (c) δ = 0
Figure 5. Correspondence plot for Table 1 with C = 50 where (a) δ = 1 , (b) δ = 1 / 2 and (c) δ = 0
Preprints 108423 g005
Figure 6. Correspondence plot for Table 2 where (a) δ = 0 , (b) δ = 1 / 2 and (c) δ = 1
Figure 6. Correspondence plot for Table 2 where (a) δ = 0 , (b) δ = 1 / 2 and (c) δ = 1
Preprints 108423 g006
Table 1. A near-symmetric artificial contingency table where C is a non-negative integer.
Table 1. A near-symmetric artificial contingency table where C is a non-negative integer.
Columns
Rows C1 C2 C3 C4 Total
R1 10 20 30 40 100
R2 20 + C 50 60 70 200 + C
R3 30 60 20 40 150
R4 40 70 40 80 230
Total 100 + C 200 150 230 680 + C
Table 2. The pre- and post-courtship behaviour of bitterlings (Rhodeus amarus Bloch, Source: [33] ).
Table 2. The pre- and post-courtship behaviour of bitterlings (Rhodeus amarus Bloch, Source: [33] ).
Pre-Courtship Behaviour
Post- jk tu hb cs fl qu le hd sk sn cf ff Total
JK 654 128 172 56 27 25 1 28 0 46 14 18 1169
TU 101 132 62 27 5 1 1 11 0 8 5 9 362
HB 171 62 197 130 0 25 0 50 14 18 14 12 693
CS 60 22 152 135 0 8 0 43 16 15 12 4 467
FL 19 2 0 0 419 19 0 2 0 17 5 11 494
QU 36 1 18 5 12 789 119 295 26 70 1 14 1386
LE 4 0 0 0 0 57 167 73 0 8 0 0 309
HD 22 9 40 37 5 245 7 171 287 53 8 13 897
SK 3 2 7 38 0 120 8 134 19 28 4 9 363
SN 42 2 17 16 20 70 11 67 9 225 12 12 503
CF 18 3 10 13 6 5 0 8 0 24 97 9 193
FF 27 3 6 5 10 13 0 18 0 10 8 29 129
Total 1157 366 681 462 504 1377 314 900 371 522 180 131 6965
Table 3. The matrix of divergence residuals, S δ for δ = 1 , 1 / 2 and 0.
Table 3. The matrix of divergence residuals, S δ for δ = 1 , 1 / 2 and 0.
Pre-Courtship Behaviour
Post- δ jk tu hb cs fl qu le hd sk sn cf ff
1 0.015 <0.001 -0.003 0.010 -0.012 -0.011 0.007 -0.015 0.004 -0.006 -0.011
JK 1/2 0.015 <0.001 -0.003 0.010 -0.013 -0.014 0.007 -0.027 0.004 -0.006 -0.012
0 0.014 <0.001 -0.003 0.009 -0.013 -0.017 0.007 -0.074 0.004 -0.006 -0.013
1 -0.015 0 0.006 0.010 0 0.008 0.004 -0.012 0.016 0.006 0.015
TU 1/2 -0.016 0 0.006 0.009 0 0.007 0.004 -0.022 0.014 0.006 0.013
0 -0.016 0 0.006 0.008 0 0.006 0.004 -0.056 0.013 0.005 0.012
1 <0.001 0 -0.011 0 0.009 0 0.009 0.013 0.001 0.007 0.012
HB 1/2 <0.001 0 -0.011 0 0.009 0 0.009 0.012 0.001 0.007 0.011
0 <0.001 0 -0.012 0 0.008 0 0.008 0.011 0.001 0.006 0.010
1 0.003 -0.006 0.011 0 0.007 0 0.006 -0.025 -0.002 -0.002 -0.003
CS 1/2 0.003 -0.006 0.011 0 0.007 0 0.006 -0.029 -0.002 -0.002 -0.003
0 0.003 -0.006 0.011 0 0.006 0 0.005 -0.033 -0.002 -0.002 -0.003
1 -0.010 -0.010 0 0 0.011 0 -0.010 0 -0.004 -0.003 0.002
FL 1/2 -0.010 -0.011 0 0 0.010 0 -0.011 0 -0.004 -0.003 0.002
0 -0.011 -0.013 0 0 0.010 0 -0.013 0 -0.004 -0.003 0.002
1 0.012 0 -0.009 -0.007 -0.011 0.040 0.018 -0.066 0 -0.014 0.002
QU 1/2 0.011 0 -0.009 -0.008 -0.011 0.037 0.018 -0.083 0 -0.017 0.002
0 0.011 0 -0.010 -0.008 -0.012 0.034 0.017 -0.106 0 -0.023 0.002
1 0.011 -0.008 0 0 0 -0.040 0.063 -0.024 -0.006 0 0
LE 1/2 0.010 -0.015 0 0 0 -0.044 0.053 -0.046 -0.006 0 0
0 0.009 -0.034 0 0 0 -0.049 0.046 -0.144 -0.006 0 0
1 -0.007 -0.004 -0.009 -0.006 0.010 -0.018 -0.063 0.063 -0.011 0 -0.008
HD 1/2 -0.007 -0.004 -0.009 -0.006 0.009 -0.019 -0.088 0.058 -0.011 0 -0.008
0 -0.008 -0.004 -0.009 -0.006 0.008 -0.019 -0.132 0.054 -0.012 0 -0.008
1 0.015 0.012 -0.013 0.025 0 0.066 0.024 -0.063 0.026 0.017 0
SK 1/2 0.012 0.010 -0.014 0.023 0 0.058 0.020 -0.070 0.024 0.014 0
0 0.010 0.008 -0.016 0.021 0 0.051 0.017 -0.079 0.021 0.012 0
1 -0.004 -0.016 -0.001 0.002 0.004 0 0.006 0.011 -0.026 -0.017 0.004
SN 1/2 -0.004 -0.020 -0.001 0.002 0.004 0 0.006 0.011 -0.031 -0.019 0.004
0 -0.004 -0.024 -0.001 0.001 0.004 0 0.005 0.010 -0.037 -0.021 0.003
1 0.006 -0.006 -0.007 0.002 0.003 0.014 0 0 -0.017 0.017 0.002
CF 1/2 0.006 -0.006 -0.007 0.002 0.002 0.012 0 0 -0.032 0.016 0.002
0 0.006 -0.007 -0.008 0.002 0.002 0.011 0 0 -0.090 0.015 0.002
1 0.011 -0.015 -0.012 0.003 -0.002 -0.002 0 0.008 0 -0.004 -0.002
FF 1/2 0.011 -0.017 -0.013 0.003 -0.002 -0.002 0 0.007 0 -0.004 -0.002
0 0.010 -0.020 -0.015 0.003 -0.002 -0.002 0 0.007 0 -0.004 -0.002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Downloads

81

Views

31

Comments

0

Subscription

Notify me about updates to this article or when a peer-reviewed version is published.

Email

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated