2-EXPTIME

In mathematics and multivariate statistics, the centering matrix^[1] is a symmetric and idempotent matrix, which when multiplied with a vector has the same effect as subtracting the mean of the components of the vector from every component.

Definition

The centering matrix of size n is defined as the n-by-n matrix

C_{n} = I_{n} - \frac{1}{n} 𝕆

where $I_{n}$ is the identity matrix of size n and $𝕆$ is an n-by-n matrix of all 1's. This can also be written as:

C_{n} = I_{n} - \frac{1}{n} 1 1^{⊤}

where $1$ is the column-vector of n ones and where $⊤$ denotes matrix transpose.

For example

C_{1} = [\begin{matrix} 0 \end{matrix}]

,

C_{2} = [\begin{array}{rrr} 1 & 0 \\ 0 & 1 \end{array}] - \frac{1}{2} [\begin{array}{rrr} 1 & 1 \\ 1 & 1 \end{array}] = [\begin{array}{rrr} \frac{1}{2} & - \frac{1}{2} \\ - \frac{1}{2} & \frac{1}{2} \end{array}]

,

C_{3} = [\begin{array}{rrr} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}] - \frac{1}{3} [\begin{array}{rrr} 1 & 1 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \end{array}] = [\begin{array}{rrr} \frac{2}{3} & - \frac{1}{3} & - \frac{1}{3} \\ - \frac{1}{3} & \frac{2}{3} & - \frac{1}{3} \\ - \frac{1}{3} & - \frac{1}{3} & \frac{2}{3} \end{array}]

Properties

Given a column-vector, $v$ of size n, the centering property of $C_{n}$ can be expressed as

C_{n} v = v - (\frac{1}{n} 1^{'} v) 1

where $\frac{1}{n} 1^{'} v$ is the mean of the components of $v$ .

$C_{n}$ is symmetric positive semi-definite.

$C_{n}$ is idempotent, so that $C_{n}^{k} = C_{n}$ , for $k = 1, 2, \dots$ . Once the mean has been removed, it is zero and removing it again has no effect.

$C_{n}$ is singular. The effects of applying the transformation $C_{n} v$ cannot be reversed.

$C_{n}$ has the eigenvalue 1 of multiplicity n − 1 and eigenvalue 0 of multiplicity 1.

$C_{n}$ has a nullspace of dimension 1, along the vector $1$ .

$C_{n}$ is a projection matrix. That is, $C_{n} v$ is a projection of $v$ onto the (n − 1)-dimensional subspace that is orthogonal to the nullspace $1$ . (This is the subspace of all n-vectors whose components sum to zero.)

Application

Although multiplication by the centering matrix is not a computationally efficient way of removing the mean from a vector, it forms an analytical tool that conveniently and succinctly expresses mean removal. It can be used not only to remove the mean of a single vector, but also of multiple vectors stored in the rows or columns of a matrix. For an m-by-n matrix $X$ , the multiplication $C_{m} X$ removes the means from each of the n columns, while $X C_{n}$ removes the means from each of the m rows.

The centering matrix provides in particular a succinct way to express the scatter matrix, $S = (X - μ 1^{'}) (X - μ 1^{'})^{'}$ of a data sample $X$ , where $μ = \frac{1}{n} X 1$ is the sample mean. The centering matrix allows us to express the scatter matrix more compactly as

S = X C_{n} (X C_{n})^{'} = X C_{n} C_{n} X^{'} = X C_{n} X^{'} .

$C_{n}$ is the covariance matrix of the multinomial distribution, in the special case where the parameters of that distribution are $k = n$ , and $p_{1} = p_{2} = \dots = p_{n} = \frac{1}{n}$ .

References

↑ John I. Marden, Analyzing and Modeling Rank Data, Chapman & Hall, 1995, ISBN 0-412-99521-2, page 59.

[1] John I. Marden, Analyzing and Modeling Rank Data, Chapman & Hall, 1995, ISBN 0-412-99521-2, page 59.

[1]

2-EXPTIME

Contents

Definition

Properties

Application

References

Navigation menu

2-EXPTIME

Definition

Properties

Application

References

Navigation menu

Search