Transform coding can exploit correlation in signals to concentrate its information1 in a subset of transformed elements called coefficients, by decorrelating the input samples [4]. Normally, after the transformation, quantization [2] of the signal is more effective2 when the energy of the signal is accumulated in an small number of coefficients because we can dedicate more bits to encode the more energetic ones.
In general, transform domains require larger dynamic ranges than the original ones.
Both, TC and VQ [3] works exploiting the correlation between samples, although SQ (Scalar Quantization) does not. Therefore, we can expect that the RD performance [1] of a (TC+SQ)-based codec should perform in the RD domain similarly to VQ.
All linear3
transforms can be described as a matrix-vector product [5]
Transforms are used in signal coding to provide relative (between subbands) energy
compaction. The capatility of a transform to achieve this effect can be estimated by
the so called transform coding gain [6, 4] defined by
Some transforms, such as the DCT are applied by 2D blocks which (for example, of
Rate-control is mainly performed through the configuration of the quantization step sizes. Notice that, in general, if the transform is orthogonal and therefore the subbands are independent, the quantization step size of a subband should be inversely proportional to the subband gain.
[1] V. González-Ruiz. Information Theory.
[2] V. González-Ruiz. Scalar Quantization.
[3] V. González-Ruiz. Vector Quantization.
[4] K. Sayood. Introduction to Data Compression. Morgan Kaufmann, 2017.
[5] G. Strang. Linear Algebra and Its Applications. Belmont, CA: Thomson, Brooks/Cole, 2006.
[6] M. Vetterli and J. Kovačević. Wavelets and Subband Coding. Prentice-hall, 1995.
[7] M. Vetterli, J. Kovačević, and V.K. Goyal. Foundations of Signal Processing. Cambridge University Press, 2014.