Table 2 Upper bounds on the number of blocks that can be processed in parallel by an s.p.e.d. 2D-BDCT. indicates a direct or a fast implementation of the DCT. is equivalent to a 16-bit fixed point implementation. and are equivalent to a single precision and a double precision floating point implementations, respectively. We have assumed .