TY - GEN
T1 - Low-power multiplierless DCT for image/video coders
AU - Kim, Byoung Il
AU - Ziavras, Sotirios G.
N1 - Copyright:
Copyright 2009 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - A multiplierless discrete cosine transform (DCT) architecture is proposed to improve the power efficiency of image/video coders. Power reduction is achieved by minimizing both the number of arithmetic operations and their bit width. To minimize arithmetic-operation redundancy, our DCT design focuses on Chen's factorization approach and the constant matrix multiplication (CMM) problem. The 8x1 DCT is decomposed using six two-input butterfly networks. Each butterfly is for 2x2 matrix multiplication and requires a maximum of eight adders/subtractors with 13-bit cosine coefficients. Consequently, the proposed 8x1 DCT architecture is composed of 56 adders and subtractors, which represent a reduction of 61.9% and 46.1% in arithmetic operations compared to the conventional NEDA and CORDIC architectures, respectively. To further improve the power efficiency, an adaptive companding scheme is proposed. The proposed DCT architecture was implemented on a Xilinx FPGA. The results from power estimation show that our architecture can reduce the power dissipation by up to 90% compared to conventional multiplierless DCT architectures.
AB - A multiplierless discrete cosine transform (DCT) architecture is proposed to improve the power efficiency of image/video coders. Power reduction is achieved by minimizing both the number of arithmetic operations and their bit width. To minimize arithmetic-operation redundancy, our DCT design focuses on Chen's factorization approach and the constant matrix multiplication (CMM) problem. The 8x1 DCT is decomposed using six two-input butterfly networks. Each butterfly is for 2x2 matrix multiplication and requires a maximum of eight adders/subtractors with 13-bit cosine coefficients. Consequently, the proposed 8x1 DCT architecture is composed of 56 adders and subtractors, which represent a reduction of 61.9% and 46.1% in arithmetic operations compared to the conventional NEDA and CORDIC architectures, respectively. To further improve the power efficiency, an adaptive companding scheme is proposed. The proposed DCT architecture was implemented on a Xilinx FPGA. The results from power estimation show that our architecture can reduce the power dissipation by up to 90% compared to conventional multiplierless DCT architectures.
KW - Constant matrix multiplication (CMM)
KW - Discrete cosine transform (DCT)
KW - Multiplierless DCT
KW - Power dissipation
UR - http://www.scopus.com/inward/record.url?scp=70350245007&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70350245007&partnerID=8YFLogxK
U2 - 10.1109/ISCE.2009.5156873
DO - 10.1109/ISCE.2009.5156873
M3 - Conference contribution
AN - SCOPUS:70350245007
SN - 9781424429769
T3 - Digest of Technical Papers - IEEE International Conference on Consumer Electronics
SP - 133
EP - 136
BT - 2009 IEEE 13th International Symposium on Consumer Electronics, ISCE 2009
T2 - 2009 IEEE 13th International Symposium on Consumer Electronics, ISCE 2009
Y2 - 25 May 2009 through 28 May 2009
ER -