An 8-Point IDCT computing resource implemented on a TriMedia/CPU64 reconfigurable functional unit
2001
This paper presents the implementation of an 8-point Inverse Discrete Cosine Transform (IDCT) comput- ing resource on a TriMedia/CPU64 FPGA-based Reconfig- urable Functional Unit (RFU). TriMedia/CPU64 is a 64-bit 5 issue-slot VLIW processor launching a long instruction ev- ery clock cycle. The RFU consists mainly of an FPGA core, and is embedded into the TriMedia as any other hardwired functional unit, i.e., it receives instructions from the instruc- tion decoder, reads its input arguments from and writes the computed values back to the register file. To reduce the computational complexity of IDCT, we used a modified ver- sion of the Loeffler algorithm which requires 14 multipli- cations. Since each multiplicand is a 16-bit signed number represented in 2's complement notation, while each multi- plier is a positive constant of 15 bits or less, we employed a "multiplication-by-constant" scheme which was optimized against the multiplier. To increase the throughput of the IDCT computing resource, we propose a pipeline implemen- tation. When mapped on an ACEX EP1K100 FPGA-based RFU, our 8-point IDCT computing resource exhibits a la- tency of 16 TriMedia cycles, a recovery of 2 cycles, and oc- cupies 42% of the logic cells of the device.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
17
References
0
Citations
NaN
KQI