Efficient asynchronous bundled-data pipelines for DCT matrix-vector multiplication

  • Sunan Tugsinavisut
  • , Youpyo Hong
  • , Daewook Kim
  • , Kyeounsoo Kim
  • , Peter A. Beerel

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

This paper demonstrates the design of efficient asynchronous bundled-data pipelines for the matrix-vector multiplication core of discrete cosine transforms (DCTs). The architecture is optimized for both zero and small-valued data, typical in DCT applications, yielding both high average performance and low average power. The proposed bundled-data pipelines include novel data-dependent delay lines with integrated control circuitry to efficiently implement speculative completion sensing. The control circuits are based on a novel control-circuit template that simplifies the design of such nonlinear pipelines. Extensive post-layout back-end timing analysis was performed to gain confidence in the timing margins as well as to quantify performance and energy. Comparison with a synchronous counterpart suggests that our best asynchronous design yields 30% higher average throughput with negligible energy overhead.

Original languageEnglish
Pages (from-to)448-461
Number of pages14
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
Volume13
Issue number4
DOIs
StatePublished - Apr 2005

Keywords

  • Asynchronous pipelines
  • Bundled-data pipelines
  • Control circuit templates
  • Discrete cosine transforms
  • Matrix-vector multiplication
  • Precharged full buffer
  • True four-phase full buffer

Fingerprint

Dive into the research topics of 'Efficient asynchronous bundled-data pipelines for DCT matrix-vector multiplication'. Together they form a unique fingerprint.

Cite this