Optimistic Parallelization of Floating-Point Accumulation

Kapre, Nachiket; DeHon, André

Optimistic Parallelization of Floating-Point Accumulation

Files

04272867.pdf (386.88 KB)

Penn collection

Departmental Papers (ESE)

Subject

floating-point addition
parallel prefix
optimistic parallelism

Permalink

https://repository.upenn.edu/handle/20.500.14332/33553

View all metadata

Author

Kapre, Nachiket

DeHon, André

Abstract

Floating-point arithmetic is notoriously non-associative due to the limited precision representation which demands intermediate values be rounded to fit in the available precision. The resulting cyclic dependency in floating-point accumulation inhibits parallelization of the computation, including efficient use of pipelining. In practice, however, we observe that floating-point operations are mostly associative. This observation can be exploited to parallelize floating-point accumulation using a form of optimistic concurrency. In this scheme, we first compute an optimistic associative approximation to the sum and then relax the computation by iteratively propagating errors until the correct sum is obtained. We map this computation to a network of 16 statically-scheduled, pipelined, double-precision floating-point adders on the Virtex-4 LX160 (-12) device where each floating-point adder runs at 296MHz and has a pipeline depth of 10. On this 16 PE design, we demonstrate an average speedup of 6× with randomly generated data and 3-7× with summations extracted from Conjugate Gradient benchmarks.

Date of presentation

2007-06-25

Conference name

Departmental Papers (ESE)

Conference dates

2023-05-17T02:12:54.000

Comments

Copyright 2008 IEEE. Reprinted from Proceedings of the 18th IEEE International Symposium on Computer Arithmetic ARITH '07, pages 205-216. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Collection

Presentations