Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [day] [month] [year] [list]
Date: Sun, 28 Apr 2019 00:13:34 +0200
From: Szabolcs Nagy <>
Subject: [PATCH 0/3] fp_barrier improvements

fp_barrier does not need to drop excess precision, so the type is
changed accordingly.

i386 fp_barrier is now "more efficient" than in the previous patch.

at least the first two patches shoud improve things on x86.

Szabolcs Nagy (3):
  math: keep excess precision in fp_barrier
  x86: optimize fp_arch.h
  math: use fp_force_eval and fp_barrier in fma

 arch/aarch64/fp_arch.h |  6 ++++--
 arch/i386/fp_arch.h    | 48 ++++++++++++++++++++++++++++++++++++++++++
 arch/x32/fp_arch.h     | 40 +++++++++++++++++++++++++++++++++++
 arch/x86_64/fp_arch.h  | 40 +++++++++++++++++++++++++++++++++++
 src/internal/libm.h    | 14 ++++++------
 src/math/fma.c         |  8 +++----
 6 files changed, 143 insertions(+), 13 deletions(-)
 create mode 100644 arch/i386/fp_arch.h
 create mode 100644 arch/x32/fp_arch.h
 create mode 100644 arch/x86_64/fp_arch.h


View attachment "0001-math-keep-excess-precision-in-fp_barrier.patch" of type "text/x-diff" (1922 bytes)

View attachment "0002-x86-optimize-fp_arch.h.patch" of type "text/x-diff" (4048 bytes)

View attachment "0003-math-use-fp_force_eval-and-fp_barrier-in-fma.patch" of type "text/x-diff" (1133 bytes)

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.