Date: Wed, 25 Mar 2020 22:45:45 +0100 From: Szabolcs Nagy <nsz@...t70.net> To: musl@...ts.openwall.com Subject: [PATCH] aarch64: add optimized memcpy, memmove and memset minimal edits to upstream version for easier updates and because this code was benchmarked across many cores. gcc generates slow code for the current c implementations. the integer memcpy was chosen instead of the simd one, this performs better on little cores, i think this is the more conservative choice for now. note: there are upcoming security architectures which may mean updates to these functions (BTI - landing pads, PAUTH - return address signing, MTE - 16byte tag granule may affect optimized strcmp etc, not relevant yet), but runtime support for these will need other libc changes. View attachment "0001-aarch64-add-optimized-memcpy-memmove-and-memset.patch" of type "text/x-diff" (10109 bytes)
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.