|
|
Message-ID: <20251104210156.GZ1827@brightrain.aerifal.cx> Date: Tue, 4 Nov 2025 16:01:56 -0500 From: Rich Felker <dalias@...c.org> To: The 8472 <the8472.rs@...inite-source.de> Cc: Alejandro Colomar <alx@...nel.org>, Thiago Macieira <thiago@...ieira.org>, Florian Weimer <fw@...eb.enyo.de>, libc-alpha@...rceware.org, musl@...ts.openwall.com, Arthur O'Dwyer <arthur.j.odwyer@...il.com>, Jonathan Wakely <jwakely@...hat.com> Subject: Re: Re: realloci(): A realloc() variant that works in-place On Tue, Nov 04, 2025 at 12:51:16AM +0100, The 8472 wrote: > Hello, > > On 03/11/2025 22:28, Rich Felker wrote: > > On Mon, Nov 03, 2025 at 10:36:07AM +0100, Alejandro Colomar wrote: > > > Hi Rich, > > > > > > On Sun, Nov 02, 2025 at 07:28:57PM -0500, Rich Felker wrote: > > > > On Mon, Nov 03, 2025 at 12:58:39AM +0100, Alejandro Colomar wrote: > > > > > > All this will need fine-tuning once implementations exist. > > > > > > > > > > > > > So, why not require the caller to not ask too much? We could go back to > > > > > > > reporting an error if there's not enough memory. > > > > > > > > > > > > > > Of course, it would still guarantee no errors when shrinking, but > > > > > > > I think we could error out when growing. > > > > > > > > > > > > I'd prefer no errors either way. If there isn't memory to grow the underlying > > > > > > space (a brk() system call returns ENOMEM), then realloci() returns as much as > > > > > > it could get but not more. > > > > > > > > > > The problem is that this is asking the implementation to speculate. > > > > > > > > > > Consider the case that a realloci() implementation knows that the > > > > > requested size fails. Let's put some arbitrary numbers: > > > > > > > > > > old_size = 10000; > > > > > requested_size = 30000; > > > > > > > > > > It knows the block can grow to somewhere between 10000 (which it > > > > > currently has) and 30000 (the system reported ENOMEM), but now it has > > > > > the task of allocating as much as it can get. Should it do a binary > > > > > search of the size? Try 20000, then if it fails try 15000, etc.? > > > > > That's speculation, and it would make this function too slow. > > > > > > > > I don't see any plausible implementation in which this involved a > > > > binary search. Either you have fixed-size slots in which case you just > > > > look at the size of the slot to see what the max obtainable is, or you > > > > have a dlmalloc-like situation where you check the size of the > > > > adjacent free block (if any) to determine the max obtainable. These > > > > are O(1) operations. > > > > > > I was thinking of mremap(2) without MREMAP_MAYMOVE. > > > > OK, this whole conversation is mixing up unrelated things: > > > > 1. In-place realloc to avoid relatively-expensive memcpy > > 2. In-place realloc to avoid updating pointers > > > > The case where mremap would be used is utterly irrelevant to (1). And > > further, the cost of the mremap operation is so high (syscall > > overhead, page table/TLB synchronization) that any cost of updating > > pointers because the object moved is dwarfed and thereby irrelevant > > too. > > > > So I don't see why anyone should care about this case. > > > > Moreover, I see (2) as entirely misguided. The whole provenance model > > makes it broken to try to rely on pointer values not changing, and no > > code should be trying to do that. A new allocator interface should not > > be pandering to this very fragile, very likely to be broken by > > compiler transformations, utterly backwards practice. Just treat the > > old pointer as invalid and always update like you're supposed to, > > regardless of whether the value is different. > > > > Rich > > > > On the Rust side we have uses for both these scenarios, and more. > > A) A strictly in-place realloc is useful for collections and > arenas that have outstanding borrows (thus cannot move) > but want to try growing in-place before they have to allocate > another chunk. This "useful" needs to be quantified. Only in very very rare cases will in-place expansion even be possible. The vast majority of the time, you must allocate another discontiguous chunk to meet the above contractual obligation anyway. > For those a metadata-update or mremap without MAYMOVE is fine. > > B) Collections that want to resize and can change their pointer > but need custom data movement, i.e. not a plain memcpy from the old > to the new location. A VecDeque that needs to copy its front and > tail to different locations after a resize. A Vec wants to copy > fewer bytes than its allocated size. > > In these cases mremap(MREMAP_MAYMOVE) is fine but memcpy should > be avoided and we would fallback to malloc + custom copy operations > + free. malloc + custom copy + free sounds like it's always the right solution. Simply ignore the existence of realloc entirely. It's not useful. > C) Alignment-changing reallocations, for example to go from a Box<[u8]> > to Box<[f32]>. In those cases mremap and memcpy are both fine but for > large allocations the former would be preferred. Why is alignment-changing reallocation a thing? Why would the type of an object change like this? Even if it does, all allocations are always inherently aligned sufficiently for the alignment requirement of any non-over-aligned type. Rich
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.