replace the system allocator in executables #18915

thestinger · 2014-11-13T06:10:16Z

This adds support for replacing the system allocator with jemalloc by
overriding the weak symbols on Linux (including Android). It is disabled
by default with #![no_std] and can be toggled via a compiler switch. It
will be possible to extend this to other platforms in the future.

This results in a performance improvement for memory allocation in C
along with reduced fragmentation. For example, the time spent on LLVM
passes in the Rust compiler on Linux is cut by 10% and peak memory usage
is reduced by 15%.

Closes #18896

thestinger · 2014-11-13T06:28:21Z

This provides one piece of the puzzle but there are various ways this could be improved. It is possible to replace the platform allocator on OS X / Windows but it's not as simple as overriding weak symbols. It could also use symbol aliases instead of wrappers in the case where liballoc is being statically linked.

Future improvements

It would be nice if an alternate global allocator could be dropped in at runtime (dropping in an asserts build of jemalloc or another allocator), but it's not yet clear if there's a good way to do it beyond 2 layers of indirection (wrapper functions marked as weak). On some platforms like Windows, mixed allocator usage is outweighed by the gains of using a better allocator in Rust code. However, it's a performance / memory usage loss relative to using a single great allocator in both Rust and C due to various forms of fragmentation. On FreeBSD, the system allocator is jemalloc so Rust should avoid bundling it there in the future.

andrew-d · 2014-11-14T01:28:43Z

For example, the time spent on LLVM passes in the Rust compiler on Linux is cut by 10% and peak memory usage is reduced by 15%.

👍 🍰

brson · 2014-11-14T16:55:06Z

@thestinger Thanks for following up with this patch.

I can't offhand think of any complications this might cause so I'm tentatively in favor.

I imagine the ultimate design here might change, along with changes to both -C flags and target specs, but I'm currently thinking of both of those as unstable, so not too concerned about expanding them.

It's worth noting that even after this patch rustc's memory usage will not be improved on OS X or Windows.

brson · 2014-11-14T16:58:35Z

Actually, @thestinger will this result in overriding malloc on Mac via the Zone allocator API? What happens on Windows with this patch?

brson · 2014-11-14T16:59:23Z

Oh, duh. This specifically targets Linux so has no effect on OS X and Windows.

yuriks · 2014-11-14T17:19:03Z

@brson My understanding from the previous debacle is that Rust currently already overrides the allocator on OS X.

alexcrichton · 2014-11-16T21:18:46Z

src/rt/rust_malloc.c

+
+void *mallocx(size_t size, int flags) {
+    return je_mallocx(size, flags);
+}


How come this reexports a number of jemalloc symbols without the je_ prefix? I would expect the standard libc weak symbols to be exposed, but the jemalloc symbols aren't able to be overridden, right?

None of the public symbols defined by jemalloc are weak symbols. I'm exporting these to address the demand that mallocx be usable as it is in vanilla jemalloc with no prefix.

Can you elaborate on this "demand" a little more? This is basically one of the possible shims rustc can inject, and the purpose is to override the system malloc/free, and I am unaware of the desire to export jemalloc-specific symbols as well.

It's necessary for Rust's jemalloc to satisfy the needs of third party code calling into jemalloc. That was the primary argument against the last pull request...

If Rust doesn't do this, then third party code using jemalloc cannot be used. C libraries don't usually have versions in the symbol names, so you can't just have multiple copies living side-by-side without problems.

I was under the impression that this "third party code" was primarily code in other processes that Rust itself was linked into. Either via a staticlib, dylib, or dlopen()'d dylib. Within a Rust executable itself (which this PR is focused on), however, I don't think that this would help too much. Libraries should be written knowing that the allocator is not their decision, and should plan appropriately (not relying on an upstream definition of jemalloc). Native code linked into an executable cannot rely on the existence of these symbols as the compiler is the one choosing whether to link in jemalloc or not, not the code itself.

Note that I'm just at this from the perspective of having this shim be as small as possible. I'd rather stick to well-known standardized apis like malloc than duplicate the nonstandard apis of jemalloc. If these were to fall out of sync with the jemalloc definitions, then I imagine badness could ensue.

I was under the impression that this "third party code" was primarily code in other processes that Rust itself was linked into.

If that code depends on jemalloc, then it will need to be using Rust's jemalloc.

I was under the impression that this "third party code" was primarily code in other processes that Rust itself was linked into. Either via a staticlib, dylib, or dlopen()'d dylib. Within a Rust executable itself (which this PR is focused on), however, I don't think that this would help too much. Libraries should be written knowing that the allocator is not their decision, and should plan appropriately (not relying on an upstream definition of jemalloc). Native code linked into an executable cannot rely on the existence of these symbols as the compiler is the one choosing whether to link in jemalloc or not, not the code itself.

The only argument against the previous one was that it would break code relying on mixing mallocx and free. The previous pull request was simpler and didn't have the added overhead of these wrapper functions. I'll just reopen it in favour of this one if that dubious argument has been abandoned.

I'd rather stick to well-known standardized apis like malloc than duplicate the nonstandard apis of jemalloc.

They are not "duplicated" in any way. It is manually removing the prefix because you rejected my pull request doing this the easy and low-overhead way by using the default configuration.

If these were to fall out of sync with the jemalloc definitions, then I imagine badness could ensue.

It's a stable API. There was a long deprecation period for the old experimental API before the shift to this one.

alexcrichton · 2014-11-16T21:19:07Z

I've looked this over and I've just got one question about the number of reexported symbols, but otherwise looks good to me.

This adds support for replacing the system allocator with jemalloc by overriding the weak symbols on Linux (including Android). It is disabled by default with #![no_std] and can be toggled via a compiler switch. It will be possible to extend this to other platforms in the future. This results in a performance improvement for memory allocation in C along with reduced fragmentation. For example, the time spent on LLVM passes in the Rust compiler on Linux is cut by 10% and peak memory usage is reduced by 15%. Closes #18896

thestinger · 2014-11-20T13:33:20Z

Is third party code mixing mallocx + free something that has to be supported or not? If not, then I will just reopen the last pull request as clearly things have changed since then.

alexcrichton · 2014-11-20T19:02:17Z

I've discussed this with @nikomatsakis and I would be ok for now with adding a comment to the C file explicitly stating that the reexportation of any non-libc symbol is experimental and may change in the future, can you add a comment to that effect?

alexcrichton · 2014-12-08T19:18:05Z

Closing due to inactivity, but feel free to reopen with my comment addressed!

alexcrichton reviewed Nov 16, 2014
View reviewed changes

thestinger closed this Nov 26, 2014

thestinger reopened this Nov 26, 2014

alexcrichton closed this Dec 8, 2014

thestinger deleted the jemalloc branch January 28, 2015 03:09

replace the system allocator in executables #18915

replace the system allocator in executables #18915

Uh oh!

Conversation

thestinger commented Nov 13, 2014

Uh oh!

thestinger commented Nov 13, 2014

Future improvements

Uh oh!

andrew-d commented Nov 14, 2014

Uh oh!

brson commented Nov 14, 2014

Uh oh!

brson commented Nov 14, 2014

Uh oh!

brson commented Nov 14, 2014

Uh oh!

yuriks commented Nov 14, 2014

Uh oh!

alexcrichton Nov 16, 2014

Choose a reason for hiding this comment

Uh oh!

thestinger Nov 19, 2014

Choose a reason for hiding this comment

Uh oh!

alexcrichton Nov 19, 2014

Choose a reason for hiding this comment

Uh oh!

thestinger Nov 19, 2014

Choose a reason for hiding this comment

Uh oh!

thestinger Nov 19, 2014

Choose a reason for hiding this comment

Uh oh!

alexcrichton Nov 20, 2014

Choose a reason for hiding this comment

Uh oh!

thestinger Nov 20, 2014

Choose a reason for hiding this comment

Uh oh!

alexcrichton commented Nov 16, 2014

Uh oh!

thestinger commented Nov 20, 2014

Uh oh!

alexcrichton commented Nov 20, 2014

Uh oh!

alexcrichton commented Dec 8, 2014

Uh oh!

Uh oh!