romenskiy2012/jemalloc

mirror of https://github.com/jemalloc/jemalloc.git synced 2026-04-14 14:41:42 +03:00

Author	SHA1	Message	Date
Guangli Dai	01e9ecbeb2	Remove build-time configuration 'config_limit_usize_gap'	2025-05-06 14:47:35 -07:00
Slobodan Predolac	852da1be15	Add experimental option force using SYS_process_madvise	2025-04-28 18:45:30 -07:00
Jay Lee	a4defdb854	detect false failure of strerror_r See tikv/jemallocator#108. In a summary, test on `strerror_r` can fail due to reasons other than `strerror_r` itself, so add an additional test to determine the failure is expected. Signed-off-by: Jay Lee <BusyJayLee@gmail.com>	2025-03-17 17:50:20 -07:00
Qi Wang	22440a0207	Implement process_madvise support. Add opt.process_madvise_max_batch which determines if process_madvise is enabled (non-zero) and the max # of regions in each batch. Added another limiting factor which is the space to reserve on stack, which results in the max batch of 128.	2025-03-07 15:32:32 -08:00
guangli-dai	c067a55c79	Introducing a new usize calculation policy Converting size to usize is what jemalloc has been done by ceiling size to the closest size class. However, this causes lots of memory wastes with HPA enabled. This commit changes how usize is calculated so that the gap between two contiguous usize is no larger than a page. Specifically, this commit includes the following changes: 1. Adding a build-time config option (--enable-limit-usize-gap) and a runtime one (limit_usize_gap) to guard the changes. When build-time config is enabled, some minor CPU overhead is expected because usize will be stored and accessed apart from index. When runtime option is also enabled (it can only be enabled with the build-time config enabled). a new usize calculation approach wil be employed. This new calculation will ceil size to the closest multiple of PAGE for all sizes larger than USIZE_GROW_SLOW_THRESHOLD instead of using the size classes. Note when the build-time config is enabled, the runtime option is default on. 2. Prepare tcache for size to grow by PAGE over GROUPPAGE. To prepare for the upcoming changes where size class grows by PAGE when larger than NGROUP PAGE, disable the tcache when it is larger than 2 * NGROUP * PAGE. The threshold for tcache is set higher to prevent perf regression as much as possible while usizes between NGROUP * PAGE and 2 * NGROUP * PAGE happen to grow by PAGE. 3. Prepare pac and hpa psset for size to grow by PAGE over GROUP*PAGE For PAC, to avoid having too many bins, arena bins still have the same layout. This means some extra search is needed for a page-level request that is not aligned with the orginal size class: it should also search the heap before the current index since the previous heap might also be able to have some allocations satisfying it. The same changes apply to HPA's psset. This search relies on the enumeration of the heap because not all allocs in the previous heap are guaranteed to satisfy the request. To balance the memory and CPU overhead, we currently enumerate at most a fixed number of nodes before concluding none can satisfy the request during an enumeration. 4. Add bytes counter to arena large stats. To prepare for the upcoming usize changes, stats collected by multiplying alive allocations and the bin size is no longer accurate. Thus, add separate counters to record the bytes malloced and dalloced. 5. Change structs use when freeing to avoid using index2size for large sizes. - Change the definition of emap_alloc_ctx_t - Change the read of both from edata_t. - Change the assignment and usage of emap_alloc_ctx_t. - Change other callsites of index2size. Note for the changes in the data structure, i.e., emap_alloc_ctx_t, will be used when the build-time config (--enable-limit-usize-gap) is enabled but they will store the same value as index2size(szind) if the runtime option (opt_limit_usize_gap) is not enabled. 6. Adapt hpa to the usize changes. Change the settings in sec to limit is usage for sizes larger than USIZE_GROW_SLOW_THRESHOLD and modify corresponding tests. 7. Modify usize calculation and corresponding tests. Change the sz_s2u_compute. Note sz_index2size is not always safe now while sz_size2index still works as expected.	2025-03-06 15:08:13 -08:00
roblabla	c17bf8b368	Disable config from file or envvar with build flag This adds a new autoconf flag, --disable-user-config, which disables reading the configuration from /etc/malloc.conf or the MALLOC_CONF environment variable. This can be useful when integrating jemalloc in a binary that internally handles all aspects of the configuration and shouldn't be impacted by ambient change in the environment.	2025-02-05 15:01:50 -08:00
Dmitry Ilvokhin	34c823f147	Add autoconf options to enable sanitizers This commit allows to enable sanitizers with autoconf options, instead of modifying `CFLAGS`, `CXXFLAGS` and `LDFLAGS` directly. * `--enable-tsan` option to enable Thread Sanitizer. * `--enable-ubsan` option to enable Undefined Behaviour Sanitizer. End goal is to speedup development by finding problems quickly, early and easier. Eventually, when all current issues will be fixed, we can enable sanitizers in CI. Fortunately, there are not a lot of problems we need to fix. Address Sanitizer is a bit controversial, because it replaces memory allocator, so we decided to left it out for a while. Below are couple of examples of how tests look like under different sanitizers at the moment. ``` $ ../configure --enable-tsan --enable-debug <...> asan : 0 tsan : 1 ubsan : 0 $ make -j`nproc` check <...> Thread T13 (tid=332043, running) created by main thread at: #0 pthread_create <null> (libtsan.so.0+0x61748) #1 thd_create ../test/src/thd.c:25 (bin_batching+0x5631ca) #2 stress_run ../test/unit/bin_batching.c:148 (bin_batching+0x40364c) #3 test_races ../test/unit/bin_batching.c:249 (bin_batching+0x403d79) #4 p_test_impl ../test/src/test.c:149 (bin_batching+0x562811) #5 p_test_no_reentrancy ../test/src/test.c:213 (bin_batching+0x562d35) #6 main ../test/unit/bin_batching.c:268 (bin_batching+0x40417e) SUMMARY: ThreadSanitizer: data race ../include/jemalloc/internal/edata.h:498 in edata_nfree_inc ``` ``` $ ../configure --enable-ubsan --enable-debug <...> asan : 0 tsan : 0 ubsan : 1 $ make -j`nproc` check <...> === test/unit/hash === ../test/unit/hash.c:119:16: runtime error: left shift of 176 by 24 places cannot be represented in type 'int' <...> ```	2025-02-05 14:28:28 -08:00
Dan Horák	17881ebbfd	Add configure check for gettid() presence The gettid() function is available on Linux in glibc only since version 2.30. There are supported distributions that still use older glibc version. Thus add a configure check if the gettid() function is available and extend the check in src/prof_stack_range.c so it's skipped also when gettid() isn't available. Fixes: https://github.com/jemalloc/jemalloc/issues/2740	2024-12-17 12:40:54 -08:00
Dmitry Ilvokhin	0ce13c6fb5	Add opt `hpa_hugify_sync` to hugify synchronously Linux 6.1 introduced `MADV_COLLAPSE` flag to perform a best-effort synchronous collapse of the native pages mapped by the memory range into transparent huge pages. Synchronous hugification might be beneficial for at least two reasons: we are not relying on khugepaged anymore and get an instant feedback if range wasn't hugified. If `hpa_hugify_sync` option is on, we'll try to perform synchronously collapse and if it wasn't successful, we'll fallback to asynchronous behaviour.	2024-11-20 10:52:52 -08:00
Dmitry Ilvokhin	a361e886e2	Move `je_cv_thp` logic closer to definition	2024-11-20 10:52:52 -08:00
Qi Wang	6d625d5e5e	Add support for clock_gettime_nsec_np() Prefer clock_gettime_nsec_np(CLOCK_UPTIME_RAW) to mach_absolute_time().	2024-10-14 10:33:27 -07:00
Nathan Slingerland	edc1576f03	Add safe frame-pointer backtrace unwinder	2024-10-01 11:01:56 -07:00
Guangli Dai	0181aaa495	Optimize edata_cmp_summary_compare when __uint128_t is available	2024-09-23 16:23:42 -07:00
roblabla	734f29ce56	Fix compilation with MSVC 2022 On MSVC, log is an intrinsic that doesn't require libm. However, AC_SEARCH_LIBS does not successfully detect this, as it will try to compile a program using the wrong signature for log. Newer versions of MSVC CL detects this and rejects the program with the following messages: conftest.c(40): warning C4391: 'char log()': incorrect return type for intrinsic function, expected 'double' conftest.c(44): error C2168: 'log': too few actual parameters for intrinsic function Since log is always available on MSVC (it's been around since the dawn of time), we simply always assume it's there if MSVC is detected.	2024-09-23 10:42:31 -07:00
Kevin Svetlitski	120abd703a	Add support for the `deprecated` attribute This is useful for enforcing the usage of getter/setter functions to access fields which are considered private or have unique access constraints.	2023-08-04 15:47:05 -07:00
Kevin Svetlitski	ea5b7bea31	Add configuration option controlling DSS support In many environments, the fallback `sbrk(2)` allocation path is never used even if the system supports the syscall; if you're at the point where `mmap(2)` is failing, `sbrk(2)` is unlikely to succeed. Without changing the default, I've added the ability to disable the usage of DSS altogether, so that you do not need to pay for the additional code size and handful of extra runtime branches in such environments.	2023-08-03 11:52:25 -07:00
Kevin Svetlitski	cdb2c0e02f	Implement C23's `free_sized` and `free_aligned_sized` [N2699 - Sized Memory Deallocation](https://www.open-std.org/jtc1/sc22/wg14/www/docs/n2699.htm) introduced two new functions which were incorporated into the C23 standard, `free_sized` and `free_aligned_sized`. Both already have analogues in Jemalloc, all we are doing here is adding the appropriate wrappers.	2023-07-20 15:06:41 -07:00
Kevin Svetlitski	5711dc31d8	Only enable `-Wstrict-prototypes` in CI to unbreak feature detection Adding `-Wstrict-prototypes` to the default `CFLAGS` in PR #2473 had the non-obvious side-effect of breaking configure-time feature detection, because the [test-program `autoconf` generates for feature detection](https://www.gnu.org/software/autoconf/manual/autoconf-2.67/html_node/Generating-Sources.html#:~:text=main%20()) defines `main` as: ```c int main() ``` Which causes all feature checks to fail, since this triggers `-Wstrict-prototypes` and the feature checks use `-Werror`. Resolved by only adding `-Wstrict-prototypes` to `EXTRA_{CFLAGS,CXXFLAGS}` in CI, since these flags are not used during feature detection and we control which compiler is used.	2023-07-06 18:03:13 -07:00
Qi Wang	602edd7566	Enabled -Wstrict-prototypes and fixed warnings.	2023-07-06 12:00:02 -07:00
Kevin Svetlitski	f2e00d2fd3	Remove trailing whitespace Additionally, added a GitHub Action to ensure no more trailing whitespace will creep in again in the future. I'm excluding Markdown files from this check, since trailing whitespace is significant there, and also excluding `build-aux/install-sh` because there is significant trailing whitespace on the line that sets `defaultIFS`.	2023-06-23 11:58:18 -07:00
Qi Wang	6ea8a7e928	Add config detection for JEMALLOC_HAVE_PTHREAD_SET_NAME_NP. and use it on the background thread name setting.	2023-05-11 09:10:57 -07:00
guangli-dai	31e01a98f1	Fix the rdtscp detection bug and add prefix for the macro.	2023-03-23 11:16:19 -07:00
Marvin Schmidt	aba1645f2d	configure: Handle -linux-musl hosts properly This is the same as the `--linux*` case with the two exceptions that we don't set glibc=1 and don't define JEMALLOC_USE_CXX_THROW	2023-03-16 12:11:40 -07:00
Qi Wang	d503d72129	Add the missing descriptions in AC_DEFINE	2023-03-14 16:47:00 -07:00
guangli-dai	09e4b38fb1	Use asm volatile during benchmarks.	2023-02-24 11:17:48 -08:00
Qi Wang	481bbfc990	Add a configure option --enable-force-getenv. Allows the use of getenv() rather than secure_getenv() to read MALLOC_CONF. This helps in situations where hosts are under full control, and setting MALLOC_CONF is needed while also setuid. Disabled by default.	2022-11-04 13:37:14 -07:00
David Carlier	4c95c953e2	fix build for non linux/BSD platforms.	2022-10-03 10:42:09 -07:00
David Carlier	4fc5c4fbac	New configure option '--enable-pageid' for Linux The option makes jemalloc use prctl with PR_SET_VMA to tag memory mappings with "jemalloc_pg" or "jemalloc_pg_overcommit". This allows to easily identify jemalloc's mappings in /proc/<pid>/maps. PR_SET_VMA is only available in Linux 5.17 and above.	2022-06-09 18:54:08 -07:00
Qi Wang	b950934916	Enable retain by default on macOS. High number of mappings result in unusually high fork() cost on macOS. Retain fixes the issue, at a small cost of extra VM space reserved.	2022-06-09 11:37:44 -07:00
David Carlier	df8f7d10af	Implement malloc_getcpu for amd64 and arm64 macOS This enables per CPU arena on MacOS	2022-06-08 15:13:55 -07:00
Alex Lapenkou	5b1f2cc5d7	Implement pvalloc replacement Despite being an obsolete function, pvalloc is still present in GLIBC and should work correctly when jemalloc replaces libc allocator.	2022-05-18 17:01:09 -07:00
Qi Wang	8cb814629a	Make the default option of zero realloc match the system allocator.	2022-05-05 17:11:18 -07:00
Qi Wang	7ae0f15c59	Add a default page size when cross-compile for Apple M1. When cross-compile for M1 and no page size specified, use the default 16K and skip detecting the page size (which is likely incorrect).	2022-03-21 14:30:48 -07:00
Alex Lapenkov	eb65d1b078	Fix FreeBSD system jemalloc TSD cleanup Before this commit, in case FreeBSD libc jemalloc was overridden by another jemalloc, proper thread shutdown callback was involved only for the overriding jemalloc. A call to _malloc_thread_cleanup from libthr would be redirected to user jemalloc, leaving data about dead threads hanging in system jemalloc. This change tackles the issue in two ways. First, for current and old system jemallocs, which we can not modify, the overriding jemalloc would locate and invoke system cleanup routine. For upcoming jemalloc integrations, the cleanup registering function will also be redirected to user jemalloc, which means that system jemalloc's cleanup routine will be registered in user's jemalloc and a single call to _malloc_thread_cleanup will be sufficient to invoke both callbacks.	2022-03-02 10:10:27 -08:00
Qi Wang	063d134aeb	Properly detect background thread support on Darwin. When cross-compile, the host type / abi should be checked to determine background thread compatibility.	2022-02-15 10:10:11 -08:00
Alex Lapenkou	eafd2ac39f	Forbid spaces in prefix and exec_prefix Spaces in these are also not handled correctly by Make, so there's sense in not allowing that.	2022-01-19 12:28:16 -08:00
Alex Lapenkou	36a09ba2c7	Forbid spaces in install suffix To avoid potential issues with removing unintended files after 'make uninstall', spaces are no longer allowed in install suffix. It's worth mentioning, that with GNU Make on Linux spaces in install suffix didn't work anyway, leading to errors in the Makefile. But being verbose about this restriction makes it more transparent for the developers.	2022-01-19 12:28:16 -08:00
Qi Wang	8b49eb132e	Fix the HELP_STRING of --enable-doc.	2022-01-11 21:15:24 -08:00
Jonathan Swinney	89fe8ee6bf	Use the isb instruction instead of yield for spin locks on arm isb introduces a small delay which is closer to the x86 pause instruction.	2022-01-10 15:29:56 -08:00
Qi Wang	b75822bc6e	Implement use-after-free detection using junk and stash. On deallocation, sampled pointers (specially aligned) get junked and stashed into tcache (to prevent immediate reuse). The expected behavior is to have read-after-free corrupted and stopped by the junk-filling, while write-after-free is checked when flushing the stashed pointers.	2021-12-29 14:44:43 -08:00
Joshua Watt	e491df1d2f	Fix warnings when using autoheader.	2021-12-22 13:57:41 -08:00
David CARLIER	113e8e68e1	freebsd 14 build fix proposal. seems to have introduced finally more linux api cpu affinity (sched_* family) compatibility detected at configure time thus adjusting accordingly.	2021-12-06 13:15:21 -08:00
David CARLIER	cf9724531a	Darwin malloc_size override support proposal. Darwin has similar api than Linux/FreeBSD's malloc_usable_size.	2021-10-01 14:32:40 -07:00
Qi Wang	deb8e62a83	Implement guard pages. Adding guarded extents, which are regular extents surrounded by guard pages (mprotected). To reduce syscalls, small guarded extents are cached as a separate eset in ecache, and decay through the dirty / muzzy / retained pipeline as usual.	2021-09-26 16:30:15 -07:00
Alex Lapenkou	26140dd246	Reject --enable-prof-libunwind without --enable-prof Prior to the change you could specify --enable-prof-libunwind without --enable-prof which would do effectively nothing. This was confusing as I expected --enable-prof-libunwind to act like --enable-prof, but use libunwind.	2021-09-13 14:02:40 -07:00
Alex Lapenkou	2c625d5cd9	Fix warnings when compiled with clang When clang sees an unknown warning option, unlike gcc it doesn't fail the build with error. It issues a warning. Hence JE_CFLAGS_ADD with warning options that didnt't exist in clang would still mark those options as available. This led to several warnings when built with clang or "gcc" on OSX. This change fixes those warnings by simply making clang fail builds with non-existent warning options.	2021-08-13 14:14:46 -07:00
David Goldblatt	4452a4812f	Add opt.experimental_infallible_new. This allows a guarantee that operator new never throws. Fix the .gitignore rules to include test/integration/cpp while we're here.	2021-06-24 12:22:51 -07:00
David Goldblatt	0689448b1e	Travis: Unbreak the builds. In the hopes of future-proofing as much as possible, jump to the latest distribution Travis supports.	2021-06-24 07:40:28 -07:00
Ondřej Surý	2c0f4c2ac3	Fix typo in configure.ac: experimetal -> experimental	2021-05-25 08:20:37 -07:00
Qi Wang	11127240ca	Remove redundant enable-debug definition in configure.	2021-03-12 11:30:56 -08:00

1 2 3 4 5 ...

343 commits