romenskiy2012/jemalloc

mirror of https://github.com/jemalloc/jemalloc.git synced 2026-05-15 09:26:21 +03:00

Author	SHA1	Message	Date
Guangli Dai	1dfa6f7aa4	Replace PAI vtable dispatch with direct calls The pai_t interface implements C-style polymorphism via function pointers to abstract over PAC and HPA. This abstraction provides no real benefit: only two implementations exist, the dispatcher already knows which one to use, and HPA stubs 2 of 5 operations. Remove the runtime dispatch in favor of direct calls. This commit: - Promotes pac_alloc/expand/shrink/dalloc/time_until_deferred_work to external linkage and replaces the pai_t self parameter with pac_t pac. - Promotes hpa_alloc/expand/shrink/dalloc/time_until_deferred_work to external linkage and replaces pai_t self with hpa_shard_t shard. - Updates hpa_dalloc_batch's signature to take hpa_shard_t * directly and removes the hpa_from_pai container-of helper. Updates internal callers in hpa_alloc, hpa_dalloc, and hpa_sec_flush_impl. - Drops the vtable assignments from pac_init() and hpa_shard_init(). - Replaces pai_alloc/dalloc/etc. dispatch in pa.c with direct calls. HPA expand and shrink (which are unconditional failure stubs) are skipped entirely for HPA-owned extents. - Removes the pa_get_pai() helper. - Updates tests in test/unit/hpa.c and test/unit/hpa_sec_integration.c to call hpa_alloc/dalloc/etc. directly. The pai_t struct field stays as dead weight in pac_t and hpa_shard_t; it is removed in the next commit along with pai.h itself. No behavioral changes.	2026-05-12 13:43:16 -07:00
Slobodan Predolac	4cc497a4a7	Add configurable HPA opts, shard count override, and bump MAX_ALLOCATIONS Three changes to make pa_microbench easier to drive for fragmentation experiments: - Replace HPA_SHARD_OPTS_DEFAULT use with a single editable g_hpa_opts global. The microbench does not consult MALLOC_CONF for HPA shard opts, so this is the place to set the baseline configuration (slab_max_alloc, hugification_threshold, dirty_mult, hugify_delay_ms, purge_threshold, hugify_style, etc.). - Add -n/--nshards N to override the shard count derived from the trace. When set, each event is routed to (event->shard_ind % N), letting us study the impact of arena consolidation. Without the flag the behavior is unchanged (num_shards = max_shard_id + 1). - Bump MAX_ALLOCATIONS from 10M to 200M so the full ~50M-event adfinder trace (and similar) fits in the in-memory event buffer.	2026-04-30 17:48:32 -04:00
Slobodan Predolac	76015f519f	Fix bug in pa_microbench pa_microbench was creating its own emap_t per shard on top of the arena_emap_global that JET malloc initializes during jet_malloc(16) at startup, breaking the production assumption of one rtree per process. Fix it by reusing the existing JET emap.	2026-04-23 18:36:13 -04:00
Slobodan Predolac	7c40be249c	Add npurges and npurge_passes to output of pa_benchmark	2026-03-10 18:14:33 -07:00
Slobodan Predolac	707aab0c95	[pa-bench] Add clock to pa benchmark	2026-03-10 18:14:33 -07:00
guangli-dai	261591f123	Add a page-allocator microbenchmark.	2026-03-10 18:14:33 -07:00
guangli-dai	56cdce8592	Adding trace analysis in preparation for page allocator microbenchmark.	2026-03-10 18:14:33 -07:00
guangli-dai	6200e8987f	Reformat the codebase with the clang-format 18.	2026-03-10 18:14:33 -07:00
Shirui Cheng	8fefabd3a4	increase the ncached_max in fill_flush test case to 1024	2024-08-06 13:16:09 -07:00
Qi Wang	602edd7566	Enabled -Wstrict-prototypes and fixed warnings.	2023-07-06 12:00:02 -07:00
Kevin Svetlitski	f2e00d2fd3	Remove trailing whitespace Additionally, added a GitHub Action to ensure no more trailing whitespace will creep in again in the future. I'm excluding Markdown files from this check, since trailing whitespace is significant there, and also excluding `build-aux/install-sh` because there is significant trailing whitespace on the line that sets `defaultIFS`.	2023-06-23 11:58:18 -07:00
guangli-dai	09e4b38fb1	Use asm volatile during benchmarks.	2023-02-24 11:17:48 -08:00
guangli-dai	a74acb57e8	Fix dividing 0 error in stress/cpp/microbench Summary: Per issue #2356, some CXX compilers may optimize away the new/delete operation in stress/cpp/microbench.cpp. Thus, this commit (1) bumps the time interval to 1 if it is 0, and (2) modifies the pointers in the microbench to volatile.	2022-12-06 10:46:14 -08:00
Guangli Dai	e8f9f13811	Inline free and sdallocx into operator delete	2022-11-21 11:14:05 -08:00
guangli-dai	06374d2a6a	Benchmark operator delete Added the microbenchmark for operator delete. Also modified bench.h so that it can be used in C++.	2022-11-21 11:14:05 -08:00
David CARLIER	cf9724531a	Darwin malloc_size override support proposal. Darwin has similar api than Linux/FreeBSD's malloc_usable_size.	2021-10-01 14:32:40 -07:00
Yinan Zhang	e82771807e	Cache mallctl mib for batch allocation stress test	2020-12-07 09:10:11 -08:00
Yinan Zhang	0dfdd31e0f	Add tiny batch size to batch allocation stress test	2020-12-07 09:10:11 -08:00
David Goldblatt	b0ffa39cac	Mallctl stress test: fix a type. The mallctlbymib_long helper was copy-pasted from mallctlbymib_short, and incorrectly used its output variable (a char ) rather than the output variable of the mallctl call it was using (a uint64_t), causing breakages when sizeof(char ) differed from sizeof(uint64_t).	2020-08-17 12:50:14 -07:00
David Goldblatt	32d4673221	Add a mallctl speed stress test.	2020-08-13 10:03:15 -07:00
Yinan Zhang	e032a1a1de	Add a stress test for batch allocation	2020-08-03 09:36:40 -07:00
Yinan Zhang	d8cea87562	Move size inspections to test/analyze	2020-06-26 09:45:28 -07:00
David Goldblatt	7e09a57b39	stress/sizes: Fix an off-by-one issue. Algorithmically, a size greater than 1024 ZB could access one-past-the-end of the sizes array. This couldn't really happen since SIZE_MAX is less than 1024 ZB on all platforms we support (and we pick the arguments to this function to be reasonable anyways), but it's not like there's any reason not to fix it, either.	2020-06-16 10:34:19 -07:00
David Goldblatt	dcfa6fd507	stress/sizes: Add a couple more types.	2020-06-16 10:34:19 -07:00
David Goldblatt	17a64fe91c	Add a small program to print data structure sizes.	2020-06-11 08:13:38 -07:00
David Goldblatt	d338dd45d7	Tcache: Make incremental gc bytes configurable.	2020-05-16 13:34:23 -07:00
David Goldblatt	97b7a9cf77	Add a fill/flush microbenchmark.	2020-05-16 13:34:23 -07:00
David Goldblatt	2c09d43494	Add a benchmark of large allocations.	2020-05-04 12:36:45 -07:00
David Goldblatt	1f71e1ca43	Add hook microbenchmark.	2018-08-09 13:16:54 -07:00
David Goldblatt	4d2e4bf5eb	Get rid of most of the various inline macros.	2017-04-24 10:33:21 -07:00
David Goldblatt	b407a65401	Add basic reentrancy-checking support, and allow arena_new to reenter. This checks whether or not we're reentrant using thread-local data, and, if we are, moves certain internal allocations to use arena 0 (which should be properly initialized after bootstrapping). The immediate thing this allows is spinning up threads in arena_new, which will enable spinning up background threads there.	2017-04-07 14:10:27 -07:00
Jason Evans	f408643a4c	Remove extraneous parens around return arguments. This resolves #540.	2017-01-20 21:43:07 -08:00
Jason Evans	c4c2592c83	Update brace style. Add braces around single-line blocks, and remove line breaks before function-opening braces. This resolves #537.	2017-01-20 21:43:07 -08:00
Jason Evans	ffbb7dac3d	Remove leading blank lines from function bodies. This resolves #535.	2017-01-13 14:49:24 -08:00
Jason Evans	f6bd2e5a17	Code formatting fixes.	2016-03-23 16:15:42 -07:00
Jason Evans	5fae7dc1b3	Fix MinGW-related portability issues. Create and use FMT* macros that are equivalent to the PRI* macros that inttypes.h defines. This allows uniform use of the Unix-specific format specifiers, e.g. "%zu", as well as avoiding Windows-specific definitions of e.g. PRIu64. Add ffs()/ffsl() support for compiling with gcc. Extract compatibility definitions of ENOENT, EINVAL, EAGAIN, EPERM, ENOMEM, and ENORANGE into include/msvc_compat/windows_extra.h and use the file for tests as well as for core jemalloc code.	2015-07-23 13:56:25 -07:00
Jason Evans	acbcbad1e1	Thwart compiler optimizations.	2014-10-15 14:49:14 -07:00
Jason Evans	47395a1b4c	Avoid purging in microbench when lazy-lock is enabled.	2014-10-04 14:59:38 -07:00
Daniel Micay	4cfe55166e	Add support for sized deallocation. This adds a new `sdallocx` function to the external API, allowing the size to be passed by the caller. It avoids some extra reads in the thread cache fast path. In the case where stats are enabled, this avoids the work of calculating the size from the pointer. An assertion validates the size that's passed in, so enabling debugging will allow users of the API to debug cases where an incorrect size is passed in. The performance win for a contrived microbenchmark doing an allocation and immediately freeing it is ~10%. It may have a different impact on a real workload. Closes #28	2014-09-08 17:34:24 -07:00
Jason Evans	c3f8650749	Add relevant function attributes to [msn]allocx().	2014-09-08 16:47:51 -07:00
Jason Evans	a1f3929ffd	Thwart optimization of free(malloc(1)) in microbench.	2014-09-08 16:23:48 -07:00
Daniel Micay	c3bfe9569a	avoid conflict with the POSIX timer_t type It hits a compilation error with glibc 2.19 without a rename.	2014-09-08 01:20:44 -04:00
Jason Evans	423d78a21b	Add microbench tests.	2014-09-07 19:58:04 -07:00

43 commits