[AutoBump] Merge with a6bb8a70 (Jan 20) (5) #543

jorickert · 2025-05-20T11:37:41Z

No description provided.

completing fpurge interception for mac too.

Instruction `preld` is used to prefetch one cache-line of data from memory in advance into the cache. This commit allows it to be generated automatically.

…3484)

…pass (llvm#118437) Inspired by https://reviews.llvm.org/D146600, this commit adds some TTI hooks for LoongArch to make LoopDataPrefetch pass really work. Including: - `getCacheLineSize()`: 64 for loongarch64. - `getPrefetchDistance()`: After testing SPEC CPU 2017, improvements taken by prefetching are more obvious when set PrefetchDistance to 200(results shown blow), although different benchmarks fit for different best choice. - `enableWritePrefetching()`: store prefetch is supported by LoongArch, so set WritePrefetching to true in default. - `getMinPrefetchStride()` and `getMaxPrefetchIterationsAhead()` still use default values: 1 and UINT_MAX, so not override them. After this commit, the test added by https://reviews.llvm.org/D146600 can generate llvm.prefetch intrinsic IR correctly. Results of spec2017rate benchmarks (testing date: ref, copies: 1): - For all C/C++ benchmarks, compared to O3+novec/lsx/lasx, prefetch can bring about -1.58%/0.31%/0.07% performance improvement for int benchmarks and 3.26%/3.73%/3.78% improvement for floating point benchmarks. (Only O3+novec+prefetch decreases when testing intrate.) - But prefetch results in performance reduction almost for every Fortran benchmark compiled by flang. While considering all C/C++/Fortran benchmarks, prefetch performance will decrease about 1% ~ 5%. FIXME: Keep `loongarch-enable-loop-data-prefetch` option default to false for now due to the bad effect for Fortran.

Needed for libstdc++ 15 compatibility.

devnexen and others added 7 commits January 20, 2025 07:55

[compiler-rt] rtsan pipe2 interception for Linux. (llvm#123517)

02909a4

completing fpurge interception for mac too.

[clang][bytecode] Fix discarding DerivedToBase casts (llvm#123523)

6972788

[LoongArch] Add generation support for preld instruction (llvm#118436)

84220ec

Instruction `preld` is used to prefetch one cache-line of data from memory in advance into the cache. This commit allows it to be generated automatically.

[compiler-rt][rtsan] intercept getpeername/recvmmsg/sendmmsg (llvm#12…

18d5d84

…3484)

[MLIR] Add missing include (NFC)

a6bb8a7

Needed for libstdc++ 15 compatibility.

[AutoBump] Merge with a6bb8a7 (Jan 20)

5558ce1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoBump] Merge with a6bb8a70 (Jan 20) (5) #543

[AutoBump] Merge with a6bb8a70 (Jan 20) (5) #543

Uh oh!

jorickert commented May 20, 2025

Uh oh!

Uh oh!

[AutoBump] Merge with a6bb8a70 (Jan 20) (5) #543

Are you sure you want to change the base?

[AutoBump] Merge with a6bb8a70 (Jan 20) (5) #543

Uh oh!

Conversation

jorickert commented May 20, 2025

Uh oh!

Uh oh!