glibc

Commit Graph

Author	SHA1	Message	Date
Osama Abdelkader	ff041e8f8e	benchtests: Add benchmarks for frexp functions Add benchmark support for frexp, frexpf, and frexpl to measure the performance improvement of the fast path optimization. - Created frexp-inputs, frexpf-inputs, frexpl-inputs with random test values - Added frexp, frexpf, frexpl to bench-math list - Added CFLAGS to disable builtins for accurate benchmarking These benchmarks will be used to quantify the performance gains from the fast path optimization for normal floating-point numbers. Signed-off-by: Osama Abdelkader <osama.abdelkader@gmail.com>	2025-11-10 08:58:16 -03:00
Adhemerval Zanella	b983c854e6	math: Sync acosh from CORE-MATH The c9abdf80 fix handle some cases for RNDZ. Checked on x86_64-linux-gnu.	2025-11-10 08:58:14 -03:00
Adhemerval Zanella	50000da305	Filter out internal abort during ld.so build clang might generate an abort call when cleanup functions (set by __attribute__ ((cleanup)) calls functions not marked as nothrow. The hurd already provides abort for the loader at sysdeps/mach/hurd/dl-sysdep.c, and adding it rtld-stubbed-symbols triggers duplicate symbols. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2025-11-10 08:58:12 -03:00
Adhemerval Zanella	e3394859ee	x86: Fix THREAD_GSCOPE_RESET_FLAG build on clang clang does not support __seg_fs in asm constraint. Reviewed-by: Uros Bizjak <ubizjak@gmail.com>	2025-11-10 08:58:10 -03:00
Adhemerval Zanella	427c25278d	x86: Adapt "%v" usage on clang to emit VEX enconding clang does not support the %v to select the AVX encoding, nor the '%d' asm contrain, and for AVX build it requires all 3 arguments. This patch add a new internal header, math-inline-asm.h, that adds functions to abstract the inline asm required differences between gcc and clang. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2025-11-10 08:58:06 -03:00
Adhemerval Zanella	d25db12c2a	x86: math: Use of __libgcc_cmp_return__ iff compiler supports it clang does not support '__attribute__ ((mode (__libgcc_cmp_return__)))', so use a more close related type instead fo the default 'int'.	2025-11-10 08:57:59 -03:00
Adhemerval Zanella	bd6b49e11d	string: Check if attribute can declared after function declaration Some symbols that might be auto-generated by the compiler are redefined to internal alias (for instance mempcpy to __mempcpy). However, if fortify is enabled, the fortify wrapper is define before the alias re-defined and clang warns attribute declaration must precede definition. Use an asm alias if compiler does not support it, instead of an attribute. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2025-11-10 08:57:56 -03:00
Adhemerval Zanella	9044d61ff9	nss: Suppress clang -Wstring-plus-int on __nss_shlib_revision definition clang issues an warning that adding 'unsigned long' to a string does not append to the string. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2025-11-10 08:57:53 -03:00
Adhemerval Zanella	ab1a96c978	stdlib: Remove mp_clz_tab.c The count_leading_zeros is not used anymore, so there is no need to provide the table for possible usage. The hppa already provides the compat symbol on libgcc-compat.c. Reviewed-by: Collin Funk <collin.funk1@gmail.com>	2025-11-10 08:57:52 -03:00
Samuel Thibault	494ba55d0b	hurd: make setpriority clamp nice values to 0..2*NZERO-1 [BZ #33614 ]	2025-11-09 11:43:35 +01:00
Samuel Thibault	a701ecc9f6	Revert "hurd: Make rename refuse trailing slashes [BZ #32570 ]" This reverts commit `2ae4ec56c2`. This introduced regressions, as rename should accept trailing slashes for directories: BZ #33607, BZ #33608 This was rather fixed on the server side: https://cgit.git.savannah.gnu.org/cgit/hurd/hurd.git/commit/?id=e34000cb395a135dd2ad5c13e6f6d4c5c1006389	2025-11-09 00:06:38 +01:00
Samuel Thibault	23db68ed9d	Add missing $(rpath-link) to elf/ld.so --library-path This is notably needed for GNU/Hurd's libmachuser.so and libhurduser.so.	2025-11-08 21:47:25 +01:00
Samuel Thibault	4b5eeec17b	Add missing $(rpath-link) to /elf/ld.so --library-path This is notably needed for GNU/Hurd's libmachuser.so and libhurduser.so.	2025-11-08 17:44:48 +01:00
Carlos O'Donell	ea8e2b96d8	nss: Add ERANGE testing to tst-nss-test4 (bug 33361) This adds testing for the fix added in commit: `0fceed2545` "nss: Group merge does not react to ERANGE during merge (bug 33361)" The in-use group size is increased large enough to trigger ERANGE for initial buffers and cause a retry. The actualy size is approximately twice that required to trigger the defect, though any size larger than NSS_BUFLEN_GROUP triggers the defect. Without the fix the group is not merged and the failure is detected, but with the fix the ERANGE error is handled, buffers are enlarged and subsequently correctly merged. Tested with a/b testing before and after patching. Tested on x86_64 with no regression. Co-authored-by: Patsy Griffin <patsy@redhat.com> Reviewed-by: DJ Delorie <dj@redhat.com>	2025-11-08 09:28:11 -05:00
Florian Weimer	259adb087d	aarch64: Remove $(aarch64-bti) check The variable was removed in commit `2c421fc430` ("AArch64: Cleanup PAC and BTI"), so this Makefile fragment is always excluded. Reviewed-by: Yury Khrustalev <yury.khrustalev@arm.com>	2025-11-07 14:12:01 +01:00
Yury Khrustalev	58a31b4316	posix: Fix invalid flags test for p{write,read}v2 Two tests fail from time to time when a new flag is added for the p{write,read}v2 functions in a new Linux kernel: - misc/tst-preadvwritev2 - misc/tst-preadvwritev64v2 This disrupts when testing Glibc on a system with a newer kernel and it seems we can try improve testing for invalid flags setting all the bits that are not supposed to be supported (rather than setting only the next unsupported bit). Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-11-07 08:57:09 +00:00
Florian Weimer	b64335ff11	support: Exit on consistency check failure in resolv_response_add_name Using TEST_VERIFY (crname_target != crname) instructs some analysis tools that crname_target == crname might hold. Under this assumption, they report a use-after-free for crname_target->offset below, caused by the previous free (crname). Reviewed-by: Collin Funk <collin.funk1@gmail.com>	2025-11-07 08:50:44 +01:00
Joe Ramsay	e45af510bc	AArch64: Fix instability in AdvSIMD sinh Previously presence of special-cases in one lane could affect the results in other lanes due to unconditional scalar fallback. The old WANT_SIMD_EXCEPT option (which has never been enabled in libmvec) has been removed from AOR, making it easier to spot and fix this. No measured change in performance. This patch applies cleanly as far back as 2.41, however there are conflicts with 2.40 where sinh was first introduced. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-06 18:30:47 +00:00
Joe Ramsay	6c22823da5	AArch64: Fix instability in AdvSIMD tan Previously presence of special-cases in one lane could affect the results in other lanes due to unconditional scalar fallback. The old WANT_SIMD_EXCEPT option (which has never been enabled in libmvec) has been removed from AOR, making it easier to spot and fix this. 4% improvement in throughput with GCC 14 on Neoverse V1. This bug is present as far back as 2.39 (where tan was first introduced). Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-06 18:30:47 +00:00
Joe Ramsay	5b82fb1882	AArch64: Optimise SVE scalar callbacks Instead of using SVE instructions to marshall special results into the correct lane, just write the entire vector (and the predicate) to memory, then use cheaper scalar operations. Geomean speedup of 16% in special intervals on Neoverse with GCC 14. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-06 15:45:37 +00:00
Florian Weimer	20a2a75608	support: Fix FILE * leak in check_for_unshare_hints in test-container The file opened via fopen is never closed.	2025-11-06 16:27:51 +01:00
H.J. Lu	975c8c4e22	i386: Simplify powl computation for small integral y [BZ #33586 ] On i386, tests added by commit `1b657c53c2` Author: Siddhesh Poyarekar <siddhesh@sourceware.org> Date: Fri Oct 10 20:21:13 2025 -0400 Simplify powl computation for small integral y [BZ #33411] exposed the same bug in i386 e_powl.S: FAIL: math/test-float64x-pow original exit status 1 testing _Float64x (without inline functions) Failure: pow (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_downward (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_towardzero (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_upward (0x1p+8192, 0x1p+0): Exception "Overflow" set FAIL: math/test-ldouble-pow original exit status 1 testing long double (without inline functions) Failure: pow (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_downward (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_towardzero (0x1p+8192, 0x1p+0): Exception "Overflow" set Failure: pow_upward (0x1p+8192, 0x1p+0): Exception "Overflow" set Port x86-64 e_powl.S fix to i386 e_powl.S. This fixes BZ #33586. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sunil K Pandey <skpgkp2@gmail.com>	2025-11-06 07:04:17 +08:00
Adhemerval Zanella	3078358ac6	math: Remove the SVID error handling from tgammaf It improves latency for about 1.5% and throughput for about 2-4%. Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-05 10:19:37 -03:00
Adhemerval Zanella	de0e623434	math: Remove the SVID error handling from lgammaf/lgammaf_r It improves latency throughput for about 2%. Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-05 09:27:07 -03:00
Adhemerval Zanella	c0be0b4527	Add FD_PIDFS_ROOT from Linux 6.17 to bits/fcntl-linux.h It was added by commit 3941e37f62fe2c3c8b8675c12183185f20450539 Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	1e750f62c4	Add AT_EXECVE_CHECK from Linux 6.14 to bits/fcntl-linux.h It was added by commit a5874fde3c0884a33ed4145101052318c5e17c74 Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	04e6bdb437	Add AT_HANDLE_CONNECTABLE from Linux 6.13 to bits/fcntl-linux.h It was added by commit c374196b2b9f4b803fccd59ed82f0712041e21e1. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	f8a6b02223	linux: Update statx-generic.h with linux 6.16 It adds the new fields on generic statx struct from Linux commit 5d894321c49e61379189b0ff605f316e39cbd1e9. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	cf73362a7e	linux: Update statx-generic.h with linux 6.14 It adds the new constant STATX_DIO_READ_ALIGN and related fields in generic statx struct from Linux commit 7ed6cbe0f8caa6ee38a2dc8f1b925acb904cc01f. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	5a2ace34fc	linux: Add STATX_WRITE_ATOMIC/STATX_ATTR_WRITE_ATOMIC definitions to generic statx The commit `fc650bfd71` added STATX_WRITE_ATOMIC/STATX_ATTR_WRITE_ATOMIC on the statx-generic.h without updating the generic statx struct. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	4540cc2237	linux: Add STATX_SUBVOL definition to generic statx The commit `bbb6a742c8` added STATX_SUBVOL on the statx-generic.h without updating the generic statx struct.	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	2ae6152144	linux: Add STATX_DOALIGN definition to generic statx The commit `07937809ac` added STATX_MNT_ID_UNIQUE on the statx-generic.h without updating the generic statx struct.	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	0d0726e4eb	linux: Add STATX_MNT_ID_UNIQUE definition to generic statx The commit `88a2cf6c4b` added STATX_MNT_ID_UNIQUE on the statx-generic.h without updating the generic statx struct.	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	03d9cb23b8	Update syscall lists for Linux 6.17 Linux 6.16 adds no new syscalls, while Linux 6.17 adds file_getattr and file_setattr (commit be7efb2d20d67f334a7de2aef77ae6c69367e646). Update syscall-names.list and regenerate the arch-syscall.h headers with build-many-glibcs.py update-syscalls.	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	c0c9524a11	Update PIDFD_* constants for Linux 6.17 The pidfd interface was extended with: * PIDFD_GET_INFO and pidfd_info (along with related extra flags) to allow get information about the process without the need to parse /proc (commit cdda1f26e74ba, Linux 6.13). * PIDFD_SELF_{THREAD,THREAD_GROUP,SELF,SELF_PROCESS} to allow pidfd_send_signal refer to the own process or thread lead groups without the need of allocating a file descriptor (commit f08d0c3a71114, Linux 6.15). * PIDFD_INFO_COREDUMP that extends PIDFD_GET_INFO to obtain coredump information. Linux uAPI header defines both PIDFD_SELF_THREAD and PIDFD_SELF_THREAD_GROUP on linux/fcntl.h (since they reserve part of the AT_* values), however for glibc I do not see any good reason to add pidfd definitions on fcntl-linux.h. The tst-pidfd.c is extended with some PIDFD_SELF_* tests and a new ‘tst-pidfd_getinfo.c’ test is added to check PIDFD_GET_INFO. The PIDFD_INFO_COREDUMP tests would require very large and complex tests that are already covered by kernel tests. Checked on aarch64-linux-gnu and x86_64-linux-gnu on kernels 6.8 and 6.17. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	bd7be9f447	Update kernel version to 6.17 in header constant tests There are no new constants covered by tst-mman-consts.py, tst-mount-consts.py or tst-sched-consts.py in Linux 6.17.	2025-11-05 07:15:52 -03:00
Adhemerval Zanella	7ec8eb5676	math: Remove the SVID error handling from atan2f It improves latency for about 3-6% and throughput for about 5-12%. Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-05 07:15:52 -03:00
Collin Funk	a5cc3018f3	Add feature test macros for POSIX.1-2024. * include/features.h (_POSIX_C_SOURCE): Document the value of 202405L for POSIX.1-2024. Set it to 202405L when _GNU_SOURCE or _DEFAULT_SOURCE is defined. (_XOPEN_SOURCE): Document the value of 800 for POSIX-1.2024. Set it to 800 when _GNU_SOURCE is defined. (__USE_XOPEN2K24, __USE_XOPEN2K24XSI): New internal macros. Set them when _POSIX_C_SOURCE is 202405L or greater and/or when _XOPEN_SOURCE is 800 or greater. * manual/creature.texi (Feature Test Macros): Document the new values for _POSIX_C_SOURCE and _XOPEN_SOURCE. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Signed-off-by: Collin Funk <collin.funk1@gmail.com>	2025-11-04 18:00:57 -08:00
Joseph Myers	26e4810210	Rename fromfp files in preparation for changing types for C23 As discussed in bug 28327, the fromfp functions changed type in C23 (compared to the version in TS 18661-1); they now return the same type as the floating-point argument, instead of intmax_t / uintmax_t. As with other such incompatible changes compared to the initial TS 18661 versions of interfaces (the types of totalorder functions, in particular), it seems appropriate to support only the new version as an API, not the old one (although many programs written for the old API might in fact work wtih the new one as well). Thus, the existing implementations should become compat symbols. They are sufficiently different from how I'd expect to implement the new version that using separate implementations in separate files is more convenient than trying to share code, and directly sharing testcases would be problematic as well. Rename the existing fromfp implementation and test files to names reflecting how they're intended to become compat symbols, so freeing up the existing filenames for a subsequent implementation of the C23 versions of these functions (which is the point at which the existing implementations would actually become compat symbols). gen-fromfp-tests.py and gen-fromfp-tests-inputs are not renamed; I think it will make sense to adapt the test generator to be able to generate most tests for both versions of the functions (with extra test inputs added that are only of interest with the C23 version). The ldbl-opt/nldbl-* files are also not renamed; since those are for a static only library, no compat versions are needed, and they'll just have their contents changed when the C23 version is implemented. Tested for x86_64, and with build-many-glibcs.py.	2025-11-04 23:41:35 +00:00
Joseph Myers	26d11a0944	Add C23 long_double_t, _FloatN_t C23 Annex H adds <math.h> typedefs long_double_t and _FloatN_t (originally introduced in TS 18661-3), analogous to float_t and double_t. Add these typedefs to glibc. (There are no _FloatNx_t typedefs.) C23 also slightly changes the rules for how such typedef names should be defined, compared to the definition in TS 18661-3. In both cases, <TYPE>_t corresponds to the evaluation format for <TYPE>, as specified by FLT_EVAL_METHOD (for which <math.h> uses glibc's internal __GLIBC_FLT_EVAL_METHOD). Specifically, each FLT_EVAL_METHOD value corresponds to some type U (for example, 64 corresponds to U = _Float64), and for types with exactly the same set of values as U, TS 18661-3 says expressions with those types are to be evaluated to the range and precision of type U (so <TYPE>_t is defined to U), whereas C23 only does that for types whose values are a strict subset of those of type U (so <TYPE>_t is defined to <TYPE>). As with other cases where semantics changed between TS 18661 and C23, this patch only implements the newer version of the semantics (including adjusting existing definitions of float_t and double_t as needed). The new semantics are contradictory between the main standard and Annex H for the case of FLT_EVAL_METHOD == 2 and the choice of double_t when double and long double have the same values (the main standard says it's defined as long double in that case, whereas Annex H would define it as double), which I've raised on the WG14 reflector (but I think setting FLT_EVAL_METHOD == 2 when double and long double have the same values is a fairly theoretical combination of features); for now glibc follows the value in the main standard in that case. Note that I think all existing GCC targets supported by glibc only use values -1, 0, 1, 2 or 16 for FLT_EVAL_METHOD (so most of the header code is somewhat theoretical, though potentially relevant with other compilers since the choice of FLT_EVAL_METHOD is only an API choice, not an ABI one; it can vary with compiler options, and these typedefs should not be used in ABIs). The testcase (expanded to cover the new typedefs) is really just repeating the same logic in a second place (so all it really tests is that __GLIBC_FLT_EVAL_METHOD is consistent with FLT_EVAL_METHOD). Tested for x86_64 and x86, and with build-many-glibcs.py.	2025-11-04 17:12:00 +00:00
Peter Bergner	47975914fb	riscv: Add vector registers to __SYSCALL_CLOBBERS The Linux kernel ABI specifies that the vector registers are not preserved across system calls, but the __SYSCALL_CLOBBERS macro doesn't mention them. This could possibly lead to compilers trying to keep data in the vector registers across the syscall leading to corruption. Add the vector registers to __SYSCALL_CLOBBERS when the vector extension is enabled. If the vector extension is enabled, then require GCC 15 or later and RVV 1.0 or later. Fixes: `36960f0c76` ("RISC-V: Linux Syscall Interface") Signed-off-by: Peter Bergner <bergner@tenstorrent.com>	2025-11-04 09:18:56 -06:00
Collin Funk	376e336d92	Regenerate charmap-kw.h and locfile-kw.h with gperf 3.3 In commit `970364dac0` we switched some /FALLTHROUGH/ comments to [[fallthrough]] to avoid warnings with Clang. However, since gperf emitted different output the buildbot failed. The buildbot has been updated to use gperf 3.3 which will use __attribute__ ((__fallthrough__)) where needed to avoid warnings [1]. This patch regenerates these files with the same version. [1] https://sourceware.org/pipermail/libc-testresults/2025q4/014123.html Reviewed-by: Mark Wielaard <mark@klomp.org>	2025-11-04 06:15:54 -08:00
Adhemerval Zanella	0dfc849eff	math: Remove the SVID error handling wrapper from sqrt i386 and m68k architectures should use math-use-builtins-sqrt.h rather than relying on architecture-specific or inline assembly implementations. The PowerPC optimization for PPC 601/603 (30 years old) is removed. Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Adhemerval Zanella	f27a146409	math: Remove the SVID error handling from sinhf It improves latency for about 3-10% and throughput for about 5-15%. Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Adhemerval Zanella	0e1a1178ee	math: Remove the SVID error handling from remainder The optimized i386 version is faster than the generic one, and gcc implements it through the builtin. This optimization enables us to migrate the implementation to a C version. The performance on a Zen3 chip is similar to the SVID one. The m68k provided an optimized version through __m81_u(remainderf) (mathimpl.h), and gcc does not implement it through a builtin (different than i386). Performance improves a bit on x86_64 (Zen3, gcc 15.2.1): reciprocal-throughput input master NO-SVID improvement x86_64 subnormals 18.8522 16.2506 13.80% x86_64 normal 421.8260 403.9270 4.24% x86_64 close-exponent 21.0579 18.7642 10.89% i686 subnormals 21.3443 21.4229 -0.37% i686 normal 525.8380 538.807 -2.47% i686 close-exponent 21.6589 21.7983 -0.64% Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Adhemerval Zanella	c4c6c79d70	math: Remove the SVID error handling from remainderf The optimized i386 version is faster than the generic one, and gcc implements it through the builtin. This optimization enables us to migrate the implementation to a C version. The performance on a Zen3 chip is similar to the SVID one. The m68k provided an optimized version through __m81_u(remainderf) (mathimpl.h), and gcc does not implement it through a builtin (different than i386). Performance improves a bit on x86_64 (Zen3, gcc 15.2.1): reciprocal-throughput input master NO-SVID improvement x86_64 subnormals 17.5349 15.6125 10.96% x86_64 normal 53.8134 52.5754 2.30% x86_64 close-exponent 20.0211 18.6656 6.77% i686 subnormals 21.8105 20.1856 7.45% i686 normal 73.1945 71.2199 2.70% i686 close-exponent 22.2141 20.331 8.48% Tested on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Wilco Dijkstra	324c088a18	nptl: Remove ATOMIC_EXCHANGE_USES_CAS usage The only usage was for pthread_spin_lock, introduced by `12d2dd7060`, as a way to optimize the code for certain architectures. Now that atomic builtins are used by default, let the compiler use the best code sequence for the atomic exchange. Co-authored-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Wilco Dijkstra	53807741fb	Define __HAVE_64B_ATOMICS from compiler support Now that atomic builtins are used by default, we can rely on the compiler to define when to use 64-bit atomic operations. It allows the use of 64-bit atomic operations on some 32-bit ABIs where they were not previously enabled due to missing pre-processor handling: hppa, mips64n32, s390, and sparcv9. Co-authored-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: Uros Bizjak <ubizjak@gmail.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Adhemerval Zanella	95a0ad1ea1	atomic: Consolidate atomic_write_barrier implementation All ABIs, except alpha and sparc, define it to atomic_full_barrier/__sync_synchronize, which can be mapped to __atomic_thread_fence (__ATOMIC_RELEASE). For alpha, it uses a 'wmb' which does not map to any of C11 barriers. For sparc it uses a stronger 'member #LoadStore \| #StoreStore', where the release barrier maps to just 'membar #StoreLoad'. The patch keeps the sparc definition. For PowerPC, it allows the use of lwsync for additional chips (since _ARCH_PWR4 does not cover all chips that support it). Tested on aarch64-linux-gnu. Co-authored-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00
Adhemerval Zanella	304b22d7f9	atomic: Consolidate atomic_read_barrier implementation All ABIs, except alpha, powerpc, and x86_64, define it to atomic_full_barrier/__sync_synchronize, which can be mapped to __atomic_thread_fence (__ATOMIC_SEQ_CST) in most cases, with the exception of aarch64 (where the acquire fence is generated as 'dmb ishld' instead of 'dmb ish'). For s390x, it defaults to a memory barrier where __sync_synchronize emits a 'bcr 15,0' (which the manual describes as pipeline synchronization). For PowerPC, it allows the use of lwsync for additional chips (since _ARCH_PWR4 does not cover all chips that support it). Tested on aarch64-linux-gnu, where the acquire produces a different instruction that the current code. Co-authored-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-11-04 04:14:01 -03:00

1 2 3 4 5 ...

43083 Commits All Branches Search

43083 Commits

All Branches