diff options
| author | Tom Smeding <tom@tomsmeding.com> | 2025-03-14 21:57:56 +0100 | 
|---|---|---|
| committer | Tom Smeding <tom@tomsmeding.com> | 2025-03-14 21:58:51 +0100 | 
| commit | 6276ed3c7bcd20c8b860e1275386ecd068671bcc (patch) | |
| tree | b2710f261d12a7a1b73962691c187752663543f6 /example | |
| parent | 308ca9fac150cd28d62afef852f26ae4c40fa5a0 (diff) | |
Optimise reductions and dotprod with more vectorisation
Turns out that if you don't supply -ffast-math, the C compiler will
faithfully reproduce your linear reduction order, which is rather
disastrous for parallelisation with vector units.
This changes the summation order, so numerical results might differ
slightly. To wit: the test suite needed adjustment.
Diffstat (limited to 'example')
0 files changed, 0 insertions, 0 deletions
