Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | arith: Don't FFI-import unused dotprod_*_strided ops | Tom Smeding | 2025-03-18 |
| | |||
* | Optimise reductions and dotprod with more vectorisation | Tom Smeding | 2025-03-14 |
| | | | | | | | | | Turns out that if you don't supply -ffast-math, the C compiler will faithfully reproduce your linear reduction order, which is rather disastrous for parallelisation with vector units. This changes the summation order, so numerical results might differ slightly. To wit: the test suite needed adjustment. | ||
* | arith: Correct rank arguments to C wrapper functions | Tom Smeding | 2025-03-13 |
| | |||
* | Add atan2 | Tom Smeding | 2025-03-13 |
| | |||
* | Implement quot/rem | Tom Smeding | 2025-03-13 |
| | |||
* | Slight cleanup of arith instances for Mixed/Ranked/Shaped | Tom Smeding | 2025-03-12 |
| | | | | Reformulate error messages; add NumElt superclass to FloatElt | ||
* | Binary ops without normalisation | Tom Smeding | 2025-03-12 |
| | | | | | | | | | Before: > sum(*) Double [1e6] stride 1; -1: OK > 68.9 ms ± 4.7 ms After: > sum(*) Double [1e6] stride 1; -1: OK > 1.44 ms ± 50 μs | ||
* | arith: Unary float ops on strided arrays without normalisation | Tom Smeding | 2025-03-05 |
| | |||
* | arith: Fix unary op stride bugs | Tom Smeding | 2025-02-18 |
| | |||
* | arith: Only strided unary int ops | Tom Smeding | 2025-02-16 |
| | | | | | This should have negligible overhead and will save a whole bunch of C code duplication when the FUnops are also converted to strided form. | ||
* | arith: Unary int ops on strided arrays without normalisation | Tom Smeding | 2025-02-16 |
| | |||
* | arith: Fix cover calculation in stridesDense | Tom Smeding | 2025-02-16 |
| | | | | | The calculation was incorrect for windowed arrays (as produced by RS.window) | ||
* | Add the criminally absent singletons to numEltMinIndex and numEltMaxIndex | Mikolaj Konarski | 2024-11-19 |
| | |||
* | arith: Respect offsets in dotprodinner | Tom Smeding | 2024-07-21 |
| | |||
* | Add {m,r,s}dot1Inner | Tom Smeding | 2024-06-19 |
| | |||
* | Clean up Foreign.hs | Tom Smeding | 2024-06-18 |
| | |||
* | More sensible argument order to reduce1 C op | Tom Smeding | 2024-06-18 |
| | |||
* | sumAllPrim | Tom Smeding | 2024-06-17 |
| | |||
* | Generalise some of the lifting functions to type-changing | Tom Smeding | 2024-06-17 |
| | |||
* | Conciseness cleanup | Tom Smeding | 2024-06-10 |
| | |||
* | Make arith code aware of negative strides | Tom Smeding | 2024-06-10 |
| | |||
* | Clean up some redundant catMaybes | Tom Smeding | 2024-06-10 |
| | |||
* | Dot product | Tom Smeding | 2024-06-10 |
| | |||
* | Rename arg{min,max} to {min,max}Index | Tom Smeding | 2024-06-10 |
| | |||
* | Fix extremum for replicated input arrays | Tom Smeding | 2024-06-10 |
| | |||
* | argmin and argmax | Tom Smeding | 2024-06-09 |
| | |||
* | Fix stride handling of binary arith ops | Tom Smeding | 2024-06-09 |
| | | | | liftVEltwise2 just completely ignored the existence of strides ._. | ||
* | Properly test C reductions | Tom Smeding | 2024-06-03 |
| | |||
* | Refactor Nested (modules, function names) | Tom Smeding | 2024-05-30 |
| | |||
* | Refactor Mixed (modules, regular function names) | Tom Smeding | 2024-05-30 |