aboutsummaryrefslogtreecommitdiff
path: root/src/Data/Array/Mixed/Internal
Commit message (Collapse)AuthorAge
* arith: Don't FFI-import unused dotprod_*_strided opsTom Smeding2025-03-18
|
* Optimise reductions and dotprod with more vectorisationTom Smeding2025-03-14
| | | | | | | | | Turns out that if you don't supply -ffast-math, the C compiler will faithfully reproduce your linear reduction order, which is rather disastrous for parallelisation with vector units. This changes the summation order, so numerical results might differ slightly. To wit: the test suite needed adjustment.
* arith: Correct rank arguments to C wrapper functionsTom Smeding2025-03-13
|
* Add atan2Tom Smeding2025-03-13
|
* Implement quot/remTom Smeding2025-03-13
|
* Slight cleanup of arith instances for Mixed/Ranked/ShapedTom Smeding2025-03-12
| | | | Reformulate error messages; add NumElt superclass to FloatElt
* Binary ops without normalisationTom Smeding2025-03-12
| | | | | | | | | Before: > sum(*) Double [1e6] stride 1; -1: OK > 68.9 ms ± 4.7 ms After: > sum(*) Double [1e6] stride 1; -1: OK > 1.44 ms ± 50 μs
* arith: Unary float ops on strided arrays without normalisationTom Smeding2025-03-05
|
* arith: Fix unary op stride bugsTom Smeding2025-02-18
|
* arith: Only strided unary int opsTom Smeding2025-02-16
| | | | | This should have negligible overhead and will save a whole bunch of C code duplication when the FUnops are also converted to strided form.
* arith: Unary int ops on strided arrays without normalisationTom Smeding2025-02-16
|
* arith: Fix cover calculation in stridesDenseTom Smeding2025-02-16
| | | | | The calculation was incorrect for windowed arrays (as produced by RS.window)
* Add the criminally absent singletons to numEltMinIndex and numEltMaxIndexMikolaj Konarski2024-11-19
|
* arith: Respect offsets in dotprodinnerTom Smeding2024-07-21
|
* Add {m,r,s}dot1InnerTom Smeding2024-06-19
|
* Clean up Foreign.hsTom Smeding2024-06-18
|
* More sensible argument order to reduce1 C opTom Smeding2024-06-18
|
* sumAllPrimTom Smeding2024-06-17
|
* Generalise some of the lifting functions to type-changingTom Smeding2024-06-17
|
* Conciseness cleanupTom Smeding2024-06-10
|
* Make arith code aware of negative stridesTom Smeding2024-06-10
|
* Clean up some redundant catMaybesTom Smeding2024-06-10
|
* Dot productTom Smeding2024-06-10
|
* Rename arg{min,max} to {min,max}IndexTom Smeding2024-06-10
|
* Fix extremum for replicated input arraysTom Smeding2024-06-10
|
* argmin and argmaxTom Smeding2024-06-09
|
* Fix stride handling of binary arith opsTom Smeding2024-06-09
| | | | liftVEltwise2 just completely ignored the existence of strides ._.
* Properly test C reductionsTom Smeding2024-06-03
|
* Refactor Nested (modules, function names)Tom Smeding2024-05-30
|
* Refactor Mixed (modules, regular function names)Tom Smeding2024-05-30