Commit Graph

11 Commits

Author SHA1 Message Date
kageru 146743d2c7
Use fixed array for lut generation
The old version that was generated at runtime didn’t actually contain
1.0 as a value, so getting full white output for int input was impossible.
2020-09-11 15:51:41 +02:00
kageru d762821b88
Add benchmarks 2020-05-24 20:46:23 +02:00
kageru d9b960c6c1
Remove unneeded test import
Now this builds on stable again. I’ll put future benchmarks under
benches/ to be sure.
2020-05-17 00:22:53 +02:00
kageru 4831298a9b
Optimize LUT generation
Directly calling f32::mul_add is actually more accurate and faster here
because rustc seems to be unable to rearrange the original instructions
in a way that can utilize `fma`.
When directly comparing the two implementations, mul_add was about 10%
faster on my machine (Ryzen 1700 with native target in rustflags).
Relevant Godbolt: https://godbolt.org/z/DDRZ4-
2020-05-13 23:26:23 +02:00
kageru a687dd90c4
Use macro for different bitdepths 2020-02-19 20:17:16 +01:00
kageru 13ac41ed97
Remove experiments 2019-08-12 15:01:07 +02:00
kageru cacc762810
Support all common formats 2019-06-02 23:06:24 +02:00
kageru 9236534dab
rustfmt 2019-06-02 15:31:27 +02:00
kageru b4eb5d8399
only enable mask filter and properly check the input format 2019-06-02 14:43:53 +02:00
kageru 783d36bbd2
working implementation with lut 2019-06-01 08:55:47 +02:00
kageru cf9c084cba
initial commit 2019-05-30 11:06:53 +00:00