[raymath] Added SSE to MatrixMultiply()
#5427
Merged
+62
−1
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I saw your LLM challenge on Twitter and decided to try it. I had to edit it a bit to make it work, but hey. It runs and it does what you need it to. I saw #5316 and was like "oh yeah."
I am getting roughly half a second per 200000000 ops, with a range of 2.5ns - 3.2 ns per op. The after picture is roughly one second per 200000000 ops, with a range of 4.6ns - 5.1ns per ops.
I used an LLM to generate the test: https://gist.github.com/mcdubhghlas/53f780c35482aa87713f34ded941a67d
It was compiled using this:
cc benchmark.c src/libraylib.a -Isrc -O3 -march=native -lm -lpthread -ldl -lrt -lX11 -o benchLet me know if that is insufficient, I genuinely was too busy to read over documentation to see if you had a test suite -- That said, I love you what you do Ray. Keep up the good work man :)