chiark
/
gitweb
/
~mdw
/
catacomb
/ commitdiff
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
shortlog
|
log
|
commit
| commitdiff |
tree
raw
|
patch
| inline |
side by side
(from parent 1:
f79c875
)
math/mpx-mul4-amd64-sse2.S: Improve the end-of-loop condition testing.
author
Mark Wooding
<mdw@distorted.org.uk>
Thu, 7 Nov 2019 01:54:57 +0000
(
01:54
+0000)
committer
Mark Wooding
<mdw@distorted.org.uk>
Sat, 9 May 2020 19:57:33 +0000
(20:57 +0100)
Previously, I waited until `rdi' was set up for the next iteration
before comparing it against the limit. But in fact, `DV' already has
the right value, so we can compare earlier.
math/mpx-mul4-amd64-sse2.S
patch
|
blob
|
blame
|
history
diff --git
a/math/mpx-mul4-amd64-sse2.S
b/math/mpx-mul4-amd64-sse2.S
index da3e6d6120adb85f4306be4b4cc0be9b4b645115..1c205a7388499e5b830b07adb18b309eb5851235 100644
(file)
--- a/
math/mpx-mul4-amd64-sse2.S
+++ b/
math/mpx-mul4-amd64-sse2.S
@@
-1270,10
+1270,10
@@
FUNC(mpxmont_redc4_amd64_sse2)
jb 7b
// All done for this iteration. Start the next.
-8: mov rdi, DV // -> Z = dv[i]
- mov rbx, NV // -> X = nv[0]
- cmp rdi, DVLO // all done yet?
+ cmp DV, DVLO // all done yet?
jae 9f
+ mov rdi, DV // -> Z = dv[i]
+ mov rbx, NV // -> X = nv[0]
add DV, 16
call mont4
add rdi, 16