chiark / gitweb /
math/mpx-mul4-x86-sse2.S: Optimize `squash'.
authorMark Wooding <mdw@distorted.org.uk>
Sat, 5 Nov 2016 21:28:22 +0000 (21:28 +0000)
committerMark Wooding <mdw@distorted.org.uk>
Mon, 3 Apr 2017 09:12:28 +0000 (10:12 +0100)
commitbd685577211d2706258095a8d3cbb6dd6efd937e
tree8d7531f5d8def61a8e217239e3ace8856a258779
parentd22691798187585df60826582ff71103d18b8367
math/mpx-mul4-x86-sse2.S: Optimize `squash'.

We can use `punpckldq' to assemble the 32-bit pieces, rather than a lot
of shifting to clear bits and then `por'.
math/mpx-mul4-x86-sse2.S