Alexander Kruppa
|
8100363a85
|
Version of mpn/x86_64/sandybridge/add_n.as, super-optimized for Haswell
New speed about 1.21c/l on Haswell, was 1.33c/l
|
2016-11-25 15:25:09 +01:00 |
|
Alexander Kruppa
|
6316e39430
|
Increasing copy with AVX2 for Haswell
|
2016-11-25 11:51:54 +01:00 |
|
Alexander Kruppa
|
29577b5109
|
Decreasing copy with AVX2 for Haswell
|
2016-11-24 02:01:38 +01:00 |
|
Alexander Kruppa
|
4660be16f6
|
AVX-based rshift for 4-issue Intel cpus (Haswell and newer)
|
2016-11-22 23:18:52 +01:00 |
|
Alexander Kruppa
|
105c26c466
|
AVX-based lshift for 4-issue Intel cpus (Haswell and newer)
|
2016-11-22 21:58:43 +01:00 |
|
Alexander Kruppa
|
99a1f8d05b
|
Add vzeroupper to avoid stall on Haswell if SSE2 code follows
|
2016-11-22 15:03:02 +01:00 |
|
Alexander Kruppa
|
aa75752824
|
AVX-based lshift1 and rshift1 for 4-issue Intel cpus (Haswell and newer)
|
2016-11-18 21:54:07 +01:00 |
|
William Hart
|
8435273a1a
|
Remove sb_div* small implementation (due to bug and due to being a very minor
performance improvement).
|
2015-11-13 14:47:44 +00:00 |
|
William Hart
|
45e7dbc9b4
|
Added piledriver, ivybridge, haswell to configure and fat build.
|
2014-03-25 17:32:34 +00:00 |
|