Commit Graph

10 Commits

Author SHA1 Message Date
Alexander Kruppa
38e8585c05 Temporarily removed due to bug 2016-11-25 15:27:21 +01:00
Alexander Kruppa
8100363a85 Version of mpn/x86_64/sandybridge/add_n.as, super-optimized for Haswell
New speed about 1.21c/l on Haswell, was 1.33c/l
2016-11-25 15:25:09 +01:00
Alexander Kruppa
6316e39430 Increasing copy with AVX2 for Haswell 2016-11-25 11:51:54 +01:00
Alexander Kruppa
29577b5109 Decreasing copy with AVX2 for Haswell 2016-11-24 02:01:38 +01:00
Alexander Kruppa
4660be16f6 AVX-based rshift for 4-issue Intel cpus (Haswell and newer) 2016-11-22 23:18:52 +01:00
Alexander Kruppa
105c26c466 AVX-based lshift for 4-issue Intel cpus (Haswell and newer) 2016-11-22 21:58:43 +01:00
Alexander Kruppa
99a1f8d05b Add vzeroupper to avoid stall on Haswell if SSE2 code follows 2016-11-22 15:03:02 +01:00
Alexander Kruppa
aa75752824 AVX-based lshift1 and rshift1 for 4-issue Intel cpus (Haswell and newer) 2016-11-18 21:54:07 +01:00
William Hart
8435273a1a Remove sb_div* small implementation (due to bug and due to being a very minor
performance improvement).
2015-11-13 14:47:44 +00:00
William Hart
45e7dbc9b4 Added piledriver, ivybridge, haswell to configure and fat build. 2014-03-25 17:32:34 +00:00