Alexander Kruppa
|
e508181a75
|
Version of mpn/x86_64/sandybridge/sub_n.as, super-optimized for Haswell
New speed about 1.20c/l on Haswell, was 1.33c/l
|
2016-11-28 19:43:46 +01:00 |
|
Brian Gladman
|
17b81f6006
|
add mpn_add_n and mpn_add_nc to x64 haswell build
|
2016-11-27 14:06:56 +00:00 |
|
Brian Gladman
|
d0d949835a
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2016-11-27 10:56:14 +00:00 |
|
Alexander Kruppa
|
5d75ebc8bf
|
Reduce number of registers used and use %defines for register names
|
2016-11-27 00:51:45 +01:00 |
|
Brian Gladman
|
d61bdcaf09
|
set release build of tests for foster linking
|
2016-11-26 22:52:26 +00:00 |
|
Brian Gladman
|
77b483e79f
|
add more win64 assembler for haswell
|
2016-11-26 22:35:25 +00:00 |
|
Brian Gladman
|
a95556b926
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2016-11-26 18:35:20 +00:00 |
|
Brian Gladman
|
ee198165c9
|
prepare to add win64 assembler code with parameters in XMM/YMM registers
|
2016-11-26 09:41:07 +00:00 |
|
Alexander Kruppa
|
d11c3ca728
|
Bugfix: operand name macros were wrong
|
2016-11-25 18:11:38 +01:00 |
|
Alexander Kruppa
|
ea49db539e
|
Revert "Temporarily removed due to bug"
This reverts commit 38e8585c05 .
|
2016-11-25 18:11:21 +01:00 |
|
Brian Gladman
|
ed3aa00581
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2016-11-25 15:49:11 +00:00 |
|
Alexander Kruppa
|
38e8585c05
|
Temporarily removed due to bug
|
2016-11-25 15:27:21 +01:00 |
|
Alexander Kruppa
|
8100363a85
|
Version of mpn/x86_64/sandybridge/add_n.as, super-optimized for Haswell
New speed about 1.21c/l on Haswell, was 1.33c/l
|
2016-11-25 15:25:09 +01:00 |
|
Alexander Kruppa
|
f7f64a4ff2
|
Add missing colon
|
2016-11-25 14:55:31 +01:00 |
|
Alexander Kruppa
|
6316e39430
|
Increasing copy with AVX2 for Haswell
|
2016-11-25 11:51:54 +01:00 |
|
Alexander Kruppa
|
29577b5109
|
Decreasing copy with AVX2 for Haswell
|
2016-11-24 02:01:38 +01:00 |
|
Alexander Kruppa
|
4660be16f6
|
AVX-based rshift for 4-issue Intel cpus (Haswell and newer)
|
2016-11-22 23:18:52 +01:00 |
|
Alexander Kruppa
|
105c26c466
|
AVX-based lshift for 4-issue Intel cpus (Haswell and newer)
|
2016-11-22 21:58:43 +01:00 |
|
Alexander Kruppa
|
99a1f8d05b
|
Add vzeroupper to avoid stall on Haswell if SSE2 code follows
|
2016-11-22 15:03:02 +01:00 |
|
Brian Gladman
|
0b48b8a843
|
add lshift1/rshiift1 to Visual Studio build
|
2016-11-18 22:03:24 +00:00 |
|
Alexander Kruppa
|
aa75752824
|
AVX-based lshift1 and rshift1 for 4-issue Intel cpus (Haswell and newer)
|
2016-11-18 21:54:07 +01:00 |
|
William Hart
|
8435273a1a
|
Remove sb_div* small implementation (due to bug and due to being a very minor
performance improvement).
|
2015-11-13 14:47:44 +00:00 |
|
William Hart
|
0deca61277
|
Merge remote-tracking branch 'brian/master'
|
2015-11-13 13:06:17 +00:00 |
|
William Hart
|
a56b786513
|
Correct tuning values for x86/nehalem.
|
2015-10-03 12:29:33 +00:00 |
|
Brian Gladman
|
1d5c6f0508
|
correct bug in Windows assembler for k10/k102
|
2015-09-08 22:48:15 +01:00 |
|
William Hart
|
f541fce21f
|
Removed slower sandybrige rshift1.
|
2015-06-12 11:27:42 +00:00 |
|
William Hart
|
de759a4525
|
Replace lahf and sahf with .byte declarations to support old coreutils such as
that used by OpenBSD-5.6.
|
2015-06-11 12:51:45 +00:00 |
|
William Hart
|
48bf24bdea
|
Add missing rshift1.as/asm in Sandy Bridge.
|
2015-06-10 01:40:32 +00:00 |
|
William Hart
|
acd49c3818
|
INCLUDES -> AM_CPPFLAGS as per autoreconf advice.
|
2015-06-09 17:21:21 +00:00 |
|
William Hart
|
33c8ccc034
|
Ran autoreconf -fiv --no-recursive.
|
2015-06-09 15:50:53 +00:00 |
|
William Hart
|
240255708f
|
Patch from Sage for fat binary build on 32 bit system.
|
2015-06-09 14:14:02 +00:00 |
|
William Hart
|
a6538ff19e
|
Small improvement due to Paul Zimmermann.
|
2015-02-03 17:13:23 +00:00 |
|
William Hart
|
4a5686d5fb
|
Added note about the revised version of Paul Zimmermann's article.
|
2015-02-03 16:59:31 +00:00 |
|
William Hart
|
24b7b6eab7
|
Small improvement, suggested by Paul Zimmermann.
|
2015-02-02 23:41:47 +00:00 |
|
William Hart
|
248e56c88d
|
Removed unnecessary line of code.
|
2015-02-02 13:57:01 +00:00 |
|
William Hart
|
95d4eab65c
|
Fixed bugs in invert.c and t-invert.c. The code was always computing correct
results, but this seems to be more of an accident than by design.
|
2015-02-02 12:30:50 +00:00 |
|
William Hart
|
a0e819627e
|
Merge remote-tracking branch 'brian/master'
|
2014-10-01 13:11:30 +01:00 |
|
BrianGladman
|
277565fdb4
|
1. Correct error in last commit for mpir_config.py
2. rebuild standard Windows builds
3. correct an error in win32 builds
|
2014-08-07 22:07:24 +01:00 |
|
BrianGladman
|
9194070b24
|
remove faulty tuning files from the mpn\x86w assembler directories
|
2014-07-14 22:14:57 +01:00 |
|
William Hart
|
b5f8c19e58
|
Remove a broken assert in the generic build.
|
2014-06-29 21:05:59 +01:00 |
|
William Hart
|
f996e6343f
|
Replace a gmp.h with mpir.h.
|
2014-06-26 14:33:32 +01:00 |
|
William Hart
|
bb15e496b6
|
Remove some compiler warnings.
|
2014-06-16 14:37:08 +01:00 |
|
William Hart
|
b67f620f08
|
Missing endline.
|
2014-06-16 14:19:48 +01:00 |
|
BrianGladman
|
d5f382c4f3
|
add the mpn_mul_fft function (needed by GMP-ECM) and update Windows builds accordingly
|
2014-06-12 14:34:20 +01:00 |
|
BrianGladman
|
206700ebe9
|
copy Atom tuning to Windows
|
2014-04-05 00:30:25 +01:00 |
|
William Hart
|
fedb3560cc
|
Added 64 bit atom tuning values. Thank you JP Flori!
|
2014-04-04 21:40:13 +01:00 |
|
BrianGladman
|
84a0c677b1
|
Merge branch 'master' of https://github.com/wbhart/mpir
|
2014-04-01 08:24:12 +01:00 |
|
BrianGladman
|
e3e00705de
|
add Ivybridge tuning on Windows
|
2014-04-01 08:23:35 +01:00 |
|
William Hart
|
af62ef2f1f
|
Put netburst tuning back.
|
2014-04-01 01:13:01 +01:00 |
|
BrianGladman
|
053a6cd91a
|
copy GCC tuning values over to Windows
|
2014-04-01 00:35:49 +01:00 |
|