Commit Graph

1229 Commits

Author SHA1 Message Date
BrianGladman
738c8f9987 Merge branch 'master' of https://github.com/wbhart/mpir
Conflicts:
	mpn/x86_64w/sqr_basecase.asm
2014-03-27 11:24:34 +00:00
BrianGladman
f1880215db merge Bill's latest changes 2014-03-27 07:58:58 +00:00
William Hart
4cfdfcba07 Fixed a funky bug which only shows up on Windows in mpn_mulmod_basecase. 2014-03-27 00:23:00 +00:00
William Hart
854405b4df Removed an unnecessary file and lots of very broken references to it. 2014-03-26 23:15:50 +00:00
William Hart
e924619fd7 Fix a bug in x86_64w/sqr_basecase.asm. 2014-03-26 20:48:06 +00:00
BrianGladman
41171b367a Adjust Windows builds for filename changes; correct error in sqr_basecase.asm; fix Windows build errors in DEBUG mode 2014-03-26 20:19:54 +00:00
William Hart
c0d68b3837 ARM tuning values (thanks Volker Braun). 2014-03-26 14:54:46 +00:00
William Hart
01404cb523 Merge remote-tracking branch 'brian/master' 2014-03-26 03:16:22 +00:00
William Hart
691f114d92 Removed some incorrect asserts from mulmod_2expp1_basecase.c. 2014-03-25 23:11:26 +00:00
William Hart
4ff1e2d684 Remove dc_divappr_q_n and renamed fft_negacylic to fft_negacyclic. 2014-03-25 21:17:35 +00:00
William Hart
9cf9b9bbaf Remove gcd_finda.asm and references to it. 2014-03-25 20:14:46 +00:00
William Hart
c8e4a9e590 Added bobcat tuning values. Thanks Frithjof Schulze. 2014-03-25 18:31:58 +00:00
William Hart
45e7dbc9b4 Added piledriver, ivybridge, haswell to configure and fat build. 2014-03-25 17:32:34 +00:00
William Hart
0795469230 Add new x86_64 assembly functions provided for all platforms to fat build. 2014-03-25 15:24:17 +00:00
William Hart
af5e5416c8 Remove unused tuning values, insert provisional FFT tuning values. See #92. 2014-03-25 14:33:43 +00:00
William Hart
08e6470110 Tuning for powerpc32. Remove adjustments to tuneup.c as they don't help on
ppc32.
2014-03-24 23:41:24 +00:00
William Hart
2e0b2137d6 Added tuning for powerpc64 and tried to fix at least one problem with tuning on
that platform.
2014-03-24 23:11:11 +00:00
William Hart
d6fe8951ee Added tuning for sparc32/v9 and default value for FFT_MULMOD_2EXPP1_CUTOFF. 2014-03-24 22:36:54 +00:00
William Hart
d17ac9957b Added x86/pentium4/sse2 tuning values. 2014-03-24 22:10:02 +00:00
William Hart
bcad8933d8 Whether is it nobler to do sparc64 tuning with an old gcc on a new machine or a
new gcc on an old machine... or to just tell sparc owners that tuning aint gonna
help their speed problem...
2014-03-24 21:07:07 +00:00
William Hart
984d9dbe8e Missing endlines and jacobi prototypes for gmp-impl.h. 2014-03-24 18:54:51 +00:00
William Hart
5bb2ca9aee Provisional sparc64 tuning values. 2014-03-24 18:21:47 +00:00
William Hart
10283de0c6 Spacing. 2014-03-24 17:33:45 +00:00
BrianGladman
f6691eaf65 update windows tuning by copying from GCC/Linux 2014-03-22 16:05:46 +00:00
BrianGladman
f2aad90064 Merge branch 'master' of https://github.com/wbhart/mpir 2014-03-22 12:07:35 +00:00
William Hart
a6b31574a0 Update final cutoff for ia64. 2014-03-21 18:37:23 +00:00
William Hart
403c9e4cd7 Updated ia64 tuning. 2014-03-21 18:28:58 +00:00
William Hart
06344c67e8 Added core2 tuning values. 2014-03-21 17:13:17 +00:00
William Hart
dfe64a438f Update nehalem tuning values. 2014-03-21 16:34:38 +00:00
William Hart
4dea3d29f9 Added piledriver tuning. 2014-03-21 16:21:44 +00:00
William Hart
b3471ddac5 Update sandybridge and westmere tuning values. 2014-03-21 16:05:02 +00:00
BrianGladman
d0bdadf0e4 Merge branch 'master' of https://github.com/wbhart/mpir 2014-03-21 15:59:48 +00:00
William Hart
25dc06c67f Add bulldozer tuning values (not currently used). 2014-03-21 15:44:22 +00:00
William Hart
34d5011417 Updated k10 tuning values. 2014-03-21 15:28:00 +00:00
William Hart
0f873fe8b5 Update k8 tuning values. 2014-03-21 15:20:55 +00:00
William Hart
c50bf32dfc Added netburst tuning. 2014-03-21 15:05:00 +00:00
BrianGladman
74524a3e29 Merge branch 'master' of https://github.com/wbhart/mpir 2014-03-20 16:16:06 +00:00
William Hart
a307e2221f New penryn tuning. 2014-03-20 16:12:00 +00:00
BrianGladman
8a93786668 Merge branch 'master' of https://github.com/wbhart/mpir 2014-03-20 15:53:33 +00:00
William Hart
3b6a11e4c5 Added tuning code from GMP for gcd, gcdext, jacobi and fac_ui. 2014-03-20 15:28:01 +00:00
BrianGladman
c3a91f5403 Merge branch 'master' of https://github.com/wbhart/mpir 2014-03-20 00:06:26 +00:00
William Hart
e6b6751c97 Update tuning for penryn. 2014-03-19 23:41:58 +00:00
William Hart
471fddeece Added tuning for small version of sb_divappr_q and sb_div_qr. 2014-03-19 23:09:38 +00:00
William Hart
5d35ea55a4 Better k10 tuning values. 2014-03-19 17:48:14 +00:00
William Hart
4fa350f51d Hack FFT usage into mpn_mulmod_2expp1. 2014-03-19 17:32:01 +00:00
BrianGladman
f17b419e36 Merge branch 'master' of https://github.com/wbhart/mpir 2014-02-28 17:10:31 +00:00
William Hart
8e3504e372 Updated to new bin_ui and bin_uiui code from GMP 5.1.3. Added a version of
mpn_divrem_hensel_rsh_qr_1 that takes a precomputed inverse.
2014-02-28 16:44:53 +00:00
BrianGladman
6e53dff494 add new factorial code and tests to Visual Studio builds 2014-02-28 15:12:11 +00:00
BrianGladman
edcefb28cb Update Windows Visual Studio 2013 build for latest changes 2014-02-27 22:16:06 +00:00
William Hart
67809f39a2 Update to GMP 5.1.3 factorial code. 2014-02-27 21:35:04 +00:00
William Hart
68c4a8d583 Apparently I didn't update these correctly. 2014-02-27 17:35:55 +00:00
William Hart
3c20c75b3b Whoops, forgot to git add these files at the last commit. 2014-02-27 16:11:37 +00:00
William Hart
0c96bff79f Add rudimentary mpn_mulmod_bnm1. Update to new gcd and xgcd code from GMP. 2014-02-27 16:00:08 +00:00
William Hart
d4adcd7ce6 Some very minor speedups/cleanups. 2014-02-27 12:30:53 +00:00
William Hart
c7cdb70393 Small improvement to sb_div_q. 2014-02-25 15:58:30 +00:00
BrianGladman
94c011b8a3 1. add revised core 2 assembler to the Windows build
2. add revised nehalem tuning on Windows
3. fix bug in speed on Windows x64
2014-02-24 21:07:18 +00:00
BrianGladman
e6047c1c9e Merge branch 'master' of https://github.com/wbhart/mpir 2014-02-23 21:30:13 +00:00
William Hart
b65ad160ab Fix slowdown on penryn, retune k10. 2014-02-23 17:47:35 +00:00
William Hart
796fb5b77e Switch back to single function for sb_divappr_q. 2014-02-21 16:15:43 +00:00
William Hart
d5a444bd02 Use GMP add/submul_1 on core2 as well. 2014-02-21 15:25:41 +00:00
William Hart
faa82ed6c9 Update penryn tuning. 2014-02-21 14:27:14 +00:00
William Hart
738fb46982 Fix a bug in sb_divappr_q. 2014-02-21 14:14:41 +00:00
William Hart
e9cd157e50 Improve tail of sb_divappr_q and fix bug. 2014-02-21 13:41:37 +00:00
William Hart
e41d78e02b Switch to GMP add/submul_1.asm on Penryn. 2014-02-21 11:22:14 +00:00
William Hart
8bb084e6d2 Speed up sb_divappr_q. 2014-02-21 10:57:56 +00:00
William Hart
5548023724 Switch to using both types of precomputed inverse. 2014-02-20 20:21:04 +00:00
BrianGladman
71f91234c2 Merge branch 'master' of https://github.com/wbhart/mpir 2014-02-19 07:38:40 +00:00
William Hart
fc7b94378a Updated atom tuning values. 2014-02-18 22:42:48 +00:00
BrianGladman
2e9c90554c update windows builds for GMP sqr_basecase change 2014-02-18 17:36:29 +00:00
William Hart
8873ab5738 Merge remote-tracking branch 'brian/master' 2014-02-18 17:24:53 +00:00
BrianGladman
1a41ad921f Merge branch 'master' of https://github.com/wbhart/mpir 2014-02-18 16:25:37 +00:00
William Hart
a1a268dc7e Corrected nehalem tuning value. 2014-02-18 12:42:07 +00:00
William Hart
c8c2a49c43 Update nehalem tuning values. 2014-02-18 11:29:26 +00:00
William Hart
2a741c97d0 On nehalem the GMP sqr_basecase is faster. Thus both nehalem and westmere can
use the mpn/x86_64/sqr_basecase.asm file.
2014-02-18 11:25:37 +00:00
William Hart
d10ddc5184 Adjust atom kara_sqr_n crossover. 2014-02-18 11:15:46 +00:00
BrianGladman
d76d8284b9 add GMP sqr_basecase assembler code for Windows 2014-02-17 23:06:06 +00:00
BrianGladman
b261394905 add new sqr_basecase code on Windows 2014-02-17 23:02:51 +00:00
William Hart
7c3794b221 Updated some tuning values for penryn. 2014-02-17 21:08:46 +00:00
William Hart
581be69339 Update sqr fft crossover. 2014-02-17 20:40:31 +00:00
William Hart
c63be9f5f7 Update k102 tuning. 2014-02-17 20:38:58 +00:00
William Hart
e2f5dd4c02 Switched to using correct basecase threshold in kara_sqr_n. 2014-02-17 20:23:26 +00:00
William Hart
48544b2e95 Sandybridge is faster with GMP sqr_basecase. 2014-02-17 19:52:48 +00:00
William Hart
edb937d0e3 GMP sqr_basecase is faster on westmere. 2014-02-17 19:37:58 +00:00
William Hart
06b0fea603 Update tuning values for netburst. 2014-02-17 19:14:15 +00:00
William Hart
01fc86a25e The GMP sqr_basecase is *way* faster on netburst. 2014-02-17 19:08:27 +00:00
William Hart
24d3b12a0a Use GMP version of sqr_basecase on AMD K8/K10/etc and Intel Core2/Penryn/etc. 2014-02-17 16:43:45 +00:00
William Hart
d9416e6bee Added some GMP m4 macros for new sqr_basecase code + license changes. 2014-02-17 15:54:07 +00:00
William Hart
449976bd8d Fix a slowdown in FFT (actually in mulmod_2expp1_basecase) when squaring. 2014-02-17 15:48:41 +00:00
William Hart
41674225e1 Cleaned up lots of C code, adding lots of whitespace. 2014-02-14 23:25:39 +00:00
William Hart
8437e1f92a Added comments to add_err1_n/add_err2_n.asm and fixed some comments in
sub_err1_n/sub_err2_n.asm.
2014-02-07 14:55:59 +00:00
William Hart
66827c7e00 Comments in sub_err1_n/sub_err2_n.asm. 2014-02-07 13:05:55 +00:00
William Hart
7e5589a8bb Ran autoreconf -i to change autotools versions to those of development machine. 2014-02-04 13:40:51 +00:00
William Hart
fbf04dda95 Minor improvement of divide and conquer divapprox based on proof in paper. 2014-02-01 00:06:00 +00:00
Brian Gladman
d6ca8cf09a Merge remote-tracking branch 'remotes/Bill/master' 2013-08-06 13:25:33 +01:00
Brian Gladman
fd76ae5284 adjust Intel macros in dedicated windows assembler 2013-08-06 13:25:11 +01:00
William Hart
0550f973bd Merge remote-tracking branch 'brian/master' 2013-08-06 13:20:09 +01:00
Jean-Pierre Flori
b3438467e6 Ran "autoreconf -fiv --no-recursive". 2013-08-05 00:34:35 +02:00
Jean-Pierre Flori
4491e7b7da Modify PROLOGUE declarations in x86_64w.
This let configure define correct HAVE_NATIVE_* constants.
This may break the VS builds.
2013-08-05 00:17:37 +02:00
Jean-Pierre Flori
bdd21aa9c4 Harmonize PROLOGUE in Win64 assembly files.
This breaks non-generic builds with MinGW.
2013-08-04 23:45:38 +02:00
Jean-Pierre Flori
f9c0c1381d Fix for MPIR_AS. 2013-08-02 21:01:04 +02:00