Commit Graph

549 Commits

Author SHA1 Message Date
jasonmoxham
6ee4e35940 New asm function mpn_mul_basecase for K8/K10/Core2/Penryn/Nehalem 2009-05-13 19:49:42 +00:00
gladman
1b83dc8e1c Update MPIR version number to 1.2.0 2009-05-13 10:52:36 +00:00
gladman
d6962d575f 1. Add new/changed Core2 assembler files to the Windows build
2. Workaround VC++ optimisation bug in mul_fft.c
2009-05-13 09:54:24 +00:00
gladman
47c7c6b832 Further update to Windows K8 build 2009-05-12 21:33:52 +00:00
wbhart
f9463aedd8 Went back to mpz/mul.c from mpir-1.1. 2009-05-12 19:50:50 +00:00
gladman
9918886c2f Update Windows K8 build to add new assembler 2009-05-12 19:37:47 +00:00
wbhart
21f51a706c Added toom32 for unbalanced multiplications. 2009-05-12 18:28:20 +00:00
wbhart
c6881fa3a9 Fixed bugs in Toom3 code. 2009-05-12 09:22:27 +00:00
wbhart
fb914ab4ac Fixed some buglets in toom4. 2009-05-11 15:16:53 +00:00
wbhart
ae48f90e2f Fix speed regression in mul.c, switch unbalanced toom back on. Add
missing toom3_interpolate prototype.
2009-05-11 12:30:17 +00:00
wbhart
4babcebbfa Turned off unbalanced multiplications as they slow things down.
-This line, and those below, will be ignored--

M    mpn/generic/mul.c
2009-05-11 11:06:38 +00:00
wbhart
102e4f59eb I put the wrong version of this in. Here it is back again. 2009-05-11 11:02:51 +00:00
wbhart
9e56c61071 Added toom42 and code to handle unbalanced multiplication. 2009-05-11 10:09:09 +00:00
jasonmoxham
0a1d07af4e New asm function mpn_sublsh1_n for K8/K10 2009-05-10 20:03:47 +00:00
jasonmoxham
574f3be308 New asm function mpn_divexact_byff for K8/K10/Core2/penryn/nehalem 2009-05-10 19:35:54 +00:00
jasonmoxham
359fab42b5 New asm functions mpn_rsh1add_n mpn_rsh1sub_n for K8/K10/Core2/penryn/nehalem 2009-05-10 18:46:48 +00:00
jasonmoxham
428e43b40e New asm functions mpn_addadd_n mpn_addsub_n mpn_subadd_n for K8/K10 2009-05-10 16:25:01 +00:00
wbhart
4c442dbf94 Went back to old mul.c (again) :-( 2009-05-10 14:09:59 +00:00
wbhart
1b58a8b49e Speed toom4 up by passing some arguments to the interpolate code in the
output space so they don't have to be moved at the end.
2009-05-10 13:45:27 +00:00
gladman
45e909b337 correct erroneous Windows addition to gmp-h.in 2009-05-10 13:15:12 +00:00
wbhart
e9bb26a854 Added missing #include "mpir.h". 2009-05-10 07:28:21 +00:00
wbhart
4f99bbe9fc Added missing toom3 file. 2009-05-10 07:15:02 +00:00
wbhart
9c79e0a98b Factored out mpn_toom3_sqr_n and mpn_toom3_mul_n and removed duplication
of mpn_toom3_interpolate. Rewrote mpn_toom3_sqr_n.
2009-05-10 07:12:38 +00:00
wbhart
44dadcf975 Reverted to earlier version of mul.c because the later version slows
down the 128x128 mul bench.
2009-05-10 04:39:49 +00:00
wbhart
c8aa69c789 Added toom3_mul_n with better memory usage. 2009-05-10 04:24:39 +00:00
jasonmoxham
90d8207a80 New functions mpn_sumdiff for core2/penryn/nehalem , or rather faster to do separate add and sub 2009-05-10 03:39:43 +00:00
jasonmoxham
b07549802a New asm functions mpn_add_n mpn_sub_n for Core2/penryn/nehalem 2009-05-10 01:26:52 +00:00
jasonmoxham
0c3c909910 New asm functions for mpn_copyi mpn_copyd for k8,k10,core2,penryn,nehalem 2009-05-10 00:20:44 +00:00
wbhart
1cc8b35cfe Another slight speedup. 2009-05-09 21:51:40 +00:00
wbhart
96e8e4e410 Added my copyright info. 2009-05-09 21:38:06 +00:00
wbhart
6ed1dd6474 Whoops I screwed up toom4 and toom7, putting them back now. 2009-05-09 21:23:15 +00:00
wbhart
7a0e036d36 Fixed toom4 and toom7 issues and added k8 tuning code. 2009-05-09 21:12:13 +00:00
wbhart
72f93a085c Added new toom3 code. 2009-05-09 20:56:34 +00:00
gladman
d942415a1c 1. Update Windows Powershell scripts in mpirbench to refer to MPIR rather than GMP
2. Update MPIR version number in Windows config files to 1.1.2 
3. Add an MSVC inline definition in in gmp-h.in
4. Correct locale test (as per GMP correction)
5. Add Windows x64 set/copy intrinsics to mul_fft.c (improves FFT speed score by 2%)
2009-05-09 13:26:27 +00:00
wbhart
0ba06242c6 Fixed some bugs in best_k code used by FFT. 2009-05-09 02:54:08 +00:00
wbhart
911916ce7e Fixed a carry issue with tc*_addmul which created a requirement for extra memory in toom code. 2009-05-08 14:12:47 +00:00
wbhart
5624d9a6fc New toom4 and toom7 code.
* Don't make copies before basecase multiplications
* Factor out interpolation code
* Convert interpolation code to twos complement
* Optimise code using new assembly functions where available
2009-05-08 13:21:14 +00:00
gladman
b55aae6dd5 Add Windows build files to bench in trunk 2009-05-06 18:35:56 +00:00
gladman
bd34c0bfc5 1. Update g2y.py, the GAS to YASM Python script
2. Provide tuning for new FFT code
3. Add some documentation to YASM assembler macros for Windows
2009-05-06 18:20:52 +00:00
wbhart
2ad5066cea Tried to clean up a little. 2009-05-05 23:52:02 +00:00
wbhart
49441a5e20 Fixed bug in mul_fft.c 2009-05-05 22:18:16 +00:00
gladman
7f8174f08d update g2y.py to allow command line parameters 2009-05-05 20:43:45 +00:00
gladman
eeaca671af Remove C99 features in mul_fft.c 2009-05-05 20:41:29 +00:00
wbhart
041df82e0d Added Zimmermann et al's FFT (after making a bug fix). 2009-05-05 12:27:29 +00:00
gladman
57f06bfe7e 2009-05-01 19:03:56 +00:00
gladman
d18fab116f Adjust Windows build slightly 2009-05-01 18:33:21 +00:00
gladman
8692baeb3d Make two 'ease of use' changes to the Windows build 2009-05-01 14:30:27 +00:00
jasonmoxham
8e7cb567a0 correct divexact_byBm1of speed measuring 2009-04-24 22:29:42 +00:00
wbhart
b399395e1b Added -apple to support very recent apple machines. 2009-04-23 15:42:05 +00:00
jasonmoxham
b52aa5a59e mpn_mul_2 doesn't do size=1 2009-04-22 22:05:08 +00:00