William Hart
|
3c20c75b3b
|
Whoops, forgot to git add these files at the last commit.
|
2014-02-27 16:11:37 +00:00 |
|
William Hart
|
0c96bff79f
|
Add rudimentary mpn_mulmod_bnm1. Update to new gcd and xgcd code from GMP.
|
2014-02-27 16:00:08 +00:00 |
|
William Hart
|
ce836c6f86
|
Update compiler flags for k10.
|
2014-02-27 12:44:44 +00:00 |
|
William Hart
|
d4adcd7ce6
|
Some very minor speedups/cleanups.
|
2014-02-27 12:30:53 +00:00 |
|
William Hart
|
cc92f1ab7e
|
Remove orphaned memory allocation.
|
2014-02-26 18:31:48 +00:00 |
|
William Hart
|
44c5877e6a
|
Add mpn_tdiv_q to speed.
|
2014-02-26 17:34:08 +00:00 |
|
William Hart
|
c7cdb70393
|
Small improvement to sb_div_q.
|
2014-02-25 15:58:30 +00:00 |
|
William Hart
|
bcf545c219
|
Speed up precomputed inverses.
|
2014-02-25 14:24:08 +00:00 |
|
BrianGladman
|
94c011b8a3
|
1. add revised core 2 assembler to the Windows build
2. add revised nehalem tuning on Windows
3. fix bug in speed on Windows x64
|
2014-02-24 21:07:18 +00:00 |
|
BrianGladman
|
e6047c1c9e
|
Merge branch 'master' of https://github.com/wbhart/mpir
|
2014-02-23 21:30:13 +00:00 |
|
William Hart
|
ec94b97a94
|
Slight speedup for k10.
|
2014-02-23 18:11:17 +00:00 |
|
William Hart
|
b65ad160ab
|
Fix slowdown on penryn, retune k10.
|
2014-02-23 17:47:35 +00:00 |
|
William Hart
|
796fb5b77e
|
Switch back to single function for sb_divappr_q.
|
2014-02-21 16:15:43 +00:00 |
|
William Hart
|
d5a444bd02
|
Use GMP add/submul_1 on core2 as well.
|
2014-02-21 15:25:41 +00:00 |
|
William Hart
|
faa82ed6c9
|
Update penryn tuning.
|
2014-02-21 14:27:14 +00:00 |
|
William Hart
|
738fb46982
|
Fix a bug in sb_divappr_q.
|
2014-02-21 14:14:41 +00:00 |
|
William Hart
|
e9cd157e50
|
Improve tail of sb_divappr_q and fix bug.
|
2014-02-21 13:41:37 +00:00 |
|
William Hart
|
e41d78e02b
|
Switch to GMP add/submul_1.asm on Penryn.
|
2014-02-21 11:22:14 +00:00 |
|
William Hart
|
3079fab0eb
|
Correct SPEED_MPN_DC_DIV_SMALL_Q
|
2014-02-21 11:02:35 +00:00 |
|
William Hart
|
8bb084e6d2
|
Speed up sb_divappr_q.
|
2014-02-21 10:57:56 +00:00 |
|
William Hart
|
5548023724
|
Switch to using both types of precomputed inverse.
|
2014-02-20 20:21:04 +00:00 |
|
William Hart
|
c33f9b04f7
|
Fix SPEED_DC_DIV_Q_SMALL and add sb_divappr_q to speed.
|
2014-02-20 18:49:28 +00:00 |
|
BrianGladman
|
71f91234c2
|
Merge branch 'master' of https://github.com/wbhart/mpir
|
2014-02-19 07:38:40 +00:00 |
|
William Hart
|
fc7b94378a
|
Updated atom tuning values.
|
2014-02-18 22:42:48 +00:00 |
|
BrianGladman
|
2e9c90554c
|
update windows builds for GMP sqr_basecase change
|
2014-02-18 17:36:29 +00:00 |
|
William Hart
|
8873ab5738
|
Merge remote-tracking branch 'brian/master'
|
2014-02-18 17:24:53 +00:00 |
|
BrianGladman
|
1a41ad921f
|
Merge branch 'master' of https://github.com/wbhart/mpir
|
2014-02-18 16:25:37 +00:00 |
|
William Hart
|
a1a268dc7e
|
Corrected nehalem tuning value.
|
2014-02-18 12:42:07 +00:00 |
|
William Hart
|
c8c2a49c43
|
Update nehalem tuning values.
|
2014-02-18 11:29:26 +00:00 |
|
William Hart
|
2a741c97d0
|
On nehalem the GMP sqr_basecase is faster. Thus both nehalem and westmere can
use the mpn/x86_64/sqr_basecase.asm file.
|
2014-02-18 11:25:37 +00:00 |
|
William Hart
|
d10ddc5184
|
Adjust atom kara_sqr_n crossover.
|
2014-02-18 11:15:46 +00:00 |
|
BrianGladman
|
2d7eb1d31c
|
adjust windows builds for new ser_basecase assembler code
|
2014-02-17 23:15:59 +00:00 |
|
BrianGladman
|
d76d8284b9
|
add GMP sqr_basecase assembler code for Windows
|
2014-02-17 23:06:06 +00:00 |
|
BrianGladman
|
b261394905
|
add new sqr_basecase code on Windows
|
2014-02-17 23:02:51 +00:00 |
|
BrianGladman
|
1e020f0151
|
add windows build for sandybridge
|
2014-02-17 23:00:31 +00:00 |
|
William Hart
|
7c3794b221
|
Updated some tuning values for penryn.
|
2014-02-17 21:08:46 +00:00 |
|
William Hart
|
581be69339
|
Update sqr fft crossover.
|
2014-02-17 20:40:31 +00:00 |
|
William Hart
|
c63be9f5f7
|
Update k102 tuning.
|
2014-02-17 20:38:58 +00:00 |
|
William Hart
|
e2f5dd4c02
|
Switched to using correct basecase threshold in kara_sqr_n.
|
2014-02-17 20:23:26 +00:00 |
|
William Hart
|
48544b2e95
|
Sandybridge is faster with GMP sqr_basecase.
|
2014-02-17 19:52:48 +00:00 |
|
William Hart
|
edb937d0e3
|
GMP sqr_basecase is faster on westmere.
|
2014-02-17 19:37:58 +00:00 |
|
William Hart
|
06b0fea603
|
Update tuning values for netburst.
|
2014-02-17 19:14:15 +00:00 |
|
William Hart
|
01fc86a25e
|
The GMP sqr_basecase is *way* faster on netburst.
|
2014-02-17 19:08:27 +00:00 |
|
William Hart
|
24d3b12a0a
|
Use GMP version of sqr_basecase on AMD K8/K10/etc and Intel Core2/Penryn/etc.
|
2014-02-17 16:43:45 +00:00 |
|
William Hart
|
d9416e6bee
|
Added some GMP m4 macros for new sqr_basecase code + license changes.
|
2014-02-17 15:54:07 +00:00 |
|
William Hart
|
449976bd8d
|
Fix a slowdown in FFT (actually in mulmod_2expp1_basecase) when squaring.
|
2014-02-17 15:48:41 +00:00 |
|
William Hart
|
7aa8f6364e
|
Further C code cleanups.
|
2014-02-16 17:23:27 +00:00 |
|
William Hart
|
41674225e1
|
Cleaned up lots of C code, adding lots of whitespace.
|
2014-02-14 23:25:39 +00:00 |
|
William Hart
|
8437e1f92a
|
Added comments to add_err1_n/add_err2_n.asm and fixed some comments in
sub_err1_n/sub_err2_n.asm.
|
2014-02-07 14:55:59 +00:00 |
|
William Hart
|
66827c7e00
|
Comments in sub_err1_n/sub_err2_n.asm.
|
2014-02-07 13:05:55 +00:00 |
|