(no author)
a5f8b60fe2
Fixed some incorrect things in the division code.
2010-02-22 00:45:04 +00:00
(no author)
69b3ac6b38
2010-02-21 19:51:12 +00:00
(no author)
0ad3dfc64c
Retune for Core2/penryn.
2010-02-21 17:08:33 +00:00
(no author)
f0ee2d2055
Added some extra checks to prevent infinite loops and did some minor
...
cleanup.
2010-02-21 16:58:09 +00:00
(no author)
007e9d8ed0
Corrected some bugs in the division code.
2010-02-21 15:47:37 +00:00
(no author)
7ee22203e6
Minor removal of comment.
2010-02-21 14:15:41 +00:00
(no author)
4538e699fa
Fixed more bugs.
2010-02-20 01:31:02 +00:00
(no author)
dd872ac9d0
Fixed a bug in inv_div_qr.
2010-02-20 01:14:43 +00:00
(no author)
c98e613e37
Fixed a bug in inv_divappr_q and added a test for inv_divappr_q.
2010-02-20 01:02:01 +00:00
(no author)
7befb9443c
Ran automake and fixed the test.
2010-02-19 22:52:35 +00:00
(no author)
54f313fb2d
Ran automake and fixed some bugs.
2010-02-19 21:59:10 +00:00
(no author)
27e7e1a492
Tuned yet again for core2/penryn.
2010-02-19 17:06:19 +00:00
(no author)
1d88fcbc0f
Fixed a tuning bug and tuned for K102.
2010-02-19 16:53:01 +00:00
(no author)
c649c25885
remove use of of GMP function not in MPIR (mpn_bdiv_dbm1)
2010-02-19 16:30:08 +00:00
(no author)
2175d5423a
Retuned for core2/penryn.
2010-02-19 16:20:55 +00:00
(no author)
596ed07fbc
Added some tuning for core2/penryn and some additional functions to be
...
built by tuning code.
2010-02-19 16:09:02 +00:00
(no author)
f50d7f04ff
Hopefully fixed tuning of division routines.
2010-02-19 15:22:22 +00:00
(no author)
f444a2bf6c
Attempt to tune some of the division functions.
2010-02-19 12:54:56 +00:00
(no author)
89ef2d6caf
Add logic functions in C (logops.c) to the Windows builds that need them (32-bit & generic C)
2010-02-18 18:59:54 +00:00
(no author)
5b52d6c93d
Update Windows tuning files (only nehalem done though)
2010-02-18 18:18:52 +00:00
(no author)
2e2976dd0e
Added tuning info for core2/penryn.
2010-02-18 16:03:50 +00:00
(no author)
19b37fb7c1
Added speed, tune and try code for toom8_mul/sqr and tuning for K102.
2010-02-18 14:50:40 +00:00
(no author)
d7b3af99f2
1. Add toom8 code to Windows build
...
2. tidy up gmp-mparam.h for a future tuning effort on Windows
2010-02-18 14:39:28 +00:00
(no author)
fea5f0c849
Added a missing mpir.h and some proxy tuning values for core2/penryn
...
toom8 squaring.
2010-02-18 13:46:24 +00:00
(no author)
9cb500d53b
Turned on toom8 squaring code.
2010-02-18 13:40:38 +00:00
(no author)
660b1caaf4
Fixed multiplications being too unbalanced for Toom8h.
2010-02-18 13:15:23 +00:00
(no author)
c4746c30a7
tune for windows nehalem
2010-02-17 10:43:21 +00:00
(no author)
bb96dd4a16
Finished putting too8 mul in.
2010-02-17 00:29:00 +00:00
(no author)
710f109c76
Added missing function and made some corrections.
2010-02-17 00:09:31 +00:00
(no author)
3fad25a740
Added toom8 files.
2010-02-16 23:47:07 +00:00
(no author)
11f57996a4
1. remove C99 out of order declarations in rootrem.c
...
2. Add latest source code to Windows build
2010-02-15 10:15:43 +00:00
(no author)
deb47bec36
Removed an unnecessary line.
2010-02-14 01:07:54 +00:00
(no author)
2515976d28
Minor cleanup.
2010-02-13 23:36:35 +00:00
(no author)
a39e1da77b
Whoops, correct a typo.
2010-02-13 21:55:35 +00:00
(no author)
6d174ba39a
Corrected small root case.
2010-02-13 21:54:26 +00:00
(no author)
a2c42da38f
Fiddled with tuning value.
2010-02-13 21:20:12 +00:00
(no author)
20ae4de5b1
Ran autoconf, connnected up rootrem code and changed tuning value.
2010-02-13 21:18:11 +00:00
(no author)
afd2f2bca4
Made some minor changes to rootrem code.
2010-02-13 21:11:18 +00:00
(no author)
01e8f2702a
Added GMP rootrem code and added crossover from old code, which is faster.
2010-02-13 21:01:24 +00:00
(no author)
ee376cce80
Turned on inv_div_qr code in the tdiv_qr function.
2010-02-13 18:34:48 +00:00
(no author)
397a0689cd
Turned on new inv_div_qr division code.
2010-02-13 18:09:07 +00:00
(no author)
d2eac73245
Added configure and prototype infor for new inv_div_qr(_n) functions and
...
fixed a typo.
2010-02-13 17:58:56 +00:00
(no author)
dcc043e3ab
Added code for balanced and unbalanced division with remainder using a
...
precomputed inverse.
2010-02-13 17:48:01 +00:00
(no author)
b4e21c591f
Fixed approximate quotient code.
2010-02-13 17:13:29 +00:00
(no author)
c100247b24
Added a test for inv_divappr_q_n and made it pass.
2010-02-11 20:36:56 +00:00
(no author)
e3b6db3580
Fixed some segfaults.
2010-02-11 18:52:17 +00:00
(no author)
31b09b4302
Fixed some typos.
2010-02-11 16:49:43 +00:00
(no author)
5924f0b6da
Added asymptotically fast division code.
2010-02-11 16:39:41 +00:00
(no author)
048820278c
Added tdiv_qr.c from GMP.
2010-02-11 14:23:21 +00:00
(no author)
64da22761e
Fixed a bug I accidentally introduced into the code.
2010-02-11 13:59:20 +00:00
(no author)
f83a14a538
More fiddling with tuning.
2010-02-11 00:20:44 +00:00
(no author)
1d08ade198
Added sb_div_qr and dc_div_qr from GMP sources. Fixed some
...
bugs, incorrect #defines and fiddled with tuning.
2010-02-10 22:48:46 +00:00
(no author)
490c26c544
Removed and error.
2010-02-10 11:38:05 +00:00
(no author)
25e405dee5
Restored some of original GMP code.
2010-02-10 11:36:40 +00:00
(no author)
808c57b5b3
Fixed some typos.
2010-02-10 11:22:39 +00:00
(no author)
f4a69eec60
Added sb_div_q, sb_divappr_q from GMP and tdiv_3by2 and invert_1 from GMP
...
and changed everything to use them.
2010-02-10 11:17:59 +00:00
(no author)
61e882ef48
Added sb_div_q based on dc_div_q code.
...
Added better tuning values.
Fixed a segfault.
2010-02-10 02:09:24 +00:00
(no author)
068f94dd5c
Fixed a segfault.
2010-02-09 23:25:48 +00:00
(no author)
9086a12d6c
Moved dcpi1_dc_div_q.c from GMP to dc_div_q.c (modified trivially).
...
Moved dcpi1_divappr_q.c from GMP to dc_divappr_q.c (modified heavily to use
only MPIR functions), removing dcpi1_divappr_q_n function. Moved div_q.c
from GMP to tdiv_q.c (modified trivially).
Currently tests do not pass. More work required!
2010-02-09 21:19:08 +00:00
(no author)
bf50a2d72e
Add another missed mp_bitcnt_t change
2010-02-09 10:22:47 +00:00
(no author)
1758461822
Another minor change to core2 tuning values. Seems to slightly improve
...
timings (though almost imperceptibly).
2010-02-09 00:43:06 +00:00
(no author)
88c8b3e0f3
Add mp_bitcnt_t type to MPIR
2010-02-08 17:06:12 +00:00
(no author)
00f292cc48
adjust windows tune slightly
2010-02-08 10:02:55 +00:00
(no author)
6b59c6b0ba
Fixed some bugs in mpn_sqr_n to do with SQR_THRESHOLDS.
2010-02-07 20:58:21 +00:00
(no author)
a909902fe4
Fixed a bug in toom7_sqr_n.
2010-02-07 20:46:54 +00:00
(no author)
015ce32bf7
correct bug in new assembler code
2010-02-07 17:59:56 +00:00
(no author)
3306eae2a2
Add HAVE_NATIVE defines for Windows code
2010-02-07 17:33:56 +00:00
(no author)
9639773959
Hand tuned SQR thresholds for core2/penryn.
2010-02-07 15:43:19 +00:00
(no author)
3c16b2f566
1. Update g2y.py to cope with GMP assembler
...
2. add new assembler to Windows build
2010-02-07 14:48:02 +00:00
(no author)
2edb3c830a
Added a clarification to the copyright.
2010-02-07 14:07:42 +00:00
(no author)
0c7b48b1fd
Convert divrem_2.asm to yasm format.
2010-02-07 14:02:39 +00:00
(no author)
a54d23d652
Convert divem_euclidean_qr_1.asm to yasm format.
2010-02-07 13:32:16 +00:00
(no author)
0f5bb75287
New tuning values for core2/penryn.
2010-02-07 03:28:55 +00:00
(no author)
c7ade0be6f
Replaced nhgcd2 with the GMP hgcd2.c file, removing the matrix1_vector fns and
...
replacing them with the original.
2010-02-06 15:51:49 +00:00
(no author)
06d4a32943
Added divrem_1 and divrem_2 x86_64 assembly code from GMP, replaced
...
divrem_euclidean_qr_1 with divrem_1.
2010-02-06 13:28:23 +00:00
(no author)
0bd0a87f7e
Temporarily committing these so they can be converted to yasm format.
2010-02-06 03:16:22 +00:00
wbhart
1c2f0506f5
Fixed a bug in mpn_gcdext reported by Dan Grayson.
2010-01-30 22:27:37 +00:00
wbhart
c8d33128b5
Made a change to hopefully support unaligned memory allocation as requested by
...
Dan Grayson. Passes make check and try mpn_lshift.
2010-01-10 23:54:55 +00:00
gladman
0ac38fd71c
1. repair Windows build script
...
2. Add tuning for nehalem on Windows
2010-01-06 15:07:32 +00:00
wbhart
0200f63f3b
Added tuning params for K102. Added new fft code to main directory.
2010-01-01 14:06:41 +00:00
gladman
73dccb229b
correct bug I introduced into invert.c
2009-12-09 20:30:32 +00:00
gladman
aaf7949430
1. update run-test.py for Windows build
...
2. remove test code in invert.c in Windows build
2009-12-09 20:02:31 +00:00
wbhart
52a2c8b839
Added my copyright to the file mpn/generic/invert.c.
2009-12-09 18:56:38 +00:00
wbhart
7db71713ef
Removed a mid block declaration.
2009-12-09 18:40:54 +00:00
gladman
8d7347c60c
1. Add mpn.invert test to Windows
...
2. remove test subroutine from mpn/invert.c
2009-12-09 09:06:31 +00:00
gladman
7e5051b5d6
1. Add a new test to Windows build
...
2. Remove C99 features from two new files
2009-12-07 09:18:30 +00:00
wbhart
d252f5b22e
Fixed the test code for mpz_likely_prime_p.
2009-12-07 01:06:47 +00:00
wbhart
ce52d5515b
Removed a small bit of unnecessary code.
2009-12-06 21:55:07 +00:00
wbhart
90f2cf62f0
Rewrte dc_divappr giving proof sketch and making it slightly different.
2009-12-06 21:14:36 +00:00
gladman
11c4a69572
Add Windows build for Intel nehalem
2009-12-03 17:32:55 +00:00
gladman
e3f6129a64
2009-12-02 17:09:35 +00:00
gladman
1485e3dc1b
1. Add Jason's new assembler code to the Windows builds
...
2. Tidy up assembler to prepare for Windows nehalem build
2009-12-02 16:24:00 +00:00
gladman
b086474ddf
Bring Windows naming into line with that on Linux
...
Rename 'amd64' elemnts to 'k8'
2009-12-02 12:29:38 +00:00
jasonmoxham
829dacd87a
New core2/penryn asm functions popcount hamdist
2009-11-19 10:53:45 +00:00
jasonmoxham
afc620f493
new K8 asm functions mpn_popcount mpn_hamdist
2009-11-19 10:38:40 +00:00
jasonmoxham
367f00f4fe
New K8 asm functions mpn_and,ior,xor
2009-11-19 10:35:17 +00:00
jasonmoxham
beb4d5b735
New K8 asm funtions mpn_lshift2,3,4,5,6
2009-11-19 10:32:03 +00:00
jasonmoxham
7bd0558974
New atom asm funtion mpn-copyi
2009-11-19 09:09:50 +00:00
jasonmoxham
ef2b3db6ef
remove # comments
2009-11-19 08:49:14 +00:00
jasonmoxham
8bcfe2975a
New nehalem asm mpn_copyi mpn_copyd
2009-11-19 08:45:27 +00:00
jasonmoxham
579e36f2b1
New nehalem asm mpn_com
2009-11-19 08:44:50 +00:00
jasonmoxham
59bf8d86e4
new nehalem asm logic mpn fn's
2009-11-19 08:37:54 +00:00
jasonmoxham
dcf3afa567
convert addlsh from gas to yasm format
2009-11-18 17:43:25 +00:00
wbhart
969b44ec5a
attempt to push windows tuning update
...
Signed-off-by: <Brian Gladman>
2009-11-17 13:34:35 +00:00
wbhart
dc983a5503
Added some clarifying comments to the mulmid code.
2009-11-17 13:34:30 +00:00
wbhart
9cb7c5cf1f
Switch over to exact quotient when divisor gets large in mpn_tdiv_q.
2009-11-17 13:15:23 +00:00
jasonmoxham
1757343142
new directory for core2-apple-darwin 32bit with no PIC code in it
2009-10-24 17:31:05 +00:00
wbhart
be0a749738
Oops, wrong author added to license info.
2009-10-19 05:59:49 +00:00
wbhart
ecdbd4a162
Added copyright info to GCD and XGCD code.
2009-10-19 05:49:29 +00:00
jasonmoxham
c6af9fbfc2
some more masm? movq/movd mixups
2009-10-16 00:45:14 +00:00
jasonmoxham
4ba747128d
change movq to movd for old masm assembler
2009-10-15 18:21:27 +00:00
jasonmoxham
be135c7347
change asm #comment to C comment
2009-10-15 18:13:19 +00:00
gladman
0965fc4532
update VC++ readme.txt and Core2 tuning
2009-10-15 16:24:35 +00:00
wbhart
19c4d52566
Fixed a carry bug in mpn_tdiv_q and added the failing cases to tests/mpz/t-tdiv.c.
2009-10-13 04:52:00 +00:00
wbhart
07bbd31e1d
Fixed more bugs in sb_divappr_q.
2009-10-12 02:05:16 +00:00
jasonmoxham
d0067e2f3b
fix broken sparc32/v9 add/sub code
2009-10-11 16:32:54 +00:00
wbhart
380b80372a
Fixed some bugs in mpn_tdiv_q and added a try test for mpn_tdiv_q.
2009-10-11 06:46:10 +00:00
jasonmoxham
6edccbf39a
adjust sparc32/v9=t2,ABI=32 params for mulhigh
2009-10-09 20:25:51 +00:00
wbhart
0f39b0f20c
Clarified some comments.
2009-10-09 04:17:32 +00:00
wbhart
5e38fe6d8b
Added some rough justifications for doing what we do in sb_divappr_q.
2009-10-09 03:48:31 +00:00
jasonmoxham
ce832e54a5
varro powerpc64/mode64 64bit params
2009-10-09 01:27:36 +00:00
jasonmoxham
b8c10614e7
ia64 params
2009-10-09 00:33:52 +00:00
jasonmoxham
45ec139248
pentium4 32bit params
2009-10-09 00:33:19 +00:00
jasonmoxham
b49482cf47
core2 32bit params
2009-10-09 00:10:27 +00:00
jasonmoxham
c9f16233b8
add back in old fft tuning values , better than nothing
2009-10-08 22:50:40 +00:00
jasonmoxham
297e277fe6
add mising fat entry for 32bit nano
2009-10-08 22:39:37 +00:00
jasonmoxham
518226d914
atom params
2009-10-08 22:06:01 +00:00
jasonmoxham
2488ed55e8
k7 params
2009-10-08 22:04:10 +00:00
jasonmoxham
93688a18b6
k10 params
2009-10-08 18:01:18 +00:00
jasonmoxham
1a7c40bb2b
k8 32bit params
2009-10-08 17:52:27 +00:00
jasonmoxham
265bd0a989
sparc64 params's
2009-10-08 17:49:13 +00:00
jasonmoxham
f09d7f9df1
sparc 32 bit param
2009-10-08 17:05:41 +00:00
jasonmoxham
2f138f7a16
core2 params
2009-10-08 15:48:27 +00:00
jasonmoxham
e49eccab57
k8 params
2009-10-08 15:30:59 +00:00
jasonmoxham
4053a62930
k102 params
2009-10-08 15:28:51 +00:00
wbhart
8258200ecb
Fixed a major bug in mpn_tdiv_q and extended some tests to run longer.
2009-10-08 11:54:06 +00:00
wbhart
a4d22db74e
Extended the mpz/t-tdiv test and fixed a bug in mpn_tdiv_q.
2009-10-08 02:54:11 +00:00
jasonmoxham
9fb5a0cb90
nehalem 32bit params
2009-10-07 12:21:08 +00:00
jasonmoxham
2821267426
nehalem 64 params
2009-10-07 12:00:35 +00:00
wbhart
6d16ef649e
Fixed unusual bug in sb_divappr_q (I think).
2009-10-07 09:58:10 +00:00
gladman
448c60fd77
Add latest division code to the Windows builds
2009-10-07 09:21:06 +00:00
wbhart
938a8de8ed
Added fast mpn_tdiv_q and test code and used it in mpz_tdiv_q.
2009-10-07 05:27:59 +00:00
gladman
f42829cd8e
Core2 assembler (copies) for Windows
2009-10-05 15:56:30 +00:00
jasonmoxham
ff4fc75bb8
New core2/penryn asm fns mod_1_? divrem_hensel_qr_1_2 rsh_divrem_hensel_qr_1_2
2009-10-05 15:02:50 +00:00
jasonmoxham
a10fd0d649
New atom asm functions mod_1_? rsh_divrem_hensel_qr_1_1 divrem_hensel_qr_1_1
2009-10-05 14:50:05 +00:00
jasonmoxham
dc1fc39381
New nehalem asm functions mod_1_? rsh_divrem_hensel_qr_1_2 divrem_hensel_qr_1_2
2009-10-05 14:19:08 +00:00
gladman
343fe6ce88
Add mod_1_3 assembler to Windows
2009-10-05 08:29:15 +00:00
jasonmoxham
f8c338ad86
remove old copy,zero from fft
2009-10-05 02:14:27 +00:00
jasonmoxham
4d25d12c67
remove old dead addsub
2009-10-05 02:05:18 +00:00
jasonmoxham
ad4a181469
New AMD asm function mpn_mod_1_3
2009-10-04 23:49:02 +00:00