Commit Graph

92 Commits

Author SHA1 Message Date
jasonmoxham
0ba8d8d2fb corrected K8 not.asm 2011-02-21 16:42:26 +00:00
jasonmoxham
cd6fabdacf New AMD mpn_not asm function 2011-02-20 23:13:34 +00:00
jasonmoxham
dd88a1b5fd New AMD mpn_mod_1_2 2011-02-20 13:29:40 +00:00
jasonmoxham
75d6bf6f07 correct wrong name in amd submul_1 2011-02-20 09:27:40 +00:00
jasonmoxham
a4cd2bfbd1 update AMD mul_1 and submul_1 , just like the new addmul_1 2011-02-11 22:05:06 +00:00
jasonmoxham
c5ae3bd756 New AMD addmul_1 , was 450bytes now 407bytes 2011-01-28 10:44:16 +00:00
jasonmoxham
f4cad36a24 change calling convention on the asm code in x86_64 for redc_1 2010-12-16 22:39:03 +00:00
jasonmoxham
e34cec1068 change name of redc_basecase to redc_1 and export 2010-12-16 02:32:35 +00:00
jasonmoxham
874e62cbf0 New K8 asm function mpn_store 2010-12-05 10:35:49 +00:00
jasonmoxham
77e6892a3c windows conversion script doestn't like sums in code 2010-12-05 01:34:06 +00:00
jasonmoxham
b820af3933 New K8 asm logic functions mpn_nand_n mpn_nior_n 2010-12-05 00:55:20 +00:00
jasonmoxham
b6c1ca003c new K8 asm function popcount hamdist 2010-12-03 23:22:36 +00:00
jasonmoxham
d9e61dbf34 new asm code for K10 popcount hamdist 2010-12-03 23:00:04 +00:00
jasonmoxham
411107f047 replace divebyff.* with divexact_byff.* 2010-08-13 13:23:52 +00:00
jasonmoxham
484cceadc4 new K8only dir for asm that is faster on K8 but NOT K10 2010-06-26 09:00:48 +00:00
mvngu
1d0a85ca7a k8: flavius.skynet Linux: tuning values for GCD_THRESHOLD and GCDEXT_THRESHOLD 2010-05-20 04:24:32 +00:00
jasonmoxham
53edce5a20 removed all old gcdext_threshold that were set to zero 2010-05-12 20:56:21 +00:00
wbhart
73c82fbe5c Added tuning for k102. 2010-05-11 23:36:25 +00:00
(no author)
2f7dfb1c01 copy old fft tables into the updated gmp-mparam.h 2010-03-25 00:03:23 +00:00
(no author)
68ddf76fae mpn/x86_64/k8/gmp-mparam.h from box2 2010-03-24 22:44:56 +00:00
(no author)
4edafb7ebc mpn/x86_64/k8/k10/k102/gmp-mparam.h from lena 2010-03-24 19:35:05 +00:00
(no author)
627e796d79 mpn/x86_64/k8/k10/gmp-mparam.h from cuda1 2010-03-24 17:39:26 +00:00
(no author)
c5ce40f051 Tuning for K10 and fixed a tuning bug. 2010-03-14 19:24:27 +00:00
(no author)
2e7f1349e4 Better tuning values and commit a modified file. 2010-03-14 02:13:05 +00:00
(no author)
e3dd3ca81a Fixed K102 tuning values. 2010-03-04 19:17:58 +00:00
(no author)
c7bd66e7a4 Finished tuning code for dc_bdiv fns and tuned for K102. 2010-03-03 00:30:14 +00:00
(no author)
1d88fcbc0f Fixed a tuning bug and tuned for K102. 2010-02-19 16:53:01 +00:00
(no author)
f50d7f04ff Hopefully fixed tuning of division routines. 2010-02-19 15:22:22 +00:00
(no author)
f444a2bf6c Attempt to tune some of the division functions. 2010-02-19 12:54:56 +00:00
(no author)
19b37fb7c1 Added speed, tune and try code for toom8_mul/sqr and tuning for K102. 2010-02-18 14:50:40 +00:00
(no author)
9cb500d53b Turned on toom8 squaring code. 2010-02-18 13:40:38 +00:00
(no author)
3fad25a740 Added toom8 files. 2010-02-16 23:47:07 +00:00
(no author)
a2c42da38f Fiddled with tuning value. 2010-02-13 21:20:12 +00:00
wbhart
c8d33128b5 Made a change to hopefully support unaligned memory allocation as requested by
Dan Grayson. Passes make check and try mpn_lshift.
2010-01-10 23:54:55 +00:00
wbhart
0200f63f3b Added tuning params for K102. Added new fft code to main directory. 2010-01-01 14:06:41 +00:00
jasonmoxham
afc620f493 new K8 asm functions mpn_popcount mpn_hamdist 2009-11-19 10:38:40 +00:00
jasonmoxham
367f00f4fe New K8 asm functions mpn_and,ior,xor 2009-11-19 10:35:17 +00:00
jasonmoxham
beb4d5b735 New K8 asm funtions mpn_lshift2,3,4,5,6 2009-11-19 10:32:03 +00:00
jasonmoxham
c6af9fbfc2 some more masm? movq/movd mixups 2009-10-16 00:45:14 +00:00
jasonmoxham
4ba747128d change movq to movd for old masm assembler 2009-10-15 18:21:27 +00:00
jasonmoxham
be135c7347 change asm #comment to C comment 2009-10-15 18:13:19 +00:00
jasonmoxham
c9f16233b8 add back in old fft tuning values , better than nothing 2009-10-08 22:50:40 +00:00
jasonmoxham
93688a18b6 k10 params 2009-10-08 18:01:18 +00:00
jasonmoxham
e49eccab57 k8 params 2009-10-08 15:30:59 +00:00
jasonmoxham
4053a62930 k102 params 2009-10-08 15:28:51 +00:00
jasonmoxham
ad4a181469 New AMD asm function mpn_mod_1_3 2009-10-04 23:49:02 +00:00
jasonmoxham
bf0f5c4e6f add carry limb into the existing rsh_divrem_hensel 2009-10-04 02:16:25 +00:00
jasonmoxham
cdaad5bffc New AMD asm function mpn_rsh_divrem_hensel_qr_1_2 2009-10-03 22:21:44 +00:00
jasonmoxham
90d8b76405 New amd asm function mpn_mod_1_2 2009-10-03 00:04:27 +00:00
jasonmoxham
520fec686d tweek mod_1_1 amd asm to full speed 2009-10-02 02:57:31 +00:00