Commit Graph

105 Commits

Author SHA1 Message Date
jasonmoxham
f1ec8a275e add half/double test code and correct return values 2011-07-13 12:54:47 +00:00
jasonmoxham
5955fc2424 mpn_half mpn_double asm for K8 2011-07-13 09:48:02 +00:00
jasonmoxham
0573a58d96 tweek karaadd/sub , add k10 versions 2011-07-05 11:20:40 +00:00
jasonmoxham
543331fc68 karasub change mirrored to karaadd 2011-07-04 20:41:38 +00:00
jasonmoxham
9638ffae9d karasub fixed rcx simplifications 2011-07-04 20:34:42 +00:00
jasonmoxham
94cd105050 karasub swap rcx,rdx save one inst 2011-07-04 20:13:45 +00:00
jasonmoxham
73e2007369 add new karaadd/sub to speed 2011-07-04 18:14:25 +00:00
jasonmoxham
96bad6323d removr 2.3 branch 2011-07-04 16:53:52 +00:00
jasonmoxham
38246bb348 kara tweek 2011-07-04 05:55:43 +00:00
jasonmoxham
a962f39ee1 karaadd karasub and toom22 2011-07-04 04:34:15 +00:00
jasonmoxham
75cc63f4c7 delete , for now k8's udiv.asm and umul.asm 2011-04-04 16:36:07 +00:00
jasonmoxham
0302f8f283 retune k10 params 2011-02-22 09:34:42 +00:00
jasonmoxham
9868d5ddef retune K8 params 2011-02-22 08:52:03 +00:00
jasonmoxham
0ba8d8d2fb corrected K8 not.asm 2011-02-21 16:42:26 +00:00
jasonmoxham
cd6fabdacf New AMD mpn_not asm function 2011-02-20 23:13:34 +00:00
jasonmoxham
dd88a1b5fd New AMD mpn_mod_1_2 2011-02-20 13:29:40 +00:00
jasonmoxham
75d6bf6f07 correct wrong name in amd submul_1 2011-02-20 09:27:40 +00:00
jasonmoxham
a4cd2bfbd1 update AMD mul_1 and submul_1 , just like the new addmul_1 2011-02-11 22:05:06 +00:00
jasonmoxham
c5ae3bd756 New AMD addmul_1 , was 450bytes now 407bytes 2011-01-28 10:44:16 +00:00
jasonmoxham
f4cad36a24 change calling convention on the asm code in x86_64 for redc_1 2010-12-16 22:39:03 +00:00
jasonmoxham
e34cec1068 change name of redc_basecase to redc_1 and export 2010-12-16 02:32:35 +00:00
jasonmoxham
874e62cbf0 New K8 asm function mpn_store 2010-12-05 10:35:49 +00:00
jasonmoxham
77e6892a3c windows conversion script doestn't like sums in code 2010-12-05 01:34:06 +00:00
jasonmoxham
b820af3933 New K8 asm logic functions mpn_nand_n mpn_nior_n 2010-12-05 00:55:20 +00:00
jasonmoxham
b6c1ca003c new K8 asm function popcount hamdist 2010-12-03 23:22:36 +00:00
jasonmoxham
d9e61dbf34 new asm code for K10 popcount hamdist 2010-12-03 23:00:04 +00:00
jasonmoxham
411107f047 replace divebyff.* with divexact_byff.* 2010-08-13 13:23:52 +00:00
jasonmoxham
484cceadc4 new K8only dir for asm that is faster on K8 but NOT K10 2010-06-26 09:00:48 +00:00
mvngu
1d0a85ca7a k8: flavius.skynet Linux: tuning values for GCD_THRESHOLD and GCDEXT_THRESHOLD 2010-05-20 04:24:32 +00:00
jasonmoxham
53edce5a20 removed all old gcdext_threshold that were set to zero 2010-05-12 20:56:21 +00:00
wbhart
73c82fbe5c Added tuning for k102. 2010-05-11 23:36:25 +00:00
(no author)
2f7dfb1c01 copy old fft tables into the updated gmp-mparam.h 2010-03-25 00:03:23 +00:00
(no author)
68ddf76fae mpn/x86_64/k8/gmp-mparam.h from box2 2010-03-24 22:44:56 +00:00
(no author)
4edafb7ebc mpn/x86_64/k8/k10/k102/gmp-mparam.h from lena 2010-03-24 19:35:05 +00:00
(no author)
627e796d79 mpn/x86_64/k8/k10/gmp-mparam.h from cuda1 2010-03-24 17:39:26 +00:00
(no author)
c5ce40f051 Tuning for K10 and fixed a tuning bug. 2010-03-14 19:24:27 +00:00
(no author)
2e7f1349e4 Better tuning values and commit a modified file. 2010-03-14 02:13:05 +00:00
(no author)
e3dd3ca81a Fixed K102 tuning values. 2010-03-04 19:17:58 +00:00
(no author)
c7bd66e7a4 Finished tuning code for dc_bdiv fns and tuned for K102. 2010-03-03 00:30:14 +00:00
(no author)
1d88fcbc0f Fixed a tuning bug and tuned for K102. 2010-02-19 16:53:01 +00:00
(no author)
f50d7f04ff Hopefully fixed tuning of division routines. 2010-02-19 15:22:22 +00:00
(no author)
f444a2bf6c Attempt to tune some of the division functions. 2010-02-19 12:54:56 +00:00
(no author)
19b37fb7c1 Added speed, tune and try code for toom8_mul/sqr and tuning for K102. 2010-02-18 14:50:40 +00:00
(no author)
9cb500d53b Turned on toom8 squaring code. 2010-02-18 13:40:38 +00:00
(no author)
3fad25a740 Added toom8 files. 2010-02-16 23:47:07 +00:00
(no author)
a2c42da38f Fiddled with tuning value. 2010-02-13 21:20:12 +00:00
wbhart
c8d33128b5 Made a change to hopefully support unaligned memory allocation as requested by
Dan Grayson. Passes make check and try mpn_lshift.
2010-01-10 23:54:55 +00:00
wbhart
0200f63f3b Added tuning params for K102. Added new fft code to main directory. 2010-01-01 14:06:41 +00:00
jasonmoxham
afc620f493 new K8 asm functions mpn_popcount mpn_hamdist 2009-11-19 10:38:40 +00:00
jasonmoxham
367f00f4fe New K8 asm functions mpn_and,ior,xor 2009-11-19 10:35:17 +00:00