jasonmoxham
|
0ba8d8d2fb
|
corrected K8 not.asm
|
2011-02-21 16:42:26 +00:00 |
|
jasonmoxham
|
cd6fabdacf
|
New AMD mpn_not asm function
|
2011-02-20 23:13:34 +00:00 |
|
jasonmoxham
|
dd88a1b5fd
|
New AMD mpn_mod_1_2
|
2011-02-20 13:29:40 +00:00 |
|
jasonmoxham
|
75d6bf6f07
|
correct wrong name in amd submul_1
|
2011-02-20 09:27:40 +00:00 |
|
jasonmoxham
|
a4cd2bfbd1
|
update AMD mul_1 and submul_1 , just like the new addmul_1
|
2011-02-11 22:05:06 +00:00 |
|
jasonmoxham
|
c5ae3bd756
|
New AMD addmul_1 , was 450bytes now 407bytes
|
2011-01-28 10:44:16 +00:00 |
|
jasonmoxham
|
f4cad36a24
|
change calling convention on the asm code in x86_64 for redc_1
|
2010-12-16 22:39:03 +00:00 |
|
jasonmoxham
|
e34cec1068
|
change name of redc_basecase to redc_1 and export
|
2010-12-16 02:32:35 +00:00 |
|
jasonmoxham
|
874e62cbf0
|
New K8 asm function mpn_store
|
2010-12-05 10:35:49 +00:00 |
|
jasonmoxham
|
77e6892a3c
|
windows conversion script doestn't like sums in code
|
2010-12-05 01:34:06 +00:00 |
|
jasonmoxham
|
b820af3933
|
New K8 asm logic functions mpn_nand_n mpn_nior_n
|
2010-12-05 00:55:20 +00:00 |
|
jasonmoxham
|
b6c1ca003c
|
new K8 asm function popcount hamdist
|
2010-12-03 23:22:36 +00:00 |
|
jasonmoxham
|
d9e61dbf34
|
new asm code for K10 popcount hamdist
|
2010-12-03 23:00:04 +00:00 |
|
jasonmoxham
|
411107f047
|
replace divebyff.* with divexact_byff.*
|
2010-08-13 13:23:52 +00:00 |
|
jasonmoxham
|
484cceadc4
|
new K8only dir for asm that is faster on K8 but NOT K10
|
2010-06-26 09:00:48 +00:00 |
|
mvngu
|
1d0a85ca7a
|
k8: flavius.skynet Linux: tuning values for GCD_THRESHOLD and GCDEXT_THRESHOLD
|
2010-05-20 04:24:32 +00:00 |
|
jasonmoxham
|
53edce5a20
|
removed all old gcdext_threshold that were set to zero
|
2010-05-12 20:56:21 +00:00 |
|
wbhart
|
73c82fbe5c
|
Added tuning for k102.
|
2010-05-11 23:36:25 +00:00 |
|
(no author)
|
2f7dfb1c01
|
copy old fft tables into the updated gmp-mparam.h
|
2010-03-25 00:03:23 +00:00 |
|
(no author)
|
68ddf76fae
|
mpn/x86_64/k8/gmp-mparam.h from box2
|
2010-03-24 22:44:56 +00:00 |
|
(no author)
|
4edafb7ebc
|
mpn/x86_64/k8/k10/k102/gmp-mparam.h from lena
|
2010-03-24 19:35:05 +00:00 |
|
(no author)
|
627e796d79
|
mpn/x86_64/k8/k10/gmp-mparam.h from cuda1
|
2010-03-24 17:39:26 +00:00 |
|
(no author)
|
c5ce40f051
|
Tuning for K10 and fixed a tuning bug.
|
2010-03-14 19:24:27 +00:00 |
|
(no author)
|
2e7f1349e4
|
Better tuning values and commit a modified file.
|
2010-03-14 02:13:05 +00:00 |
|
(no author)
|
e3dd3ca81a
|
Fixed K102 tuning values.
|
2010-03-04 19:17:58 +00:00 |
|
(no author)
|
c7bd66e7a4
|
Finished tuning code for dc_bdiv fns and tuned for K102.
|
2010-03-03 00:30:14 +00:00 |
|
(no author)
|
1d88fcbc0f
|
Fixed a tuning bug and tuned for K102.
|
2010-02-19 16:53:01 +00:00 |
|
(no author)
|
f50d7f04ff
|
Hopefully fixed tuning of division routines.
|
2010-02-19 15:22:22 +00:00 |
|
(no author)
|
f444a2bf6c
|
Attempt to tune some of the division functions.
|
2010-02-19 12:54:56 +00:00 |
|
(no author)
|
19b37fb7c1
|
Added speed, tune and try code for toom8_mul/sqr and tuning for K102.
|
2010-02-18 14:50:40 +00:00 |
|
(no author)
|
9cb500d53b
|
Turned on toom8 squaring code.
|
2010-02-18 13:40:38 +00:00 |
|
(no author)
|
3fad25a740
|
Added toom8 files.
|
2010-02-16 23:47:07 +00:00 |
|
(no author)
|
a2c42da38f
|
Fiddled with tuning value.
|
2010-02-13 21:20:12 +00:00 |
|
wbhart
|
c8d33128b5
|
Made a change to hopefully support unaligned memory allocation as requested by
Dan Grayson. Passes make check and try mpn_lshift.
|
2010-01-10 23:54:55 +00:00 |
|
wbhart
|
0200f63f3b
|
Added tuning params for K102. Added new fft code to main directory.
|
2010-01-01 14:06:41 +00:00 |
|
jasonmoxham
|
afc620f493
|
new K8 asm functions mpn_popcount mpn_hamdist
|
2009-11-19 10:38:40 +00:00 |
|
jasonmoxham
|
367f00f4fe
|
New K8 asm functions mpn_and,ior,xor
|
2009-11-19 10:35:17 +00:00 |
|
jasonmoxham
|
beb4d5b735
|
New K8 asm funtions mpn_lshift2,3,4,5,6
|
2009-11-19 10:32:03 +00:00 |
|
jasonmoxham
|
c6af9fbfc2
|
some more masm? movq/movd mixups
|
2009-10-16 00:45:14 +00:00 |
|
jasonmoxham
|
4ba747128d
|
change movq to movd for old masm assembler
|
2009-10-15 18:21:27 +00:00 |
|
jasonmoxham
|
be135c7347
|
change asm #comment to C comment
|
2009-10-15 18:13:19 +00:00 |
|
jasonmoxham
|
c9f16233b8
|
add back in old fft tuning values , better than nothing
|
2009-10-08 22:50:40 +00:00 |
|
jasonmoxham
|
93688a18b6
|
k10 params
|
2009-10-08 18:01:18 +00:00 |
|
jasonmoxham
|
e49eccab57
|
k8 params
|
2009-10-08 15:30:59 +00:00 |
|
jasonmoxham
|
4053a62930
|
k102 params
|
2009-10-08 15:28:51 +00:00 |
|
jasonmoxham
|
ad4a181469
|
New AMD asm function mpn_mod_1_3
|
2009-10-04 23:49:02 +00:00 |
|
jasonmoxham
|
bf0f5c4e6f
|
add carry limb into the existing rsh_divrem_hensel
|
2009-10-04 02:16:25 +00:00 |
|
jasonmoxham
|
cdaad5bffc
|
New AMD asm function mpn_rsh_divrem_hensel_qr_1_2
|
2009-10-03 22:21:44 +00:00 |
|
jasonmoxham
|
90d8b76405
|
New amd asm function mpn_mod_1_2
|
2009-10-03 00:04:27 +00:00 |
|
jasonmoxham
|
520fec686d
|
tweek mod_1_1 amd asm to full speed
|
2009-10-02 02:57:31 +00:00 |
|