jasonmoxham
|
beb4d5b735
|
New K8 asm funtions mpn_lshift2,3,4,5,6
|
2009-11-19 10:32:03 +00:00 |
|
jasonmoxham
|
c6af9fbfc2
|
some more masm? movq/movd mixups
|
2009-10-16 00:45:14 +00:00 |
|
jasonmoxham
|
4ba747128d
|
change movq to movd for old masm assembler
|
2009-10-15 18:21:27 +00:00 |
|
jasonmoxham
|
be135c7347
|
change asm #comment to C comment
|
2009-10-15 18:13:19 +00:00 |
|
jasonmoxham
|
c9f16233b8
|
add back in old fft tuning values , better than nothing
|
2009-10-08 22:50:40 +00:00 |
|
jasonmoxham
|
93688a18b6
|
k10 params
|
2009-10-08 18:01:18 +00:00 |
|
jasonmoxham
|
e49eccab57
|
k8 params
|
2009-10-08 15:30:59 +00:00 |
|
jasonmoxham
|
4053a62930
|
k102 params
|
2009-10-08 15:28:51 +00:00 |
|
jasonmoxham
|
ad4a181469
|
New AMD asm function mpn_mod_1_3
|
2009-10-04 23:49:02 +00:00 |
|
jasonmoxham
|
bf0f5c4e6f
|
add carry limb into the existing rsh_divrem_hensel
|
2009-10-04 02:16:25 +00:00 |
|
jasonmoxham
|
cdaad5bffc
|
New AMD asm function mpn_rsh_divrem_hensel_qr_1_2
|
2009-10-03 22:21:44 +00:00 |
|
jasonmoxham
|
90d8b76405
|
New amd asm function mpn_mod_1_2
|
2009-10-03 00:04:27 +00:00 |
|
jasonmoxham
|
520fec686d
|
tweek mod_1_1 amd asm to full speed
|
2009-10-02 02:57:31 +00:00 |
|
jasonmoxham
|
adfded6fe5
|
split out mpn_mod_1_? from divrem_euclidean_r and add New asm function for AMD for mod_1_1
|
2009-10-01 22:04:24 +00:00 |
|
jasonmoxham
|
07f97a0963
|
new AMD asm function mpn_divrem_hensel_qr_1_2
|
2009-09-30 23:41:09 +00:00 |
|
jasonmoxham
|
24d1b6c39b
|
rename divrem_hensel amd asm to match
|
2009-09-30 02:52:41 +00:00 |
|
jasonmoxham
|
56801786a7
|
New asm functions for AMD divrem_hensel_qr_1 divrem_hensel_r_1
|
2009-09-29 23:52:09 +00:00 |
|
jasonmoxham
|
4783c9dc4e
|
New AMD rsh_divrem_hensel_qr_1 asm fn
|
2009-09-29 21:53:03 +00:00 |
|
jasonmoxham
|
9a315eef2c
|
mixed up gas and yasm syntax
|
2009-08-23 23:52:01 +00:00 |
|
jasonmoxham
|
1c4bb4fa9f
|
didn't like it as a macro , so new amd asm functions mpn_inclsh mpn_declsh
|
2009-08-23 23:44:19 +00:00 |
|
jasonmoxham
|
a614713922
|
New macro/function for AMD mpn_inclsh_n
|
2009-08-23 23:04:14 +00:00 |
|
jasonmoxham
|
573b911273
|
New asm functions K8/K10 mpn_addlsh_n mpn_sublsh_n and carry-in varients
|
2009-08-23 22:20:49 +00:00 |
|
jasonmoxham
|
303f9fb219
|
New K8/K10 asm function mpn_sub_err1_n
|
2009-08-18 22:36:21 +00:00 |
|
jasonmoxham
|
392ea17854
|
New K8/K10 asm function add_err1
|
2009-08-18 15:37:23 +00:00 |
|
jasonmoxham
|
1a7781ade8
|
New K8 asm function mpn_lshiftc
|
2009-08-12 03:13:11 +00:00 |
|
jasonmoxham
|
942d2666ca
|
Tweek K8 mpn_rshift
|
2009-08-11 02:27:30 +00:00 |
|
jasonmoxham
|
2d1deac90c
|
New K8 asm functions mpn_lshift2 mpn_rshift2
|
2009-07-24 11:54:46 +00:00 |
|
wbhart
|
b0db490a0b
|
K8 tuning values.
|
2009-05-28 00:12:42 +00:00 |
|
wbhart
|
5a67fa8b45
|
Added K10 tuning values.
|
2009-05-27 22:27:18 +00:00 |
|
jasonmoxham
|
e788f8b9b4
|
mul_basecase to yasm
|
2009-05-20 13:03:53 +00:00 |
|
jasonmoxham
|
5fb3fafe42
|
mul_basecase for GAS, so I know what the yasm conversion is of
|
2009-05-20 00:01:35 +00:00 |
|
wbhart
|
d96ef5e5d9
|
Reverted a change which slows things down on k8.
|
2009-05-19 02:04:09 +00:00 |
|
jasonmoxham
|
2768eeaaf0
|
New asm functions mpn_store MPN_ZERO for k8/k10/nehalem
|
2009-05-14 20:30:27 +00:00 |
|
jasonmoxham
|
3b7c555c8e
|
New generic functions/macros mpn_lshift2 mpn_rshift2 and tests/speed etc
|
2009-05-14 02:44:19 +00:00 |
|
jasonmoxham
|
ccf3200d93
|
mul_basecase tweeks
|
2009-05-13 22:51:35 +00:00 |
|
jasonmoxham
|
6ee4e35940
|
New asm function mpn_mul_basecase for K8/K10/Core2/Penryn/Nehalem
|
2009-05-13 19:49:42 +00:00 |
|
jasonmoxham
|
0a1d07af4e
|
New asm function mpn_sublsh1_n for K8/K10
|
2009-05-10 20:03:47 +00:00 |
|
jasonmoxham
|
574f3be308
|
New asm function mpn_divexact_byff for K8/K10/Core2/penryn/nehalem
|
2009-05-10 19:35:54 +00:00 |
|
jasonmoxham
|
359fab42b5
|
New asm functions mpn_rsh1add_n mpn_rsh1sub_n for K8/K10/Core2/penryn/nehalem
|
2009-05-10 18:46:48 +00:00 |
|
jasonmoxham
|
428e43b40e
|
New asm functions mpn_addadd_n mpn_addsub_n mpn_subadd_n for K8/K10
|
2009-05-10 16:25:01 +00:00 |
|
jasonmoxham
|
0c3c909910
|
New asm functions for mpn_copyi mpn_copyd for k8,k10,core2,penryn,nehalem
|
2009-05-10 00:20:44 +00:00 |
|
wbhart
|
7a0e036d36
|
Fixed toom4 and toom7 issues and added k8 tuning code.
|
2009-05-09 21:12:13 +00:00 |
|
wbhart
|
72f93a085c
|
Added new toom3 code.
|
2009-05-09 20:56:34 +00:00 |
|
jasonmoxham
|
3599d92433
|
converted addmul_2 to yasm
|
2009-04-14 17:00:30 +00:00 |
|
jasonmoxham
|
38072364ee
|
duplicate x86_64 mul_2.as to overcome fat issues
|
2009-04-13 20:32:16 +00:00 |
|
jasonmoxham
|
875685f2a1
|
update gmp-mparam's
|
2009-04-11 04:03:34 +00:00 |
|
jasonmoxham
|
6787300718
|
remove un-needed case in mul_basecase.as for x86_64
|
2009-04-02 00:25:40 +00:00 |
|
jasonmoxham
|
d6f0373c37
|
update gmp-mparam for k10,core2,penryn
|
2009-04-01 22:48:19 +00:00 |
|
jasonmoxham
|
5ecc4581da
|
nehalem,k8 tune params
|
2009-04-01 22:13:15 +00:00 |
|
jasonmoxham
|
863fd95eb1
|
update k10,nehalem,core2,penryn gmp-mparam.h
|
2009-04-01 13:37:17 +00:00 |
|
wbhart
|
e42709e967
|
Added toom4 multiplication.
|
2009-04-01 08:21:03 +00:00 |
|
jasonmoxham
|
8ca3be5bef
|
merge div-branch into trunk with svn merge -r 1782:1816 ../branches/x86_64-division/ run on my local trunk
|
2009-03-31 23:56:06 +00:00 |
|
wbhart
|
32409ddc7d
|
Removed superfluous instructions from conversion to yasm format in
diveby3.
|
2009-03-29 19:05:14 +00:00 |
|
jasonmoxham
|
2235444edf
|
x86_64 mpn_subadd_n plus tests,tune
|
2009-03-29 10:49:51 +00:00 |
|
jasonmoxham
|
5a048dae03
|
merged x86_64 cpuid branch into trunk with svn merge -r 1755:1779 ../branches/x86_64_cpuid/ run in my local copy of trunk
|
2009-03-19 19:52:22 +00:00 |
|