jasonmoxham
|
70c9a062d9
|
New asm functions for nehalem mpn_add_err1_n mpn_sub_err1_n
|
2009-09-01 15:03:33 +00:00 |
|
jasonmoxham
|
9a315eef2c
|
mixed up gas and yasm syntax
|
2009-08-23 23:52:01 +00:00 |
|
jasonmoxham
|
1c4bb4fa9f
|
didn't like it as a macro , so new amd asm functions mpn_inclsh mpn_declsh
|
2009-08-23 23:44:19 +00:00 |
|
jasonmoxham
|
a614713922
|
New macro/function for AMD mpn_inclsh_n
|
2009-08-23 23:04:14 +00:00 |
|
jasonmoxham
|
573b911273
|
New asm functions K8/K10 mpn_addlsh_n mpn_sublsh_n and carry-in varients
|
2009-08-23 22:20:49 +00:00 |
|
jasonmoxham
|
7268e5f9ac
|
New asm function nehalem mpn_addlsh_n , delete old mpn_addlsh1_n
|
2009-08-23 17:57:21 +00:00 |
|
jasonmoxham
|
620c9e38df
|
core2/penryn new addlsh faster than old addlsh1 , so delete
|
2009-08-23 16:43:52 +00:00 |
|
jasonmoxham
|
211e597c89
|
add new function core2/penryn mpn_addlsh_n
|
2009-08-23 15:58:03 +00:00 |
|
jasonmoxham
|
303f9fb219
|
New K8/K10 asm function mpn_sub_err1_n
|
2009-08-18 22:36:21 +00:00 |
|
jasonmoxham
|
392ea17854
|
New K8/K10 asm function add_err1
|
2009-08-18 15:37:23 +00:00 |
|
jasonmoxham
|
4f9d128e34
|
New asm functions mpn_copyi for core2/penryn
|
2009-08-14 09:03:07 +00:00 |
|
jasonmoxham
|
a69bf92c40
|
New asm function nehalem mpn_store
|
2009-08-13 09:57:49 +00:00 |
|
jasonmoxham
|
59b98ca38f
|
New core2/penryn mpn_store
|
2009-08-13 08:59:39 +00:00 |
|
jasonmoxham
|
1a7781ade8
|
New K8 asm function mpn_lshiftc
|
2009-08-12 03:13:11 +00:00 |
|
jasonmoxham
|
942d2666ca
|
Tweek K8 mpn_rshift
|
2009-08-11 02:27:30 +00:00 |
|
jasonmoxham
|
2d1deac90c
|
New K8 asm functions mpn_lshift2 mpn_rshift2
|
2009-07-24 11:54:46 +00:00 |
|
jasonmoxham
|
2dae13c07c
|
New intel x86_64 assembler code for left/right shift
|
2009-07-24 11:50:15 +00:00 |
|
wbhart
|
cc86e972e5
|
Merged in David Harvey's mulmid code - not actually used by anything
yet. No division code.
|
2009-07-24 03:12:09 +00:00 |
|
wbhart
|
9d8438f70b
|
Added toom53 and fiddled with the toom4 cutoff on penryn.
|
2009-07-23 07:48:34 +00:00 |
|
wbhart
|
42c02e9733
|
Moved new mul_basecase into netburst directory.
|
2009-06-07 00:32:55 +00:00 |
|
wbhart
|
d599623d25
|
Added tuning values for netburst.
|
2009-06-06 23:37:19 +00:00 |
|
wbhart
|
6b49328621
|
Slightly relaxed the conditions used in make tune to prevent tuning malfunctions and added make tune values for netburst.
|
2009-06-02 15:35:31 +00:00 |
|
wbhart
|
bc25f5bacd
|
More generic x86_64 tuning values and values for fat binaries (taken from k8 values).
|
2009-05-31 18:47:52 +00:00 |
|
wbhart
|
987e5940a8
|
Added some generic x86_64 tuning values (just copied from K8).
|
2009-05-31 18:40:51 +00:00 |
|
jasonmoxham
|
4a767f802c
|
update fat for divrem_2
|
2009-05-29 17:28:19 +00:00 |
|
wbhart
|
b96d7f466b
|
Tuning parameters for Core2.
|
2009-05-28 09:20:35 +00:00 |
|
wbhart
|
b0db490a0b
|
K8 tuning values.
|
2009-05-28 00:12:42 +00:00 |
|
jasonmoxham
|
2fff63ed30
|
nehalem mparam update
|
2009-05-27 23:02:45 +00:00 |
|
wbhart
|
5a67fa8b45
|
Added K10 tuning values.
|
2009-05-27 22:27:18 +00:00 |
|
wbhart
|
f33c6a799e
|
Tuning parameters for penryn.
|
2009-05-27 19:34:25 +00:00 |
|
jasonmoxham
|
749c195a7c
|
Convert new divrem to yasm format
|
2009-05-27 14:28:30 +00:00 |
|
jasonmoxham
|
1ff359d28a
|
divrem_2 speedup
|
2009-05-27 11:21:58 +00:00 |
|
jasonmoxham
|
61cd1223a3
|
New asm function mpn_divrem_euclidean_qr_2 for X86_64
|
2009-05-20 17:58:41 +00:00 |
|
jasonmoxham
|
e788f8b9b4
|
mul_basecase to yasm
|
2009-05-20 13:03:53 +00:00 |
|
jasonmoxham
|
5fb3fafe42
|
mul_basecase for GAS, so I know what the yasm conversion is of
|
2009-05-20 00:01:35 +00:00 |
|
jasonmoxham
|
9c68614d77
|
Add new function generic mpn_divrem_euclidean_qr_2
|
2009-05-19 09:30:34 +00:00 |
|
wbhart
|
a49b4e2337
|
More tuning values, not that they make any difference.
|
2009-05-19 05:44:48 +00:00 |
|
wbhart
|
3d66bd5322
|
Added tuning values for core2 and fixed a minor bug in mul_n.c.
|
2009-05-19 04:03:49 +00:00 |
|
wbhart
|
d96ef5e5d9
|
Reverted a change which slows things down on k8.
|
2009-05-19 02:04:09 +00:00 |
|
wbhart
|
e330cc79bc
|
Fixed some bugs related to tuning gcdext and added tuning for toom4 and
toom7 squaring code.
|
2009-05-19 00:57:17 +00:00 |
|
jasonmoxham
|
2768eeaaf0
|
New asm functions mpn_store MPN_ZERO for k8/k10/nehalem
|
2009-05-14 20:30:27 +00:00 |
|
jasonmoxham
|
3b7c555c8e
|
New generic functions/macros mpn_lshift2 mpn_rshift2 and tests/speed etc
|
2009-05-14 02:44:19 +00:00 |
|
jasonmoxham
|
ccf3200d93
|
mul_basecase tweeks
|
2009-05-13 22:51:35 +00:00 |
|
jasonmoxham
|
6ee4e35940
|
New asm function mpn_mul_basecase for K8/K10/Core2/Penryn/Nehalem
|
2009-05-13 19:49:42 +00:00 |
|
jasonmoxham
|
0a1d07af4e
|
New asm function mpn_sublsh1_n for K8/K10
|
2009-05-10 20:03:47 +00:00 |
|
jasonmoxham
|
574f3be308
|
New asm function mpn_divexact_byff for K8/K10/Core2/penryn/nehalem
|
2009-05-10 19:35:54 +00:00 |
|
jasonmoxham
|
359fab42b5
|
New asm functions mpn_rsh1add_n mpn_rsh1sub_n for K8/K10/Core2/penryn/nehalem
|
2009-05-10 18:46:48 +00:00 |
|
jasonmoxham
|
428e43b40e
|
New asm functions mpn_addadd_n mpn_addsub_n mpn_subadd_n for K8/K10
|
2009-05-10 16:25:01 +00:00 |
|
jasonmoxham
|
90d8207a80
|
New functions mpn_sumdiff for core2/penryn/nehalem , or rather faster to do separate add and sub
|
2009-05-10 03:39:43 +00:00 |
|
jasonmoxham
|
b07549802a
|
New asm functions mpn_add_n mpn_sub_n for Core2/penryn/nehalem
|
2009-05-10 01:26:52 +00:00 |
|