wbhart
|
73c82fbe5c
|
Added tuning for k102.
|
2010-05-11 23:36:25 +00:00 |
|
wbhart
|
0291aa8fa5
|
Fixed the gcd tuning code and added core2/penryn tuning.
|
2010-05-11 23:27:43 +00:00 |
|
wbhart
|
236fcac5c7
|
Fixed the xgcd normalisation issue and redid the tuning code for gcd and
xgcd.
|
2010-05-11 18:37:34 +00:00 |
|
(no author)
|
d41166e589
|
retune atom params
|
2010-04-08 21:49:34 +00:00 |
|
(no author)
|
eec9eac26f
|
Added tuning for i7 (thanks to Gonzalo Tornaria).
|
2010-04-08 00:19:26 +00:00 |
|
(no author)
|
fde0379365
|
Tuning for netburst.
|
2010-04-02 04:48:42 +00:00 |
|
(no author)
|
2f7dfb1c01
|
copy old fft tables into the updated gmp-mparam.h
|
2010-03-25 00:03:23 +00:00 |
|
(no author)
|
68ddf76fae
|
mpn/x86_64/k8/gmp-mparam.h from box2
|
2010-03-24 22:44:56 +00:00 |
|
(no author)
|
4edafb7ebc
|
mpn/x86_64/k8/k10/k102/gmp-mparam.h from lena
|
2010-03-24 19:35:05 +00:00 |
|
(no author)
|
5a9c6aa3eb
|
mpn/x86_64/core2/penryn/gmp-mparam.h from mod
|
2010-03-24 19:32:19 +00:00 |
|
(no author)
|
6c57fd310f
|
mpn/x86_64/atom/gmp-mparam.h from atom
|
2010-03-24 18:11:22 +00:00 |
|
(no author)
|
1555039bd3
|
mpn/x86_64/core2/gmp-mparam.h from eno
|
2010-03-24 17:43:47 +00:00 |
|
(no author)
|
627e796d79
|
mpn/x86_64/k8/k10/gmp-mparam.h from cuda1
|
2010-03-24 17:39:26 +00:00 |
|
(no author)
|
5c8556ba1e
|
params for atom
|
2010-03-24 15:47:51 +00:00 |
|
(no author)
|
5b91e6fe9d
|
add divrem_2 to the fat structure
|
2010-03-18 15:27:42 +00:00 |
|
(no author)
|
c5ce40f051
|
Tuning for K10 and fixed a tuning bug.
|
2010-03-14 19:24:27 +00:00 |
|
(no author)
|
5bef504b53
|
Rearrange tuning code and retune for core2/penryn.
|
2010-03-14 18:50:31 +00:00 |
|
(no author)
|
2e7f1349e4
|
Better tuning values and commit a modified file.
|
2010-03-14 02:13:05 +00:00 |
|
(no author)
|
a504019ac0
|
Slight speedup of toom4 code.
|
2010-03-14 02:01:52 +00:00 |
|
(no author)
|
57e1701e57
|
Core2/penryn tuning.
|
2010-03-07 03:46:51 +00:00 |
|
(no author)
|
e3dd3ca81a
|
Fixed K102 tuning values.
|
2010-03-04 19:17:58 +00:00 |
|
(no author)
|
476da06377
|
Tuned for core2/penryn.
|
2010-03-03 00:33:07 +00:00 |
|
(no author)
|
c7bd66e7a4
|
Finished tuning code for dc_bdiv fns and tuned for K102.
|
2010-03-03 00:30:14 +00:00 |
|
(no author)
|
1ef58c1d86
|
Added mulmid_basecase assembly code and fixed a few build issues.
|
2010-02-23 23:50:37 +00:00 |
|
(no author)
|
5675665544
|
Added mulmid_basecase assembly code from David's patch.
|
2010-02-23 23:14:51 +00:00 |
|
(no author)
|
0ad3dfc64c
|
Retune for Core2/penryn.
|
2010-02-21 17:08:33 +00:00 |
|
(no author)
|
27e7e1a492
|
Tuned yet again for core2/penryn.
|
2010-02-19 17:06:19 +00:00 |
|
(no author)
|
1d88fcbc0f
|
Fixed a tuning bug and tuned for K102.
|
2010-02-19 16:53:01 +00:00 |
|
(no author)
|
2175d5423a
|
Retuned for core2/penryn.
|
2010-02-19 16:20:55 +00:00 |
|
(no author)
|
596ed07fbc
|
Added some tuning for core2/penryn and some additional functions to be
built by tuning code.
|
2010-02-19 16:09:02 +00:00 |
|
(no author)
|
f50d7f04ff
|
Hopefully fixed tuning of division routines.
|
2010-02-19 15:22:22 +00:00 |
|
(no author)
|
f444a2bf6c
|
Attempt to tune some of the division functions.
|
2010-02-19 12:54:56 +00:00 |
|
(no author)
|
2e2976dd0e
|
Added tuning info for core2/penryn.
|
2010-02-18 16:03:50 +00:00 |
|
(no author)
|
19b37fb7c1
|
Added speed, tune and try code for toom8_mul/sqr and tuning for K102.
|
2010-02-18 14:50:40 +00:00 |
|
(no author)
|
fea5f0c849
|
Added a missing mpir.h and some proxy tuning values for core2/penryn
toom8 squaring.
|
2010-02-18 13:46:24 +00:00 |
|
(no author)
|
9cb500d53b
|
Turned on toom8 squaring code.
|
2010-02-18 13:40:38 +00:00 |
|
(no author)
|
3fad25a740
|
Added toom8 files.
|
2010-02-16 23:47:07 +00:00 |
|
(no author)
|
a2c42da38f
|
Fiddled with tuning value.
|
2010-02-13 21:20:12 +00:00 |
|
(no author)
|
20ae4de5b1
|
Ran autoconf, connnected up rootrem code and changed tuning value.
|
2010-02-13 21:18:11 +00:00 |
|
(no author)
|
1758461822
|
Another minor change to core2 tuning values. Seems to slightly improve
timings (though almost imperceptibly).
|
2010-02-09 00:43:06 +00:00 |
|
(no author)
|
9639773959
|
Hand tuned SQR thresholds for core2/penryn.
|
2010-02-07 15:43:19 +00:00 |
|
(no author)
|
2edb3c830a
|
Added a clarification to the copyright.
|
2010-02-07 14:07:42 +00:00 |
|
(no author)
|
0c7b48b1fd
|
Convert divrem_2.asm to yasm format.
|
2010-02-07 14:02:39 +00:00 |
|
(no author)
|
a54d23d652
|
Convert divem_euclidean_qr_1.asm to yasm format.
|
2010-02-07 13:32:16 +00:00 |
|
(no author)
|
0f5bb75287
|
New tuning values for core2/penryn.
|
2010-02-07 03:28:55 +00:00 |
|
(no author)
|
06d4a32943
|
Added divrem_1 and divrem_2 x86_64 assembly code from GMP, replaced
divrem_euclidean_qr_1 with divrem_1.
|
2010-02-06 13:28:23 +00:00 |
|
(no author)
|
0bd0a87f7e
|
Temporarily committing these so they can be converted to yasm format.
|
2010-02-06 03:16:22 +00:00 |
|
wbhart
|
c8d33128b5
|
Made a change to hopefully support unaligned memory allocation as requested by
Dan Grayson. Passes make check and try mpn_lshift.
|
2010-01-10 23:54:55 +00:00 |
|
wbhart
|
0200f63f3b
|
Added tuning params for K102. Added new fft code to main directory.
|
2010-01-01 14:06:41 +00:00 |
|
jasonmoxham
|
829dacd87a
|
New core2/penryn asm functions popcount hamdist
|
2009-11-19 10:53:45 +00:00 |
|
jasonmoxham
|
afc620f493
|
new K8 asm functions mpn_popcount mpn_hamdist
|
2009-11-19 10:38:40 +00:00 |
|
jasonmoxham
|
367f00f4fe
|
New K8 asm functions mpn_and,ior,xor
|
2009-11-19 10:35:17 +00:00 |
|
jasonmoxham
|
beb4d5b735
|
New K8 asm funtions mpn_lshift2,3,4,5,6
|
2009-11-19 10:32:03 +00:00 |
|
jasonmoxham
|
7bd0558974
|
New atom asm funtion mpn-copyi
|
2009-11-19 09:09:50 +00:00 |
|
jasonmoxham
|
ef2b3db6ef
|
remove # comments
|
2009-11-19 08:49:14 +00:00 |
|
jasonmoxham
|
8bcfe2975a
|
New nehalem asm mpn_copyi mpn_copyd
|
2009-11-19 08:45:27 +00:00 |
|
jasonmoxham
|
579e36f2b1
|
New nehalem asm mpn_com
|
2009-11-19 08:44:50 +00:00 |
|
jasonmoxham
|
59bf8d86e4
|
new nehalem asm logic mpn fn's
|
2009-11-19 08:37:54 +00:00 |
|
jasonmoxham
|
dcf3afa567
|
convert addlsh from gas to yasm format
|
2009-11-18 17:43:25 +00:00 |
|
jasonmoxham
|
c6af9fbfc2
|
some more masm? movq/movd mixups
|
2009-10-16 00:45:14 +00:00 |
|
jasonmoxham
|
4ba747128d
|
change movq to movd for old masm assembler
|
2009-10-15 18:21:27 +00:00 |
|
jasonmoxham
|
be135c7347
|
change asm #comment to C comment
|
2009-10-15 18:13:19 +00:00 |
|
jasonmoxham
|
c9f16233b8
|
add back in old fft tuning values , better than nothing
|
2009-10-08 22:50:40 +00:00 |
|
jasonmoxham
|
518226d914
|
atom params
|
2009-10-08 22:06:01 +00:00 |
|
jasonmoxham
|
93688a18b6
|
k10 params
|
2009-10-08 18:01:18 +00:00 |
|
jasonmoxham
|
2f138f7a16
|
core2 params
|
2009-10-08 15:48:27 +00:00 |
|
jasonmoxham
|
e49eccab57
|
k8 params
|
2009-10-08 15:30:59 +00:00 |
|
jasonmoxham
|
4053a62930
|
k102 params
|
2009-10-08 15:28:51 +00:00 |
|
jasonmoxham
|
2821267426
|
nehalem 64 params
|
2009-10-07 12:00:35 +00:00 |
|
jasonmoxham
|
ff4fc75bb8
|
New core2/penryn asm fns mod_1_? divrem_hensel_qr_1_2 rsh_divrem_hensel_qr_1_2
|
2009-10-05 15:02:50 +00:00 |
|
jasonmoxham
|
a10fd0d649
|
New atom asm functions mod_1_? rsh_divrem_hensel_qr_1_1 divrem_hensel_qr_1_1
|
2009-10-05 14:50:05 +00:00 |
|
jasonmoxham
|
dc1fc39381
|
New nehalem asm functions mod_1_? rsh_divrem_hensel_qr_1_2 divrem_hensel_qr_1_2
|
2009-10-05 14:19:08 +00:00 |
|
jasonmoxham
|
ad4a181469
|
New AMD asm function mpn_mod_1_3
|
2009-10-04 23:49:02 +00:00 |
|
jasonmoxham
|
bf0f5c4e6f
|
add carry limb into the existing rsh_divrem_hensel
|
2009-10-04 02:16:25 +00:00 |
|
jasonmoxham
|
cdaad5bffc
|
New AMD asm function mpn_rsh_divrem_hensel_qr_1_2
|
2009-10-03 22:21:44 +00:00 |
|
jasonmoxham
|
90d8b76405
|
New amd asm function mpn_mod_1_2
|
2009-10-03 00:04:27 +00:00 |
|
jasonmoxham
|
520fec686d
|
tweek mod_1_1 amd asm to full speed
|
2009-10-02 02:57:31 +00:00 |
|
jasonmoxham
|
adfded6fe5
|
split out mpn_mod_1_? from divrem_euclidean_r and add New asm function for AMD for mod_1_1
|
2009-10-01 22:04:24 +00:00 |
|
jasonmoxham
|
07f97a0963
|
new AMD asm function mpn_divrem_hensel_qr_1_2
|
2009-09-30 23:41:09 +00:00 |
|
jasonmoxham
|
24d1b6c39b
|
rename divrem_hensel amd asm to match
|
2009-09-30 02:52:41 +00:00 |
|
jasonmoxham
|
56801786a7
|
New asm functions for AMD divrem_hensel_qr_1 divrem_hensel_r_1
|
2009-09-29 23:52:09 +00:00 |
|
jasonmoxham
|
4783c9dc4e
|
New AMD rsh_divrem_hensel_qr_1 asm fn
|
2009-09-29 21:53:03 +00:00 |
|
jasonmoxham
|
a2bf208858
|
add via nano cpuid and code path
|
2009-09-27 23:12:12 +00:00 |
|
jasonmoxham
|
f446d380ce
|
whoops uppercase instead of lower case
|
2009-09-08 02:05:31 +00:00 |
|
jasonmoxham
|
993fab5c21
|
rename K10_2 to K102 as autotools doesnt like - and fat mechanism doesn't like _
|
2009-09-08 01:55:29 +00:00 |
|
jasonmoxham
|
9010f58508
|
update fat to cope with K10_2 and core2,k8 etc on 32bit
|
2009-09-08 01:49:01 +00:00 |
|
jasonmoxham
|
a6542196d3
|
Select best asm functions from existing for Atom cpu 64bit
|
2009-09-06 12:49:19 +00:00 |
|
jasonmoxham
|
70c9a062d9
|
New asm functions for nehalem mpn_add_err1_n mpn_sub_err1_n
|
2009-09-01 15:03:33 +00:00 |
|
jasonmoxham
|
9a315eef2c
|
mixed up gas and yasm syntax
|
2009-08-23 23:52:01 +00:00 |
|
jasonmoxham
|
1c4bb4fa9f
|
didn't like it as a macro , so new amd asm functions mpn_inclsh mpn_declsh
|
2009-08-23 23:44:19 +00:00 |
|
jasonmoxham
|
a614713922
|
New macro/function for AMD mpn_inclsh_n
|
2009-08-23 23:04:14 +00:00 |
|
jasonmoxham
|
573b911273
|
New asm functions K8/K10 mpn_addlsh_n mpn_sublsh_n and carry-in varients
|
2009-08-23 22:20:49 +00:00 |
|
jasonmoxham
|
7268e5f9ac
|
New asm function nehalem mpn_addlsh_n , delete old mpn_addlsh1_n
|
2009-08-23 17:57:21 +00:00 |
|
jasonmoxham
|
620c9e38df
|
core2/penryn new addlsh faster than old addlsh1 , so delete
|
2009-08-23 16:43:52 +00:00 |
|
jasonmoxham
|
211e597c89
|
add new function core2/penryn mpn_addlsh_n
|
2009-08-23 15:58:03 +00:00 |
|
jasonmoxham
|
303f9fb219
|
New K8/K10 asm function mpn_sub_err1_n
|
2009-08-18 22:36:21 +00:00 |
|
jasonmoxham
|
392ea17854
|
New K8/K10 asm function add_err1
|
2009-08-18 15:37:23 +00:00 |
|
jasonmoxham
|
4f9d128e34
|
New asm functions mpn_copyi for core2/penryn
|
2009-08-14 09:03:07 +00:00 |
|
jasonmoxham
|
a69bf92c40
|
New asm function nehalem mpn_store
|
2009-08-13 09:57:49 +00:00 |
|
jasonmoxham
|
59b98ca38f
|
New core2/penryn mpn_store
|
2009-08-13 08:59:39 +00:00 |
|