Brian Gladman
|
768a1699f3
|
switch skylake assembler for mullow_n_basecase
|
2017-02-16 20:08:59 +00:00 |
|
Brian Gladman
|
bde2739796
|
Merge branch 'master' of github.com:wbhart/mpir
|
2017-02-16 17:49:37 +00:00 |
|
Brian Gladman
|
1fd786c39c
|
add core2 and sandybridge assembler for mullow_n_basecase on WIndows
|
2017-02-16 17:41:41 +00:00 |
|
Brian Gladman
|
bd263c7bca
|
add mullow_n_basecase common assembler code to the Windows build
|
2017-02-16 12:58:49 +00:00 |
|
William Hart
|
167dd4c925
|
Update tuning values for broadwell.
|
2017-02-16 10:27:05 +00:00 |
|
William Hart
|
69d5e3e766
|
Merge branch 'master' of https://github.com/wbhart/mpir
|
2017-02-15 18:20:50 +00:00 |
|
William Hart
|
8e0ef9d2d6
|
Merge remote-tracking branch 'brian/master'
|
2017-02-15 18:19:18 +00:00 |
|
Brian Gladman
|
97477e4c1d
|
Merge branch 'master' of github.com:wbhart/mpir
|
2017-02-15 13:15:15 +00:00 |
|
Jean-Pierre Flori
|
28d2620ddf
|
Untrack autotools generated files.
|
2017-02-15 11:14:40 +00:00 |
|
William Hart
|
7819de4620
|
Added broadwell and fixed fat binary support, and fixed two yasm files.
|
2017-02-15 10:23:21 +00:00 |
|
Brian Gladman
|
b005717671
|
add latest tuning data
|
2017-02-14 17:47:19 +00:00 |
|
William Hart
|
58a7e98fcd
|
Run autoreconf.
|
2017-02-14 15:01:24 +00:00 |
|
William Hart
|
46d92b32f0
|
Add bulldozer tuning values.
|
2017-02-14 14:11:58 +00:00 |
|
William Hart
|
590abc8e20
|
Add haswell/avx tuning values.
|
2017-02-14 13:54:34 +00:00 |
|
William Hart
|
c52e0803e7
|
Add tuning values for Nehalem, Ivy Bridge, Skylake AVX.
|
2017-02-14 10:30:48 +00:00 |
|
William Hart
|
6b6213b58b
|
Fix missing semicolon and add tuning values for k102.
|
2017-02-13 17:27:54 +00:00 |
|
Jean-Pierre Flori
|
feb4d44508
|
Merge branch 'master' into powm
Conflicts:
configure.ac
gmp-impl.h
|
2017-02-13 16:42:13 +00:00 |
|
Brian Gladman
|
b961c90963
|
extend add_n/sub_n assembler code for Skylake AVX to provide add_nc/sub_nc
|
2017-02-09 12:26:38 +00:00 |
|
Brian Gladman
|
bd83f26a70
|
add Jens mpn_add_n and mpn_sub_n Skylake assembler to Windows
|
2017-02-03 16:29:45 +00:00 |
|
Brian Gladman
|
a0a04e75e3
|
add tuning for updated assembler code
|
2017-02-02 12:58:53 +00:00 |
|
Brian Gladman
|
5dd6dcb92d
|
Merge branch 'master' of github.com:wbhart/mpir
|
2017-02-02 12:45:38 +00:00 |
|
Alexander Kruppa
|
12ffe87475
|
Faster on Haswell, too
|
2017-01-31 15:18:18 +01:00 |
|
Alexander Kruppa
|
625c3ecc28
|
Parameters for AVX Skylake
|
2017-01-31 12:42:37 +01:00 |
|
Alexander Kruppa
|
d4453c1389
|
New parameters for AVX Haswell
|
2017-01-31 12:41:50 +01:00 |
|
Alex Kruppa
|
bc14d36599
|
Parameters for Bulldozer
|
2017-01-31 03:38:06 -08:00 |
|
Brian Gladman
|
bbc0f948a4
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-31 11:27:41 +00:00 |
|
Alexander Kruppa
|
43c4fdee30
|
Simplified hgcd2(), faster on Skylake
|
2017-01-30 10:29:07 +01:00 |
|
Alexander Kruppa
|
652854acd3
|
Plain-C nsumdiff_n function
|
2017-01-30 09:45:33 +01:00 |
|
Alexander Kruppa
|
2ce86fbd7e
|
Move mpn_hgcd_mul_matrix1_vector() to hgcd_matrix.c
|
2017-01-30 09:44:44 +01:00 |
|
Brian Gladman
|
ad24850c52
|
add haswell assembler code to WIndows x64 build
|
2017-01-22 22:41:01 +00:00 |
|
Brian Gladman
|
76e884a6a8
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-22 15:55:15 +00:00 |
|
Alexander Kruppa
|
578bf82f68
|
nsumdiff_n, computes -(a+b), a-b
|
2017-01-21 22:59:20 +01:00 |
|
Brian Gladman
|
f3f27ce9a2
|
add piledriver assembler code to Windows
|
2017-01-18 23:22:43 +00:00 |
|
Brian Gladman
|
3ce4ca48e3
|
add latest skylake AVX code to Windows
|
2017-01-18 13:09:22 +00:00 |
|
Brian Gladman
|
721da455a0
|
move haswell AVX code into a subdirectory
|
2017-01-18 11:56:24 +00:00 |
|
Brian Gladman
|
5d9df9b153
|
add karaadd and karasub for haswell to WIndows x64
|
2017-01-17 21:53:21 +00:00 |
|
Brian Gladman
|
0b24934325
|
Merge branch 'master' of https://github.com/akruppa/mpir
# Conflicts:
# mpn/x86_64/haswell/add_n.as
# mpn/x86_64/haswell/sub_n.as
# mpn/x86_64/skylake/add_n.as
# mpn/x86_64/skylake/sub_n.as
|
2017-01-17 09:05:55 +00:00 |
|
Alexander Kruppa
|
f82a093c18
|
mpn_sub_err1_n for AVX Skylake
2.25c/l
|
2017-01-17 06:47:38 +01:00 |
|
Alexander Kruppa
|
ff493ffc6a
|
mpn_add_err1_n for AVX Skylake
2.25c/l
|
2017-01-17 06:22:24 +01:00 |
|
Alexander Kruppa
|
e52db5b826
|
mul_1 by Nurmann
1.25c/l for large enough operands within L2
|
2017-01-16 23:23:00 +01:00 |
|
Alexander Kruppa
|
0c236c583a
|
add_n and sub_n by Nurmann, now reliably at 1c/l
|
2017-01-12 16:36:06 +01:00 |
|
Alexander Kruppa
|
366a80ddbc
|
Faster on Haswell, too
|
2017-01-11 20:26:09 +01:00 |
|
Alexander Kruppa
|
cf4c153842
|
Merge branch 'master' of github.com:akruppa/mpir
|
2017-01-11 20:19:45 +01:00 |
|
Alexander Kruppa
|
189047563e
|
karaadd that avoids rcl instructions
|
2017-01-10 01:27:36 +01:00 |
|
Alexander Kruppa
|
29e130da17
|
karasub that avoids rcl instructions and 3-component addresses
|
2017-01-10 01:27:22 +01:00 |
|
Brian Gladman
|
8edb5826fb
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-02 16:16:43 +00:00 |
|
Alex Kruppa
|
5fea3fd389
|
Use whichever existing function of MPIR or GMP is fastest for Bulldozer
|
2017-01-02 06:57:03 -08:00 |
|
Alex Kruppa
|
5c647dafc1
|
Add dummy IFDOS macro
|
2017-01-02 06:56:43 -08:00 |
|
Alex Kruppa
|
0595d7cc7c
|
Copied from core2/
The files differed only in one whitespace character
|
2017-01-02 06:32:54 -08:00 |
|
Alex Kruppa
|
8247a638e0
|
Copy SSE2 com_n from nehalem/
|
2016-12-31 12:26:35 -08:00 |
|