Commit Graph

3191 Commits

Author SHA1 Message Date
Brian Gladman
b961c90963 extend add_n/sub_n assembler code for Skylake AVX to provide add_nc/sub_nc 2017-02-09 12:26:38 +00:00
Brian Gladman
bd83f26a70 add Jens mpn_add_n and mpn_sub_n Skylake assembler to Windows 2017-02-03 16:29:45 +00:00
Brian Gladman
a0a04e75e3 add tuning for updated assembler code 2017-02-02 12:58:53 +00:00
Brian Gladman
5dd6dcb92d Merge branch 'master' of github.com:wbhart/mpir 2017-02-02 12:45:38 +00:00
Alexander Kruppa
adec7e7871 Merge ../mpir.wbhart 2017-01-31 16:33:09 +01:00
Alexander Kruppa
12ffe87475 Faster on Haswell, too 2017-01-31 15:18:18 +01:00
Brian Gladman
93e4c1807e add latest changes from Alex 2017-01-31 13:43:07 +00:00
Alexander Kruppa
625c3ecc28 Parameters for AVX Skylake 2017-01-31 12:42:37 +01:00
Alexander Kruppa
d4453c1389 New parameters for AVX Haswell 2017-01-31 12:41:50 +01:00
Alex Kruppa
bc14d36599 Parameters for Bulldozer 2017-01-31 03:38:06 -08:00
Brian Gladman
bbc0f948a4 Merge branch 'master' of https://github.com/akruppa/mpir 2017-01-31 11:27:41 +00:00
Alexander Kruppa
dd530fa21c Add mpn_nsumdiff_n 2017-01-30 18:03:19 +01:00
Alexander Kruppa
43c4fdee30 Simplified hgcd2(), faster on Skylake 2017-01-30 10:29:07 +01:00
Alexander Kruppa
652854acd3 Plain-C nsumdiff_n function 2017-01-30 09:45:33 +01:00
Alexander Kruppa
2ce86fbd7e Move mpn_hgcd_mul_matrix1_vector() to hgcd_matrix.c 2017-01-30 09:44:44 +01:00
Brian Gladman
ad24850c52 add haswell assembler code to WIndows x64 build 2017-01-22 22:41:01 +00:00
Brian Gladman
76e884a6a8 Merge branch 'master' of https://github.com/akruppa/mpir 2017-01-22 15:55:15 +00:00
Alexander Kruppa
6841694fa1 Use nsumdiff_n(), if available 2017-01-22 04:20:51 +01:00
Alexander Kruppa
578bf82f68 nsumdiff_n, computes -(a+b), a-b 2017-01-21 22:59:20 +01:00
Brian Gladman
f3f27ce9a2 add piledriver assembler code to Windows 2017-01-18 23:22:43 +00:00
Brian Gladman
3ce4ca48e3 add latest skylake AVX code to Windows 2017-01-18 13:09:22 +00:00
Brian Gladman
721da455a0 move haswell AVX code into a subdirectory 2017-01-18 11:56:24 +00:00
Brian Gladman
5d9df9b153 add karaadd and karasub for haswell to WIndows x64 2017-01-17 21:53:21 +00:00
Brian Gladman
4d4e6d6793 Merge branch 'master' of https://github.com/BrianGladman/mpir 2017-01-17 20:32:10 +00:00
Brian Gladman
7ce43db8e3 adjustments for Visual Studio 2017 2017-01-17 20:31:34 +00:00
Brian Gladman
9ae65471e6 keep up to date 2017-01-17 20:29:22 +00:00
Brian Gladman
0b24934325 Merge branch 'master' of https://github.com/akruppa/mpir
# Conflicts:
#	mpn/x86_64/haswell/add_n.as
#	mpn/x86_64/haswell/sub_n.as
#	mpn/x86_64/skylake/add_n.as
#	mpn/x86_64/skylake/sub_n.as
2017-01-17 09:05:55 +00:00
Alexander Kruppa
f82a093c18 mpn_sub_err1_n for AVX Skylake
2.25c/l
2017-01-17 06:47:38 +01:00
Alexander Kruppa
ff493ffc6a mpn_add_err1_n for AVX Skylake
2.25c/l
2017-01-17 06:22:24 +01:00
Alexander Kruppa
e52db5b826 mul_1 by Nurmann
1.25c/l for large enough operands within L2
2017-01-16 23:23:00 +01:00
wbhart
ded6b88c96 Merge pull request #185 from thofma/master
Adjust to new sed version
2017-01-13 08:44:50 +01:00
Tommy Hofmann
fdb590023f Adjust to new sed version
Change [:space] to [[:space]].
sed requires character classes to be inside square brackets.
As of sed 4.3, missing the brackets raises an error.
2017-01-13 08:30:56 +01:00
Alexander Kruppa
0c236c583a add_n and sub_n by Nurmann, now reliably at 1c/l 2017-01-12 16:36:06 +01:00
Alexander Kruppa
366a80ddbc Faster on Haswell, too 2017-01-11 20:26:09 +01:00
Alexander Kruppa
cf4c153842 Merge branch 'master' of github.com:akruppa/mpir 2017-01-11 20:19:45 +01:00
Alexander Kruppa
189047563e karaadd that avoids rcl instructions 2017-01-10 01:27:36 +01:00
Alexander Kruppa
29e130da17 karasub that avoids rcl instructions and 3-component addresses 2017-01-10 01:27:22 +01:00
Brian Gladman
8edb5826fb Merge branch 'master' of https://github.com/akruppa/mpir 2017-01-02 16:16:43 +00:00
Alex Kruppa
5fea3fd389 Use whichever existing function of MPIR or GMP is fastest for Bulldozer 2017-01-02 06:57:03 -08:00
Alex Kruppa
5c647dafc1 Add dummy IFDOS macro 2017-01-02 06:56:43 -08:00
Alex Kruppa
0595d7cc7c Copied from core2/
The files differed only in one whitespace character
2017-01-02 06:32:54 -08:00
Alex Kruppa
8247a638e0 Copy SSE2 com_n from nehalem/ 2016-12-31 12:26:35 -08:00
Brian Gladman
45322b6277 add GPL headers to assembler code 2016-12-22 22:23:25 +00:00
Brian Gladman
959308dd5f add new assembler code to Windows 2016-12-22 17:50:17 +00:00
Brian Gladman
93c1cda4d8 Merge branch 'master' of https://github.com/BrianGladman/mpir 2016-12-22 17:34:46 +00:00
Brian Gladman
ac5ed04440 add latest assembler code to Windows 2016-12-22 17:31:36 +00:00
Brian Gladman
3ad5d241b8 add latest assembler code to Windows 2016-12-22 16:53:24 +00:00
Brian Gladman
5167ce8705 Merge branch 'master' of https://github.com/akruppa/mpir 2016-12-21 13:51:52 +00:00
Alexander Kruppa
82b062537b Merge branch 'master' of github.com:akruppa/mpir 2016-12-21 13:15:51 +01:00
Alexander Kruppa
a781118371 Use local label names 2016-12-21 01:25:24 +01:00