Brian Gladman
|
bbc0f948a4
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-31 11:27:41 +00:00 |
|
Alexander Kruppa
|
dd530fa21c
|
Add mpn_nsumdiff_n
|
2017-01-30 18:03:19 +01:00 |
|
Alexander Kruppa
|
43c4fdee30
|
Simplified hgcd2(), faster on Skylake
|
2017-01-30 10:29:07 +01:00 |
|
Alexander Kruppa
|
652854acd3
|
Plain-C nsumdiff_n function
|
2017-01-30 09:45:33 +01:00 |
|
Alexander Kruppa
|
2ce86fbd7e
|
Move mpn_hgcd_mul_matrix1_vector() to hgcd_matrix.c
|
2017-01-30 09:44:44 +01:00 |
|
Brian Gladman
|
ad24850c52
|
add haswell assembler code to WIndows x64 build
|
2017-01-22 22:41:01 +00:00 |
|
Brian Gladman
|
76e884a6a8
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-22 15:55:15 +00:00 |
|
Alexander Kruppa
|
6841694fa1
|
Use nsumdiff_n(), if available
|
2017-01-22 04:20:51 +01:00 |
|
Alexander Kruppa
|
578bf82f68
|
nsumdiff_n, computes -(a+b), a-b
|
2017-01-21 22:59:20 +01:00 |
|
Brian Gladman
|
f3f27ce9a2
|
add piledriver assembler code to Windows
|
2017-01-18 23:22:43 +00:00 |
|
Brian Gladman
|
3ce4ca48e3
|
add latest skylake AVX code to Windows
|
2017-01-18 13:09:22 +00:00 |
|
Brian Gladman
|
721da455a0
|
move haswell AVX code into a subdirectory
|
2017-01-18 11:56:24 +00:00 |
|
Brian Gladman
|
5d9df9b153
|
add karaadd and karasub for haswell to WIndows x64
|
2017-01-17 21:53:21 +00:00 |
|
Brian Gladman
|
4d4e6d6793
|
Merge branch 'master' of https://github.com/BrianGladman/mpir
|
2017-01-17 20:32:10 +00:00 |
|
Brian Gladman
|
7ce43db8e3
|
adjustments for Visual Studio 2017
|
2017-01-17 20:31:34 +00:00 |
|
Brian Gladman
|
9ae65471e6
|
keep up to date
|
2017-01-17 20:29:22 +00:00 |
|
Brian Gladman
|
0b24934325
|
Merge branch 'master' of https://github.com/akruppa/mpir
# Conflicts:
# mpn/x86_64/haswell/add_n.as
# mpn/x86_64/haswell/sub_n.as
# mpn/x86_64/skylake/add_n.as
# mpn/x86_64/skylake/sub_n.as
|
2017-01-17 09:05:55 +00:00 |
|
Alexander Kruppa
|
f82a093c18
|
mpn_sub_err1_n for AVX Skylake
2.25c/l
|
2017-01-17 06:47:38 +01:00 |
|
Alexander Kruppa
|
ff493ffc6a
|
mpn_add_err1_n for AVX Skylake
2.25c/l
|
2017-01-17 06:22:24 +01:00 |
|
Alexander Kruppa
|
e52db5b826
|
mul_1 by Nurmann
1.25c/l for large enough operands within L2
|
2017-01-16 23:23:00 +01:00 |
|
Alexander Kruppa
|
0c236c583a
|
add_n and sub_n by Nurmann, now reliably at 1c/l
|
2017-01-12 16:36:06 +01:00 |
|
Alexander Kruppa
|
366a80ddbc
|
Faster on Haswell, too
|
2017-01-11 20:26:09 +01:00 |
|
Alexander Kruppa
|
cf4c153842
|
Merge branch 'master' of github.com:akruppa/mpir
|
2017-01-11 20:19:45 +01:00 |
|
Alexander Kruppa
|
189047563e
|
karaadd that avoids rcl instructions
|
2017-01-10 01:27:36 +01:00 |
|
Alexander Kruppa
|
29e130da17
|
karasub that avoids rcl instructions and 3-component addresses
|
2017-01-10 01:27:22 +01:00 |
|
Brian Gladman
|
8edb5826fb
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2017-01-02 16:16:43 +00:00 |
|
Alex Kruppa
|
5fea3fd389
|
Use whichever existing function of MPIR or GMP is fastest for Bulldozer
|
2017-01-02 06:57:03 -08:00 |
|
Alex Kruppa
|
5c647dafc1
|
Add dummy IFDOS macro
|
2017-01-02 06:56:43 -08:00 |
|
Alex Kruppa
|
0595d7cc7c
|
Copied from core2/
The files differed only in one whitespace character
|
2017-01-02 06:32:54 -08:00 |
|
Alex Kruppa
|
8247a638e0
|
Copy SSE2 com_n from nehalem/
|
2016-12-31 12:26:35 -08:00 |
|
Brian Gladman
|
45322b6277
|
add GPL headers to assembler code
|
2016-12-22 22:23:25 +00:00 |
|
Brian Gladman
|
959308dd5f
|
add new assembler code to Windows
|
2016-12-22 17:50:17 +00:00 |
|
Brian Gladman
|
93c1cda4d8
|
Merge branch 'master' of https://github.com/BrianGladman/mpir
|
2016-12-22 17:34:46 +00:00 |
|
Brian Gladman
|
ac5ed04440
|
add latest assembler code to Windows
|
2016-12-22 17:31:36 +00:00 |
|
Brian Gladman
|
3ad5d241b8
|
add latest assembler code to Windows
|
2016-12-22 16:53:24 +00:00 |
|
Brian Gladman
|
5167ce8705
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2016-12-21 13:51:52 +00:00 |
|
Alexander Kruppa
|
82b062537b
|
Merge branch 'master' of github.com:akruppa/mpir
|
2016-12-21 13:15:51 +01:00 |
|
Alexander Kruppa
|
a781118371
|
Use local label names
|
2016-12-21 01:25:24 +01:00 |
|
Alexander Kruppa
|
54816efdfd
|
Move AVX2-dependent files to avx/ subdirectories
|
2016-12-20 23:57:20 +01:00 |
|
Alexander Kruppa
|
4f46342830
|
addmul_1 for Skylake from GMP 6.1.1
|
2016-12-20 23:45:03 +01:00 |
|
Alexander Kruppa
|
c608c88dca
|
Improve dummy macros
|
2016-12-20 23:44:42 +01:00 |
|
Alexander Kruppa
|
72fe382864
|
sqr_basecase for Skylake from GMP 6.1.1
|
2016-12-20 06:32:39 +01:00 |
|
Alexander Kruppa
|
3e249beaf2
|
Add dummy defines for macros used by GMP
|
2016-12-20 06:32:14 +01:00 |
|
Alexander Kruppa
|
5ae24aef66
|
mul_basecase for Skylake from GMP 6.1.1
|
2016-12-20 05:15:54 +01:00 |
|
Alexander Kruppa
|
f28068172d
|
add_n and sub_n
1c/l most of the time, but sometimes gets into a bad "mode" where performance
degrades to up to 1.2c/l
|
2016-12-20 05:14:15 +01:00 |
|
Brian Gladman
|
3ef16e3f7c
|
correct typos in two assembler file names
|
2016-12-18 16:56:42 +00:00 |
|
Brian Gladman
|
207ba39dc8
|
minor assembler code changes
|
2016-12-17 14:37:31 +00:00 |
|
Brian Gladman
|
55752e8061
|
add the revised add_n/sub_n assembler code to the Windows build
|
2016-12-13 14:10:48 +00:00 |
|
Brian Gladman
|
df53b304fb
|
Merge branch 'master' of https://github.com/akruppa/mpir
|
2016-12-13 13:21:32 +00:00 |
|
Alexander Kruppa
|
4ed54114e5
|
Add add_nc, sub_nc
|
2016-12-12 18:29:19 +01:00 |
|