should addsub return int or limb ?
should the new function be entered in the fat structure?
tune params on K8,K10,core2,Pentium D