2. tidy up format of Windows assembler code
2. Remove the now redundant 32 to 64 register mapping for mp_size_t inputs in Windows assembler