BabyBear (31-bit field, p = 231 − 227 + 1) and Grumpkin / BN254 scalar field (254-bit prime) — Montgomery multiplication on GPU