feat: 10×26-bit field arithmetic (Fe26)
Replace stdlib 4×64-bit Montgomery field with direct 10×26-bit
representation for secp256k1. All point arithmetic, batch affine
conversion, endomorphism, and verification now operate in Fe26.
~9% faster than v0.0.1 baseline with safe normalize-on-output strategy.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>