Cap typeset_val at 250; document training mix and splits in README
Val: typeset_val now sampled to 250 (was all ~1000), matching mathwriting_val.
Total val: ~1000 (250 mathwriting + 250 typeset + 500 mixed).
README: add effective training mix table with caps applied, validation
table with exact sample counts, and test split listing.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>