Add CANONICALIZATION.md — deep technical doc for research team review
Comprehensive analysis of the canonicalization pipeline:
- Exact algorithm walkthrough (rule-based + LLM-enhanced)
- Concrete output examples with real numbers from TaskFlow
- 10 specific shortcomings with root causes and impact analysis
- 6 deeper structural problems (coverage, confidence, hierarchy)
- 8 potential research directions for alternatives
- Evaluation criteria table with current baselines
- Ranked list of what we need help with
Written to give researchers enough context to propose
alternative approaches without reading the codebase.