Refactor pdf_to_md.py: extract CacheManager and ImageManager classes
Major refactoring to reduce complexity of 414-line main() function:
- Add ExtractionConfig and ExtractionResult dataclasses
- Extract CacheManager class (consolidates 10 cache functions)
- Extract ImageManager class with context manager for cleanup
Bug fixes:
- Fix --no-images emitting broken image links from cache fallback
- Fix fast_noimages cache entries never being reused (filter bug)
- Fix temp directory leak with --no-cache --stdout
Also adds missing --images-scale and --no-progress options to README.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>