Fix code review issues, refactor, and add README
Code review fixes:
- Fix --no-images flag to properly suppress cached images
- Fix image path joining bug (handle 'images/' prefix from Docling)
- Extend dependency check to cover all required packages
- Deduplicate images by xref in extract_images()
- Add fallback metadata lookup for cache clearing deleted PDFs
- Fix tempfile handle issue for Windows compatibility
- Fix mkdir to use parents=True
Refactoring:
- Delete extract_pdf_accurate() (was dead code duplicating extract_pdf_docling)
- Delete test_docling.py (functionality covered by main CLI)
- Rename docling_extractor.py to extractor.py (more generic)
- Extract _save_docling_images() helper function
Documentation:
- Add README.md
- Update SKILL.md with correct dependencies
- Remove references to non-existent --chunked and --workers options
- Remove unverified accuracy percentage claims