Add DeepSeek-OCR-2 model code with eval_mode inference fix
modeling_deepseekocr2.py is the upstream model package with one local patch:
in the eval_mode branch of infer(), pass attention_mask=torch.ones_like(input_ids)
to generate() to suppress the spurious warning caused by pad_token_id == eos_token_id,
and reuse the already-computed _input_ids_cuda tensor in the decode step.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>