[SPARKNLP-1318] Reader2Image Integration with AutoGGUFVisionModel #14705

danilojsl · 2025-12-08T15:51:31Z

Description

This PR extends Reader2Image to support interoperability with AutoGGUFVisionModel by introducing flexible handling of encoded vs. decoded image bytes and optional prompt output.

Key changes

Added a new parameter useEncodedImageBytes to control whether the image result stores:
- Encoded (compressed) file bytes for models like AutoGGUFVisionModel
- Decoded pixel matrix for models such as Qwen2VLTransformer
Implemented outputPromptColumn parameter to optionally output a separate prompt column containing text prompts as Spark NLP Annotations.

Motivation and Context

Complete integration of Reader2Image with VLM models

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
Code improvements with no or little impact
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING page.
I have added tests to cover my changes.
All new and existing tests passed.

…ration with AutoGGUFVisionModel

danilojsl added 2 commits December 8, 2025 10:50

[SPARKNLP-1318] Adding modifications to Reader2Image to support integ…

69c45db

…ration with AutoGGUFVisionModel

[SPARKNLP-1318] Adding modifications to Reader2Image on Python

6b792ee

danilojsl marked this pull request as ready for review December 8, 2025 17:18

danilojsl requested a review from DevinTDHa December 8, 2025 17:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARKNLP-1318] Reader2Image Integration with AutoGGUFVisionModel #14705

[SPARKNLP-1318] Reader2Image Integration with AutoGGUFVisionModel #14705

Uh oh!

danilojsl commented Dec 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARKNLP-1318] Reader2Image Integration with AutoGGUFVisionModel #14705

Are you sure you want to change the base?

[SPARKNLP-1318] Reader2Image Integration with AutoGGUFVisionModel #14705

Uh oh!

Conversation

danilojsl commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

danilojsl commented Dec 8, 2025 •

edited

Loading