-
Notifications
You must be signed in to change notification settings - Fork 298
Add vision genai inference path for multi-file VLM evaluation #2488
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+359
−19
Merged
Changes from all commits
Commits
Show all changes
18 commits
Select commit
Hold shift + click to select a range
fffa1b8
Add vision genai inference path for multi-file VLM evaluation
jiafatom 120d0e7
Fix task_type_components_map to apply all component overrides
jiafatom 5bf9c89
Address review comments on vision genai inference
jiafatom bf8ca82
Fix EP mapping: skip CPUExecutionProvider for genai
jiafatom 398d655
Fix lint: remove unused import, unused loop var, use .values()
jiafatom ab380c3
Fix genai provider: use device field instead of ORT EP names
jiafatom e25cd2e
Cap max_length to 128 for vision VQA generation
jiafatom b7f46c9
Increase max_length cap to 4096 for vision genai inference
jiafatom df8756b
Address all review comments and fix lint errors
jiafatom 23fa91b
Add system_prompt support for vision VQA evaluation
jiafatom a144c5e
Add options_col support and extract leading number from predictions
jiafatom d51af37
Address review comments: extract helper, remove debug code, fix lint
jiafatom 7d0738e
Add opt-in number extraction for multiple-choice VQA tasks
jiafatom f3524f6
Re-trigger CI: flaky test_mnb_to_qdq failure unrelated to PR changes
jiafatom fa7a239
Address Copilot review: fix vision detection for empty dict, add unit…
jiafatom 57fb8b3
Address review: add JSON error handling, guard PIL import, fix file h…
jiafatom 9e6156e
Fix formatting: use parenthesized context managers
jiafatom d1b55a7
Fix formatting: collapse ImportError raise to single line
jiafatom File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.