Added Multiframe Inference for llama4+internvl #547

aditjadh · 2025-08-27T01:52:44Z

Summary
This pull request introduces multiframe inference (video summarization) capabilities to the following models:

OpenGVLab/InternVL3-8B
meta-llama/Llama-4-Scout-17B-16E

Details
Implemented support for processing multiple frames as input, enabling enhanced video understanding and summarization.
Updated model pipelines to handle sequential frame data efficiently.
Ensured compatibility with existing inference workflows and maintained performance benchmarks.

Motivation
Multiframe inference allows these models to better capture temporal context and generate more coherent and informative summaries from video inputs. This enhancement is particularly valuable for applications in video analysis, surveillance, and multimedia content summarization.

Added Multiframe Inference for llama4+internvl

263655f

aditjadh requested review from quic-rishinr, ochougul, quic-hemagnih and quic-amitraj as code owners August 27, 2025 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Multiframe Inference for llama4+internvl #547

Added Multiframe Inference for llama4+internvl #547

Uh oh!

aditjadh commented Aug 27, 2025

Uh oh!

Uh oh!

Added Multiframe Inference for llama4+internvl #547

Are you sure you want to change the base?

Added Multiframe Inference for llama4+internvl #547

Uh oh!

Conversation

aditjadh commented Aug 27, 2025

Uh oh!

Uh oh!