InternVL benchmark result

First, I thank you very much for your contribution. 💯 💯 💯 

In MathVerse, You have proven that most MLLMs solve problems based on "Text Redundancy".

I saw that, in `InternVL` they scale up the vision encoder to reduce the gap between Visual and Textual information. And it's also achieved Top 1 in `MathVista`.

Can you provide the benchmark results of `InternVL` on the `MathVerse` dataset? I think it will add useful information to your hypothesis.

Reference papers:
> https://arxiv.org/pdf/2312.14238.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InternVL benchmark result #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

InternVL benchmark result #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions