-
Notifications
You must be signed in to change notification settings - Fork 729
Pull requests: open-compass/VLMEvalKit
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Benchmark] Add support for LongDocURL
#1595
opened Jul 1, 2026 by
zhaowei-wang-nlp
Contributor
Loading…
Add defaultdict import to omnidocbench.py
#1585
opened Jun 27, 2026 by
shenyunhang
Contributor
Loading…
[Dataset] Add BenchCAD (CAD code-understanding VQA)
#1583
opened Jun 25, 2026 by
HaozheZhang6
Loading…
[Benchmark] Add support for OmniDocBench v1.6 benchmark
#1572
opened Jun 10, 2026 by
Helen1p
Loading…
[Feature] Support DiffSpot (fine-grained visual change detection on web UIs)
#1568
opened Jun 4, 2026 by
banyinjushi
Loading…
3 tasks done
[Feature] Add Rebellions (RBLN) NPU evaluation backend
#1566
opened Jun 2, 2026 by
rebel-hwkim
Loading…
[Benchmark] Add support for MaRVL, xGQA and ALM-Bench
#1564
opened Jun 1, 2026 by
inakiLakunza
Contributor
Loading…
[Fix] Video-MME-v2: improve acc calculation and data preparation
#1551
opened May 20, 2026 by
EliYuan30
Loading…
fix: guard choices[0] and message=None before content access
#1550
opened May 17, 2026 by
qizwiz
Loading…
[Cleanup] Remove unused Polygon3 dependency (#1528)
#1548
opened May 16, 2026 by
SHAI-Akshay-Tripathi
Contributor
Loading…
[Fix] Fix default judge model selection conflict in run.py and tools.py
#1532
opened May 6, 2026 by
TianhaoLiang2000
Collaborator
Loading…
[Fix] Fix judge intermediate result caching and resume support
#1531
opened May 6, 2026 by
TianhaoLiang2000
Collaborator
Loading…
[Benchmark] Add support for MMOral-Uni benchmark
#1527
opened Apr 26, 2026 by
isjinghao
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-07-01.