Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FlagEval
non-profit
https://flageval.baai.ac.cn/
Activity Feed
Follow
19
AI & ML interests
None defined yet.
Recent Activity
philokey
updated
a dataset
about 2 months ago
FlagEval/coco_val2014_sampled
philokey
authored
a paper
about 2 months ago
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
philokey
updated
a dataset
about 2 months ago
FlagEval/MeasureBench
View all activity
Team members
11
FlagEval
's datasets
13
Sort: Recently updated
FlagEval/ERQAPlus
Viewer
•
Updated
about 1 month ago
•
800
•
39
•
1
FlagEval/coco_val2014_sampled
Viewer
•
Updated
Nov 6
•
1k
•
53
FlagEval/MeasureBench
Viewer
•
Updated
Nov 3
•
2.44k
•
313
•
1
FlagEval/EmbodiedVerse-Bench
Viewer
•
Updated
Jun 25
•
2.04k
•
551
FlagEval/Where2Place
Viewer
•
Updated
May 29
•
100
•
420
FlagEval/SAT
Viewer
•
Updated
May 6
•
150
•
122
FlagEval/HMMT_2025
Viewer
•
Updated
May 6
•
30
•
674
•
1
FlagEval/ERQA
Viewer
•
Updated
Apr 22
•
400
•
1.13k
•
2
FlagEval/sub_spatial
Viewer
•
Updated
Apr 21
•
690
•
459
FlagEval/EmbSpatial-Bench
Viewer
•
Updated
Apr 21
•
3.64k
•
318
•
2
FlagEval/documentation-images
Viewer
•
Updated
Nov 13, 2024
•
3
•
177
FlagEval/CLCC_v1
Viewer
•
Updated
Jul 29, 2024
•
760
•
54
•
3
FlagEval/HalluDial
Updated
Jun 26, 2024
•
26
•
3