Scale AI logo
SEAL Logo

VISTA

Visual Language Understanding

Last updated: April 4, 2025

Performance Comparison

1

54.65±1.46

1

54.63±0.55

3

51.79±0.63

3

51.66±1.08

3

51.63±0.25

3

50.78±0.57

4

50.07±1.14

6

49.59±0.66

7

49.15±0.36

7

47.32±1.78

8

48.23±0.70

10

46.97±1.29

10

46.96±0.95

11

45.50±1.20

11

45.34±0.91

12

45.49±0.21

13

45.25±0.40

16

43.53±1.24

16

43.25±1.26

18

43.21±0.52

18

43.02±1.14

18

42.11±1.39

22

41.14±0.58

22

39.95±0.80

23

39.85±0.71

24

Claude 3.5 Sonnet (October 2024)

38.72±0.51

26

Claude 3.5 Sonnet (June 2024)

38.37±0.70

26

38.33±0.55

26

ChatGPT-4o-latest (November 2024)

37.99±0.48

26

Gemini 1.5 Pro

37.07±1.34

31

GPT-4o (August 2024)

34.94±0.23

31

34.59±1.12

31

Gemini 1.5 Flash 002

34.03±1.41

32

Pixtral Large (November 2024)

33.89±0.69

32

32.69±1.40

36

Qwen2-VL-72B-Instruct

28.56±1.37

36

Claude 3 Opus

27.82±0.55

38

26.55±0.35

38

Nova Pro

26.27±0.61

38

Pixtral 12B (September 2024)

25.97±0.74

38

Nova Lite

25.50±0.77

40

Llama 3.2 90B Vision Instruct

24.61±0.80

43

Llama 3.2 11B Vision-Instruct

20.47±0.15

44

Phi 3.5 Vision-Instruct

15.18±0.81