Scale Logo
SEAL Logo

Spanish

Deprecated (as of March 2025)

Last updated: March 20, 2025

Performance Comparison

1

Gemini 2.0 Pro (December 2024)

1176.00±38.00

2

o1 (December 2024)

1134.00±36.00

3

Gemini Pro Flash 2

1119.00±32.00

4

o1-preview

1111.00±27.00

5

Gemini 2.0 Flash Thinking (January 2025)

1108.00±34.00

6

Gemini 1.5 Pro (November 2024)

1105.00±30.00

7

GPT-4o (May 2024)

1084.00±24.00

8

o3-mini

1079.00±33.00

9

Gemini 1.5 Pro (May 2024)

1069.00±26.00

10

Gemini 1.5 Pro (August 27, 2024)

1067.00±23.00

11

GPT-4o (August 2024)

1067.00±26.00

12

GPT-4 (November 2024)

1034.00±31.00

13

Mistral Large 2

1032.00±24.00

14

GPT-4 Turbo Preview

1020.00±22.00

15

Gemini 1.5 Pro (April 2024)

1005.00±33.00

16

Claude 3.5 Sonnet (June 2024)

992.00±25.00

17

Aya Expanse 32B

983.00±30.00

18

Gemini 1.5 Flash

980.00±28.00

19

Gemma 2 27B

951.00±26.00

20

Llama 3.2 90B Vision Instruct

944.00±24.00

21

Claude 3 Opus

919.00±22.00

22

Llama 3.1 405B Instruct

915.00±23.00