Scale Logo
SEAL Logo

Arabic

Deprecated (as of March 2025)

Last updated: March 20, 2025

Performance Comparison

1

Gemini 1.5 Pro (August 27, 2024)

1147.00 ±33.00

2

Gemini 2.0 Pro (December 2024)

1138.00 ±29.00

3

Gemini 2.0 Flash Thinking (January 2025)

1120.00 ±29.00

4

o1 (December 2024)

1120.00 ±27.00

5

Gemini 1.5 Pro (November 2024)

1116.00 ±28.00

6

o3-mini

1093.00 ±30.00

7

Gemini Pro Flash 2

1090.00 ±28.00

8

o1-preview

1087.00 ±36.00

9

GPT-4o (August 2024)

1066.00 ±47.00

11

GPT-4 (November 2024)

1011.00 ±26.00

12

Claude 3.5 Sonnet (June 2024)

995.00 ±44.00

13

Mistral Large 2

970.00 ±54.00

14

Gemini 1.5 Flash

967.00 ±39.00

15

Aya 23 35B*

932.00 ±25.00

16

Aya Expanse 32B

1025.00 ±24.00

16

Llama 3.1 405B Instruct

875.00 ±55.00

17

Llama 3.3 70B Instruct

808.00 ±36.00

18

Jais Adapted 70B

787.00 ±25.00

19

Gemma 2 27B

661.00 ±29.00