It surpasses OpenAI o1-pro and claude 3.7 sonnet in the same test:
> o1-pro score 110
> o3-mini score 104
> 3.7 sonnet score 103
For reference, the average human IQ is defined as 100