Blind Tests: AI Music Edition

From The Goldborg Variations

Your results are anonymously logged for fun.

Which Model?
Opus 4.5 vs 4.6

Listen to a piece of AI-generated music and guess which model composed it. All models start from the same seed and iterate independently.

The models: Claude Opus 4.5, Gemini 3.1 Pro, ChatGPT 5.2, and Grok 4.1

Have you taken this quiz before?
Prior exposure to the samples?

Which model made this?

Results

Results are anonymously logged for research.

First, listen to labeled examples from each model to calibrate your ear. These are from the Goldberg Variations seed — the same seed, evolved independently by each model.

Opus 4.5 (Goldberg seed):
Opus 4.6 (Goldberg seed):
Have you taken this test before?
Prior exposure to the samples?

A

B

Which one is Opus 4.5?

Results

Results are anonymously logged for research.