Gemini 2.0 flash thinking 0121 successfully created the Double snake fight game, people hyped that o3 mini could, but I have proper detailed instructions about 50-60 words. o3 mini high is less than 1% above Gemini thinking in livebench math, probably o3 mini medium is worser

Is o3 medium (free version Chatgpt) better than Gemini 2.0 flash thinking, I think it's slightly worser instead of better. Though o3 mini high might be better which is only for paid users. www.livebench.ai Snake fight: https://drive.google.com/file/d/1jqGMA0ZkXCTzeEpXD7QWWU0sfLwF9paJ/view?usp=drivesdk Sorry the auto save got turned off automatically 😢, so couldn't save it in ai studio