Cristina Criddle / Financial Times:
Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests — Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
No comment yet, add your voice below!