Bilim ve AraştırmaAI Benchmark Rankings Are Fragile: Small Data Manipulation Can Rewire Leaderboards
New research reveals that popular AI model ranking platforms are dangerously susceptible to minor data tampering, with just 0.003% of removed user ratings capable of flipping top rankings. Experts warn that reliance on these leaderboards for model selection may be misleading and potentially harmful.




















