TR

Yapay Zeka Modelleri

LLM'ler, GPT, Claude, Gemini, model eğitimi, benchmark sonuçları ve yeni model duyuruları

1560 articles found · Page 45 / 65

GLM-5-Q2 Outperforms GLM-4.7-Q4 in Accuracy Despite Longer Latency, New Tests Reveal
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility11 views

GLM-5-Q2 Outperforms GLM-4.7-Q4 in Accuracy Despite Longer Latency, New Tests Reveal

New benchmark tests show GLM-5-Q2 achieves perfect accuracy in both English and Chinese reasoning tasks, outpacing its predecessor GLM-4.7-Q4, despite requiring more processing time. With a larger memory footprint but comparable speed, the model signals a shift toward precision over efficiency in local AI deployment.

A
AI Haberleri
GPT-5.1 High Emerges as Leading Model for Technical Learning, Users Report Unprecedented Depth
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility10 views

GPT-5.1 High Emerges as Leading Model for Technical Learning, Users Report Unprecedented Depth

Users on Reddit report that GPT-5.1 High delivers significantly deeper, slower, and more precise responses than other state-of-the-art models, particularly in computer science domains. While OpenAI has not officially confirmed the model’s existence, anecdotal evidence suggests a new benchmark in reasoning and detail-oriented AI performance.

A
AI Haberleri
Claude Sonnet 4.6 Launches with Opus-Level Performance at Mid-Tier Pricing
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility12 views

Claude Sonnet 4.6 Launches with Opus-Level Performance at Mid-Tier Pricing

Anthropic has unveiled Claude Sonnet 4.6, a major upgrade that delivers near-Opus-level reasoning and coding capabilities while maintaining the cost-effective pricing of its predecessor. The model introduces a 1M token context window and enhanced agent planning, making it a compelling choice for enterprises seeking high performance without premium costs.

A
AI Haberleri
Claude Sonnet 4.6 Shows Subtle but Significant Leap Over 4.5 in Spatial Reasoning
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility6 views

Claude Sonnet 4.6 Shows Subtle but Significant Leap Over 4.5 in Spatial Reasoning

New benchmark data reveals Claude Sonnet 4.6 outperforms its predecessor by 12.7% on MineBench’s 3D spatial reasoning tasks, signaling a quiet but meaningful advancement in Anthropic’s mid-tier model lineup. The improvement comes despite identical system prompts and context windows, suggesting architectural refinements rather than just prompt engineering.

A
AI Haberleri
Ferret-UI Lite: Apple’s Breakthrough in On-Device GUI Agents
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility9 views

Ferret-UI Lite: Apple’s Breakthrough in On-Device GUI Agents

Apple researchers have unveiled Ferret-UI Lite, a compact 3B-parameter AI agent capable of interacting with graphical interfaces across mobile, web, and desktop platforms. The model leverages synthetic and real-world GUI data alongside chain-of-thought reasoning to achieve high performance on resource-constrained devices.

A
AI Haberleri
Claude Sonnet 4.6 with Extended Thinking: AI Breakthrough or Hype? Journalist Tests Limits
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility15 views

Claude Sonnet 4.6 with Extended Thinking: AI Breakthrough or Hype? Journalist Tests Limits

Anthropic's new Claude Sonnet 4.6 model, now the default for free users, is being stress-tested by developers and researchers using complex reasoning tasks, coding challenges, and safety probes. Early results suggest significant improvements in extended reasoning—but questions remain about scalability and safety.

A
AI Haberleri
Anthropic Unveils Claude 4.6 Sonnet with 1M Token Context and Enterprise-Grade Coding Capabilities
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility17 views

Anthropic Unveils Claude 4.6 Sonnet with 1M Token Context and Enterprise-Grade Coding Capabilities

Anthropic has launched Claude 4.6 Sonnet, a groundbreaking AI model offering a 1 million token context window, real-time code execution for fact verification, and flagship performance at one-fifth the cost. Designed for developers and enterprises, the model redefines large-scale reasoning and AI-driven coding assistance.

A
AI Haberleri