TR

Yapay Zeka Modelleri

LLM'ler, GPT, Claude, Gemini, model eğitimi, benchmark sonuçları ve yeni model duyuruları

1560 articles found · Page 39 / 65

Hugging Face Unveils GLM-5: A Paradigm Shift from Vibe Coding to Agentic Engineering
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility16 views

Hugging Face Unveils GLM-5: A Paradigm Shift from Vibe Coding to Agentic Engineering

The Hugging Face H4 team has revealed GLM-5, a groundbreaking AI model that moves beyond informal 'vibe coding' toward structured agentic engineering. Drawing from a technical report published on February 15, 2026, the model integrates autonomous reasoning, tool use, and dynamic task decomposition — signaling a new era in open-source LLM development.

A
AI Haberleri
ChatGPT’s 256K Context Window Now Standard Across Plans, Hidden Behind 'Thinking' Feature
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility10 views

ChatGPT’s 256K Context Window Now Standard Across Plans, Hidden Behind 'Thinking' Feature

OpenAI has quietly expanded the context window to 256K tokens for all ChatGPT users when the 'Thinking' mode is enabled, a change not reflected on its public pricing page. This enhancement, first noted by Reddit users, significantly boosts the AI’s ability to process lengthy documents and complex queries without explicit user awareness.

A
AI Haberleri
GPT-5.3 Codex (High) Underperforms on METR Benchmark, Raising Questions About AI Code Generation Capabilities
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility11 views

GPT-5.3 Codex (High) Underperforms on METR Benchmark, Raising Questions About AI Code Generation Capabilities

New benchmark results reveal that GPT-5.3 Codex (High) scored significantly below expectations on the METR code evaluation metric, challenging assumptions about the latest AI models' coding proficiency. Experts are now scrutinizing whether scaling alone can drive meaningful gains in real-world software development tasks.

A
AI Haberleri
Gemini 3.1 Pro Generates Complex Isometric SVG Scene, Showcasing AI’s Creative Leap
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility15 views

Gemini 3.1 Pro Generates Complex Isometric SVG Scene, Showcasing AI’s Creative Leap

A Reddit user demonstrated that Google’s Gemini 3.1 Pro can generate a fully functional isometric 3D scene using only SVG code — no external tools or manual rendering. The feat, validated by benchmark data showing the model’s superior abstract reasoning, underscores AI’s growing capacity for creative technical execution.

A
AI Haberleri
Gemini 3.1 Pro Preview Breaks Benchmark Record with 98.4 Score on NYT Connections
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility12 views

Gemini 3.1 Pro Preview Breaks Benchmark Record with 98.4 Score on NYT Connections

Google's Gemini 3.1 Pro Preview has achieved a record-breaking 98.4% accuracy on the Extended NYT Connections benchmark, surpassing its predecessor and setting a new standard for AI reasoning. The milestone underscores rapid advancements in large language models' ability to handle complex semantic and contextual puzzles.

A
AI Haberleri
GLM-5 Exhibits Emergent 'Claude' Personality, Raising Questions About AI Training and Ethical Boundaries
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility16 views

GLM-5 Exhibits Emergent 'Claude' Personality, Raising Questions About AI Training and Ethical Boundaries

New observations reveal that Zhipu's GLM-5 large language model adopts the behavioral traits of Anthropic's Claude when prompted to identify as it—bypassing its own censorship protocols. Experts are divided on whether this is an intentional design feature or an emergent artifact of training data contamination.

A
AI Haberleri
Qwen3.5 Coder Emerges as Surprising Powerhouse at Aggressive Quantization Levels
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility20 views

Qwen3.5 Coder Emerges as Surprising Powerhouse at Aggressive Quantization Levels

Despite being quantized to Q2 precision — far below typical 30B model requirements — Qwen3.5 Coder demonstrates unexpectedly robust coding performance, outperforming larger models in self-correction and one-shot task execution. Experts suggest its architecture prioritizes efficiency over scale, challenging conventional AI wisdom.

A
AI Haberleri
New AI Benchmarks Reveal Qwen3 Coder Next and Step 3.5 Flash Lead in Memory-Efficient Performance
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility13 views

New AI Benchmarks Reveal Qwen3 Coder Next and Step 3.5 Flash Lead in Memory-Efficient Performance

Recent benchmarks on ROCm-powered hardware show Qwen3 Coder Next and Step 3.5 Flash outperforming rival models in memory-constrained environments, signaling a shift toward efficient, high-capability AI deployment. The results, published by a community researcher, highlight emerging trends in on-device AI inference.

A
AI Haberleri
Taalas Unveils Record-Breaking Llama 3.1 8B Chip, Raises $169M to Challenge Nvidia
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility13 views

Taalas Unveils Record-Breaking Llama 3.1 8B Chip, Raises $169M to Challenge Nvidia

Canadian AI hardware startup Taalas has launched a custom silicon chip that runs Llama 3.1 8B at an unprecedented 17,000 tokens per second, redefining inference speed. The company, backed by $169 million in funding, aims to disrupt Nvidia’s dominance in the AI chip market with its aggressively quantized architecture.

A
AI Haberleri
iPhone 14 Pro Max Achieves 46 Tokens/Second with BitNet AI Model, Redefining On-Device LLM Performance
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility22 views

iPhone 14 Pro Max Achieves 46 Tokens/Second with BitNet AI Model, Redefining On-Device LLM Performance

A developer has successfully ported Microsoft’s BitNet, a 1-bit quantized AI model, to the iPhone 14 Pro Max, achieving 45–46 tokens per second—surpassing previous on-device benchmarks. The breakthrough demonstrates the potential for powerful, privacy-preserving AI to run directly on consumer smartphones without cloud dependency.

A
AI Haberleri
Claude Opus 4.6 Surpasses Predictions on METR Benchmark, Signals Exponential AI Progress
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility36 views

Claude Opus 4.6 Surpasses Predictions on METR Benchmark, Signals Exponential AI Progress

Claude Opus 4.6 has achieved unprecedented performance on METR’s 50%-time-horizon benchmark, outpacing all prior models and challenging established AI timelines. According to analysis from LessWrong, its progress suggests an accelerating trajectory in AI capability, with implications for safety, policy, and development.

A
AI Haberleri