Yapay Zeka Modelleri

LLM'ler, GPT, Claude, Gemini, model eğitimi, benchmark sonuçları ve yeni model duyuruları

1560 articles found · Page 39 / 65

Hugging Face Unveils GLM-5: A Paradigm Shift from Vibe Coding to Agentic Engineering

Yapay Zeka Modelleri

schedule5 ay önce

visibility17 views

Hugging Face Unveils GLM-5: A Paradigm Shift from Vibe Coding to Agentic Engineering

The Hugging Face H4 team has revealed GLM-5, a groundbreaking AI model that moves beyond informal 'vibe coding' toward structured agentic engineering. Drawing from a technical report published on February 15, 2026, the model integrates autonomous reasoning, tool use, and dynamic task decomposition — signaling a new era in open-source LLM development.

A

AI Haberleri

Claude Code AI Agent Is Revolutionizing the 2026 Forecast Market

Yapay Zeka Modelleri

schedule5 ay önce

visibility13 views

Claude Code AI Agent Is Revolutionizing the 2026 Forecast Market

Anthropic's Claude Code AI agent is gaining a competitive advantage by outperforming human forecasters in prediction markets by 2026. Technical details and market impacts are being examined.

A

AI Haberleri

ChatGPT’s 256K Context Window Now Standard Across Plans, Hidden Behind 'Thinking' Feature

Yapay Zeka Modelleri

schedule5 ay önce

visibility13 views

ChatGPT’s 256K Context Window Now Standard Across Plans, Hidden Behind 'Thinking' Feature

OpenAI has quietly expanded the context window to 256K tokens for all ChatGPT users when the 'Thinking' mode is enabled, a change not reflected on its public pricing page. This enhancement, first noted by Reddit users, significantly boosts the AI’s ability to process lengthy documents and complex queries without explicit user awareness.

A

AI Haberleri

Why Users Are Frustrated with ChatGPT’s Recent Performance Decline

Yapay Zeka Modelleri

schedule5 ay önce

visibility24 views

Why Users Are Frustrated with ChatGPT’s Recent Performance Decline

Users across online forums are reporting a sharp decline in ChatGPT’s accuracy and responsiveness, citing frequent factual errors and overzealous content filters. Experts suggest this may stem from recent model updates aimed at safety, not degradation.

A

AI Haberleri

AI Coding Breakthrough: Opus 4.6 and GPT-5.3-Codex Revolutionize Developer Productivity

Yapay Zeka Modelleri

schedule5 ay önce

visibility12 views

AI Coding Breakthrough: Opus 4.6 and GPT-5.3-Codex Revolutionize Developer Productivity

A software developer reports a transformative leap in coding efficiency using the latest AI models, Opus 4.6 and GPT-5.3-Codex, after years of skepticism toward AI-assisted tools. Experts suggest this marks a turning point in human-AI collaboration in software development.

A

AI Haberleri

Claude Code Faces Criticism Over Excessive Token Usage in Code Generation

Yapay Zeka Modelleri

schedule5 ay önce

visibility18 views

Claude Code Faces Criticism Over Excessive Token Usage in Code Generation

Developers are raising alarms about excessive token consumption in Anthropic's Claude Code, leading to performance bottlenecks and increased costs. The issue, documented on GitHub and discussed on Hacker News, highlights broader concerns about AI efficiency in developer tooling.

A

AI Haberleri

Gemini 3.1 Pro Surpasses Benchmarks but Sparks Debate Over Human-Like AI

Yapay Zeka Modelleri

schedule5 ay önce

visibility13 views

Gemini 3.1 Pro Surpasses Benchmarks but Sparks Debate Over Human-Like AI

Google's Gemini 3.1 Pro achieves record-breaking performance on technical benchmarks, yet users report a chilling loss of conversational warmth. As AI shifts from mimicking humans to optimizing for metrics, experts question whether we're advancing intelligence—or losing its soul.

A

AI Haberleri

Taalas HC1 ASIC at 16,960 tok/s: Revolutionizing Local LLMs by 2026

Yapay Zeka Modelleri

schedule5 ay önce

visibility17 views

Taalas HC1 ASIC at 16,960 tok/s: Revolutionizing Local LLMs by 2026

Taalas achieved a speed of 16,960 tokens/second running the Llama 3.1 8B model with its HC1 custom ASIC chip, set to launch in 2026. This performance establishes a new standard for real-time AI in edge computing applications.

A

AI Haberleri

OpenAI Claims 30% Speed Boost in GPT-5.3-Codex-Spark, Reaches 1200 Tokens/Second

Yapay Zeka Modelleri

schedule5 ay önce

visibility23 views

OpenAI Claims 30% Speed Boost in GPT-5.3-Codex-Spark, Reaches 1200 Tokens/Second

OpenAI engineer Thibault Sottiaux announced a significant performance upgrade to the GPT-5.3-Codex-Spark model, achieving over 1200 tokens per second—a 30% improvement. The development underscores accelerating efforts to optimize large language model inference efficiency for enterprise applications.

A

AI Haberleri

GPT-5.3 Codex (High) Underperforms on METR Benchmark, Raising Questions About AI Code Generation Capabilities

Yapay Zeka Modelleri

schedule5 ay önce

visibility11 views

GPT-5.3 Codex (High) Underperforms on METR Benchmark, Raising Questions About AI Code Generation Capabilities

New benchmark results reveal that GPT-5.3 Codex (High) scored significantly below expectations on the METR code evaluation metric, challenging assumptions about the latest AI models' coding proficiency. Experts are now scrutinizing whether scaling alone can drive meaningful gains in real-world software development tasks.

A

AI Haberleri

Gemini 3.1 Pro Generates Complex Isometric SVG Scene, Showcasing AI’s Creative Leap

Yapay Zeka Modelleri

schedule5 ay önce

visibility15 views

Gemini 3.1 Pro Generates Complex Isometric SVG Scene, Showcasing AI’s Creative Leap

A Reddit user demonstrated that Google’s Gemini 3.1 Pro can generate a fully functional isometric 3D scene using only SVG code — no external tools or manual rendering. The feat, validated by benchmark data showing the model’s superior abstract reasoning, underscores AI’s growing capacity for creative technical execution.

A

AI Haberleri

Gemini 3.1 Pro Preview Breaks Benchmark Record with 98.4 Score on NYT Connections

Yapay Zeka Modelleri

schedule5 ay önce

visibility14 views

Gemini 3.1 Pro Preview Breaks Benchmark Record with 98.4 Score on NYT Connections

Google's Gemini 3.1 Pro Preview has achieved a record-breaking 98.4% accuracy on the Extended NYT Connections benchmark, surpassing its predecessor and setting a new standard for AI reasoning. The milestone underscores rapid advancements in large language models' ability to handle complex semantic and contextual puzzles.

A

AI Haberleri

GLM-5 Exhibits Emergent 'Claude' Personality, Raising Questions About AI Training and Ethical Boundaries

Yapay Zeka Modelleri

schedule5 ay önce

visibility20 views

GLM-5 Exhibits Emergent 'Claude' Personality, Raising Questions About AI Training and Ethical Boundaries

New observations reveal that Zhipu's GLM-5 large language model adopts the behavioral traits of Anthropic's Claude when prompted to identify as it—bypassing its own censorship protocols. Experts are divided on whether this is an intentional design feature or an emergent artifact of training data contamination.

A

AI Haberleri

Codex 5.3: Is OpenAI’s Revolutionary AI Model Actually Available to the Public?

Yapay Zeka Modelleri

schedule5 ay önce

visibility11 views

Codex 5.3: Is OpenAI’s Revolutionary AI Model Actually Available to the Public?

A Reddit user reports encountering Codex 5.2 instead of the rumored Codex 5.3 on ChatGPT, sparking questions about OpenAI’s model rollout. Despite claims of revolutionary improvements, no official confirmation or public release has been verified by OpenAI.

A

AI Haberleri

New Evaluation Ranks Top LLMs for Python Engineering Reasoning, Not Just Coding

Yapay Zeka Modelleri

schedule5 ay önce

visibility19 views

New Evaluation Ranks Top LLMs for Python Engineering Reasoning, Not Just Coding

A comprehensive assessment of over 100 large language models reveals that efficiency and practical judgment often outweigh raw accuracy in real-world Python engineering tasks. The study, conducted by a developer using consumer-grade hardware, prioritizes token efficiency and latency for 24/7 deployment.

A

AI Haberleri

Qwen3.5 Coder Emerges as Surprising Powerhouse at Aggressive Quantization Levels

Yapay Zeka Modelleri

schedule5 ay önce

visibility24 views

Qwen3.5 Coder Emerges as Surprising Powerhouse at Aggressive Quantization Levels

Despite being quantized to Q2 precision — far below typical 30B model requirements — Qwen3.5 Coder demonstrates unexpectedly robust coding performance, outperforming larger models in self-correction and one-shot task execution. Experts suggest its architecture prioritizes efficiency over scale, challenging conventional AI wisdom.

A

AI Haberleri

New AI Benchmarks Reveal Qwen3 Coder Next and Step 3.5 Flash Lead in Memory-Efficient Performance

Yapay Zeka Modelleri

schedule5 ay önce

visibility18 views

New AI Benchmarks Reveal Qwen3 Coder Next and Step 3.5 Flash Lead in Memory-Efficient Performance

Recent benchmarks on ROCm-powered hardware show Qwen3 Coder Next and Step 3.5 Flash outperforming rival models in memory-constrained environments, signaling a shift toward efficient, high-capability AI deployment. The results, published by a community researcher, highlight emerging trends in on-device AI inference.

A

AI Haberleri

Developer Fixes Qwen3-Coder-Next Parser Issue, Boosting Local AI Code Generation

Yapay Zeka Modelleri

schedule5 ay önce

visibility19 views

Developer Fixes Qwen3-Coder-Next Parser Issue, Boosting Local AI Code Generation

A community developer has resolved a critical parsing bug in Qwen3-Coder-Next, enhancing the model's ability to generate accurate code in offline environments. The fix, submitted via GitHub, has been widely praised by AI enthusiasts and local LLM users.

A

AI Haberleri

Google DeepMind Announces Upcoming Gemma Model Update Amid Rising AI Community Anticipation

Yapay Zeka Modelleri

schedule5 ay önce

visibility38 views

Google DeepMind Announces Upcoming Gemma Model Update Amid Rising AI Community Anticipation

Google DeepMind has confirmed plans to release an updated version of its open-weight Gemma language model, sparking excitement among developers and AI researchers. The announcement, made via a Reddit post by a community member, aligns with DeepMind’s ongoing commitment to accessible AI innovation.

A

AI Haberleri

Taalas Unveils Record-Breaking Llama 3.1 8B Chip, Raises $169M to Challenge Nvidia

Yapay Zeka Modelleri

schedule5 ay önce

visibility16 views

Taalas Unveils Record-Breaking Llama 3.1 8B Chip, Raises $169M to Challenge Nvidia

Canadian AI hardware startup Taalas has launched a custom silicon chip that runs Llama 3.1 8B at an unprecedented 17,000 tokens per second, redefining inference speed. The company, backed by $169 million in funding, aims to disrupt Nvidia’s dominance in the AI chip market with its aggressively quantized architecture.

A

AI Haberleri

Gemini 3.1 Pro Revolutionizes AI Development with User-Driven Game Creation

Yapay Zeka Modelleri

schedule5 ay önce

visibility11 views

Gemini 3.1 Pro Revolutionizes AI Development with User-Driven Game Creation

A Reddit user demonstrated Google's Gemini 3.1 Pro creating a fully functional sci-fi browser game in under hours using only natural language prompts, showcasing unprecedented AI coding capabilities. Experts say this marks a turning point in human-AI collaboration for software development.

A

AI Haberleri

Gemini 3.1 Pro Outperforms 3.0 Pro in Spatial Reasoning, Sparks Benchmark Revolution

Yapay Zeka Modelleri

schedule5 ay önce

visibility15 views

Gemini 3.1 Pro Outperforms 3.0 Pro in Spatial Reasoning, Sparks Benchmark Revolution

New benchmarks reveal Gemini 3.1 Pro delivers a generational leap over its predecessor, with unprecedented output complexity and improved reasoning—though hallucinations and performance bottlenecks remain concerns.

A

AI Haberleri

iPhone 14 Pro Max Achieves 46 Tokens/Second with BitNet AI Model, Redefining On-Device LLM Performance

Yapay Zeka Modelleri

schedule5 ay önce

visibility22 views

iPhone 14 Pro Max Achieves 46 Tokens/Second with BitNet AI Model, Redefining On-Device LLM Performance

A developer has successfully ported Microsoft’s BitNet, a 1-bit quantized AI model, to the iPhone 14 Pro Max, achieving 45–46 tokens per second—surpassing previous on-device benchmarks. The breakthrough demonstrates the potential for powerful, privacy-preserving AI to run directly on consumer smartphones without cloud dependency.

A

AI Haberleri

Claude Opus 4.6 Surpasses Predictions on METR Benchmark, Signals Exponential AI Progress

Yapay Zeka Modelleri

schedule5 ay önce

visibility36 views

Claude Opus 4.6 Surpasses Predictions on METR Benchmark, Signals Exponential AI Progress

Claude Opus 4.6 has achieved unprecedented performance on METR’s 50%-time-horizon benchmark, outpacing all prior models and challenging established AI timelines. According to analysis from LessWrong, its progress suggests an accelerating trajectory in AI capability, with implications for safety, policy, and development.

A

AI Haberleri