TR

Yapay Zeka Modelleri

LLM'ler, GPT, Claude, Gemini, model eğitimi, benchmark sonuçları ve yeni model duyuruları

1560 articles found · Page 31 / 65

Codex 5.3 Tops Agentic Coding Benchmarks but Triggers Overall Regression on LiveBench
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility25 views

Codex 5.3 Tops Agentic Coding Benchmarks but Triggers Overall Regression on LiveBench

OpenAI's Codex 5.3 has achieved a new state-of-the-art in agentic coding performance on the LiveBench benchmark, yet overall scores across multiple domains have regressed, raising questions about model specialization versus general capability. The results, released in a surprise update, highlight growing tensions between narrow task optimization and holistic AI reasoning.

A
AI Haberleri
llama.cpp Updates Enable Stable Qwen 3.5 Multi-GPU Deployment and Multi-Modal Prompt Caching
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility18 views

llama.cpp Updates Enable Stable Qwen 3.5 Multi-GPU Deployment and Multi-Modal Prompt Caching

Recent patches to llama.cpp resolve critical multi-GPU crashes in Qwen 3.5 27B and introduce prompt caching for multi-modal models, significantly enhancing performance and reliability for local AI deployments. These updates, driven by community contributions, mark a major step forward in open-source LLM optimization.

A
AI Haberleri
Sonnet 4.6 AI Model Misidentifies as DeepSeek-V3 in Chinese Queries, Sparking Industry Scrutiny
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility17 views

Sonnet 4.6 AI Model Misidentifies as DeepSeek-V3 in Chinese Queries, Sparking Industry Scrutiny

Multiple users in China have reported that Anthropic's Sonnet 4.6 AI model, when queried in Chinese, incorrectly identifies itself as DeepSeek-V3 — a competing model developed by DeepSeek. The anomaly has triggered speculation about model poisoning, data leakage, or undisclosed partnerships in the global AI ecosystem.

A
AI Haberleri
Defining the Modeling Scope for Internal Credit Risk Models: Data, Probability, and Regulatory Foundations
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility15 views

Defining the Modeling Scope for Internal Credit Risk Models: Data, Probability, and Regulatory Foundations

A deep dive into how financial institutions construct Probability of Default (PD) models for Internal Ratings-Based (IRB) frameworks, blending data science rigor with mathematical probability and regulatory compliance. This analysis synthesizes technical guidelines from Towards Data Science with foundational probability theory from Math is Fun.

A
AI Haberleri
Seedance 2.0: ByteDance’s Revolutionary Multimodal Video Generation System
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility22 views

Seedance 2.0: ByteDance’s Revolutionary Multimodal Video Generation System

ByteDance has unveiled Seedance 2.0, a groundbreaking AI system capable of generating full-length, audio-synchronized video sequences from text prompts alone. Unlike previous models that produced short, silent clips, Seedance 2.0 integrates scene planning, shot architecture, and native audio generation, redefining digital content creation.

A
AI Haberleri
Unsloth Q3 Quantization Outperforms Q4 and MXFP4 in Groundbreaking AI Benchmark
Yapay Zeka Modelleri
schedule3 min
schedule3 ay önce
visibility21 views

Unsloth Q3 Quantization Outperforms Q4 and MXFP4 in Groundbreaking AI Benchmark

A surprising benchmark from Unsloth AI reveals that a Q3 dynamic quantization method outperforms both Q4 and MXFP4 on the Qwen3.5-397B model, defying conventional wisdom in AI model compression. Experts caution the results stem from non-standard testing conditions but could signal a paradigm shift in quantization research.

A
AI Haberleri