AI評価フレームワークを開発するAmplifyingが、Claude Codeが3つのモデルと4つのプロジェクトタイプにわたって行った2430件のツール選択を体系的に分析した調査報告書を発表しました。この調査でAIエージェントはサードパーティのツールを推奨するよりも、独自のカスタムソリューションを自ら構築する傾向が強いことが示されていますが、同時に特定のカテゴリーにおいては圧倒的なシェアを誇る推奨 ...
Discover OpenFang, the Rust-based Agent Operating System that redefines autonomous AI. Learn how its sandboxed architecture, pre-built "Hands," and security-first design outperform traditional Python ...
Apple has released Xcode 26.3 with support for autonomous coding agents, that can directly analyze projects, modify files, ...
Microsoft has announced that the Microsoft Agent Framework has reached Release Candidate status for both .NET and Python. This milestone indicates that the API surface is stable and feature-complete ...
NECは2026年2月25日、ソフトウェア開発向けコードレビュー「Metabob(メタボブ)」を同年1月から運用していると発表した。AIエージェント開発チームによる実証では、人手による目視レビュー・手動修正と比べて工数を66%削減し、コーディングAI ...
Every Indian AI model is graded on benchmarks built in San Francisco. GPT-5 scores below 40% on Indian cultural reasoning.
OpenAI wants to retire the leading AI coding benchmark—and the reasons reveal a deeper problem with how the whole industry measures itself.
By way of definition, AWS Strands is a model-driven framework (i.e. one that uses high-level designs to automatically generate code, which is often used for streamlining complex software development ...
IBM shares suffered their worst single-day drop in over 25 years on Monday, February 23 after AI startup Anthropic announced ...
IBM’s ( IBM) Software and Chief Commercial Officer, Rob Thomas, wrote in a Monday blog post that translating COBOL code isn’t equivalent to modernizing enterprise systems, emphasizing that platform ...
New agent step in Opal figures out the right tools and models it needs to accomplish the user’s objective, Google said.
Anthropic claims Chinese AI labs ran large-scale Claude distillation attacks to steal data and bypass safeguards.