ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

New

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analy

AI Research Tools

8.6 (60.458 score)

freemium

Visit Tool

Overview

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM — ingested from rss

Similar Tools

Cognito

Freemium

Predicting model behavior before release by simulating deployment

Industrial policy for the Intelligence Age

Freemium

View all in AI Research Tools →

Verified Info

Added to directory6/25/2026

CategoryAI Research Tools

Pricing modelfreemium

Ratings & Reviews

Rate ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Alternatives to ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

View All

Check out real-life AI prototypes from the Futures Lab.

Freemium

<img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/FutureLabs_social.max-600x600.format-webp.w

AI Research ToolsCompare →

Helping build shared standards for advanced AI

Freemium

OpenAI helps build shared standards for advanced AI, supporting evaluation frameworks, safety practices, and global coop

AI Research ToolsCompare →

NotebookLM for Google Workspace

Freemium

AI research assistant that organizes and synthesizes your documents.

AI Research ToolsCompare →

OlmoEarth v1.1: A more efficient family of Earth observation models

Freemium

OlmoEarth v1.1: A more efficient family of Earth observation models — ingested from rss

AI Research ToolsCompare →

NotebookLM Canvas

Freemium

Visual workspace that transforms research notes into interactive diagrams.

AI Research ToolsCompare →

NotebookLM (Google)

Freemium

AI research assistant that turns documents into insights and audio

AI Research ToolsCompare →