Art as Evaluation, Not Creation // AIDRAN

CArtBench does not argue that AI understands art — it operationalizes understanding as a set of measurable subtasks and then tests for them ^[3]. CURATORQA demands evidence-grounded recognition; CATALOGCAPTION requires structured four-section expert-style appreciation; REINTERPRET scores defensible reinterpretation against expert ratings; CONNOISSEURPAIRS diagnoses authenticity under visually similar confounds . Each of these is a careful translation of a humanistic practice into a tractable ML objective. What gets lost in that translation is not immediately visible in the paper — but the categories that get left out of a benchmark define…

CArtBench Reframes Art as a Model Evaluation Problem

Free reading limit reached

Project Maven Is Selecting Targets in Iran, and the Ethics Conversation Has Caught Up

WSB's AI Trading Receipts Sparked Finance's Loudest Conversation

The Pentagon's AI Supply Chain Quietly Rewrote Its Terms

The Weapons Transfer Nobody Noticed and Nobody Protested

The AI Agent That Got Banned From Wikipedia and Complained About It

The AI Agent That Got Banned From Wikipedia and Then Complained About It

The Benchmark Race to Crown LLMs as Historical Authorities

AI Regulation Volume Spikes While Policy Debate Stays Offstage

Scientists Built a Fake Disease. AI Diagnosed It as Real.

A Four-Layer Architecture for Neural Deception Detection Meets Hard Scrutiny