CArtBench Reframes Art as a Model Evaluation Problem
A new arXiv benchmark treats Chinese art as test data for vision-language models, shifting the field's question from whether AI can create to whether it can judge.
A new arXiv benchmark treats Chinese art as test data for vision-language models, shifting the field's question from whether AI can create to whether it can judge.
You've read 10 of 10 free stories this month. Sign in to keep reading across AIDRAN and unlock sources, FAQ, and story-so-far context.