English
全部
图片
视频
地图
资讯
购物
更多
航班
旅游
酒店
搜索
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
过去 7 天
时间不限
过去 1 小时
过去 24 小时
过去 30 天
按时间排序
按相关度排序
4 天
高中生用「我的世界」评测SOTA模型!Claude暂时领先,DeepSeek紧随其后
AI频频刷新基准测试纪录,却算不清「strawberry」里到底有几个字母r,在人类看来很简单的问题却频频出错。这种反差促使创意测评兴起,例如由一名高中生开发的MC-Bench,用Minecraft方块「竞技场」模式评价AI能力。这种新的测评范式,或 ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果
今日热点
Dow plunges 2,200 points
US must return MD man
Extends deadline for deal
Charged with rape, assault
3 family members hit, killed
US fencer disqualified
Gun rights to be restored?
NJ mom found not guilty
Backs parental proxy voting
To freeze $510M in grants?
Shooting death arrest
JFK Profile in Courage Award
Trump admin sets terms
228K jobs added in March
Outsider folk singer dies
Powell speaks on economy
Health funding cuts on hold
Faces new federal charges
US, China hold security talks
Urges Fed to cut rates
China to impose 34% tariff?
Rainfall threatens flooding
Retires from WNBA
Klarna halts US IPO plans
Four space tourists return
DOE's AI data center plans
FL deputy killed in shootout
反馈