English
全部
搜索
图片
视频
地图
资讯
更多
购物
航班
旅游
酒店
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
时间不限
过去 1 小时
过去 24 小时
过去 7 天
过去 30 天
按相关度排序
按时间排序
资讯
8 天
高中生用「我的世界」评测SOTA模型!Claude暂时领先,DeepSeek紧随其后
AI频频刷新基准测试纪录,却算不清「strawberry」里到底有几个字母r,在人类看来很简单的问题却频频出错。这种反差促使创意测评兴起,例如由一名高中生开发的MC-Bench,用Minecraft方块「竞技场」模式评价AI能力。这种新的测评范式,或 ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果
今日热点
Allows to terminate workers
Florida wins third NCAA title
Threatens to veto Senate bill
Missing woman found alive
Coach charged with murder
Agrees to surrender license
'Chinese nationals captured'
Court restores data access
Nightclub roof collapse
US admiral at NATO fired
Plane skids off runway
Seeks to restrict testimony
Asks SCOTUS to block retrial
ME sues over funding freeze
Skips NH Senate bid
Scientists revive dire wolf
SK: Pres election on June 3
US-RU crew arrives at ISS
Vows to fight Trump's tariffs
US scholar jailed in Thailand
Offers buyouts to workers
Iran to hold indirect talks
To acquire Hidden Road
Norcross hospitalized
Diabetes, autism link study
Johnson pecked by ostrich
Revokes legal status
Sets date for special election
IN reports 1st measles case
EPA to review fluoride risks
California man pleads guilty
反馈