Deepseek Benchmark - Search News

News

2don MSN

Figuring out which AI model is right for you is harder than you think

AI models are numerous and confusing to navigate, but the benchmarks used to measure their performance are also challenging.

12d

DeepSeek-GRM: Introducing an Enhanced AI Reasoning Technique

Researchers from DeepSeek and Tsinghua University say combining two techniques improves the answers the large language model ...

3don MSN

OpenAI Beats DeepSeek On Sentence-level Reasoning

ChatGPT and other AI chatbots based on large language models are known to occasionally make things up, including scientific and legal citations ...

10d

These Chinese AI Companies Could Be The Next DeepSeek

DeepSeek’s meteoric rise put the spotlight on artificial intelligence from China. Here are the other buzzy Chinese AI ...

InfoWorld11d

What misleading Meta Llama 4 benchmark scores show enterprise leaders about evaluating AI performance claims

AI benchmarking is critical to determine performance, but results can be irrelevant to enterprise workflows; enterprise ...

Morningstar12d

Microsoft is quietly using DeepSeek's technology. What that means for AI stocks.

Microsoft's diversification from OpenAI speaks volumes about the future of AI investing When I wrote about DeepSeek's remarkable AI breakthrough in January, I didn't expect to see my predictions ...

New York Post4d

US mulls penalties to block DeepSeek from buying American technology

The Trump administration is weighing penalties that would block China’s DeepSeek from buying U.S. technology and is debating barring Americans’ access to its services, the New York Times ...

InfoWorld9d

Vector Institute aims to clear up confusion about AI model performance

DeepSeek and OpenAI’s o1 models performed the best across the various benchmarks, but all models still struggle in a range of ...

13don MSN

Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

Chinese startup DeepSeek, led by Liang Wenfeng, is developing generative reward modeling (GRM) to enhance AI efficiency and ...

Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

It achieved an 8.0% higher win rate over DeepSeek R1, suggesting that its strengths generalize beyond just logic or math-heavy challenges.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results