In partnership with

DeepSeek r2 Leaks!

The TLDR
Leaked details about DeepSeek's upcoming R2 model reveal a 1.2T-parameter mixture-of-experts architecture that could slash AI costs by 97% compared to GPT-4 Turbo. Running on Huawei chips instead of NVIDIA GPUs, R2 could dramatically expand access to high-performance AI and reshape the AI hardware landscape.

An internal paper that has been circulating in specialist forums since the weekend suggests that DeepSeek could radically reduce the costs of AI services with its upcoming R2 model.

R2 combines a so-called “mixture-of-experts” architecture with 1.2 trillion parameters, of which only 78 billion are actively calculated. According to the leak, this technology reduces the price per processed token by 97 percent compared to GPT-4 Turbo. The basis is a 5.2-petabyte dataset from legal, financial and patent data; the training runs on Huawei's Ascend-910B chips with 82 percent utilization and achieves around 512 PetaFLOPS - almost half an ExaFLOP.

This opens up two opportunities for the AI community: firstly, the running costs for research, start-ups and open source projects are drastically reduced. Secondly, R2 shows that competitive models are also possible without NVIDIA GPUs - a step towards greater technological independence.

If DeepSeek confirms the figures, developers could soon be using powerful models locally or in inexpensive clouds. Which application would you be the first to try out?

Why it matters: R2 promises affordable high-performance AI and thus paves the way for broader participation in modern AI technology. At the same time, it shifts the balance of power in the global AI hardware race.

Boring slides are out.

Most slide decks look the same—static, boring, and forgettable.

Leave them behind with Prezi AI, the AI presentation software that builds unique, dynamic presentations just from your prompt. It’s fast, intuitive, and designed to fit your ideas perfectly.

Stand out with Prezi’s unique format that uses zooming, cinematic movement to wow your audience.

Graph of the Day

AI is on the rise in every sector.

Morgan Stanley: AI demand remains robust despite market volatility

Morgan Stanley describes concerns about a decline in AI investment as unfounded. Analyst Joseph Moore highlights the continued strong demand for GPUs, particularly for inference tasks, despite recent market weakness in AI stocks. The introduction of more efficient language models such as DeepSeek and new US tariffs have put short-term pressure on companies such as Nvidia. Nevertheless, Moore predicts strong revenue growth for Nvidia once supply bottlenecks, particularly for H20 chips, are resolved. This underlines the structural importance of AI infrastructure for future economic growth.

Bank of England warns of systemic risks from AI in the financial sector

The Bank of England identifies potential systemic risks from the increased use of AI in banks and insurance companies. While AI can increase efficiency, common weaknesses in widely used models could lead to misjudgements of risk and credit misallocations. There is also a risk that AI-driven trading strategies could lead to synchronized market movements during periods of stress, jeopardizing stability. Dependence on a small number of AI service providers also increases operational risk. The Bank emphasizes the need for flexible and forward-looking monitoring in order to meet these challenges.

Bank of America Invests $4 Billion in AI and New Technologies

Bank of America plans to invest $4 billion in AI and new technologies in 2025, representing almost one-third of its total technology budget. Over 90% of employees are already using the internal AI assistant, leading to a reduction of IT support requests by more than half. AI tools are boosting efficiency in areas such as development, training, customer service, and customer engagement. These investments highlight the strategic importance of AI for future competitiveness and efficiency improvements in the banking sector.

Poll of the Day

Will the financial market systematically face more problems due to increasing AI?

Login or Subscribe to participate

In The News

AI Models Are Moving From Helpers to True Delegates

Newer AI models like OpenAI's o3 and o4-mini aren't just improving on benchmarks — they're fundamentally changing what tasks can be fully delegated to them. Unlike earlier versions like GPT-3.5 and GPT-4, these models use integrated tools to independently manage complex workflows. With minimal human input, o3 can now execute detailed tasks from general instructions. This marks a major shift toward AI acting as autonomous collaborators rather than simple assistants.

Anthropic Launches Research on AI Model Experiences

Anthropic has begun a cautious research program to explore whether advanced AI models could have experiences of their own. While the idea of "model welfare" remains highly uncertain and controversial, the company believes it’s important to investigate.

Tencent Launches Hunyuan 3D AI Engine v2.5

Tencent’s Hunyuan 3D AI Creation Engine v2.5 brings a major leap in ultra-high-definition 3D modeling, featuring a 10B parameter geometric model and high-quality PBR textures. With improved animation tools, flexible pipelines, and expanded free access, it’s set to transform 3D creation for games, AR/VR, and more.

Quote of the Day

Hi All,

Thank you for reading. We would be delighted if you shared the newsletter with your friends! We look forward to expanding the newsletter in the future with even more specialized topics. Until then, follow us on social media to stay up to date.

Cheers,
Dan

Keep Reading

No posts found