In partnership with

Dear Readers,

The limits of what AI models can achieve are currently shifting noticeably. With Qwen3-Max, Alibaba presents a language model that not only impresses with its sheer size, but also scores points for its stability and long-term capability. It's a signal: agentic systems that independently control complex processes are moving from a promise for the future to practical reality – and in doing so, changing how we think about work, decision-making, and creativity.

In this issue, we take a look at what Qwen3-Max can really do, why hedge funds like Magnetar are shifting billions into AI infrastructure, and how central banks around the world are incorporating AI tools into their daily practices. We also offer exciting insights into new benchmarks, market trends, and fresh debates from the scene. Anyone who wants to understand how power, capital, and technology are being rearranged will find the common thread here – it's worth reading on.

In Today’s Issue:

💵 Alibaba just dropped a massive new AI model with over a trillion parameters.

📈 A major hedge fund just made a 56% return betting on the AI infrastructure boom.

🇦🇺 Even Australia's central bank is now using a custom AI to help make policy decisions.

🇺🇸 The US Federal Reserve is also using AI, but don't worry, it's not setting interest rates (yet).

And more AI goodness…

All the best,

AI Progress Report

GPT-5 Pro Tackles the Hardest Problems

A developer shared their experience of GPT-5 Pro solving a complex coding problem in just 10 minutes that other AI assistants couldn't handle, highlighting its impressive capabilities on difficult, real-world tasks.

Grok gets Background Thinking

According to a new report, xAI is developing "background thinking" capabilities for Grok, allowing it to process and reason about information continuously.

New AI Test: Telling Time

A new visual reasoning benchmark called ClockBench has been introduced, which reveals a massive gap between human and AI performance on the simple task of reading an analog clock.

This talks about behind-the-scenes look at the creation of OpenAI's Codex, detailing its origin story, surprising user adoption patterns, and its profound impact on the future of software engineering.

A Google DeepMind employee disagrees with Geoffrey Hinton's thesis that AI will make the vast majority poorer and a few incredibly rich.

More and more coders are switching from Claude Code to OpenAI's Codex.

Qwen3-Max-Preview Released!

The Takeaway

👉 With over a trillion parameters, Qwen3-Max-Preview is one of the largest freely accessible models and is now available for use via API.

👉 The model improves instruction following, handling of long dialogues, and use of tools.

👉 This opens up new opportunities for developers and companies to build more stable agent-based workflows and complex automations.

👉 The release underscores that scaling brings not only theoretical but also practical advances.

Alibaba has unveiled its largest language model to date, Qwen3-Max-Preview (Instruct), with more than a trillion parameters and direct access via API or Qwen Chat. The model is designed to follow instructions much better, conduct longer dialogues without interruptions, and work more reliably on multi-step tasks or in conjunction with tools. With this release, Alibaba is taking a clear step toward more productive, agentic AI workflows.

For the community, this means that the long-awaited “scaling” is finally showing tangible benefits. Such a large model is not only a proof of performance, but also opens up more practical applications – for example, in long-term contexts, in multilingual applications, or in the field of AI assistants that can control processes independently. When a system clearly understands commands and implements them consistently, prompt engineering becomes less of an art and more of a fine-tuning exercise.

The outlook is exciting: such models could soon be able to independently plan multiple steps, prepare decisions, and relieve the burden on humans without the need to monitor every single input. It is a preview of orchestrated AI systems that organize entire processes in the background like co-pilots.

Why it matters: Qwen3-Max Preview marks a decisive step in the development of agentic systems. Huge models show that they not only impress on paper, but can also provide greater stability, accuracy, and automation in everyday life.

Sources:

Marketing ideas for marketers who hate boring

The best marketing ideas come from marketers who live it.

That’s what this newsletter delivers.

The Marketing Millennials is a look inside what’s working right now for other marketers. No theory. No fluff. Just real insights and ideas you can actually use—from marketers who’ve been there, done that, and are sharing the playbook.

Every newsletter is written by Daniel Murray, a marketer obsessed with what goes into great marketing. Expect fresh takes, hot topics, and the kind of stuff you’ll want to steal for your next campaign.

Because marketing shouldn’t feel like guesswork. And you shouldn’t have to dig for the good stuff.

Old But Unbeaten: Gemini 2.5 Pro Remains #1 in LMArena Text

Magnetar makes massive investment in AI infrastructure

Hedge fund Magnetar has invested around $500 million in CoreWeave, a leading provider of AI computing capacity. Since its IPO in March 2025, the stock has more than doubled, giving the fund a return of 56%. The shift from traditional credit strategies to high-growth AI investments shows how hedge funds are structurally tapping into new risk profiles – a signal for capital markets where artificial intelligence infrastructure is increasingly setting the pace.

Central banks are upgrading with AI

The Reserve Bank of Australia is testing an AI system that processes over 200,000 analytical documents and 40 years of monetary policy experience. Combined with text analyses of liaison data from 25 years, this creates a tool that does not replace decisions, but prepares them in a much more informed manner. This could enable central banks to respond more quickly and accurately to economic shocks. Its use marks an institutional change: data intelligence is moving closer to the center of monetary policy practice.

The Fed uses AI - with clear limits

The US Federal Reserve also uses AI, but not for interest rate decisions. According to Governor Lisa Cook, the technology primarily supports the analysis of protocols, programming, and research into financial risks. In the long term, the increase in productivity could help to reduce inflationary pressure, while the high initial investment could create price pressure in the short term. AI is thus establishing itself as an analytical tool in the day-to-day work of central banks – a digital upgrade without any loss of control.

Which one do you prefer: OpenAIs Codex or Claude Code?

Go from AI overwhelmed to AI savvy professional

AI will eliminate 300 million jobs in the next 5 years.

Yours doesn't have to be one of them.

Here's how to future-proof your career:

  • Join the Superhuman AI newsletter - read by 1M+ professionals

  • Learn AI skills in 3 mins a day

  • Become the AI expert on your team

How'd We Do Today?

Superintelligence+ unlocks the full potential of AI! For just €5/month, gain access to a special Saturday edition of our newsletter packed with in-depth AI research. Join our exclusive Discord community to connect with fellow AI enthusiasts. Plus, participate in giveaways for a chance to win coveted access and invites to cutting-edge AI software and tools.

Reply

or to participate

Keep Reading

No posts found