- Superintelligence.
- Posts
- This Week in AI: GPT-4o Native Image Generation
This Week in AI: GPT-4o Native Image Generation
Your Weekly Dose of AI News
In The News
OpenAI Reveals Native Image Generation!
OpenAI introduces native image generation in GPT-4o, creating a unified system for text and visuals. The technology excels at rendering accurate text in images and processing complex multi-concept prompts simultaneously.
Google With Major Update: Gemini 2.5Google's Gemini 2.5 Pro leads LMArena rankings with exceptional performance on challenging benchmarks. The model achieves top scores on GPQA and AIME 2025 without requiring costly techniques, while processing massive context windows of up to one million tokens. | New DeepSeek v3 Update Shocks The World!DeepSeek releases V3-0324, a 685-billion parameter open-source model running efficiently at 20 tokens per second on Mac hardware. The MIT-licensed technology approaches Claude 3.7 Sonnet's performance while making cutting-edge AI more accessible. |
Builder’s Corner
Run AI Agents Completely Offline NowYou can now run AI agents entirely on your local computer without cloud services. Combining Gemma 3, Smolagents, and LM Studio creates a free alternative. | ChatGPT Simplifies AI Cartoon CreationChatGPT enables creation of consistent AI cartoons across multiple camera angles with simple prompts. The system eliminates complex techniques previously required for animation consistency. |
Demos to Blow Grandma’s Mind
Figure's Humanoid Achieves Natural Walking
Figure's 02 humanoid robot achieves natural human-like walking using an end-to-end neural network. The system was trained in a high-fidelity physics simulation environment.
China's Kepler Reveals Advanced Robot HandKepler showcases the Forerunner K2 robot with highly advanced hands featuring 11 degrees of freedom. Each finger contains 25 tactile sensors while supporting 33-pound payloads. | MagicBot Personalizes Car Shopping ExperienceMagicBot serves as an AI car shopping assistant that analyzes users' personalities and interests. The system provides personalized vehicle recommendations based on individual profiles |
Products of the Week
Reve Image Beats Midjourney, Google
Reve Image dominates the Artificial Analysis Image Arena, outperforming Midjourney, Imagen 3, and FLUX. The model excels in text rendering and prompt adherence.
Omni-Modal AI with Qwen ChatQwen Chat introduces real-time voice and video conversations powered by the new open-source omni-model, Qwen2.5-Omni-7B. The omni-model handles text, audio, image, and video with simultaneous thinking and speaking capabilities. | Kling AI Upgrades Elements with New FeaturesKling AI upgrades Elements with faster generation and improved prompt understanding for better overall image quality results. The update adds new Endframes and Extend features for enhanced creative capabilities. |
Hi All,
Thank you for reading. We would be delighted if you shared the newsletter with your friends! We look forward to expanding the newsletter in the future with even more specialized topics. Until then, follow us on social media to stay up to date.
Cheers,
Dan