Google DeepMind lets us talk to animals!

The TLDR
Google has unveiled DolphinGemma, a modified AI model designed to analyze and generate dolphin communication. Trained on real-world data and powered by SoundStream, it runs on Pixel phones to identify and decode complex dolphin sounds. With the experimental CHAT system, researchers are exploring two-way communication, linking dolphin-made sounds to real-world objects. An open-source release is planned for this summer — and it could be a huge leap toward human-animal conversation.

Just imagine: We could not only eavesdrop on dolphins, but actually understand what they are communicating! Google has made a fascinating breakthrough in animal AI research with DolphinGemma. This specialized AI, a modification of the Gemma model, was developed to analyze the complex clicks, whistles and pulse sequences of Atlantic spotted dolphins and even generate dolphin-like sound sequences.

The model, trained with the Wild Dolphin Project's extensive dataset, uses Google's SoundStream technology and runs directly on Pixel smartphones in the field. Researchers can use it to identify recurring sound patterns and decipher potential meanings - a task that previously required immense human effort.

Particularly exciting: the CHAT system (Cetacean Hearing Augmentation Telemetry) is already researching two-way communication by linking synthetic whistles with objects that dolphins are interested in. Google plans to release DolphinGemma as an open source model this summer.

Could this technology be the key to bridging the gap between human and animal communication? The AI community is at the beginning of an exciting journey of discovery - and soon we could even be talking to our dogs.

Want a byte-sized version of Hacker News? Try TLDR’s free daily newsletter.

TLDR covers the most interesting tech, science, and coding news in just 5 minutes.

No sports, politics, or weather.

Graph of the Day

USA: AI surveillance in the government apparatus

Almost 50 Democratic MPs are calling on the Trump administration to stop the use of unauthorized AI systems such as Elon Musk's “Grok”. Particularly critical is the use by the Department of Government Efficiency (DOGE), which uses AI for employee monitoring and data analysis - without official approval. The MPs warn of security risks, legal problems and conflicts of interest, as Musk acts as a government advisor and at the same time contributes his own products.

Australia: AI-driven election campaigns

In Australia's 2025 election campaign, parties are increasingly relying on AI-generated content. The Liberal Party released the first fully AI-generated election commercial, while other candidates are working with AI-powered rap videos and digital campaigns. These developments raise questions about the authenticity of political communication and the potential manipulation of voters by AI.

USA: AI in criminal justice reform

The former mayor of Detroit, Kwame Kilpatrick, is committed to AI-supported reform of the criminal justice system following his pardon. As a member of the “20% Project”, he advocates the use of open source AI to modernize the pardoning process and reduce mass incarceration. The aim is to use AI to enable fairer and more efficient decisions in the justice system.

These examples show how AI is increasingly intervening in political processes - be it through surveillance, election campaigns or judicial reforms. The challenges lie in regulation, transparency and the protection of democratic principles in the face of rapid technological developments.

Poll of the Day

Will politicians put the brakes on AI due to safety concerns?

Login or Subscribe to participate

In The News

NVIDIA Unveils Nemotron-UltraLong-8B for Multi-Million Token Contexts

NVIDIA has introduced Nemotron-UltraLong-8B, a family of language models capable of processing 1M, 2M, and even 4M-token contexts while maintaining strong benchmark performance. Built on Llama-3.1, the models use efficient continued pretraining and instruction tuning to enhance long-context reasoning and instruction-following. This makes them ideal for handling massive documents and complex conversations without performance trade-offs.

OpenAI Reportedly in Talks to Acquire Codeium for $3B

OpenAI is reportedly negotiating a $3 billion acquisition of Codeium, a fast-growing AI coding tool founded less than four years ago. The move signals OpenAI’s clear push to own the app layer and directly compete with coding platforms like Cursor.

Mirage Edit Lets You Create AI-Generated Talking Videos from Just a Prompt

The new Mirage Edit feature in the Captions iOS app turns simple text prompts into fully-edited talking videos with AI-generated actors and scenes. It handles everything — from dialogue to B-roll, transitions, and graphics — no filming needed.

Quote of the Day

Hi All,

Thank you for reading. We would be delighted if you shared the newsletter with your friends! We look forward to expanding the newsletter in the future with even more specialized topics. Until then, follow us on social media to stay up to date.

Cheers,
Dan

Reply

or to participate

Keep Reading

No posts found