Google’s DolphinGemma AI Bridges Communication Gap Between Humans and Dolphins

April 14, 2025 Google has introduced DolphinGemma, an AI model developed in collaboration with Georgia Tech and the Wild Dolphin Project (WDP), aiming to decode dolphin vocalizations and facilitate two-way communication between humans and dolphins.

DolphinGemma is trained on decades of underwater audio and video data collected by WDP, focusing on Atlantic spotted dolphins. The model identifies patterns in dolphin vocal sequences and can generate realistic dolphin-like sounds. Operating as an audio-in, audio-out system, it predicts subsequent dolphin sounds similarly to how language models predict the next word in human text.

The AI system is based on Google’s lightweight Gemma models and utilizes SoundStream for audio representation. With approximately 400 million parameters, DolphinGemma is compact enough to run on Pixel phones used in field research.

This advancement could revolutionize our understanding of dolphin communication, providing insights into their social behaviors and potentially enabling interactive exchanges between humans and dolphins. By identifying recurring sound patterns and sequences, researchers hope to uncover hidden structures and meanings within dolphin vocalizations.

Google plans to release DolphinGemma as an open model this summer, allowing researchers worldwide to adapt it for studying other cetacean species. Fine-tuning will be necessary for species with different vocal patterns, but the open-source nature of the model facilitates such adaptations.

 

Top Stories

Related Articles

October 21, 2025 Apple CEO Tim Cook wrapped up a six-day visit to China with a pledge to bring the more...

October 10, 2025 OpenAI may soon set up shop in Canada as the ChatGPT maker explores how it can contribute more...

October 7, 2025 A McMaster University researcher has used artificial intelligence to identify a new antibiotic compound that could transform more...

August 25, 2025 xAI has announced that they are issuing and Open-Source version of Grok 2.5 — a move likely intended more...

Jim Love

Jim is an author and podcast host with over 40 years in technology.

Share:
Facebook
Twitter
LinkedIn