WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.

Google joins push to localise AI for African languages with speech database

3 min read

Google has collaborated with African universities and research institutions to launch WAXAL, an open-source speech database designed to support the development of voice-based artificial intelligence for African languages. 

African institutions, including Makerere University in Uganda, the University of Ghana, Digital Umuganda in Rwanda, and the African Institute for Mathematical Sciences (AIMS), participated in the data collection for this initiative. The dataset provides foundational data for 21 Sub-Saharan African languages, including Hausa, Luganda, Yoruba, and Acholi.

WAXAL is designed to support the development of speech recognition systems, voice assistants, text-to-speech tools, and other voice-enabled applications across sectors such as education, healthcare, agriculture, and public services.

“This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages,” said Aisha Walcott-Bryantt, Head of Google Research Africa

WAXAL’s launch comes amid growing efforts across Africa to develop language technologies that reflect local cultures and realities. 

In September 2025, the Nigerian government unveiled N-ATLAS, an open-source language model capable of recognising and transcribing spoken words and generating text, in Yoruba, Hausa, Igbo, and Nigerian-accented English. 

Similar initiatives are emerging in the private sector, where startups such as  South Africa’s Lelapa AI are building tools like Vulavula, which offers speech recognition, translation, and sentiment analysis. 

By making this speech dataset openly accessible, WAXAL provides the fuel for a growing wave of homegrown efforts to bring African languages into the digital age.

Although Sub-Saharan Africa is home to more than 2,000 languages, reports suggest that fewer than 5% of those languages have the resources needed for Natural Language Processing (NLP), which allows computers to understand and comprehend human language. This lack of representation in training datasets limits the effectiveness of speech recognition and text-to-speech systems for African users.  

Developed over three years with funding and technical support from Google, WAXAL addresses a major gap in global AI development.

WAXAL provides speech data for 21 Sub-Saharan African languages, including Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Swahili, and Yoruba. The dataset contains more than 11,000 hours of speech drawn from nearly two million individual recordings. 

Under the project’s partnership model, contributing institutions retain ownership of the data they collected, while making it openly available to researchers and developers worldwide.

“For AI to have a real impact in Africa, it must speak our languages and understand our contexts,” Joyce Nakatumba-Nabende, Senior Lecturer at Makerere University’s School of Computing and Information Technology, said. 

“The WAXAL dataset gives our researchers the high-quality data they need to build speech technologies that reflect our unique communities.”

Get The Best African Tech Newsletters In Your Inbox

Subscribe
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

Fed Decides On Interest Rates Today—Here’s What To Watch For

Fed Decides On Interest Rates Today—Here’s What To Watch For

The post Fed Decides On Interest Rates Today—Here’s What To Watch For appeared on BitcoinEthereumNews.com. Topline The Federal Reserve on Wednesday will conclude a two-day policymaking meeting and release a decision on whether to lower interest rates—following months of pressure and criticism from President Donald Trump—and potentially signal whether additional cuts are on the way. President Donald Trump has urged the central bank to “CUT INTEREST RATES, NOW, AND BIGGER” than they might plan to. Getty Images Key Facts The central bank is poised to cut interest rates by at least a quarter-point, down from the 4.25% to 4.5% range where they have been held since December to between 4% and 4.25%, as Wall Street has placed 100% odds of a rate cut, according to CME’s FedWatch, with higher odds (94%) on a quarter-point cut than a half-point (6%) reduction. Fed governors Christopher Waller and Michelle Bowman, both Trump appointees, voted in July for a quarter-point reduction to rates, and they may dissent again in favor of a large cut alongside Stephen Miran, Trump’s Council of Economic Advisers’ chair, who was sworn in at the meeting’s start on Tuesday. It’s unclear whether other policymakers, including Kansas City Fed President Jeffrey Schmid and St. Louis Fed President Alberto Musalem, will favor larger cuts or opt for no reduction. Fed Chair Jerome Powell said in his Jackson Hole, Wyoming, address last month the central bank would likely consider a looser monetary policy, noting the “shifting balance of risks” on the U.S. economy “may warrant adjusting our policy stance.” David Mericle, an economist for Goldman Sachs, wrote in a note the “key question” for the Fed’s meeting is whether policymakers signal “this is likely the first in a series of consecutive cuts” as the central bank is anticipated to “acknowledge the softening in the labor market,” though they may not “nod to an October cut.” Mericle said he…
Share
BitcoinEthereumNews2025/09/18 00:23
While Shiba Inu and Turbo Chase Price, 63% APY Staking Puts APEMARS at the Forefront of the Best Meme Coin Presale 2026 – Stage 6 Ends in 3 Days!

While Shiba Inu and Turbo Chase Price, 63% APY Staking Puts APEMARS at the Forefront of the Best Meme Coin Presale 2026 – Stage 6 Ends in 3 Days!

What if your meme coin investment could generate passive income without selling a single token? Shiba Inu climbed 4.97% as 207 billion tokens left exchanges. Turbo
Share
Coinstats2026/02/04 03:15
SUI Price Is Down 80%: Price Nears Level Bulls Cannot Afford to Lose

SUI Price Is Down 80%: Price Nears Level Bulls Cannot Afford to Lose

SUI price has quietly slipped into a zone that usually decides everything. Charts show an 80% drop from the peak, yet the market is no longer moving fast. This
Share
Captainaltcoin2026/02/04 03:00