background Layer 1

Yandex SpeechKit learned to synthesize and recognize speech in Uzbek language

The Yandex Cloud platform has developed neural networks that can recognize speech and play it back in Uzbek. Companies both in Russia and Uzbekistan can already use the new language in the Yandex SpeechKit service to create voice assistants, call center automation and speech analytics.

Customizing synthesis

Speech synthesis models are trained on a pre-recorded voice of a real speaker. At the same time, they are able to synthesize speech not only from the text, but also from its phoneme record, which specifies all the features of sounds in words. This will allow clients to correct the synthesis pronunciation of individual words - complex names, names of goods, borrowed expressions. To do this, they will need to specify the phoneme breakdown of the desired word in the text using special syntax. 
 
Large datasets for speech recognition 

Just like for speech synthesis, for recognition neural networks learn from examples. To train them, Yandex Cloud experts have collected a dataset with thousands of hours of audio and their transcripts, including short and long phrases, as well as names, addresses, dates and numbers.

Uzbek Latin

 Neural networks for the synthesis and recognition of Uzbek work with the Latin alphabet, which has been used in Uzbekistan for more than 20 years. The algorithms had to learn some of the sound features of the letters. For example, the letter "X" denotes a hard [x] in the alphabet, but in foreign words can be pronounced as [xs].  

"Scenarios related to artificial intelligence, particularly speech technology, are now actively developing in Uzbekistan. One of our partners estimates that the speech technology market in Uzbekistan can reach up to 395 million minutes of processed speech per year. Appearance of the new sixteenth language in Yandex SpeechKit is a big step for creating new dialog scenarios for both Uzbek and Russian companies," - said Grigory Atrepiev, head of products in Yandex Cloud.
 
The models are available on the website and can be customized using standard API tools. Synthesis and recognition of the Uzbek language with the help of Yandex SpeechKit is already used by a number of companies, including Kolesa Group, for example. Partner companies including TargetAI, JustAI, Unistar Digital and Tomoru are also already testing and integrating the technology into their solutions.

In recent years, the term “Internet of Things” (IoT) has become one of the most popular in the tech world. It promised to connect billions of devices. The next stage was the Industrial Internet of Things (IIoT), which brought revolutionary changes to production processes. Today, a new term is coming onto the scene – AIoT (Artificial Intelligence of Things), combining the capabilities of the Internet of Things with the power of AI.

Sixty-eight IT executives from 15 countries, including Brazil, India, USA, Saudi Arabia, Turkey, Kazakhstan, and Indonesia participated in the project.

June 2024 was a very dynamic month for AI and data analytics market, marked by significant events, product launches, and industry insights.

Artem Kalivanov, CIO of Beeline Uzbekistan, discussed IT project management in terms of project definition, management models like Waterfall and Agile methodologies, and the importance of Lean in process management.

We use cookies for analytical purposes and to deliver you the best experience with our website. Continuing to the site, you agree to the Cookie Policy.