background Layer 1

Yandex SpeechKit learned to synthesize and recognize speech in Uzbek language

The Yandex Cloud platform has developed neural networks that can recognize speech and play it back in Uzbek. Companies both in Russia and Uzbekistan can already use the new language in the Yandex SpeechKit service to create voice assistants, call center automation and speech analytics.

Customizing synthesis

Speech synthesis models are trained on a pre-recorded voice of a real speaker. At the same time, they are able to synthesize speech not only from the text, but also from its phoneme record, which specifies all the features of sounds in words. This will allow clients to correct the synthesis pronunciation of individual words - complex names, names of goods, borrowed expressions. To do this, they will need to specify the phoneme breakdown of the desired word in the text using special syntax. 
 
Large datasets for speech recognition 

Just like for speech synthesis, for recognition neural networks learn from examples. To train them, Yandex Cloud experts have collected a dataset with thousands of hours of audio and their transcripts, including short and long phrases, as well as names, addresses, dates and numbers.

Uzbek Latin

 Neural networks for the synthesis and recognition of Uzbek work with the Latin alphabet, which has been used in Uzbekistan for more than 20 years. The algorithms had to learn some of the sound features of the letters. For example, the letter "X" denotes a hard [x] in the alphabet, but in foreign words can be pronounced as [xs].  

"Scenarios related to artificial intelligence, particularly speech technology, are now actively developing in Uzbekistan. One of our partners estimates that the speech technology market in Uzbekistan can reach up to 395 million minutes of processed speech per year. Appearance of the new sixteenth language in Yandex SpeechKit is a big step for creating new dialog scenarios for both Uzbek and Russian companies," - said Grigory Atrepiev, head of products in Yandex Cloud.
 
The models are available on the website and can be customized using standard API tools. Synthesis and recognition of the Uzbek language with the help of Yandex SpeechKit is already used by a number of companies, including Kolesa Group, for example. Partner companies including TargetAI, JustAI, Unistar Digital and Tomoru are also already testing and integrating the technology into their solutions.

The discussion begins with the assertion that every company will eventually be forced to adopt AI. AI adoption varies across industries, with some seeing it as imperative for staying competitive, while for others, it's less of an immediate need.  New technologies like MLOps, data pipelines, autonomic databases, and predictive analytics help to improve the overall governance risk.

The international IT professionals' community has announced the 'Top 100 IT Leaders' project. It is a global initiative that allows top IT managers to share their experience, expand their professional network and showcase the best digitalization practices of their companies. Here we will answer the basic questions about the project.

The editors of Global CIO discussed with Ahmet Hakan Göral, CIO at Boutique Rugs and winner of the Project of the Year 2023 contest, his current projects, the approach to digital transformation and the specifics of the role of CIO in retail.

April was full of new initiatives from vendors as well as some fascinating news on the technology front. With Olympics in Paris approaching, more  news will be flowing in from the capital of France.

We use cookies for analytical purposes and to deliver you the best experience with our website. Continuing to the site, you agree to the Cookie Policy.