Picture for Yerbolat Khassanov

Yerbolat Khassanov

Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration

Add code
May 25, 2023
Figure 1 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 2 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 3 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Viaarxiv icon

Improving short-video speech recognition using random utterance concatenation

Add code
Oct 28, 2022
Figure 1 for Improving short-video speech recognition using random utterance concatenation
Figure 2 for Improving short-video speech recognition using random utterance concatenation
Figure 3 for Improving short-video speech recognition using random utterance concatenation
Figure 4 for Improving short-video speech recognition using random utterance concatenation
Viaarxiv icon

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics

Add code
Jan 15, 2022
Figure 1 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 2 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 3 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 4 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Viaarxiv icon

KazNERD: Kazakh Named Entity Recognition Dataset

Add code
Nov 26, 2021
Figure 1 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 2 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 3 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 4 for KazNERD: Kazakh Named Entity Recognition Dataset
Viaarxiv icon

A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data

Add code
Oct 23, 2021
Figure 1 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 2 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 3 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 4 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Viaarxiv icon

A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English

Add code
Aug 03, 2021
Figure 1 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Figure 2 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Figure 3 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Viaarxiv icon

USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments

Add code
Jul 30, 2021
Figure 1 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 2 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 3 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 4 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Viaarxiv icon

KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset

Add code
Apr 26, 2021
Figure 1 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 2 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 3 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 4 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Viaarxiv icon

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams

Add code
Dec 18, 2020
Figure 1 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 2 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 3 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 4 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Viaarxiv icon

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Add code
Sep 22, 2020
Figure 1 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 2 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 3 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 4 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Viaarxiv icon