GigaAM

Форк
0

8 месяцев назад
8 месяцев назад
7 месяцев назад
7 месяцев назад
8 месяцев назад
README.md

GigaAM: the family of open-source acoustic models for speech processing

plot

Table of contents

GigaAM

GigaAM (Giga Acoustic Model) is a Conformer-based wav2vec2 foundational model (around 240M parameters). We trained GigaAM on nearly 50 thousand hours of diversified speech audio in the Russian language.

Resources:

GigaAM-CTC

GigaAM-CTC is an Automatic Speech Recognition model. We fine-tuned the GigaAM Encoder with Connectionist Temporal Classification using the NeMo toolkit on publicly available Russian labeled data:

datasetsize, hoursweight
Golos12270.6
SOVA3690.2
Russian Common Voice2070.1
Russian LibriSpeech930.1

Resources:

The following table summarizes the performance of different models in terms of Word Error Rate on open Russian datasets:

modelparametersGolos CrowdGolos FarfieldOpenSTT YoutubeOpenSTT Phone callsOpenSTT AudiobooksMozilla Common VoiceRussian LibriSpeech
Whisper-large-v31.5B17.414.511.131.217.05.39.0
NeMo Conformer-RNNT120M2.67.224.033.817.02.813.5
GigaAM-CTC242M3.15.718.425.615.11.78.1

GigaAM-Emo

GigaAM-Emo is an acoustic model for Emotion Recognition. We fine-tuned the GigaAM Encoder on the Dusha dataset.

Resources:

The following table summarizes the performance of different models on the Dusha dataset:

CrowdPodcast
Unweighted AccuracyWeighted AccuracyMacro F1-scoreUnweighted AccuracyWeighted AccuracyMacro F1-score
DUSHA baseline
(MobileNetV2 + Self-Attention)
0.830.760.770.890.530.54
АБК (TIM-Net)0.840.770.780.900.500.55
GigaAM-Emo0.900.870.840.900.760.67

Описание

Foundational Model for Speech Recognition Tasks

Языки

Markdown

Сообщить о нарушении

Использование cookies

Мы используем файлы cookie в соответствии с Политикой конфиденциальности и Политикой использования cookies.

Нажимая кнопку «Принимаю», Вы даете АО «СберТех» согласие на обработку Ваших персональных данных в целях совершенствования нашего веб-сайта и Сервиса GitVerse, а также повышения удобства их использования.

Запретить использование cookies Вы можете самостоятельно в настройках Вашего браузера.