原文:語音算法閱讀之SpecAugment

論文: SpecAugment: A Simple Data Augmentation Methodfor Automatic Speech Recognition 思想: SpecAugment是一種log梅爾聲譜層面上的數據增強方法,可以將模型訓練的過擬合問題轉化為欠擬合問題,以便通過大網絡和長時訓練策略來緩解欠擬合問題,提升語音識別效果 模型: 輸入特征:log梅爾聲譜 聲譜增強:將log ...

2020-09-16 23:09 0 1603 推薦指數:

查看詳情

語音識別算法閱讀之LAS

LAS:   listen, attented and spell,Google 思想:   sequence to sequence的思想,模型分為encoder和dec ...

Mon Sep 14 00:00:00 CST 2020 0 883
語音識別算法閱讀之DFSMN

論文: Deep-FSMN for Large Vocabulary Continuous Speech Recognition 思想:   對於大詞匯量語音識別,往往需要更深的網絡結構,但是當FSMN[1]或cFSMN[2]的結構很深時容易引發剃度消失和爆炸問題 ...

Thu Sep 17 05:51:00 CST 2020 0 863
語音識別算法閱讀之EESEN

論文:   EESEN:END-TO-END SPEECH RECOGNITION USING DEEP RNN MODELS AND WFST-BASED DECODING ...

Wed Sep 16 07:06:00 CST 2020 0 770
語音識別算法閱讀之CTC

論文:   CTC:Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks 思想:   語音識別中,一般包含語音 ...

Sun Sep 13 23:36:00 CST 2020 0 752
語音識別算法閱讀之TDNN-F

論文:   A time delay neural network architecture for efficient modeling of longtemporal contexts ...

Thu Sep 17 07:21:00 CST 2020 0 1843
語音識別算法閱讀之transformer-transducer(facebook)

論文:   TRANSFORMER-TRANSDUCER:END-TO-END SPEECH RECOGNITION WITH SELF-ATTENTION 思想:   1)借助RNN-T在語音識別上的優勢,通過tranformer替換RNN-T中的RNN結構,實現 ...

Thu Sep 17 06:58:00 CST 2020 0 1449
 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM