原文:语音识别算法阅读之LAS

LAS: listen, attented and spell,Google 思想: sequence to sequence的思想,模型分为encoder和decoder两部分,首先将任意长的输入序列通过encoder转化为定长的特征表达,然后输入到decoder再转化为任意长的输出序列 相比于传统sequence to sequence在decoder部分引入attention机制,让模型自 ...

2020-09-13 16:00 0 883 推荐指数:

查看详情

语音识别算法阅读之DFSMN

论文: Deep-FSMN for Large Vocabulary Continuous Speech Recognition 思想:   对于大词汇量语音识别,往往需要更深的网络结构,但是当FSMN[1]或cFSMN[2]的结构很深时容易引发剃度消失和爆炸问题 ...

Thu Sep 17 05:51:00 CST 2020 0 863
语音识别算法阅读之EESEN

论文:   EESEN:END-TO-END SPEECH RECOGNITION USING DEEP RNN MODELS AND WFST-BASED DECODING ...

Wed Sep 16 07:06:00 CST 2020 0 770
语音识别算法阅读之CTC

论文:   CTC:Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks 思想:   语音识别中,一般包含语音 ...

Sun Sep 13 23:36:00 CST 2020 0 752
语音识别算法阅读之TDNN-F

论文:   A time delay neural network architecture for efficient modeling of longtemporal contexts ...

Thu Sep 17 07:21:00 CST 2020 0 1843
语音识别算法阅读之transformer-transducer(facebook)

论文:   TRANSFORMER-TRANSDUCER:END-TO-END SPEECH RECOGNITION WITH SELF-ATTENTION 思想:   1)借助RNN-T在语音识别上的优势,通过tranformer替换RNN-T中的RNN结构,实现 ...

Thu Sep 17 06:58:00 CST 2020 0 1449
 
粤ICP备18138465号  © 2018-2025 CODEPRJ.COM