通過google cloud API 使用 WaveNet

本文轉載自查看原文 2018-05-15 18:24 1175 TTS/ machine learning

Cloud Text-to-Speech 中使用了WaveNet，用於TTS，頁面上有Demo。目前是BETA版

使用方法

注冊及認證參考：Quickstart: Text-to-Speech
安裝google clould 的python庫
安裝 Google Cloud Text-to-Speech API Python 依賴（Dependencies），參見github說明
----其中包括了，安裝pip install google-cloud-texttospeech==0.1.0

為了implicit調用，設置環境變量GOOGLE_APPLICATION_CREDENTIALS到你的API Key（json文件），完成后重啟

python腳本：text到mp3

# [START tts_synthesize_text]
def synthesize_text(text):
    """Synthesizes speech from the input string of text."""
    from google.cloud import texttospeech
    client = texttospeech.TextToSpeechClient()

    input_text = texttospeech.types.SynthesisInput(text=text)

    # Note: the voice can also be specified by name.
    # Names of voices can be retrieved with client.list_voices().
    voice = texttospeech.types.VoiceSelectionParams(
        language_code='en-US',
        ssml_gender=texttospeech.enums.SsmlVoiceGender.FEMALE)

    audio_config = texttospeech.types.AudioConfig(
        audio_encoding=texttospeech.enums.AudioEncoding.MP3)

    response = client.synthesize_speech(input_text, voice, audio_config)

    # The response's audio_content is binary.
    with open('output.mp3', 'wb') as out:
        out.write(response.audio_content)
        print('Audio content written to file "output.mp3"')
# [END tts_synthesize_text]

WaveNet特性

目前支持的6種voice type

參數說明

https://cloud.google.com/text-to-speech/docs/reference/rest/v1beta1/text/synthesize#audioconfig

input_text

voice

audio_config

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 使用 PUTTY 操作 Google Cloud Google Maps API的使用 Google 字體API的基本使用 Google Map API 使用總結如何使用google地圖的api（整理）國內使用Google Maps JavaScript API 國內“Google 地圖 API”使用教程 2.5.4、Google Analytics高級應用——API的使用 Google 地圖 API V3 使用入門 Android Google Map API使用的八個步驟