java實現麥克風自動錄音

本文轉載自查看原文 2018-02-27 16:54 5742 錄音/ 語音識別

　　最近在研究語音識別，使用百度的sdk。發現只有識別的部分，而我需要保存音頻文件，並且實現當有聲音傳入時自動生成音頻文件。

　　先上代碼：

public class EngineeCore {

    String filePath = "E:\\voice\\voice_cache.wav";

    AudioFormat audioFormat;
    TargetDataLine targetDataLine;
    boolean flag = true;


private void stopRecognize() {
        flag = false;
        targetDataLine.stop();
        targetDataLine.close();
    }private AudioFormat getAudioFormat() {
        float sampleRate = 16000;
        // 8000,11025,16000,22050,44100
        int sampleSizeInBits = 16;
        // 8,16
        int channels = 1;
        // 1,2
        boolean signed = true;
        // true,false
        boolean bigEndian = false;
        // true,false
        return new AudioFormat(sampleRate, sampleSizeInBits, channels, signed, bigEndian);
    }// end getAudioFormat


    private void startRecognize() {
        try {
            // 獲得指定的音頻格式
            audioFormat = getAudioFormat();
            DataLine.Info dataLineInfo = new DataLine.Info(TargetDataLine.class, audioFormat);
            targetDataLine = (TargetDataLine) AudioSystem.getLine(dataLineInfo);

            // Create a thread to capture the microphone
            // data into an audio file and start the
            // thread running. It will run until the
            // Stop button is clicked. This method
            // will return after starting the thread.
            flag = true;
            new CaptureThread().start();
        } catch (Exception e) {
            e.printStackTrace();
        } // end catch
    }// end captureAudio method

    class CaptureThread extends Thread {
        public void run() {
            AudioFileFormat.Type fileType = null;
            File audioFile = new File(filePath);

            fileType = AudioFileFormat.Type.WAVE;
            //聲音錄入的權值
            int weight = 2;
            //判斷是否停止的計數
            int downSum = 0;

            ByteArrayInputStream bais = null;
            ByteArrayOutputStream baos = new ByteArrayOutputStream();
            AudioInputStream ais = null;
            try {
                targetDataLine.open(audioFormat);
                targetDataLine.start();
                byte[] fragment = new byte[1024];

                ais = new AudioInputStream(targetDataLine);
                while (flag) {

                    targetDataLine.read(fragment, 0, fragment.length);
                    //當數組末位大於weight時開始存儲字節（有聲音傳入），一旦開始不再需要判斷末位
                    if (Math.abs(fragment[fragment.length-1]) > weight || baos.size() > 0) {
                        baos.write(fragment);
                        System.out.println("守衛："+fragment[0]+",末尾："+fragment[fragment.length-1]+",lenght"+fragment.length);
                        //判斷語音是否停止
                        if(Math.abs(fragment[fragment.length-1])<=weight){
                            downSum++;
                        }else{
                            System.out.println("重置奇數");
                            downSum=0;
                        }
　　　　　　　　　　　　　　　//計數超過20說明此段時間沒有聲音傳入(值也可更改)
                        if(downSum>20){
                            System.out.println("停止錄入");
                            break;
                        }

                    }
                }

                //取得錄音輸入流
                audioFormat = getAudioFormat();
                byte audioData[] = baos.toByteArray();
                bais = new ByteArrayInputStream(audioData);
                ais = new AudioInputStream(bais, audioFormat, audioData.length / audioFormat.getFrameSize());
                //定義最終保存的文件名
                System.out.println("開始生成語音文件");
                AudioSystem.write(ais, AudioFileFormat.Type.WAVE, audioFile);
                downSum = 0;
                stopRecognize();

            } catch (Exception e) {
                e.printStackTrace();
            } finally {
                //關閉流

                try {
                    ais.close();
                    bais.close();
                    baos.reset();
                } catch (IOException e) {
                    e.printStackTrace();
                }
            }

        }// end run
    }// end inner class CaptureThread

接下來測試

    public static void main(String args[]) {
        EngineeCore engineeCore = new EngineeCore();

            engineeCore.startRecognize();

    }

　　當有較高的聲音傳入麥克風時，targetDataLine讀取的字節數組首位或末位絕對值會變大（位置取決於音頻格式中的一些參數，如bigEndian）。傳入音量低，絕對值會變小

錄音開始。從targetDataLine中讀取的音頻數據被保存在ByteArrayOutputStream中。一段時間音量一直低於權值時，認為無聲音傳入，結束錄音。從ByteArrayOutputStream取出字節數組，

轉為音頻保存在本地文件中。

　　注意：從targetDataLine讀取的字節數組不能直接用於百度等語音識別，需要先轉為音頻文件，然后讀取音頻文件生成的字節數組，才可用於語音識別。

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 使用js調用麥克風並錄音前端獲取麥克風錄音數據 Unity 聲音與錄音與麥克風實時播放麥克風陣列波束形成之DSB原理與實現 C# Naudio 從麥克風輸入到聲卡輸出錄音放音功能 C#實現麥克風採集與播放電容麥與其他麥克風有什么不同？ Android 麥克風錄音帶音量大小動態顯示的圓形自定義View 消除USB麥克風的電流聲安卓判斷麥克風權限