Google Cloud Speech To Text 给出 0 结果答案

【问题标题】：Google Cloud Speech To Text Giving 0 resultGoogle Cloud Speech To Text 给出 0 结果
【发布时间】：2019-05-22 06:47:30
【问题描述】：

我在 Java 中使用 Google Cloud Speech to text api。

当我调用 speechClient.recognize 时，我得到 0 个结果

pom.xml：

<dependency>
    <groupId>com.google.cloud</groupId>
    <artifactId>google-cloud-speech</artifactId>
    <version>0.80.0-beta</version>
</dependency>

Java 代码：

import java.io.FileInputStream;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;
import java.util.List;
import com.google.api.gax.core.FixedCredentialsProvider;
import com.google.auth.oauth2.GoogleCredentials;
import com.google.cloud.speech.v1.RecognitionAudio;
import com.google.cloud.speech.v1.RecognitionConfig;
import com.google.cloud.speech.v1.RecognitionConfig.AudioEncoding;
import com.google.cloud.speech.v1.RecognizeResponse;
import com.google.cloud.speech.v1.SpeechClient;
import com.google.cloud.speech.v1.SpeechRecognitionAlternative;
import com.google.cloud.speech.v1.SpeechRecognitionResult;
import com.google.cloud.speech.v1.SpeechSettings;
import com.google.protobuf.ByteString;

public class SpeechToText {

    public static void main(String[] args) {

        // Instantiates a client
        try {

            String jsonFilePath = System.getProperty("user.dir") + "/serviceaccount.json";
            FileInputStream credentialsStream = new FileInputStream(jsonFilePath);
            GoogleCredentials credentials = GoogleCredentials.fromStream(credentialsStream);
            FixedCredentialsProvider credentialsProvider = FixedCredentialsProvider.create(credentials);

            SpeechSettings speechSettings = 
                    SpeechSettings.newBuilder()
                        .setCredentialsProvider(credentialsProvider)
                        .build();       

            SpeechClient speechClient = SpeechClient.create(speechSettings);

            //SpeechClient speechClient = SpeechClient.create();

            // The path to the audio file to transcribe         
            String fileName = System.getProperty("user.dir") + "/call-recording-790.opus";

            // Reads the audio file into memory
            Path path = Paths.get(fileName);
            byte[] data = Files.readAllBytes(path);
            ByteString audioBytes = ByteString.copyFrom(data);

            System.out.println(path.toAbsolutePath());

            // Builds the sync recognize request
            RecognitionConfig config = RecognitionConfig.newBuilder().setEncoding(AudioEncoding.LINEAR16)
                    .setSampleRateHertz(8000).setLanguageCode("en-US").build();

            RecognitionAudio audio = RecognitionAudio.newBuilder().setContent(audioBytes).build();

            System.out.println("recognize builder");

            // Performs speech recognition on the audio file
            RecognizeResponse response = speechClient.recognize(config, audio);
            List<SpeechRecognitionResult> results = response.getResultsList();

            System.out.println(results.size()); // ***** HERE 0

            for (SpeechRecognitionResult result : results) {

                // There can be several alternative transcripts for a given chunk of speech.
                // Just use the
                // first (most likely) one here.
                SpeechRecognitionAlternative alternative = result.getAlternativesList().get(0);
                System.out.printf("Transcription: %s%n", alternative.getTranscript());
            }
        } catch (Exception e) {
            System.out.println(e);
        }
    }
}

在上面的代码中，我得到的 results.size 为 0。当我在 https://cloud.google.com/speech-to-text/ 的演示上上传相同的 opus 文件时，它正确地给出了输出文本。

那么为什么识别调用的结果为零？

【问题讨论】：

标签： java google-cloud-platform google-speech-api

【解决方案1】：

Speech-to-Text 返回空响应的原因可能有 3 个：

音频不清晰。
音频听不懂。
音频未使用正确的编码。

据我所知，原因 3 是您问题的最可能原因。要解决此问题，请查看page 以了解如何验证音频文件的编码，该编码必须与您在 InitialRecognizeRequest 中发送的参数匹配。

【讨论】：