編集履歴

回答編集履歴

内容追記

2020/10/19 16:42

投稿

スコア112

test CHANGED Viewed

@@ -41,3 +41,353 @@
 途中でわからないことがあれば言ってください。アドバイスできると思います。
+---
+追記：Azure Cognitive Speech SDK 導入方法
+自分のアカウントのAzure上で Cognitive Services を立てている前提です。
+立て方は上記のQiita記事を参考にしてください。
+1, Microsoftさん公式のUnityのサンプルプロジェクトをgit cloneする。
+ [https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/quickstart/csharp/unity/from-microphone](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/quickstart/csharp/unity/from-microphone)
+※これをzipでDLして quickstart/csharp/unity/from-microphone でもよい。
+ [https://github.com/Azure-Samples/cognitive-services-speech-sdk](https://github.com/Azure-Samples/cognitive-services-speech-sdk)
+2, `Microsoft.CognitiveServices.Speech.xxx.unitypackage` をDLしてインポートする[https://aka.ms/csspeech/unitypackage](https://aka.ms/csspeech/unitypackage)
+3, Assets/Scripts/`HelloWorld.cs` を下の内容に書き換える。
+変更点は、キー等の情報をスクリプトに直接記入するようになっていたので[SerializeField]でEditor上から入力できるようにした。あと日本語対応。
+```C#
+//
+// Copyright (c) Microsoft. All rights reserved.
+// Licensed under the MIT license. See LICENSE.md file in the project root for full license information.
+//
+// <code>
+using UnityEngine;
+using UnityEngine.UI;
+using Microsoft.CognitiveServices.Speech;
+#if PLATFORM_ANDROID
+using UnityEngine.Android;
+#endif
+#if PLATFORM_IOS
+using UnityEngine.iOS;
+using System.Collections;
+#endif
+public class HelloWorld : MonoBehaviour
+{
+    // Hook up the two properties below with a Text and Button object in your UI.
+    public Text outputText;
+    public Button startRecoButton;
+    private object threadLocker = new object();
+    private bool waitingForReco;
+    private string message;
+    private bool micPermissionGranted = false;
+#if PLATFORM_ANDROID || PLATFORM_IOS
+    // Required to manifest microphone permission, cf.
+    // https://docs.unity3d.com/Manual/android-manifest.html
+    private Microphone mic;
+#endif
+    [SerializeField] string subscriptionKey = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx";
+    [SerializeField] string serviceRegion = "japaneast";
+    public async void ButtonClick()
+    {
+        // Creates an instance of a speech config with specified subscription key and service region.
+        // Replace with your own subscription key and service region (e.g., "westus").
+        var config = SpeechConfig.FromSubscription(subscriptionKey, serviceRegion);
+        var lang = SourceLanguageConfig.FromLanguage("ja-JP");
+        // Make sure to dispose the recognizer after use!
+        using (var recognizer = new SpeechRecognizer(config, lang))
+        {
+            lock (threadLocker)
+            {
+                waitingForReco = true;
+            }
+            // Starts speech recognition, and returns after a single utterance is recognized. The end of a
+            // single utterance is determined by listening for silence at the end or until a maximum of 15
+            // seconds of audio is processed.  The task returns the recognition text as result.
+            // Note: Since RecognizeOnceAsync() returns only a single utterance, it is suitable only for single
+            // shot recognition like command or query.
+            // For long-running multi-utterance recognition, use StartContinuousRecognitionAsync() instead.
+            var result = await recognizer.RecognizeOnceAsync().ConfigureAwait(false);
+            // Checks result.
+            string newMessage = string.Empty;
+            if (result.Reason == ResultReason.RecognizedSpeech)
+            {
+                newMessage = result.Text;
+            }
+            else if (result.Reason == ResultReason.NoMatch)
+            {
+                newMessage = "NOMATCH: Speech could not be recognized.";
+            }
+            else if (result.Reason == ResultReason.Canceled)
+            {
+                var cancellation = CancellationDetails.FromResult(result);
+                newMessage = $"CANCELED: Reason={cancellation.Reason} ErrorDetails={cancellation.ErrorDetails}";
+            }
+            lock (threadLocker)
+            {
+                message = newMessage;
+                waitingForReco = false;
+            }
+        }
+    }
+    void Start()
+    {
+        if (outputText == null)
+        {
+            UnityEngine.Debug.LogError("outputText property is null! Assign a UI Text element to it.");
+        }
+        else if (startRecoButton == null)
+        {
+            message = "startRecoButton property is null! Assign a UI Button to it.";
+            UnityEngine.Debug.LogError(message);
+        }
+        else
+        {
+            // Continue with normal initialization, Text and Button objects are present.
+#if PLATFORM_ANDROID
+            // Request to use the microphone, cf.
+            // https://docs.unity3d.com/Manual/android-RequestingPermissions.html
+            message = "Waiting for mic permission";
+            if (!Permission.HasUserAuthorizedPermission(Permission.Microphone))
+            {
+                Permission.RequestUserPermission(Permission.Microphone);
+            }
+#elif PLATFORM_IOS
+            if (!Application.HasUserAuthorization(UserAuthorization.Microphone))
+            {
+                Application.RequestUserAuthorization(UserAuthorization.Microphone);
+            }
+#else
+            micPermissionGranted = true;
+            message = "Click button to recognize speech";
+#endif
+            startRecoButton.onClick.AddListener(ButtonClick);
+        }
+    }
+    void Update()
+    {
+#if PLATFORM_ANDROID
+        if (!micPermissionGranted && Permission.HasUserAuthorizedPermission(Permission.Microphone))
+        {
+            micPermissionGranted = true;
+            message = "Click button to recognize speech";
+        }
+#elif PLATFORM_IOS
+        if (!micPermissionGranted && Application.HasUserAuthorization(UserAuthorization.Microphone))
+        {
+            micPermissionGranted = true;
+            message = "Click button to recognize speech";
+        }
+#endif
+        lock (threadLocker)
+        {
+            if (startRecoButton != null)
+            {
+                startRecoButton.interactable = !waitingForReco && micPermissionGranted;
+            }
+            if (outputText != null)
+            {
+                outputText.text = message;
+            }
+        }
+    }
+}
+// </code>
+```
+4, 作成済みのリソースから `キー1` を `subscriptionKey` に張り付ける。また `場所` が `serviceRegion` に相当するので japaneast でなければ書き換える。
+![イメージ説明](2d0868f322cff3abb9cc4176eb76a664.png)
+5, Build Settings の Target Platform を Android に変更してビルド。実機で動作確認。（Windows でも iOS でもOKです。）
+![イメージ説明](f42c63ce85e8268f8bf7115cf7062f29.jpeg)

書式修正

2020/10/19 16:42

投稿

u824

スコア112

test CHANGED Viewed

@@ -1,4 +1,4 @@
->音声認識APIを用いて音声をテキスト化し、switch文の中で照合させて合致したcase内の処理によってアニメーションを変えるという流れなのでしょうか
+> 音声認識APIを用いて音声をテキスト化し、switch文の中で照合させて合致したcase内の処理によってアニメーションを変えるという流れなのでしょうか
@@ -8,7 +8,7 @@
->『そもそもAPIをどの様に用いれば上記の処理が出来るのか』が分からない
+> 『そもそもAPIをどの様に用いれば上記の処理が出来るのか』が分からない