質問編集履歴

他サイトでも相談させていただいている旨追記。

2022/09/20 01:59

投稿

masahiroview

スコア10

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -158,6 +158,8 @@
 こちらをmodel.tar.gzに圧縮してs3に設置しています。
+急ぎの案件のため、他サイトでも相談させてもらっています。
+進捗があった際にはこちらにも共有いたします。
+https://ja.stackoverflow.com/questions/91142/sagemaker%e7%92%b0%e5%a2%83%e3%81%ab%e3%81%a6%e3%83%87%e3%83%97%e3%83%ad%e3%82%a4%e3%81%ab%e5%a4%b1%e6%95%97%e3%81%99%e3%82%8b
 対処方法をご存知の方がいましたら、ご教授いただけますと幸いです。

Python 3.x AWS(Amazon Web Services)

entry_pointのコード追加

2022/09/17 03:52

投稿

masahiroview

スコア10

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -26,7 +26,132 @@
 predictor = pytorch_model.deploy(instance_type='ml.t2.2xlarge', initial_instance_count=1)
 ```
+※entry_pointの推論コード
+```iference.py
+import os
+import time
+import torch
+import pyopenjtalk
+from espnet2.bin.tts_inference import Text2Speech
+import matplotlib.pyplot as plt
+from espnet2.tasks.tts import TTSTask
+from espnet2.text.token_id_converter import TokenIDConverter
+import numpy as np
+import argparse
+import text_processing as texp
+import os
+import boto3
+prosodic = True
+model_dir = "model/"
+vocoder_dir = "vocoder/"
+CONTENT_TYPE = "text/plain"
+train_config = "model/config.yaml"
+model_file = "model/50epoch.pth"
+# train_config=""
+# model_file="
+vocoder_tag = "parallel_wavegan/jsut_hifigan.v1"
+# ボコーダを指定
+vocoder_config = "vocoder/config.yaml"
+vocoder_file = "vocoder/50epoch.pth"
+def model_fn(model_dir):
+    print(model_dir + "config.yaml")
+    print(model_dir + "100epoch.pth")
+    model = Text2Speech.from_pretrained(
+        train_config=model_dir + "config.yaml",
+        model_file=model_dir + "100epoch.pth",
+        vocoder_tag=vocoder_tag,
+        device="cpu",
+        speed_control_alpha=1.0,
+        noise_scale=0.333,
+        noise_scale_dur=0.333,
+    )
+    return model
+def input_fn(request_body, content_type=CONTENT_TYPE):
+    input_data = "あいうえお"
+    return input_data
+def predict_fn(input_data, model):
+    import torch
+    import os
+    import numpy as np
+    x = "デモテキスト"
+    # model, train_args = TTSTask.build_model_from_file(
+    #         train_config,  model_file, "cuda"
+    #        )
+    token_id_converter = TokenIDConverter(
+        token_list=model.train_args.token_list,
+        unk_symbol="<unk>",
+    )
+    text = x
+    if prosodic:
+        tokens = texp.a2p(x)
+        text_ints = token_id_converter.tokens2ids(tokens)
+        text = np.array(text_ints)
+    else:
+        print("\npyopenjtalk_accent_with_pauseによる解析結果：")
+        print(texp.text2yomi(x), "\n")
+    # synthesis
+    with torch.no_grad():
+        start = time.time()
+        data = model(text)
+        wav = data["wav"]
+        # print(text2speech.preprocess_fn("<dummy>",dict(text=x))["text"])
+    rtf = (time.time() - start) / (len(wav) / model.fs)
+    print(f"RTF = {rtf:5f}")
+    if not os.path.isdir("generated_wav"):
+        os.makedirs("generated_wav")
+    # let us listen to generated samples
+    from IPython.display import display, Audio
+    import numpy as np
+    #display(Audio(wav.view(-1).cpu().numpy(), rate=text2speech.fs))
+    #Audio(wav.view(-1).cpu().numpy(), rate=text2speech.fs)
+    np_wav = wav.view(-1).cpu().numpy()
+    fs = 48000
+    print("サンプリングレート", fs, "で出力します。")
+    from scipy.io.wavfile import write
+    samplerate = fs
+    t = np.linspace(0., 1., samplerate)
+    amplitude = np.iinfo(np.int16).max
+    data = amplitude * np_wav/np.max(np.abs(np_wav))
+    write("espnet/egs2/jsut/tts1/generated_wav/"+x +
+          ".wav", samplerate, data.astype(np.int16))
+    print("\n\n\n")
+    # バケットへの接続
+    s3 = boto3.resource('s3')
+    bucket = s3.Bucket('alterly-source')
+    bucket.upload_file("espnet/egs2/jsut/tts1/generated_wav/" +
+                       x + ".wav", "source/"+x+".wav")
+    x = "exit"
+input_object = input_fn("あいうえお", "text/plain")
+model = model_fn(model_dir)
+prediction = predict_fn(input_object, model)
+```
 ### 補足情報
 圧縮前のディレクトリ構造は下記です。
 ![イメージ説明](https://ddjkaamml8q8x.cloudfront.net/questions/2022-09-16/927d72cc-dfb6-4050-a18a-2dccb4f54942.png)

Python 3.x AWS(Amazon Web Services)

デフォルトの質問フォーマットの消し忘れがあったため削除

2022/09/16 06:01

投稿

masahiroview

スコア10

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -27,7 +27,7 @@
 ```
-### 補足情報（FW/ツールのバージョンなど）
+### 補足情報
 圧縮前のディレクトリ構造は下記です。
 ![イメージ説明](https://ddjkaamml8q8x.cloudfront.net/questions/2022-09-16/927d72cc-dfb6-4050-a18a-2dccb4f54942.png)

Python 3.x AWS(Amazon Web Services)

一部書き損じがあったため修正

2022/09/16 06:00

投稿

masahiroview

スコア10

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -9,9 +9,9 @@
 Please specify --force/-f option to overwrite the model archive output file.
 See -h/--help for more details./.sagemaker/mms/models/model
 ERROR - %s already exists.
+※以下、デプロイ用コード
 ```
-###デプロイ時のコード
-```ここに言語を入力
 from sagemaker import get_execution_role
 from sagemaker.pytorch.model import PyTorchModel

Python 3.x AWS(Amazon Web Services)