transformersのTFBertForQuestionAnsweringの出力が何を意味するか

transformers:TFBertForQuestionAnswering
上記サイトのTFBertForQuestionAnsweringのExampleについて質問です。

Python
1import tensorflow as tf
2from transformers import BertTokenizer, TFBertForQuestionAnswering
3
4tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
5model = TFBertForQuestionAnswering.from_pretrained('bert-base-uncased')
6input_ids = tf.constant(tokenizer.encode("Hello, my dog is cute", add_special_tokens=True))[None, :]  # Batch size 1
7outputs = model(input_ids)
8start_scores, end_scores = outputs[:2]

このコードにおいて、start_scores,end_scoresは何を意味しているのですか？

行動規範の内容に同意します

回答1件

ベストアンサー

ドキュメントにあるように、TFBertForQuestionAnsweringは入力したテキストの中から応答にあたる箇所を抽出します。

Bert Model with a span classification head on top for extractive question-answering tasks like SQuAD (a linear layers on top of the hidden-states output to compute span start logits and span end logits).

よって、start_scoresとend_scoresは抽出箇所の始点と終点のlogitsです。つまり、start_scoresとend_scoresは入力テキストと同じ長さの値をもち、それらの値はテキスト内の各tokenが始点もしくは終点である確率 (正確にはlogits) です。start_scoresの中で最も大きい値のインデックス番号から、end_scoresの中で最も大きい値のインデックス番号まで入力テキストから抜き出してquestion-answeringのタスクを実行します。

投稿2020/02/23 16:03