Python Scikit-learn RandomForestでラベル別の精度を出力したい

PythonのRandomForestで複数ラベルの分類を行なっています。
分類器の精度はF値を用いて測っているのですが、ラベルごとの分類精度を出す手法があれば教えていただきたいです。
末尾は分類及び精度出しに用いているコードです。

現状考えている方法としては、予測値(pred)を出した後、
①　test_Yとpredを合体
②　test_Yのラベル別のデータを新規のdataframeに格納
③　ラベル別に分けたdataframeのtest_Yにあたる列とpredにあたる列でf1_scoreを実行
④　ラベル別の精度が得られる
というものです。
ですが、交差検証を行っている為データ数が膨大で、以上の内容を簡易に出来るモジュールなどがありましたら教えていただきたいです。
よろしくお願いいたします。

Python
1from sklearn.ensemble import RandomForestClassifier
2clf = RandomForestClassifier(random_state=0)
3clf = clf.fit(train_X, train_y)
4pred = clf.predict(test_X)
5
6from sklearn.metrics import f1_score
7f1_score (pred,test_y,average='macro')

行動規範の内容に同意します

回答1件

ベストアンサー

classification_reportを用いてはいかがでしょうか。

sklearn.metrics.classification_report — scikit-learn 0.21.3 documentation

シンプルな使用例。

python
1ort pandas as pd
2from sklearn.datasets import load_iris
3from sklearn.svm import SVC
4from sklearn.model_selection import train_test_split
5from sklearn.metrics import classification_report
6
7iris = load_iris()
8clf = SVC(gamma="scale")
9
10X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, test_size=0.4, stratify=iris.target, random_state=0)
11
12clf.fit(X_train, y_train)
13pred = clf.predict(X_test)
14
15print(classification_report(y_test, pred, target_names=iris.target_names))
16""" =>
17              precision    recall  f1-score   support
18
19      setosa       1.00      1.00      1.00        20
20  versicolor       1.00      0.95      0.97        20
21   virginica       0.95      1.00      0.98        20
22
23    accuracy                           0.98        60
24   macro avg       0.98      0.98      0.98        60
25weighted avg       0.98      0.98      0.98        60
26
27"""
28
29report_dict = classification_report(y_test, pred, target_names=iris.target_names, output_dict=True)
30print(report_dict)
31""" =>
32{'virginica': {'precision': 0.9523809523809523, 'recall': 1.0, 'support': 20, 'f1-score': 0.975609756097561}, 'macro avg': {'precision': 0.9841269841269842, 'recall': 0.9833333333333334, 'support': 60, 'f1-score': 0.9833229101521784}, 'versicolor': {'precision': 1.0, 'recall': 0.95, 'support': 20, 'f1-score': 0.9743589743589743}, 'setosa': {'precision': 1.0, 'recall': 1.0, 'support': 20, 'f1-score': 1.0}, 'accuracy': 0.9833333333333333, 'weighted avg': {'precision': 0.9841269841269842, 'recall': 0.9833333333333333, 'support': 60, 'f1-score': 0.9833229101521785}}
33"""
34
35report_df = pd.DataFrame(report_dict)
36print(report_df)
37""" =>
38           accuracy  macro avg  setosa  versicolor  virginica  weighted avg
39f1-score   0.983333   0.983323     1.0    0.974359   0.975610      0.983323
40precision  0.983333   0.984127     1.0    1.000000   0.952381      0.984127
41recall     0.983333   0.983333     1.0    0.950000   1.000000      0.983333
42support    0.983333  60.000000    20.0   20.000000  20.000000     60.000000
43"""