回答編集履歴

2020/04/15 04:45

投稿

スコア21960

answer CHANGED Viewed

@@ -58,4 +58,8 @@
      y     train      test
 2  1.0  0.750623  0.249377 ← ほぼ 75:25 の割合になってる
 1  1.0  0.747475  0.252525 ← ほぼ 75:25 の割合になってる
-```
+```
+----
+不明点があれば追記しますので、コメントしてください

あ

2020/04/15 04:45

投稿

スコア21960

answer CHANGED Viewed

@@ -22,7 +22,7 @@
 # 層化抽出なし
 #############################
-train, test = train_test_split(y)
+train, test = train_test_split(y)  # 75]25 の割合で分割 (デフォルト)
 print("層化抽出なし")
 df = pd.DataFrame({"y": pd.value_counts(y), "train": pd.value_counts(train), "test": pd.value_counts(test)})
 print(df)
@@ -32,7 +32,7 @@
 # 層化抽出あり
 #############################
-train, test = train_test_split(y, stratify=y)
+train, test = train_test_split(y, stratify=y)  # 75]25 の割合で分割 (デフォルト)
 print("層化抽出あり")
 df = pd.DataFrame({"y": pd.value_counts(y), "train": pd.value_counts(train), "test": pd.value_counts(test)})
@@ -49,13 +49,13 @@
 2  401    307    94
 1   99     68    31
      y     train      test
-2  1.0  0.765586  0.234414
+2  1.0  0.765586  0.234414 ← ほぼ 75:25 の割合になってる
-1  1.0  0.686869  0.313131
+1  1.0  0.686869  0.313131 ← 75:25 の割合になってない
 層化抽出あり
      y  train  test
 2  401    301   100
 1   99     74    25
      y     train      test
-2  1.0  0.750623  0.249377
+2  1.0  0.750623  0.249377 ← ほぼ 75:25 の割合になってる
-1  1.0  0.747475  0.252525
+1  1.0  0.747475  0.252525 ← ほぼ 75:25 の割合になってる
 ```