回答率: 85.31%

質問するログイン新規登録

トップ Python 3.xに関する質問エラーの意味がよく分からない

編集履歴

質問編集履歴

1

該当するコードを更新しました

2023/08/25 03:44

投稿

スコア1

test CHANGED Viewed

File without changes

test CHANGED Viewed

@@ -22,18 +22,6 @@
 ### 該当のソースコード
 ```python
-dataset = datasets.load(dataset_name='statsbomb', match_id=match_id)
-```
-### 試したこと
-loadin.pyを見てみたが、StatsBombInputsのクラスがなかった。また、kloppyのgithubを確認してみたが見つけられなかった。
-### 補足情報（FW/ツールのバージョンなど）
-ここにより詳細な情報を記載してください。
-一応他のコードも書いときます
 !pip install statsbomb
 !pip install kloppy==3.12.0
@@ -49,5 +37,60 @@
 BASE_URL = 'https://raw.githubusercontent.com/statsbomb/open-data/master/data'
+comps_df = sb.Competitions().get_dataframe()
+def get_matches_df(competition_id, season_id=None, comps_df=None):
+    """試合情報を返すメソッド
+        Args:
+            - competition_id(int) : 大会id
+            - season_id(int, default=None) : シーズンid
+            - comps_df(pd.DataFrame, default=None) : 大会情報
+        Returns:
+            pd.DataFrame : 試合情報
+    """
+    # if文、season_idがあれば、シーズンを指定してjson形式のデータをdataframe形式に変換する
+    # season_idが指定されなければ、大会情報からseason_idをfor loopで回しながらダウンロード、変換する
+    if season_id:
+        matches_df = pd.DataFrame(requests.get(f'{BASE_URL}/matches/{competition_id}/{season_id}.json').json())
+    else:
+        # リストの内包表記
+        matches_df = pd.concat([pd.DataFrame(requests.get(f'{BASE_URL}/matches/{competition_id}/{season_id}.json').json()) for season_id in comps_df[comps_df.competition_id==competition_id].season_id.tolist()])
+    # ここでのmatches_dfは、エクセルで言うセルの中に辞書形式（dict）で値が入っていて分析しづらいので、それらをカラムに分解する
+    c_list = ['competition', 'season', 'home_team', 'away_team', 'stadium', 'competition_stage']
+    if competition_id == 53:
+        c_list.remove('stadium')
+    for c in c_list:
+        if c in ['stadium', 'competition_stage']:
+            key_list = ['id', 'name']
+            c_fixed_list = [f'{c}_{k}' for k in key_list]
+        else:
+            key_list = [f'{c}_{k}' for k in ['id', 'name']]
+            c_fixed_list = key_list
+        for k, c_fixed in zip(key_list, c_fixed_list):
+            matches_df[c_fixed] = matches_df[c].apply(lambda x: x[k] if type(x)==dict else None)
+    # 必要なカラムのみを残して最終形とする
+    matches_df = matches_df.drop(c_list+['metadata','referee'], axis=1).sort_values('match_date').reset_index(drop=True)
+    return matches_df
+competition_id = 53
+season_id = 106
+get_matches_df(competition_id=competition_id, season_id=season_id, comps_df=comps_df)
 match_id = 3835319
+dataset = datasets.load(dataset_name='statsbomb', match_id=match_id)
+```
+### 試したこと
+loadin.pyを見てみたが、StatsBombInputsのクラスがなかった。また、kloppyのgithubを確認してみたが見つけられなかった。
+### 補足情報（FW/ツールのバージョンなど）
+ここにより詳細な情報を記載してください