Python の TypeError を解消したい

Question

### 前提 Twitter の Progress Bar 2023 (@ProgressBar202_) のツイートを解析しようとしています。以下の形式でツイートの情報を取得しました。 ```Python tweets = [ "Progress Bar 2023 @ProgressBar202_ · 2023年1月1日 2023 is 1% complete. 1,865 7.7万 61.3万", "Progress Bar 2023 @ProgressBar202_ · 2023年1月1日 2022 is 100% complete. 902 5.9万 50.5万", "Progress Bar 2023 @ProgressBar202_ · 2022年12月28日 2022 is 99% complete. 1,083 4.8万 28.2万", "Progress Bar 2023 @ProgressBar202_ · 2022年12月25日 2022 is 98% complete. 577 2.1万 19.9万", ... ] ``` ↑のデータのフル版はこちらに貼っておきます。 ### 実現したいこと先ほど取得した情報を regex で整形し、pandas で表にしようと試みました。以下のコードの通りです。 ```Python import re import pandas as pd table = {"year": [], "percent": [], "reply": [], "RT": [], "fab": []} def parse(x): if x[-1] == "万": return int(float(x[:-1]) * 10000) elif "," in x: return int(x.replace(",","")) else: return int(x) for tw in tweets: m = re.search(r"(\d+).* is (\d+)%.* (.*) (.*) (.*)", tw) table["year"].append(int(m[1])) table["percent"].append(int(m[2])) table["reply"].append(parse(m[3])) table["RT"].append(parse(m[4])) table["fab"].append(parse(m[5])) df = pd.DataFrame(table) ``` それを以下のコードで可視化したいと思っています。 ```Python plt.figure(figsize=(15, 10)) plt.subplot(2,1,1) for year in [2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023]: data = df[df.year == year].sort_values("percent") plt.plot(data.percent, data.RT, ".-", label="%d"%year) plt.legend() plt.xlabel("%") plt.xticks(range(0, 101, 10)) plt.title("# of RTs") plt.subplot(2,1,2) for year in [2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023]: data = df[df.year == year].sort_values("percent") plt.plot(data.percent, data.fab, ".-", label="%d"%year) plt.legend() plt.xlabel("%") plt.xticks(range(0, 101, 10)) plt.title("# of fabs") plt.show() ``` ### 発生している問題・エラーメッセージ先ほどのコードを実行すると、 ``` --------------------------------------------------------------------------- TypeError Traceback (most recent call last) in 13 for tw in tweets: 14 m = re.search(r'(\d+).* is (\d+)%.* (.*) (.*) (.*)', tw) ---> 15 table["year"].append(int(m[1])) 16 table["percent"].append(int(m[2])) 17 table["reply"].append(parse(m[3])) TypeError: 'NoneType' object is not subscriptable ``` とエラーメッセージが出ます。 ### 試したこと `m` の型を確認したところ、`NoneType` であることはわかっています。 Stack Overflow 等確認してみましたが解決策等見つけることはできませんでした。皆さんのお力添えをよろしくお願いします。

Accepted Answer

質問中では省略されていますがtweetsの中に正規表現にマッチしないデータがあるのでは？

前提

実現したいこと

発生している問題・エラーメッセージ

試したこと

関連した質問