pythonのtweepyでTwitterをスクレイピングしwordcloudで表示したい

前提

サイトURL→https://python-man.club/python_get_tweet_morphological-analysis/
サイトに載っていたプログラムをそのままコピーした後
必要なものをpipでインストールしましたがエラーが多く
自分なりにプログラムを修正しても上手くいかず困っています。
python完全初心者です。よろしくお願いします。

実現したいこと

pythonのライブラリ『tweepy』を使ってTwitterをスクレイピング
した後、wordcloudを用いて使用頻度の高い単語などを表示したい。

エラーメッセージ

[('G', 2), ('123', 2), ('ゲーム', 2), ('https', 2), ('://', 2), ('t', 2), ('.', 2), ('co', 2), ('/', 2), ('年', 2), ('イメージ', 2), ('@', 1), ('ba', 1), ('0797', 1), ('naj', 1), ('残念', 1), ('はずれ', 1), ('たる', 1), ('毎日', 1), ('挑戦', 1)]

ソースコード(個人情報は文字列ffffに変更してます)

python
1import tweepy
2from datetime import datetime,timezone
3import pytz
4import pandas as pd
5import collections
6import MeCab
7import datetime
8import matplotlib.pyplot as plt  
9from wordcloud import WordCloud
10import seaborn as sns
11sns.set(font='yuminl.ttf')
12
13CONSUMER_KEY = 'ffff'
14CONSUMER_SECRET = 'ffff'
15ACCESS_TOKEN = 'ffff'
16ACCESS_SECRET = 'ffff'
17
18auth = tweepy.OAuthHandler(CONSUMER_KEY, CONSUMER_SECRET)
19auth.set_access_token(ACCESS_TOKEN, ACCESS_SECRET)
20api = tweepy.API(auth)
21
22search_results = api.search_tweets(q="ゲーム", result_type="recent",tweet_mode='extended',count=50)
23
24tw_data = []
25for tweet in search_results:
26    #tweet_dataの配列に取得したい情報を入れていく
27    tw_data.append([
28        tweet.id,
29        tweet.full_text,
30        tweet.favorite_count, 
31        tweet.retweet_count, 
32        tweet.user.id, 
33        tweet.user.screen_name,
34        tweet.user.name,
35        tweet.user.description,
36        tweet.user.friends_count,
37        tweet.user.followers_count,
38        tweet.user.following,
39        tweet.user.profile_image_url,
40        tweet.user.profile_background_image_url,
41        tweet.user.url
42                       ])
43
44#取り出したデータをpandasのDataFrameに変換
45#CSVファイルに出力するときの列の名前を定義
46labels=[
47    'ツイートID',
48    'ツイート本文',
49    'いいね数',
50    'リツイート数',
51    'ID',
52    'ユーザー名',
53    'アカウント名',
54    '自己紹介文',
55    'フォロー数',
56    'フォロワー数',
57    '自分のフォロー状況',
58    'アイコン画像URL',
59    'ヘッダー画像URL',
60    'WEBサイト'
61    ]
62
63#tw_dataのリストをpandasのDataFrameに変換
64df = pd.DataFrame(tw_data,columns=labels)
65
66df1=df.iat[2,1]
67df2=df.iat[3,1]
68tw_text=df1 + df2
69f=open('text.txt','w',encoding='UTF-8')
70f.write(str(tw_text))
71f.close
72
73#f= open("text.txt", 'r', encoding='UTF-8') 
74#text=f.read()
75#f.close()
76text=tw_text
77tagger =MeCab.Tagger()
78tagger.parse('')
79node = tagger.parseToNode(text)
80
81word_list=[]
82while node:
83    word_type = node.feature.split(',')[0]
84    if word_type in ["名詞",'代名詞']:
85        word_list.append(node.surface)
86    node=node.next
87word_chain=' '.join(word_list)
88
89c=collections.Counter(word_list)
90font_path='C:/Windows/Fonts/yuminl.ttf'
91words = ['https','t','co','自民','し','w','そう', 'ない', 'いる', 'する', 'まま', 'よう', 'てる', 'なる', 'こと', 'もう', 'いい', 'ある', 'ゆく', 'れる', 'ん', 'の']
92result = WordCloud(width=800, height=600, background_color='white', 
93                   font_path=font_path,regexp=r"[\w']+", 
94                   stopwords=words).generate(word_chain)
95result.to_file("./wordcloud_sample1.png")
96print(c.most_common(20))
97fig = plt.subplots(figsize=(8, 10))
98
99sns.set(font="Hiragino Maru Gothic Pro",context="talk",style="white")
100sns.countplot(y=word_list,order=[i[0] for i in c.most_common(20)],palette="Blues_r")

試したこと

エラーが表示されるたびにエラー行部分を修正したり削除したりしていましたが最終的に行き詰まってしまいました。

melian

2022/12/26 11:50

「ツイート時刻」と「アカウント作成日時」が足りないです。

質問をすることでしか得られない、回答やアドバイスがある。

15分調べてもわからないことは、質問しよう！

前提

実現したいこと

エラーメッセージ

ソースコード(個人情報は文字列ffffに変更してます)

試したこと

関連した質問