Python/ヤフーニューストップトピックスのテキストを抜き出したい

Python初心者です。
ヤフーのニューストップトピックスのテキストを抜き出したいと思います。
クラスの指定方法とfind_allのタグ指定方法がよくわかりません。
わかる方教えてください。

≪プログラム≫

import requests
from bs4 import BeautifulSoup

load_url = "https://www.yahoo.co.jp/"
html = requests.get(load_url)
soup = BeautifulSoup(html.content, "html.parser")

topic = soup.find(class_="2j0udhv5jERZtYzddeDwcv")
for element in topic.find_all("li"):
print(element.text)

≪エラー≫

AttributeError Traceback (most recent call last)
<ipython-input-18-9f80f06abdda> in <module>
7
8 topic = soup.find(class_="2j0udhv5jERZtYzddeDwcv")
----> 9 for element in topic.find_all("li"):
10 print(element.text)
11

AttributeError: 'NoneType' object has no attribute 'find_all'

行動規範の内容に同意します

回答2件

tagではなくclassの指定が誤っているようです。

投稿2019/12/05 02:15

john_doe_

総合スコア354

１．Pythonの基本的なコードの書き方をマスターしましょう。
topic = soup.find(class_="2j0udhv5jERZtYzddeDwcv")のあたり、意味不明。

２．得たHTMLをよく見ましょう。

Python
1import requests
2from bs4 import BeautifulSoup
3
4load_url = "https://www.yahoo.co.jp/"
5html = requests.get(load_url)
6
7with open("foo.html","wb") as f:
8  f.write(html.content)
9
10soup = BeautifulSoup(html.content, "html.parser") 
11～～～

で、foo.htmlファイルの中をよく見ましょう。

投稿2019/11/23 07:57

otn

総合スコア85893

Tochan

2019/11/23 16:07

コメントありがとうございます。初心者でコードの書き方もわからず質問してすみません。ヤフーのニューストップトピックスのテキストを抜き出したいのですが、 ↓のコードの後はどうしたらいいでしょうか？すみませんが、よろしくお願いいたします。 ------------ import requests from bs4 import BeautifulSoup load_url = "https://www.yahoo.co.jp/" html = requests.get(load_url) with open("foo.html","wb") as f: f.write(html.content) soup = BeautifulSoup(html.content, "html.parser") ------------