python：BeautifulSoup４スクレイピングによる「ValueError: not enough values to unpack」エラーの際の処理について

前提・実現したいこと

現在、SUUMOのWEBサイトから物件情報をスクレイピングしようと考えております。
以下の様にプログラムを作成いたしました。
name, price, addrress, access, size, madori, age = content.find_all("dd")
の部分でインデックスをつけ辞書に格納したいのですが、下記エラーメッセージがでてしまいます。
知識不足で恐縮ではございますが、解決方法をご教示いただけないでしょうか。

※”ｄｄ”タグを変数contentに格納し、print(content)で出力すると正常に出力されます。

発生している問題・エラーメッセージ

ValueError: not enough values to unpack (expected 7, got 0)

該当のソースコード

Python
1url = "https://suumo.jp/jj/bukken/ichiran/JJ010FJ001/?ar=090&bs=011&ta=40&jspIdFlg=patternShikugun&kb=1&kt=9999999&mb=0&mt=9999999&ekTjCd=&ekTjNm=&tj=0&cnb=0&cn=9999999&pn={}"
2
3d_list = []
4
5for i in range(1, 3):
6    print("d_listの大きさ", len(d_list))
7    
8    target_url = url.format(i)
9    
10    print(target_url)
11    
12    r = requests.get(target_url)
13
14    sleep(1)
15
16    soup = BeautifulSoup(r.text)
17
18    for contents in soup.find_all("div", class_= "property_unit-content"):
19        for content in contents.find_all("dd"):
20            name, price, addrress, access, size, madori, age = content.find_all("dd")
21
22            d = {
23            "name" : name.text,
24            "price" : price.text,
25            "addrress" : addrress.text,
26            "access" : access.text,
27            "size" : size.text,
28            "madori" : madori.text,
29            "age" : age.text,
30            }
31        
32            d_list.append(d)

試したこと

スペルの確認など

補足情報（FW/ツールのバージョンなど）

ここにより詳細な情報を記載してください。

行動規範の内容に同意します

回答2件

取得すべきデータは、contentsで得られたddのリストのテキスト部分なので、二重にループさせる必要はありません。

そして、実際のデータは、間取り(madori)と築年月(age)の間にバルコニー(の面積)のデータが入っていますから、取得する際にはそれを考慮する必要があります。

それぞれの内容のテキストを取り出し、(一部のテキストに改行が含まれているので)stripでそれを取り除き、対応するキー(nameなど)と結びつけて辞書にし、リストに辞書を登録するコードは以下のように書けます。

Python
1    keys = ["name", "price", "addrress", "access", "size", "madori", "balcony", "age"]
2    for contents in soup.find_all("div", class_="property_unit-content"):
3        d = {key: value for key, value in zip(keys, map(lambda x: x.text.strip(), contents.find_all("dd")))}
4        print(d)
5        d_list.append(d)

投稿2021/09/20 23:32

Daregada

総合スコア11990

ベストアンサー

name, price, addrress, access, size, madori, age = content.find_all("dd")
の部分で、あなたはcontent.find_all("dd")が7要素のリストを返すと思っていまうが、実際に返ったのはそうでないと言うことです。
print(content.find_all("dd"))してみましょう。

dd要素の中にまたdd要素があるとは考えにくいのですが、そのあたり大丈夫でしょうか？

Python
1
2    for contents in soup.find_all("div", class_= "property_unit-content"):
3        name, price, addrress, access, size, madori, age = contents.find_all("dd")