回答編集履歴

エラー内容追加

2018/12/19 08:14

投稿

スコア1286

test CHANGED Viewed

@@ -1,3 +1,15 @@
+# エラー内容
+http://blog.pyq.jp/entry/Python_kaiketsu_180516
+インデントがそろってないようなので確認してください。
 pandasでスクレイピングするほうが簡単ですよ。

追記

2018/12/19 08:14

投稿

スコア1286

test CHANGED Viewed

@@ -23,3 +23,73 @@
 dfs[7]
 ```
+# 追記
+Colaboratoryでスクレイピング
+https://imabari.hateblo.jp/entry/2018/04/16/172117
+こちらからURLを変えるだけで動きましたよ
+```python
+import pandas as pd
+import requests
+from bs4 import BeautifulSoup
+url = 'https://www.river.go.jp/kawabou/ipDamGaikyo.do?init=init&areaCd=89&prefCd=4001&townCd=&gamenId=01-0903&fldCtlParty=no'
+headers = {
+    'User-Agent':
+    'Mozilla/5.0 (Windows NT 10.0; WOW64; Trident/7.0; rv:11.0) like Gecko'
+}
+r = requests.get(url, headers=headers)
+if r.status_code == requests.codes.ok:
+    soup = BeautifulSoup(r.content, 'html5lib')
+    result = [[
+        x.get_text(strip=True) for x in y.find_all(['th', 'td'])
+    ] for y in soup.select('body > div.gaikyoCntt > table > tbody > tr ')]
+    df = pd.DataFrame(data=result[1:], columns=result[0])
+    df.set_index('ダム名', inplace=True)
+df
+```