質問編集履歴

お礼追加

2018/08/10 12:39

投稿

Yukiya025

スコア86

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -140,4 +140,37 @@
   File "python_ex289_ex293.py", line 13, in scrape
     html = r.text()
 TypeError: 'unicode' object is not callable
+```
+# もう一度アドバイスをもらって完成
+[umyu様](https://teratail.com/users/umyu) にスタックトレースの読み方まで教えていただいての完成です(*≧∀≦)
+`html = r.text()`部分は括弧をとりましたが、それでも既視感のあるエラー (AttributeErrorだったかな?) がまた出て堂々巡りｺﾛｺﾛ ⌒((:з)⌒((ε:)⌒((:3に入った感があったので心が折れ、[案2: beautifulsoup4を使う](https://teratail.com/questions/140261#reply-212163) に切替え、`html = r.read()`に戻しましたorz
+[https://news.google.com](https://news.google.com)をスクレイピング対象にしていましたが、urlにhtmlがまったく入っていなかったので、BeautifulSoup4のドキュメントを対象にしました。
+```python
+# ファイル名: python_ex289_ex293.py
+# -*-coding:utf-8-*-
+# 手本: https://github.com/calthoff/self_taught/blob/master/python_ex289.py/
+import urllib2
+from bs4 import BeautifulSoup
+class Scraper:
+    def __init__(self, site):
+        self.site = site
+    def scrape(self):
+        r = urllib2.urlopen(self.site) # urlopen関数を実行するとResponseオブジェクトが返される。
+        html = r.read()
+        parser = "html.parser"
+        sp = BeautifulSoup(html, parser)
+        for tag in sp.find_all("a"):
+            url = tag.get("href")
+            if url is None:
+                continue
+            if "html" in url:
+                print("\n" + url)
+news = "https://www.crummy.com/software/BeautifulSoup/bs4/doc/"
+Scraper(news).scrape()
 ```

TypeError: 'unicode' object is not callable内容追記

2018/08/10 12:38

投稿

Yukiya025

スコア86

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -132,4 +132,12 @@
 2. `parser = "html.parser"`はPython2では使えないようなので`parser = HTMLParser`に変更
 3. `AttributeError: 'Response' object has no attribute 'read'`と出たので`html = r.read()`を`html = r.text()`に変更。
-しかしエラー「`TypeError: 'unicode' object is not callable`」で白旗 (´；ω；｀)
+しかしエラー「`TypeError: 'unicode' object is not callable`」で白旗 (´；ω；｀)
+```
+Traceback (most recent call last):
+  File "python_ex289_ex293.py", line 25, in <module>
+    Scraper(news).scrape()
+  File "python_ex289_ex293.py", line 13, in scrape
+    html = r.text()
+TypeError: 'unicode' object is not callable
+```

新たなエラー追記

2018/08/09 02:03

投稿

Yukiya025

スコア86

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -91,4 +91,45 @@
 news = "https://news.google.com"
 Scraper(news).scrape()
-```
+```
+# 案1を採用
+[umyuさんの回答](https://teratail.com/questions/140261#reply-212163)から案1を採用し、`from bs3 import BeautifulSoup3`を`from BeautifulSoup import BeautifulSoup`に変更しました。おかげさまで`ImportError: No module named bs3` のエラーはなくなりました(≧∀≦)
+その他いろいろエラーがあったので、解決できる部分は解決 (多分?) したのですが、エラー 「`TypeError: 'unicode' object is not callable`」が解決できませんorz
+**只今のコード**
+```python:web.py
+# -*-coding:utf-8-*-
+# https://github.com/calthoff/self_taught/blob/master/python_ex289.py/
+from BeautifulSoup import BeautifulSoup
+import requests
+import urllib3
+from HTMLParser import HTMLParser
+class Scraper:
+    def __init__(self, site): # __init__メソッドはスクレイピング対象のURLを受け取る。
+        self.site = site
+    def scrape(self):
+        r = requests.get(self.site)
+        html = r.text()
+        parser = HTMLParser
+        sp = BeautifulSoup(html, parser)
+        for tag in sp.find_all("a"):
+            url = tag.get("href")
+            if url is None:
+                continue
+            if "html" in url:
+                print("\n" + url)
+news = "https://news.google.com"
+Scraper(news).scrape()
+```
+##エラーからの対応策概要
+1. urlopenが使えないようなのでrequests.getに変更
+2. `parser = "html.parser"`はPython2では使えないようなので`parser = HTMLParser`に変更
+3. `AttributeError: 'Response' object has no attribute 'read'`と出たので`html = r.read()`を`html = r.text()`に変更。
+しかしエラー「`TypeError: 'unicode' object is not callable`」で白旗 (´；ω；｀)