回答率: 85.30%

質問するログイン新規登録

トップに関する質問 OCR機能を用いて、画像内の対象文字列をクリックしたい

編集履歴

質問編集履歴

2

認識時間の検証結果を追加

2021/02/24 17:10

投稿

スコア21

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -88,4 +88,37 @@
 認識した画像を確認するとこんな感じです。
-![イメージ説明](0bfc2a4ffdb0df2b32785fc3daca8be8.png)
+![イメージ説明](0bfc2a4ffdb0df2b32785fc3daca8be8.png)
+### 追記(画像処理方法と文字認識時間検証)
+下記３種類の画像認識方法について、
+認識画像と認識時間を追記しました。
+sakuramochi_py 様の見解通り３種類の組み合わせが一番認識時間が
+短い結果となりました。
+①：フルカラー
+②：①＋グレースケール
+③：②＋２値化
+④：③＋反転
+![イメージ説明](da9027ce3fad6fb030d9fc7b8f553294.png)
+```Python
+img = cv2.imread("./img/eng.png")
+#グレースケールに変換
+gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
+cv2.imwrite("./img/eng.png",gray)
+#2値化
+img = cv2.imread("./img/eng.png")
+threshold = 105
+ret,img_thresh = cv2.threshold(img, threshold, 255, cv2.THRESH_BINARY)
+cv2.imwrite("./img/eng.png",img_thresh)
+#色反転
+img = cv2.imread("./img/eng.png")
+img_invert = cv2.bitwise_not(img)
+cv2.imwrite("./img/eng.png",img_invert)
+```

1

解決方法を追記

2021/02/24 17:10

投稿

スコア21

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -42,4 +42,50 @@
 pip:21.0.1
 opencv-python:4.5.1.48
 pyocr:0.8
-PyAutoGUI:0.9.52
+PyAutoGUI:0.9.52
+### 解決方法
+sakuramochi_py 様にご教授頂き、下記コードで実現できました。
+```Python
+import pyautogui as pg
+import time
+import cv2
+import pyautogui as pg
+pg.press('win')
+time.sleep(2)
+sc = pg.screenshot(region=(50, 100, 500, 700)) #始点x,y、幅、高さ
+sc.save('./img/img.png')
+lang = 'eng'
+img_path = './img/{}.png'.format(lang)
+img = Image.open(img_path)
+out_path = './img/{}_{}.png'
+word_boxes = tool.image_to_string(
+    img,
+    lang=lang,
+    builder=pyocr.builders.WordBoxBuilder(tesseract_layout=6)
+)
+out = cv2.imread(img_path)
+for d in word_boxes:
+    print(d.content)
+    print(d.position)
+    cv2.rectangle(out, d.position[0], d.position[1], (0, 0, 255), 2) #d.position[0]は認識した文字の左上の座標,[1]は右下
+    cv2.imwrite(out_path.format(lang, 'word_boxes'), out)
+    x1,y1 = d.position[0]
+    x2,y2 = d.position[1]
+    if(d.content=='Anaconda3'): #Anacondaのアイコンを認識したらクリックする
+        x3 = (x1+x2)/2+50
+        y3 = (y1+y2)/2+100
+        pg.click(x3,y3)
+```
+認識した画像を確認するとこんな感じです。
+![イメージ説明](0bfc2a4ffdb0df2b32785fc3daca8be8.png)