Home >Backend Development >Python Tutorial >How to convert images to text in python

How to convert images to text in python

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBforward: 2023-04-20 11:07:062132browse

python image to text

I used python Tesseract-OCR to make a small tool for image to text conversion, and the GUI design uses the control of the tkinter library

See the picture below for the interface and effect:

How to convert images to text in python

How to convert images to text in python ##

#进一步优化  1. 底部添加label   2.对识别后的文本处理，去空格


from PIL import Image as PImage
from PIL import ImageTk
import pytesseract
from tkinter import *
from tkinter import filedialog
from tkinter.scrolledtext import ScrolledText
import re

# 将图片内容翻译为文字，显示在文本框内
def trans():
    contents.delete(&#39;1.0&#39;, END)
    transTxt = pytesseract.image_to_string(PImage.open(filePath.get()),lang=&#39;chi_sim&#39;)
    #对transTxt进行处理  去空格，换行符去重
    transTxt = transTxt.strip(&#39;\n\r&#39;)   #无参数可以删除开头结尾的空格\n\t\r
    print(transTxt)
    contents.insert( INSERT, transTxt.replace(&#39; &#39;,&#39;&#39;).replace(&#39;\n\n&#39;,&#39;\n&#39;).replace(&#39;\r&#39;,&#39;&#39;))

#打开图片文件，显示路径，并将图片展现
def openfile():
    filename.delete(&#39;1.0&#39;, END)
    filePath.set(filedialog.askopenfilename())
    filename.insert(1.0,filePath.get())
    org_img = PImage.open(filePath.get())
    #调整图片显示大小 600*800
    w,h = org_img.size
    if w>600:
        h=int(h*600/w)
        w=600
    if h>800:
        w=int(w*800/h)
        h=800
    img = ImageTk.PhotoImage(org_img.resize((w,h)))
    showPic.config(image=img)
    showPic.image = img       #保持一个引用才能显示图片，tkinter的BUG
    

#设置主窗口
top = Tk()
top.title("OCR图片转文字  引擎：Tesseract-OCR  Made by: kaivis")
#top.iconbitmap("./pic/y1.ico")
top.geometry("1200x800")

filePath=StringVar()

bt_img1 = ImageTk.PhotoImage( file= "./pic/Outbox1.png")
bt_img2 = ImageTk.PhotoImage( file= "./pic/bt_img2.png")

#第一个窗体
frame1 = Frame (top, relief=RAISED, borderwidth=2)
frame1.pack(side=TOP, fill=BOTH,  ipady=5, expand=0)
Label(frame1,height=1,text="图片路径：").pack(side=LEFT)
filename = Text(frame1,height=2)
filename.pack(side=LEFT,padx=1, pady=0,expand=True, fill=X)
Button(frame1,text="打开文件", image=bt_img1, command=openfile).pack(side=LEFT,padx=5, pady=0)
Button(frame1,text="中文识别", image=bt_img2, command=trans).pack(side=LEFT,padx=5, pady=0)

#第二个窗体
frame2 = Frame (top, relief=RAISED, borderwidth=2)
frame2.pack (side=LEFT, fill=BOTH,  expand=1)
Label(frame2,text=&#39;图片显示：&#39;,borderwidth=5).pack(side=TOP,padx=20,pady=5)
showPic = Label(frame2,text=&#39;图片显示区&#39;)
showPic.pack(side=BOTTOM,expand=1,fill=BOTH)

#第三个窗体
frame3 = Frame (top)
frame3.pack (side=RIGHT, fill=BOTH,  expand=1)
#contents = ScrolledText(frame3)
Label(frame3,text=&#39;识别结果：&#39;,borderwidth=5).pack(side=TOP,padx=20,pady=10)
contents = Text(frame3,font=(&#39;Arial&#39;,15))
contents.pack(side=TOP,expand=1,fill=BOTH)
Label(frame3,text=&#39;Copyright 2021 baidu.com ALL Rights Reserved&#39;,borderwidth=5).pack(side=BOTTOM,padx=20,pady=10)

top.mainloop()

Problems:

The recognition rate is not high, and it is even more difficult to achieve a high accuracy for compact Chinese characters. Is there a better OCR engine?
The recognized text has been processed Space processing, text can be further optimized, especially redundant line breaks need to be processed

python screenshot to text function

Since when looking for information on the Internet, I often encounter articles When copying is impossible, in order to quickly copy the text you want, I want to write a python program to realize the function of converting screenshots to text.

How to convert images to text in python

1. Idea

First of all, you must have the function of recording the keyboard (let the program know that you are taking a screenshot) - the keyboard library needs to receive the image after taking the screenshot - ImageGrab library After obtaining the image, text recognition is required - Baidu AI Text Recognition API

2. Implementation

2.1 Import related libraries

How to convert images to text in python

2.2 Create a class and write a function to save screenshots

Since I am using the screenshot software that comes with win10, the screenshot hotkey is ‘win shift s’. You can freely change it according to different screenshot software.

How to convert images to text in python

2.3 Write a function to convert images to text

First go to Baidu Intelligent Cloud official website to apply for an image recognition API.

How to convert images to text in python

Write the parameters into the program:

How to convert images to text in python

Write the text conversion function:

How to convert images to text in python

2.5 Run

Create a class when using it, just call two functions:

How to convert images to text in python

2.6 Effect

Run the program and take a screenshot from an article in Baidu Library:

How to convert images to text in python

The result is as follows:

How to convert images to text in python

Note:

It can be seen from the running results of 2.6 that the effect is still good. Perfectly solved my current needs.

The above is the detailed content of How to convert images to text in python. For more information, please follow other related articles on the PHP Chinese website!

Python copy ocr

Statement：

This article is reproduced at:yisu.com. If there is any infringement, please contact admin@php.cn delete

Previous article：How to implement Python’s lazy import-lazy importNext article：How to implement Python’s lazy import-lazy import

See more

How to convert images to text in python

python image to text

Related articles