Home  >  Article  >  Backend Development  >  Gosseract is not running

Gosseract is not running

王林
王林forward
2024-02-09 09:09:221035browse

Gosseract 未运行

php小编西瓜推荐的光学字符识别(OCR)工具Gosseract,是一款功能强大的开源OCR库。它基于Tesseract引擎,能够识别多种语言的文字,并提供了简单易用的接口。然而,有些用户在使用Gosseract时可能会遇到"未运行"的问题。本文将为大家详细解答这个问题,并给出解决方案,帮助大家顺利运行Gosseract工具,实现高效的OCR文字识别。

问题内容

# github.com/otiai10/gosseract/v2 tessbridge.cpp:5:10:致命错误:leptonica/allheaders.h:没有这样的文件或目录 5 | #include b74a7b014bd36528c1c094b07f39ce0d | ^~~~~~~~~~~~~~~~~~~~~~~~~ 编译终止。

下面是我写的代码

它给出了一个错误。我重新安装了 tesseract-i,它仍然给出相同的错误

package main

import (
    "fmt"

    "github.com/otiai10/gosseract/v2"
)

func main() {

    client := gosseract.NewClient()

    defer client.Close()

    client.SetImage("C:\\Users\\labusers\\Downloads\\khan.png")

    text, _ := client.Text()
    fmt.Println(text)

}

解决方法

安装 tesseract 可能很棘手。以下 dockerfile 有效:

FROM golang:1.14.9

RUN cat /etc/os-release
# Output: Debian GNU/Linux 10 (buster)
RUN apt-get -qy update

RUN apt-get install -qy libleptonica-dev libtesseract-dev
RUN apt-get install -qy libtool m4 automake cmake pkg-config
RUN apt-get install -qy libicu-dev libpango1.0-dev libcairo-dev

RUN cd /opt && git clone https://github.com/tesseract-ocr/tesseract
WORKDIR /opt/tesseract
RUN git reset --hard 4.1.1
RUN ./autogen.sh &&\
    ./configure --enable-debug LDFLAGS="-L/usr/local/lib" CFLAGS="-I/usr/local/include"
RUN make -j 8
RUN make install && ldconfig
RUN tesseract --version

ENV TESSDATA_PREFIX=/usr/local/share/tessdata
ENV TESSDATA_REPO=https://github.com/tesseract-ocr/tessdata_best
WORKDIR ${TESSDATA_PREFIX}
RUN wget -q ${TESSDATA_REPO}/raw/4.1.0/eng.traineddata

dockerfile 从源代码构建 tesseract,以便您可以选择 tesseract 版本。这个dockerfile是我2年前使用gosseract库时写的。

The above is the detailed content of Gosseract is not running. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:stackoverflow.com. If there is any infringement, please contact admin@php.cn delete