問題描述:
在centos7 中做python的pytersseract驗證碼識別,在pytesseract.image_to_string時報錯,經查解決辦法是,升級tesseract。在centos中自帶了tesseract3.03,github最新的穩定版本是 3.05.01。升級過程如下
-
準備工作
yum install gcc-c++ yum install autoconf automake libtool yum install autoconf-archive yum install pkg-config yum install libpng12-dev yum install libjpeg8-dev yum install libtiff5-dev yum install zlib1g-dev
-
安裝Leptonica庫
#centos 自帶版本低于1.74,需要安裝1.74及以上版本 wget http://www.leptonica.org/source/leptonica-1.74.4.tar.gz tar zxf leptonica-1.74.4.tar.gz ./configure make && make install #安裝完成后,默認路徑是/use/local/lib #使pkg-config讀取這個路徑 export PKG_CONFIG_PATH=$PKG_CONFIG_PATH:/usr/local/lib/pkgconfig
-
安裝tesseract
wget https://github.com/tesseract-ocr/tesseract/archive/3.05.01.tar.gz tar -zxf tesseract-3.05.01.tar.gz ./autogen.sh ./configure make && make install