Issues filed for tesseract-ocr/tesseract

View Full Project
Do you use tesseract? Leave a review!

Rate of open issues in the last 60 days

tesseract open issues (View Closed Issues)
  • almost 2 years CMake hangs at "Performing 71 checks using 8 threads"
  • almost 2 years Memory leak of EDGEPT objects
  • almost 2 years Minor win32 sintax issues under Windows unicode
  • almost 2 years Replace all NULLs with nullptr
  • almost 2 years Resolution information in PNG files is ignored
  • almost 2 years Failure to box മ character in malayalam
  • almost 2 years Telugu - Simple characters are not being recognized
  • almost 2 years APPLY_BOXES: boxfile line FAILURE! Couldn't find a matching blob for മ in malayalam
  • almost 2 years Buffer overflow in proto_evidence_
  • almost 2 years Training issue : APPLY_BOXES failure : Couldn't find a matching blob
  • almost 2 years Are there more PSM modes than are listed in the help/wiki - 11 and 12?
  • almost 2 years Font detection broken for segmention_mode > PSM_SINGLE_WORD
  • about 2 years Speckled Documents Create Psychological Case for Tesseract
  • about 2 years Error in boxClipToRectangle: box outside rectangle
  • about 2 years macOS regression due to "Fix Cygwin compatibility"
  • about 2 years C-API for OSResults (orientation and script results) is not a stable ABI
  • about 2 years Minimum Pango version
  • about 2 years Creating ALTO [enhancement]
  • about 2 years text2image: comma in font name
  • about 2 years User patterns using bazaar config do not work
  • about 2 years Text is garbled in pdf.js (Cygwin / UB Mannheim binaries)
  • about 2 years peculiarities when running text2image on windows
  • about 2 years Glyphless font in pdf leads to spaces between characters
  • about 2 years non-word recognition worsened/disimproved since tesseract v3.0.4 ?
  • over 2 years Dramatically different results with O1 and O2 optimizations in clang
  • over 2 years [For the record] Tesseract 3.01 crash on specific table like columns
  • over 2 years Add version information to training tools
  • over 2 years Confuse about dump "tessnoimages.png" when segmenting page and detect orientation.
  • over 2 years Compiled 3.03 and 3.04 with VS2013, Memory Leaks detected
  • over 2 years good accuracy but too slow, how to improve Tesseract speed
  • over 2 years Segmentation fault when set variable "classify_enable_adaptive_matcher" = 0
  • over 2 years Underline attribute unsupported
  • over 2 years different results when same image is lossless-encoded at different bpp
  • over 2 years Arabic language (right to left in writing) stored (left to right) after create PDF Searchable
  • over 2 years “no best words!!” on mixed language (fra+ara) items
  • over 2 years Multi-page TIFF buffering is broken
  • almost 3 years unicharambigs man page missing v2 format
  • almost 3 years Inconsistent results with VERY similar images
  • almost 3 years Symbol level bounding box information in hOCR output
  • almost 3 years OpenCL error codes, then junk output -- possibly a build issue?
  • about 3 years Completion of error handling
  • about 3 years Complete build options for Pthread API
  • about 3 years Remove unnecessary null pointer checks
  • about 3 years reserved identifier violation
  • about 3 years OS X Yosemite with OpenCL: --enable-opencl now compiles but OCR fails
  • about 3 years OpenCL segfault
  • about 3 years Cube and combined modes doesn't work in 3.03
  • almost 2 years reading from images other than specified image filename!
  • over 1 year LSTM: Devanagari - Visarga being recognized as colon
  • over 1 year Releasing version 3.05
  • over 1 year PDF Output for Pipes in Windows
  • over 1 year LSTM: Training: Invalid network layer type:
  • over 1 year Once more unto Ghostscript mangling Tesseract-produced PDFs
  • over 1 year Change appveyor schedule to daily build
  • over 1 year Removing the legacy OCR Engine
  • over 1 year Add 'topics' to this repo
  • over 1 year Output format txt has different result from output format pdf in v3.05.00dev
  • over 1 year LSTM: Training - Deserialize header failed:
  • over 1 year 'makebox' does not put tab characters for new line
  • over 1 year HOCR: x_font missing, x_fsize broken in 4.00-alpha
  • over 1 year LSTM: incorrect recognition with multilanguage text in one line
  • over 1 year LSTM: khmer is not working with --oem 1
  • over 1 year LSTM: Words dropped during Kannada recognition
  • over 1 year Does Tesseract Actually Deskew the Image?
  • over 1 year Dropping words when trying with Telugu language
  • over 1 year LSTM: Training - Box file format
  • over 1 year LSTM: Words dropped during Devanagari recognition
  • over 1 year Training Wiki Updates and Request for Info
  • over 1 year Improve textline finding for Arabic and other languages with many diacritics
  • over 1 year LSTM: Indic - length of the compressed codes
  • over 1 year How to get the unicharset back out from the lstm?
  • over 1 year Box File disorder, Arabic Language
  • over 1 year OCR recognition improvement for single word(s) behind a specially colored background
  • over 1 year LSTM: Training - Eval not run from trainer
  • over 1 year Inverse text problem found by Viewerdebugging

tesseract closed issues

  • almost 2 years Tesseract cannot recognize clean webpage screenshot
  • almost 2 years Unable to process chalkboard writings
  • almost 2 years Is tesseract-ocr currently using lstm?
  • almost 2 years Error when compiling Tesseract 3.01 on RHEL5
  • almost 2 years Use api to show the confidence for each of the characters in image
  • almost 2 years Laravel 5.1 empty string return
  • almost 2 years Memory leaks
  • almost 2 years Unable to init CubeRecoContext object - Hindi Language (hin)
  • almost 2 years Please close milestones for 3.04
  • almost 2 years Usage in android project
  • almost 2 years leptonica library with pdf support (>= 1.71) is missing
  • almost 2 years In which class characters recognized?
  • about 2 years roadmap/changelog for upcoming releases?
  • about 2 years ParagraphInfo always sets justification to unknown
  • about 2 years english the api works well but if i use the arabic trained data the app crashes
  • about 2 years id < this->size():Error:Assert failed:in file unicharset.cpp, line 278
  • about 2 years Can I use 3.0.5tessdata to replace 3.0.2 tessdata?
  • about 2 years Simple digit picture return empty result
  • about 2 years How to share language data between multiple instances
  • about 2 years Windows VC 2015 compilation failing
  • about 2 years Compilation warning "Struct 'ETEXT_DESC' was previously declared as a class"
  • about 2 years CMakeLists.txt caused missing "allheaders.h" error
  • about 2 years MSYS2 pacman error
  • about 2 years not recognizing simple image
  • about 2 years Latest trained data file crashed upon init when using tesseract 3.02 for several languages
  • about 2 years text2image: --fonts_dir=
  • about 2 years used vs2010 complice for eeror:
  • about 2 years PKG_CHECK_MODULES on Ubuntu 14.04.5
  • about 2 years Text2image SIGSEGV on Linux
  • about 2 years 1 Text2Image.exe binary please?
  • about 2 years it is use nerual network ?
  • about 2 years tesseract cannot detect large texts.
  • about 2 years Using uzn file, output from command line and programmatically using .net dll are different.
  • about 2 years Not working with ÆØÅ (uppercase)
  • almost 2 years tesseract-dbg install problem
  • over 1 year error while running tesseract
  • over 1 year Segmentation fault when use through python(tesserocr)
  • over 1 year where is include folder and lib for opencv build
  • over 1 year Error configuring during build on OSX
  • over 1 year PDF output: odd spaces on OSX preview
  • over 1 year incorrect character box positions
  • over 1 year LSTM: Training - reduced learning rate
  • over 1 year LSTM: Training: failed to write checkpoint
  • over 1 year compiling using vs2013 -library not found
  • over 1 year JP2 files not working
  • over 1 year Did OCR post-processing in 2015, now that feature seems to be gone
  • over 1 year Request: new 4.0.0-alpha windows binaries
  • over 1 year During training using text file to fine-tune error: could not find font names...Please correct font arg..
  • over 1 year Linker errors while trying to compile Tesseract project
  • over 1 year Linking failure with --disable-graphics
  • over 1 year [ANDROID] imagedata.cpp doesn't compile
  • over 1 year Is that possible to get the text from the price tag in that photo?
  • over 1 year Segmentation fault while trying to train tesseract
  • over 1 year Allow attaching of box and tif files to issues