Issues filed for tesseract-ocr/tesseract

View Full Project
Do you use tesseract? Leave a review!

Rate of open issues in the last 60 days

tesseract open issues (View Closed Issues)
  • over 1 year CMake hangs at "Performing 71 checks using 8 threads"
  • over 1 year Memory leak of EDGEPT objects
  • over 1 year Minor win32 sintax issues under Windows unicode
  • over 1 year Replace all NULLs with nullptr
  • over 1 year Resolution information in PNG files is ignored
  • over 1 year Failure to box മ character in malayalam
  • over 1 year Telugu - Simple characters are not being recognized
  • over 1 year APPLY_BOXES: boxfile line FAILURE! Couldn't find a matching blob for മ in malayalam
  • over 1 year Buffer overflow in proto_evidence_
  • over 1 year Training issue : APPLY_BOXES failure : Couldn't find a matching blob
  • over 1 year Are there more PSM modes than are listed in the help/wiki - 11 and 12?
  • over 1 year Font detection broken for segmention_mode > PSM_SINGLE_WORD
  • over 1 year Speckled Documents Create Psychological Case for Tesseract
  • over 1 year Error in boxClipToRectangle: box outside rectangle
  • over 1 year macOS regression due to "Fix Cygwin compatibility"
  • over 1 year C-API for OSResults (orientation and script results) is not a stable ABI
  • over 1 year Minimum Pango version
  • over 1 year Creating ALTO [enhancement]
  • over 1 year text2image: comma in font name
  • over 1 year User patterns using bazaar config do not work
  • over 1 year Text is garbled in pdf.js (Cygwin / UB Mannheim binaries)
  • over 1 year peculiarities when running text2image on windows
  • almost 2 years Glyphless font in pdf leads to spaces between characters
  • almost 2 years non-word recognition worsened/disimproved since tesseract v3.0.4 ?
  • almost 2 years Dramatically different results with O1 and O2 optimizations in clang
  • about 2 years [For the record] Tesseract 3.01 crash on specific table like columns
  • about 2 years Add version information to training tools
  • about 2 years Confuse about dump "tessnoimages.png" when segmenting page and detect orientation.
  • about 2 years Compiled 3.03 and 3.04 with VS2013, Memory Leaks detected
  • about 2 years good accuracy but too slow, how to improve Tesseract speed
  • about 2 years Segmentation fault when set variable "classify_enable_adaptive_matcher" = 0
  • about 2 years Underline attribute unsupported
  • about 2 years different results when same image is lossless-encoded at different bpp
  • about 2 years Arabic language (right to left in writing) stored (left to right) after create PDF Searchable
  • about 2 years “no best words!!” on mixed language (fra+ara) items
  • about 2 years Multi-page TIFF buffering is broken
  • over 2 years unicharambigs man page missing v2 format
  • over 2 years Inconsistent results with VERY similar images
  • over 2 years Symbol level bounding box information in hOCR output
  • over 2 years OpenCL error codes, then junk output -- possibly a build issue?
  • over 2 years Completion of error handling
  • over 2 years Complete build options for Pthread API
  • over 2 years Remove unnecessary null pointer checks
  • over 2 years reserved identifier violation
  • over 2 years OS X Yosemite with OpenCL: --enable-opencl now compiles but OCR fails
  • almost 3 years OpenCL segfault
  • almost 3 years Cube and combined modes doesn't work in 3.03
  • over 1 year reading from images other than specified image filename!
  • about 1 year LSTM: Devanagari - Visarga being recognized as colon
  • about 1 year Releasing version 3.05
  • about 1 year PDF Output for Pipes in Windows
  • about 1 year LSTM: Training: Invalid network layer type:
  • about 1 year Once more unto Ghostscript mangling Tesseract-produced PDFs
  • about 1 year Change appveyor schedule to daily build
  • about 1 year Removing the legacy OCR Engine
  • about 1 year Add 'topics' to this repo
  • about 1 year Output format txt has different result from output format pdf in v3.05.00dev
  • about 1 year LSTM: Training - Deserialize header failed:
  • about 1 year 'makebox' does not put tab characters for new line
  • about 1 year HOCR: x_font missing, x_fsize broken in 4.00-alpha
  • about 1 year LSTM: incorrect recognition with multilanguage text in one line
  • about 1 year LSTM: khmer is not working with --oem 1
  • about 1 year LSTM: Words dropped during Kannada recognition
  • about 1 year Does Tesseract Actually Deskew the Image?
  • over 1 year Dropping words when trying with Telugu language
  • over 1 year LSTM: Training - Box file format
  • over 1 year LSTM: Words dropped during Devanagari recognition
  • over 1 year Training Wiki Updates and Request for Info
  • over 1 year Improve textline finding for Arabic and other languages with many diacritics
  • over 1 year LSTM: Indic - length of the compressed codes
  • over 1 year How to get the unicharset back out from the lstm?
  • over 1 year Box File disorder, Arabic Language
  • over 1 year OCR recognition improvement for single word(s) behind a specially colored background
  • over 1 year LSTM: Training - Eval not run from trainer
  • over 1 year Inverse text problem found by Viewerdebugging

tesseract closed issues

  • over 1 year Tesseract cannot recognize clean webpage screenshot
  • over 1 year Unable to process chalkboard writings
  • over 1 year Is tesseract-ocr currently using lstm?
  • over 1 year Error when compiling Tesseract 3.01 on RHEL5
  • over 1 year Use api to show the confidence for each of the characters in image
  • over 1 year Laravel 5.1 empty string return
  • over 1 year Memory leaks
  • over 1 year Unable to init CubeRecoContext object - Hindi Language (hin)
  • over 1 year Please close milestones for 3.04
  • over 1 year Usage in android project
  • over 1 year leptonica library with pdf support (>= 1.71) is missing
  • over 1 year In which class characters recognized?
  • over 1 year roadmap/changelog for upcoming releases?
  • over 1 year ParagraphInfo always sets justification to unknown
  • over 1 year english the api works well but if i use the arabic trained data the app crashes
  • over 1 year id < this->size():Error:Assert failed:in file unicharset.cpp, line 278
  • over 1 year Can I use 3.0.5tessdata to replace 3.0.2 tessdata?
  • over 1 year Simple digit picture return empty result
  • over 1 year How to share language data between multiple instances
  • over 1 year Windows VC 2015 compilation failing
  • over 1 year Compilation warning "Struct 'ETEXT_DESC' was previously declared as a class"
  • over 1 year CMakeLists.txt caused missing "allheaders.h" error
  • over 1 year MSYS2 pacman error
  • over 1 year not recognizing simple image
  • over 1 year Latest trained data file crashed upon init when using tesseract 3.02 for several languages
  • over 1 year text2image: --fonts_dir=
  • over 1 year used vs2010 complice for eeror:
  • over 1 year PKG_CHECK_MODULES on Ubuntu 14.04.5
  • over 1 year Text2image SIGSEGV on Linux
  • over 1 year 1 Text2Image.exe binary please?
  • over 1 year it is use nerual network ?
  • over 1 year tesseract cannot detect large texts.
  • over 1 year Using uzn file, output from command line and programmatically using .net dll are different.
  • over 1 year Not working with ÆØÅ (uppercase)
  • over 1 year tesseract-dbg install problem
  • about 1 year error while running tesseract
  • about 1 year Segmentation fault when use through python(tesserocr)
  • about 1 year where is include folder and lib for opencv build
  • about 1 year Error configuring during build on OSX
  • about 1 year PDF output: odd spaces on OSX preview
  • about 1 year incorrect character box positions
  • about 1 year LSTM: Training - reduced learning rate
  • about 1 year LSTM: Training: failed to write checkpoint
  • about 1 year compiling using vs2013 -library not found
  • about 1 year JP2 files not working
  • about 1 year Did OCR post-processing in 2015, now that feature seems to be gone
  • about 1 year Request: new 4.0.0-alpha windows binaries
  • about 1 year During training using text file to fine-tune error: could not find font names...Please correct font arg..
  • about 1 year Linker errors while trying to compile Tesseract project
  • about 1 year Linking failure with --disable-graphics
  • about 1 year [ANDROID] imagedata.cpp doesn't compile
  • about 1 year Is that possible to get the text from the price tag in that photo?
  • over 1 year Segmentation fault while trying to train tesseract
  • over 1 year Allow attaching of box and tif files to issues