ocre
: o.c.r. easy (and free/libre)
v0.042
download
- ocre...tgz (on linux).
- debian: in /etc/apt/sources.list you can add:
deb ftp://lem.eui.upm.es/pub/lemdeb squeeze main
deb-src ftp://lem.eui.upm.es/pub/lemdeb squeeze main
- ocre...deb (id. i386, Debian).
- ocre...rpm (id, RedHat, ..).
ocre works with grey images
Languages:
English,
Euskara/Basque,
French,
German,
Polish,
Português,
Russian,
Spanish
License:
GPL
- input: PGM/PBM file
- output: unicode or ASCII characters in standard output
- process:
This version:
(42)
Small change in segmentation.
Some errors have been corrected.
Next versions:
some errors corrected, better recognition. :-)
esperanto,
more dictionary,
multiple columns,
books opened,
more cyrillic,
simple zoning,
You can subscribe to new releases.
lacks (wants)
- revision (better use of dictionary, trigrams,...)
- robustness
- table recognition, formula id., spreadshet id., ...
- zoning
- interface with:
- output to:
- latex, html, gnumeric, ...
- ...
In the long term
I want to teach the computer to read. :-)
Webs OCR
... It's a long way to go
Updated by
Luis José Cearra Zabala
mar jun 19 13:23:06 CEST 2012