× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



What's the OCR use-case ?

Identifying or extracting info from AP invoices or other documents ?

Not sure if you knew, but GoAnywhere is now owned by HelpSystems.

You could use GoAnywhere to receive documents and our AutoMate Windows process automation software has an OCR component that can help extract info from the images.

Check it out and feel free to download a 30-day trial or reach out if you want to talk backend document processing.

http://www.helpsystems.com/automate

Regards,


Richard Schoen
Director of Document Management
e. richard.schoen@xxxxxxxxxxxxxxx
p. 952.486.6802
w. helpsystems.com

----------------------------------------------------------------------

message: 1
date: Tue, 19 Jul 2016 17:45:39 -0600
from: Gordon Schneider <gordon.schneider@xxxxxxx>
subject: Re: JAVA400-L Digest, Vol 14, Issue 40

Kristen was a great help. Learned a few things which is always a good thing.

The next step is to extract text from the image extracted. I have done some preliminary work on it. Will try to see if tesseract OCR will work for us.

Thanks

Gordon

On Jul 19, 2016, at 11:00 AM, java400-l-request@xxxxxxxxxxxx wrote:

Send JAVA400-L mailing list submissions to
java400-l@xxxxxxxxxxxx

To subscribe or unsubscribe via the World Wide Web, visit
http://lists.midrange.com/mailman/listinfo/java400-l
or, via email, send a message with subject or body 'help' to
java400-l-request@xxxxxxxxxxxx

You can reach the person managing the list at
java400-l-owner@xxxxxxxxxxxx

When replying, please edit your Subject line so it is more specific
than "Re: Contents of JAVA400-L digest..."


Today's Topics:

1. RE: JAVA400-L Digest, Vol 14, Issue 36 (Kristen Henry)


----------------------------------------------------------------------

message: 1
date: Mon, 18 Jul 2016 17:40:44 -0600
from: "Kristen Henry" <klhnry@xxxxxxxxxxxx>
subject: RE: JAVA400-L Digest, Vol 14, Issue 36

Hi,

Gordon and I met, and by testing in Qshell I was able to get the Java
command working by adding the path ""/java/PDFBox/" to the classpath
and to the other files, and by changing the semi colon to a colon in the Classpath.
It could be run in place if you cd to the /java/PDFBox directory, but
by adding the path the class can be run in other commands or apps.

java -Dsun.java2d.cmm=sun.java2d.cmm.kcms.KcmsServiceProvider -cp
/java/PDFBox/pdfbox-app-2.0.2.jar:/java/PDFBox/lib/*
/java/PDFBox/org.apache.pdfbox.tools.PDFBox ExtractImages
/java/PDFBox/Maxfield.pdf

Success!

Kristen






As an Amazon Associate we earn from qualifying purchases.

This thread ...


Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.