× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



If you're just trying to get a text version of a PDF, then I can give a
couple examples of how we've done it. We initially used an open source
application called PDFBox to dump the contents to an IFS text file (example
source snippet below).

A few years ago, I rewrote the application to use another open source tool
called iText, because we wanted to extract data from the PDF labels by
position on the page. We did this, because the text dump had become
unreliable when the customer changed content.

// Run a java application that converts PDF files into text.
QCMDEXC('JAVA CLASS(org.pdfbox.'

+ 'ExtractText) PARM(' + apos

+ PDDir + PDFFile + apos + ' ' + apos

+ tmpDir + TXTFile + apos + ')'

+ ' CLASSPATH(' + apos

+ '/java/pdfbox/PDFBox-0.7.2.jar'

+ apos + ')'

+ ' JOB(*GEN) OUTPUT(*NONE)'

:1024);



// Convert ASCII file into EBCIDC (however its spelled)
QCMDEXC('CPY OBJ(' + apos + tmpDir

+ TXTFile + apos + ')'

+ ' TOOBJ('+apos + tmpDir + TXTFile

+ apos+ ') DTAFMT(*TEXT)'

+ ' REPLACE(*YES) FROMCODPAG(819)'

+ ' TOCODEPAGE(37)'

:1024);

You are correct. I need to read data from the PDF and process it.


As an Amazon Associate we earn from qualifying purchases.

This thread ...

Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.