× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



Hello Don,

Am 04.05.2022 um 06:55 schrieb Don Brown via MIDRANGE-L <midrange-l@xxxxxxxxxxxxxxxxxx>:

Just enquiring if anyone has solved this issue and would share/advise how they achieved it.

There's an OpenSource Package called "poppler-utils", from the Poppler PDF library project. It contains a pdftotext command with some options about how to preserve the given layout.

https://en.wikipedia.org/wiki/Poppler_(software)

I don't know if this is available in PASE, though.

Together with a little Regex-Magic, extracting only data and leaving probable headers and footers aside, this should be fairly straightforward.

:wq! PoC


As an Amazon Associate we earn from qualifying purchases.

This thread ...

Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.