I have installed the tika java application from apache.org on our power server running IBM I version 7.2 and on my PC running Windows 10. We get a word document from one of our vendors. We need to extract the text from it to process the document.

I have this working on my PC by using the following command:

C:\Users\gord\Downloads>java -jar tika-app-1.14.jar -t "C:\Users\gord\Documents\Goralta Invoice.doc" > "C:\Users\gord\Documents\Goralta Invoice.txt"

The next step was to get the same result on our Power Server. I ran the following command first

JAVA CLASS('/java/Tika/tika-app-1.14.jar') PARM('-t' '/java/Tika/Goralta Invoice.doc')

It displays the text it extracted on the screen. Great. So I know the java program is working on our system. The last step is to get the results to be ported to a file.

I have tried many different combinations but I cannot get it to work. Here is an example of what I have tried and the error we are getting.

JAVA CLASS('/java/Tika/tika-app-1.14.jar') PARM('-t' '"/java/Tika/Goralta Invoice.doc" > "/java/Tika/Goralta Invoice.txt"')

Exception in thread "main" java.net.MalformedURLException: no protocol: "/java/Tika/Goralta Invoice.doc" > "/java/Tika/Goralta In
at java.net.URL.<init>(URL.java:609)
at java.net.URL.<init>(URL.java:506)
at java.net.URL.<init>(URL.java:455)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:472)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:145)
Java program completed with exit code 1

I am not a Java programmer. We use Java tools that make things easier for us as RPG programmers. Any help you can provide will be very much appreciated.

Gordon Schneider
Trans Am Piping Products Ltd.

As an Amazon Associate we earn from qualifying purchases.

This thread ...


Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2022 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.