|
-----Message d'origine-----
De : rpg400-l-bounces@xxxxxxxxxxxx
[mailto:rpg400-l-bounces@xxxxxxxxxxxx] De la part de Mike Cunningham
Envoyé : mercredi 22 septembre 2010 14:47
À : RPG programming on the IBM i / System i
Objet : RE: Convert HTML to plain text
I wish but on. This will be something that will be with us
for years. What would really be nice is a printer that
understands html. Then just spool out the html generated
using your favorite way to do html and send it to a spool
file and let the driver format and print it. For the printer
manufacture it would probably be any harder than PJL or PDF
or Postscript
-----Original Message-----
From: rpg400-l-bounces@xxxxxxxxxxxx
[mailto:rpg400-l-bounces@xxxxxxxxxxxx] On Behalf Of John McKay
Sent: Tuesday, September 21, 2010 10:31 AM
To: RPG programming on the IBM i / System i
Subject: Re: Convert HTML to plain text
If this is a once-off request, life gets simpler ...
... display the HTML page as normal in a browser, copy the
page using ctrl-c then paste and the work is done for you.
Regards,
John McKay mba
----- Original Message -----
From: "Mike Cunningham" <mike.cunningham@xxxxxxx>
To: "RPG programming on the IBM i / System i" <rpg400-l@xxxxxxxxxxxx>
Sent: Tuesday, September 21, 2010 3:01 PM
Subject: RE: Convert HTML to plain text
Thanks Vern. Looked at the samples, might go that wayalthough while it
does work I find it very difficult to get the code to workand coming back
later to look at it in 6 months I get totally lost withwhat it is doing
and how/why it works. /<\s*\/?\s*span\s*.*?>/g makes nosense to my brain.
But that is my brains problem and might be the bestsolution. I usually
prefer to go with code that is a bit more "wordy".database. It
Here is a very simple sample of what is in the field in the
is not a full html page from <html> to </html> it is just asnippet of
code in html format. This one only has <BR> to deal withbut the code
could have <span>s and <li>s and a few other goodies ifthey format the
text to look nicer (indents, bullet lists, etc) It willnever be the full
suite of html, only a small subset which makes it easier.each of the
"Students in baccalaureate programs will take coursework in
following categories: CulturalDiversity,Science/Technology/Society and
Writing Enriched. Requirements are met in this programwith:<br />DIV: MGT
410<br />WEC: MGT 410, MGT 490<br />STS: Approved STSelective in Liberal
Arts core (see catalog)<br />Directed Electives in thisprogram are: ACC
230, 310, CIM 428, MGT 116, 247, 315, 320, 340, 350, MKT243, 251, 260,
QAL 101<br />(** At least 3 credits of the 6 must be 300-400 levelAN ASSOCIATE
courses)<br /><br />STUDENTS WILL ENTER THIS PROGRAM WITH
DEGREE WITH A TECHNOLOGY EMPHASIS. Current programsineligible for this
degree are BM, RM, GS, & IS. Courses used in theassociate degree
cannot be used in the final 4 semesters."chucks or a <br>,
I did the code already to put this into 80 characters
whichever occurs first and wrote a line to the print filewhich resulted
in this.each of the
Students in baccalaureate programs will take coursework in
followingand Writing
categories: Cultural Diversity,Science/Technology/Society
Enriched.428, MGT 116,
Requirements are met in this program with:
DIV: MGT 410
WEC: MGT 410, MGT 490
STS: Approved STS elective in Liberal Arts core (see catalog)
Directed Electives in this program are: ACC 230, 310, CIM
247, 315RM, GS, &
, 320, 340, 350, MKT 243, 251, 260, QAL 101
(** At least 3 credits of the 6 must be 300-400 level courses)
STUDENTS WILL ENTER THIS PROGRAM WITH AN ASSOCIATE DEGREE WITH A
TECHNOLOGY EMPH
ASIS. Current programs ineligible for this degree are BM,
IS. Cothe word wrap.
urses used in the associate degree cannot be used in the final 4
semesters.
Can easily deal with the & but I think the fun will be
I may need to parse the string into words and join wordstogether until
the length of what is joined plus the length of the nextword exceeds 80,
then write a line.[mailto:rpg400-l-bounces@xxxxxxxxxxxx]
-----Original Message-----
From: rpg400-l-bounces@xxxxxxxxxxxx
On Behalf Of Vern Hambergmethinks. I
Sent: Tuesday, September 21, 2010 8:51 AM
To: RPG programming on the IBM i / System i
Subject: Re: Convert HTML to plain text
Mike
No code, just questions!
Is the HTML in a PF or in a STMF? The latter is preferable,
can see a couple things to do - first look for closing tags- scan for
"</" - and the scan back for the matching opening tag. Thentake on the
unary (my term) tags like <br>.<head> and not the
There's also the need, perhaps, to take out <html> and
contents of <body>.everything
Or maybe, as I get from another site, it's enough to strip
between "<" and ">" in that order - unless you havecomparison operators
in there! Sites in the google below discuss these issues.http://weblogs.asp.net/rosherove/archive/2003/05/13/6963.aspx
I did a quick google on "strip html tags". One link -
- discusses
using regular expressions. Another -http://nadeausoftware.com/articles/2007/09/php_tip_how_strip_h
tml_tags_web_page
- discusses issues about text you still want inside some tags.an appropriate
Looks as if grep or sed or the like could do the work, with
expression. And those are callable from RPG or CL through QSH.strip off all the
HTH
Vern
On 9/21/2010 7:13 AM, Mike Cunningham wrote:
Would anyone happen to have RPG code to take HTML and
normal printtags and just have plain text that would be printed using
page and alsofiles? I have a form that needs to be displayed on a web
collected usingprinted from an RPG application. Part of the form is data
a variablea rich-text editor on a web page that is stored as HTML in
put in thelength field. Works great when the form is on a webpage as it is a
what-you-see-is-what-you-get function. Any special editing
Problem isrich-text editor shows on the web page exactly as entered.
file to antaking that html code and printing it using a normal print
not be too bad.outq then the printer. Stripping out the html tags might
hardest. The printDealing with<br> tags and<p> tags and<ul><li> can be a bit more
challenging but I think word wrap is going to be the
break a wordfile line is 80 characters and I need to be sure to not
see if anyon!between lines. Some tricky code and I thought I would just
(RPG400-L) mailing
e !
might have done this already and would share their code.
Thanks
--
This is the RPG programming on the IBM i / System i
list To post a message email: RPG400-L@xxxxxxxxxxxx To subscribe,(RPG400-L) mailing
unsubscribe, or change list options,
visit: http://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxx
Before posting, please take a moment to review the archives at
http://archive.midrange.com/rpg400-l.
--
This is the RPG programming on the IBM i / System i
list
To post a message email: RPG400-L@xxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: http://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxx
Before posting, please take a moment to review the archives
at http://archive.midrange.com/rpg400-l.
--
This is the RPG programming on the IBM i / System i
(RPG400-L) mailing list
To post a message email: RPG400-L@xxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: http://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxx
Before posting, please take a moment to review the archives
at http://archive.midrange.com/rpg400-l.
--
This is the RPG programming on the IBM i / System i
(RPG400-L) mailing list
To post a message email: RPG400-L@xxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: http://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxx
Before posting, please take a moment to review the archives
at http://archive.midrange.com/rpg400-l.
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.