|
On Jul 30, 2020, at 8:53 AM, Paul Nicolay <paul.nicolay@xxxxxxxxxx> wrote:
Joep,
I was under the impression that %LEN is not unicode aware and would return the number of bytes instead of characters however the manual states the following;
For character, graphic, or UCS-2 expressions the value returned is the number of characters in the value
of the expression. For variable-length values, such as the value returned from a built-in function or a
variable-length field, the value returned by %LEN is the current length of the character, graphic, or
UCS-2 value.
Based on this it looks indeed like a bug ?
Kind regards,
Paul
________________________________________
From: RPG400-L <rpg400-l-bounces@xxxxxxxxxxxxxxxxxx> on behalf of Joep Beckeringh via RPG400-L <rpg400-l@xxxxxxxxxxxxxxxxxx>
Sent: Thursday, July 30, 2020 14:09
To: rpg400-l@xxxxxxxxxxxxxxxxxx
Cc: Joep Beckeringh
Subject: Re: %Subst and %len with UTF-8 data
Yes, I know. But in the case of ChrUtf the binary value contains 3,
while the string exists of 1 character (€) that uses 3 bytes. And the
documentation states that %len returns the number of characters.
Joep Beckeringh
Pantheon Automatisering BV
Op 30-7-2020 om 13:13 schreef Birgitta Hauser:
A varying length field is always preceeded with an (invisible) 2 (or 4) Byte Binary value which represents the total length (number of characters) of the content in the varying length field.--
Mit freundlichen Grüßen / Best regards
Birgitta Hauser
"Shoot for the moon, even if you miss, you'll land among the stars." (Les Brown)
"If you think education is expensive, try ignorance." (Derek Bok)
"What is worse than training your staff and losing them? Not training them and keeping them!"
„Train people well enough so they can leave, treat them well enough so they don't want to.“ (Richard Branson)
-----Original Message-----
From: RPG400-L <rpg400-l-bounces@xxxxxxxxxxxxxxxxxx> On Behalf Of Joep Beckeringh via RPG400-L
Sent: Donnerstag, 30. Juli 2020 12:25
To: rpg400-l@xxxxxxxxxxxxxxxxxx
Cc: Joep Beckeringh <joep.beckeringh@xxxxxxxxxx>
Subject: %Subst and %len with UTF-8 data
Hello All,
While experimenting with CCSID(*UTF8) I found the following:
- I have a character variable BufIn with CCSID(*UTF8); containing (amongst others) a Euro-sign (€; x'E282AC' in UTF-8)
- I have a variable length (length 10) character variable ChrUtf with
CCSID(*UTF8)
- When I try to get the Euro-sign into ChrUtf with 'ChrUtf =
%subst(BufIn: pos: 1)' ChrUtf contains x'0001E2'.
I am inclined to consider this a bug.
- I have another variable EurUtf - char(3) ccsid(*utf8) - which I initialized with X'E282AC' (€)
- After 'ChrUtf = EurUtf' ChrUtf contains x'0003E282AC' and %len(ChrUtf) yields 3; the number of bytes in the string; not the number of characters.
I am inclined to consider this a bug as well.
I guess we need Barbara Morris to shed some light on this.
Joep Beckeringh
Pantheon Automatisering BV
--
This is the RPG programming on IBM i (RPG400-L) mailing list To post a message email: RPG400-L@xxxxxxxxxxxxxxxxxx To subscribe, unsubscribe, or change list options,
visit: https://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxxxxxxxx
Before posting, please take a moment to review the archives at https://archive.midrange.com/rpg400-l.
Please contact support@xxxxxxxxxxxxxxxxxxxx for any subscription related questions.
Help support midrange.com by shopping at amazon.com with our affiliate link: https://amazon.midrange.com
This is the RPG programming on IBM i (RPG400-L) mailing list
To post a message email: RPG400-L@xxxxxxxxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: https://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxxxxxxxx
Before posting, please take a moment to review the archives
at https://archive.midrange.com/rpg400-l.
Please contact support@xxxxxxxxxxxxxxxxxxxx for any subscription related questions.
Help support midrange.com by shopping at amazon.com with our affiliate link: https://amazon.midrange.com
--
This is the RPG programming on IBM i (RPG400-L) mailing list
To post a message email: RPG400-L@xxxxxxxxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: https://lists.midrange.com/mailman/listinfo/rpg400-l
or email: RPG400-L-request@xxxxxxxxxxxxxxxxxx
Before posting, please take a moment to review the archives
at https://archive.midrange.com/rpg400-l.
Please contact support@xxxxxxxxxxxxxxxxxxxx for any subscription related questions.
Help support midrange.com by shopping at amazon.com with our affiliate link: https://amazon.midrange.com
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.