× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



Scott Klement wrote:
pattern = '(\S\S{40,}|.{1,40})(\s+|$)' + x'00';

The next element of the array matches your first subexpression. Subexpressions are the things in parenthesis. Your first one is (\S\S{40,}|.{1,40}) -- which means 40 or more of the letter S or 1-40 of any character.

Not to put words in Aaron's mouth, but I suspect that when he's using \s and \S he's looking for whitespace and non-whitespace respectively. Do the C regex APIs support Extended Regular Expression syntax, specifically the shorthand character classes? [1]

If I'm right about Aaron's intention with \s and \S, then I'm a bit confused about the '\S\S{40,} part. I think that if the search string contained 41 or more consecutive non-whitespace characters, that would be what was matched. I haven't tested this.

HTH,
Adam

[1] http://en.wikipedia.org/wiki/Regular_expression#POSIX_Extended_Regular_Expressions

As an Amazon Associate we earn from qualifying purchases.

This thread ...

Follow-Ups:
Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.