Ticket #696 (closed Support Requests: Works For Me)

Opened 11 years ago

Last modified 11 years ago

replacing IsBasicLatin in extanded syntax

Reported by: nobody Owned by: nobody
Milestone: Component: None
Version: None Severity:
Keywords: Cc:


I'm trying to create an expression that replaces the \p
{IsBasicLatin}, that is [\x00-\x7F]. I manage to do it 
with PERL syntax, but not with 'extended' or 'awk'. It 
simply doesn't find what I want it to find.

The expressions I've tried are:


I don't understand the problem, or how I can make it 

Thanks, Moddy.


Change History

comment:1 Changed 11 years ago by johnmaddock

  • Status changed from assigned to closed
Logged In: YES 

In the docs for POSIX regular expressions here:

It says: 

Character ranges:

For example [a-c] will match any single character in the
range 'a' to 'c'.  By default, for POSIX-Extended regular
expressions, a character x is within the range y to z, if it
collates within that range; THIS RESULTS IN LOCAL SPECIFIC
BEHAVIOUR .  This behavior can be turned off by unsetting
the collate option flag - in which case whether a character
appears within a range is determined by comparing the code
points of the characters only.

So use boost::regex::extended & ~boost::regex::collate as
the syntax type to force character ranges to be independent
of the locale.

HTH, John.

Add a comment

Modify Ticket

Change Properties
<Author field>
as closed
The resolution will be deleted. Next status will be 'reopened'

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.