diff options
author | Karl Williamson <khw@cpan.org> | 2015-02-17 15:03:32 -0700 |
---|---|---|
committer | Karl Williamson <khw@cpan.org> | 2015-02-19 22:55:01 -0700 |
commit | 64935bc6975bb01af403817752e88d6540c8711d (patch) | |
tree | 6a1619ac46d1501a5b296a2b69d9af0b98db8c58 /regcomp.sym | |
parent | 0e0b935601a8b7a2c56653412a94a36f986bc34f (diff) | |
download | perl-64935bc6975bb01af403817752e88d6540c8711d.tar.gz |
Add qr/\b{gcb}/
A function implements seeing if the space between any two characters is
a grapheme cluster break. Afer I wrote this, I realized that an array
lookup might be a better implementation, but the deadline for v5.22 was
too close to change it. I did see that my gcc optimized it down to
an array lookup.
This makes the implementation of \X go from being complicated to
trivial.
Diffstat (limited to 'regcomp.sym')
-rw-r--r-- | regcomp.sym | 16 |
1 files changed, 8 insertions, 8 deletions
diff --git a/regcomp.sym b/regcomp.sym index c20c5aaad1..7daa241dba 100644 --- a/regcomp.sym +++ b/regcomp.sym @@ -43,15 +43,15 @@ GPOS GPOS, no ; Matches where last m//g left off. # in regcomp.c uses the enum value of the modifier as an offset from the /d # version. The complements must come after the non-complements. # BOUND, POSIX and their complements are affected, as well as EXACTF. -BOUND BOUND, no ; Match "" at any word boundary using native charset rules for non-utf8 -BOUNDL BOUND, no ; Match "" at any locale word boundary -BOUNDU BOUND, no ; Match "" at any word boundary using Unicode rules -BOUNDA BOUND, no ; Match "" at any word boundary using ASCII rules +BOUND BOUND, no ; Match "" at any word boundary using native charset rules for non-utf8, otherwise Unicode rules +BOUNDL BOUND, no ; Match "" at any boundary of a given type using locale rules +BOUNDU BOUND, no ; Match "" at any boundary of a given type using Unicode rules +BOUNDA BOUND, no ; Match "" at any boundary of a given type using ASCII rules # All NBOUND nodes are required by code in regexec.c to be greater than all BOUND ones -NBOUND NBOUND, no ; Match "" at any word non-boundary using native charset rules for non-utf8 -NBOUNDL NBOUND, no ; Match "" at any locale word non-boundary -NBOUNDU NBOUND, no ; Match "" at any word non-boundary using Unicode rules -NBOUNDA NBOUND, no ; Match "" at any word non-boundary using ASCII rules +NBOUND NBOUND, no ; Match "" at any word non-boundary using native charset rules for non-utf8, otherwise Unicode rules +NBOUNDL NBOUND, no ; Match "" at any boundary of a given type using locale rules +NBOUNDU NBOUND, no ; Match "" at any boundary of a given type using using Unicode rules +NBOUNDA NBOUND, no ; Match "" at any boundary of a given type using using ASCII rules #* [Special] alternatives: REG_ANY REG_ANY, no 0 S ; Match any one character (except newline). |