diff options
author | Carlo Marcelo Arenas Belón <carenas@gmail.com> | 2023-01-06 19:34:56 -0800 |
---|---|---|
committer | Jim Meyering <meyering@fb.com> | 2023-01-07 18:24:51 -0800 |
commit | 5e3b760f65f13856e5717e5b9d935f5b4a615be3 (patch) | |
tree | 84050b2bd44f892b69288bc1b12fbcdd23826d71 /THANKS.in | |
parent | 45e1158a4bb44e507239274535290db61dd27577 (diff) | |
download | grep-5e3b760f65f13856e5717e5b9d935f5b4a615be3.tar.gz |
pcre: use UCP in UTF mode
This fixes a serious bug affecting word-boundary and word-constituent regular
expressions when the desired match involves non-ASCII UTF8 characters.
* src/pcresearch.c: Set PCRE2_UCP together with PCRE2_UTF
* tests/pcre-utf8-w: New file.
* tests/Makefile.am (TESTS): Add it.
* NEWS (Bug fixes): Mention this.
* THANKS.in: Add Gro-Tsen and Karl Petterson.
Reported by Gro-Tsen https://twitter.com/gro_tsen/status/1610972356972875777
via Karl Pettersson in https://github.com/PCRE2Project/pcre2/issues/185
This bug was present from grep-2.5, when --perl-regexp (-P) support was added.
Diffstat (limited to 'THANKS.in')
-rw-r--r-- | THANKS.in | 2 |
1 files changed, 2 insertions, 0 deletions
@@ -35,6 +35,7 @@ Gerald Stoller gerald_stoller@hotmail.com Grant McDorman grant@isgtec.com Greg Boyd gboyd.ccsf@gmail.com Greg Louis glouis@dynamicro.on.ca +Gro-Tsen https://twitter.com/gro_tsen Guglielmo 'bond' Bondioni g.bondioni@libero.it H. Merijn Brand h.m.brand@hccnet.nl Harald Hanche-Olsen hanche@math.ntnu.no @@ -50,6 +51,7 @@ Joel N. Weber II devnull@gnu.org John Hughes john@nitelite.calvacom.fr Jorge Stolfi stolfi@dcc.unicamp.br Karl Heuer kwzh@gnu.org +Karl Petterson karl.pettersson@klpn.se Kaveh R. Ghazi ghazi@caip.rutgers.edu Kazuro Furukawa furukawa@apricot.kek.jp Keith Bostic bostic@bsdi.com |