summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorEli Zaretskii <eliz@gnu.org>2021-05-30 13:20:02 +0300
committerEli Zaretskii <eliz@gnu.org>2021-05-30 13:20:02 +0300
commit5dfa5e26dd58c6e8aa135c81ecfdbda92725556e (patch)
tree626326cecd538891a8aaa8fa759dffc47542626b
parent30e5d93ee1b938c207c4f5ec015ba2633cf423cd (diff)
downloademacs-5dfa5e26dd58c6e8aa135c81ecfdbda92725556e.tar.gz
Improve documentation of regexp ranges
* doc/lispref/searching.texi (Regexp Special): Document the effect of using octal escapes in regexp ranges. (Bug#17758)
-rw-r--r--doc/lispref/searching.texi9
1 files changed, 8 insertions, 1 deletions
diff --git a/doc/lispref/searching.texi b/doc/lispref/searching.texi
index 8b900da616f..1ee4be7dd13 100644
--- a/doc/lispref/searching.texi
+++ b/doc/lispref/searching.texi
@@ -363,7 +363,7 @@ preceding expression either once or not at all. For example,
@anchor{Non-greedy repetition}
@item @samp{*?}, @samp{+?}, @samp{??}
@cindex non-greedy repetition characters in regexp
-These are @dfn{non-greedy} variants of the operators @samp{*}, @samp{+}
+are @dfn{non-greedy} variants of the operators @samp{*}, @samp{+}
and @samp{?}. Where those operators match the largest possible
substring (consistent with matching the entire containing expression),
the non-greedy variants match the smallest possible substring
@@ -438,6 +438,13 @@ including newline. However, a reversed range should always be from
the letter @samp{z} to the letter @samp{a} to make it clear that it is
not a typo; for example, @samp{[+-*/]} should be avoided, because it
matches only @samp{/} rather than the likely-intended four characters.
+
+@item
+If the end points of a range are raw 8-bit bytes (@pxref{Text
+Representations}), or if the range start is ASCII and the end is a raw
+byte (as in @samp{[a-\377]}), the range will match only ASCII
+characters and raw 8-bit bytes, but not non-ASCII characters. This
+feature is intended for searching text in unibyte buffers and strings.
@end enumerate
Some kinds of character alternatives are not the best style even