Update Scintilla regular expression info for v1.77 (character

classes, ASCII escaping, character sets containing square
brackets peculiarities). Adapted from SciTE doc.



git-svn-id: https://geany.svn.sourceforge.net/svnroot/geany/trunk@3443 ea778897-0a13-0410-b9d1-a72fbfd435f5
This commit is contained in:
Nick Treleaven 2009-01-03 16:37:47 +00:00
parent 9a89cd2a51
commit 3accd528ca
3 changed files with 313 additions and 259 deletions

View File

@ -22,6 +22,10 @@
src/editor.c, src/symbols.c:
Remove geanyobject.h includes.
Use GObject instead of unused GeanyObject argument.
* doc/geany.txt, doc/geany.html:
Update Scintilla regular expression info for v1.77 (character
classes, ASCII escaping, character sets containing square
brackets peculiarities). Adapted from SciTE doc.
2009-01-02 Nick Treleaven <nick(dot)treleaven(at)btinternet(dot)com>

File diff suppressed because it is too large Load Diff

View File

@ -1062,6 +1062,8 @@ POSIX-like, as described in the table below.
(1) Searching backwards with regular expressions is not supported.
(2) \\r and \\n are never matched because regular expression
searches are made line per line (stripped of end-of-line chars).
(3) The POSIX '?' regular expression character for optional
matching is not supported.
**In a regular expression, the following characters are interpreted:**
@ -1081,17 +1083,41 @@ POSIX-like, as described in the table below.
\\> This matches the end of a word.
\\c A backslash followed by d, D, s, S, w or W, becomes a
character class (both inside and outside sets []).
* d: decimal digits
* D: any char except decimal digits
* s: whitespace (space, \\t \\n \\r \\f \\v)
* S: any char except whitespace (see above)
* w: alphanumeric & underscore (changed by user setting)
* W: any char except alphanumeric & underscore (see above)
\\x This allows you to use a character x that would otherwise have
a special meaning. For example, \\[ would be interpreted as [
and not as the start of a character set. Use \\\\ for a literal
backslash.
[...] This indicates a set of characters, for example, [abc] means
any of the characters a, b or c. You can also use ranges, for
example [a-z] for any lower case character.
\\xHH A backslash followed by x and two hexa digits, becomes the
character whose Ascii code is equal to these digits. If not
followed by two digits, it is 'x' char itself.
[^...] The complement of the characters in the set. For example,
[^A-Za-z] means any character except an alphabetic character.
[...] Matches one of the characters in the set. If the first
character in the set is ^, it matches the characters NOT in
the set, i.e. complements the set. A shorthand S-E (start
dash end) is used to specify a set of characters S up to E,
inclusive. The special characters ] and - have no special
meaning if they appear as the first chars in the set. To
include both, put - first: [-]A-Z] (or just backslash them).
Examples::
[-]|] matches these 3 chars
[]-|] matches from ] to | chars
[a-z] any lowercase alpha
[^-]] any char except - and ]
[^A-Z] any char except uppercase alpha
[a-zA-Z] any alpha
^ This matches the start of a line (unless used inside a set, see
above).
@ -1105,12 +1131,9 @@ $ This matches the end of a line.
Saam, Saaam and so on.
====== ============================================================
Partial POSIX compatibility
```````````````````````````
Note that the POSIX '?' regular expression character for optional
matching is not supported by the Find and Replace dialogs.
.. note::
This table is adapted from Scintilla and SciTE documentation,
distributed under the `License for Scintilla and SciTE`_.