mirror of
https://github.com/neovim/neovim
synced 2025-07-15 16:51:49 +00:00
vim-patch:6f85cec: runtime(python): update rendering of Unicode named literals in syntax script
This change:
* enforces that the alias starts with a letter
* allows the other words in an alias to be separated by either a space
or a hyphen, but not both or double separators
* allows only a letter after space, possibly followed by letters or
digits
* allows both letters and digits after a hyphen
Tested with:
a = '\N{Cyrillic Small Letter Zhe} is pronounced as zh in pleasure'
b = '\N{NO-BREAK SPACE} is needed here'
# ... other tests here
r = '\N{HENTAIGANA LETTER E-1} is a Japanese hiragana letter archaic ye'
s = '\N{CUNEIFORM SIGN NU11 TENU} is a correction alias'
t = '\N{RECYCLING SYMBOL FOR TYPE-1 PLASTICS} base shape is a triangle'
print(a)
print(b)
print(r)
print(s)
print(t)
The tests confirm the behavior and are selected from real Unicode
tables/aliases to check these combinations based on the specification.
fixes: vim/vim#17323
closes: vim/vim#17735
6f85cec4fb
Co-authored-by: Zvezdan Petkovic <zpetkovic@acm.org>
This commit is contained in:
committed by
Christian Clason
parent
3e7f5d95aa
commit
444a8b3ec6
@ -160,7 +160,8 @@ syn match pythonEscape "\\\o\{1,3}" contained
|
|||||||
syn match pythonEscape "\\x\x\{2}" contained
|
syn match pythonEscape "\\x\x\{2}" contained
|
||||||
syn match pythonEscape "\%(\\u\x\{4}\|\\U\x\{8}\)" contained
|
syn match pythonEscape "\%(\\u\x\{4}\|\\U\x\{8}\)" contained
|
||||||
" Python allows case-insensitive Unicode IDs: http://www.unicode.org/charts/
|
" Python allows case-insensitive Unicode IDs: http://www.unicode.org/charts/
|
||||||
syn match pythonEscape "\\N{\a\+\%(\s\a\+\)*}" contained
|
" The specification: https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-4/#G135165
|
||||||
|
syn match pythonEscape "\\N{\a\+\%(\%(\s\a\+[[:alnum:]]*\)\|\%(-[[:alnum:]]\+\)\)*}" contained
|
||||||
syn match pythonEscape "\\$"
|
syn match pythonEscape "\\$"
|
||||||
|
|
||||||
" It is very important to understand all details before changing the
|
" It is very important to understand all details before changing the
|
||||||
|
Reference in New Issue
Block a user