| Summary: | sed: в локали UTF8 точка в регулярном выражении не матчит не-UTF8 символы | ||
|---|---|---|---|
| Product: | Sisyphus | Reporter: | Vladislav Zavjalov <slazav> |
| Component: | sed | Assignee: | placeholder <placeholder> |
| Status: | CLOSED NOTABUG | QA Contact: | qa-sisyphus |
| Severity: | normal | ||
| Priority: | P3 | CC: | glebfm, ldv, php-coder, placeholder, vt |
| Version: | unstable | ||
| Hardware: | all | ||
| OS: | Linux | ||
|
Description
Vladislav Zavjalov
2009-10-22 19:51:03 MSD
В том же info sed наткнулся:
`z'
This command empties the content of pattern space. It is usually
the same as `s/.*//', but is more efficient and works in the
presence of invalid multibyte sequences in the input stream.
POSIX mandates that such sequences are _not_ matched by `.', so
that there is no portable way to clear `sed''s buffers in the
middle of the script in most multibyte locales (including UTF-8
locales).
Так что это известно и считается правильным. Закрываю как NOTABUG, сочувствую обитателям локали utf8 :)
|