<?xml version="1.0" encoding="UTF-8" ?>

<bugzilla version="5.2"
          urlbase="https://bugzilla.altlinux.org/"
          
          maintainer="jenya@basealt.ru"
>

    <bug>
          <bug_id>22032</bug_id>
          
          <creation_ts>2009-10-22 19:51:03 +0400</creation_ts>
          <short_desc>sed: в локали UTF8 точка в регулярном выражении не матчит не-UTF8 символы</short_desc>
          <delta_ts>2010-12-13 03:09:42 +0300</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>4</classification_id>
          <classification>Development</classification>
          <product>Sisyphus</product>
          <component>sed</component>
          <version>unstable</version>
          <rep_platform>all</rep_platform>
          <op_sys>Linux</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>NOTABUG</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P3</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Vladislav Zavjalov">slazav</reporter>
          <assigned_to name="placeholder@altlinux.org">placeholder</assigned_to>
          <cc>glebfm</cc>
    
    <cc>ldv</cc>
    
    <cc>php-coder</cc>
    
    <cc>placeholder</cc>
    
    <cc>vt</cc>
          
          <qa_contact>qa-sisyphus</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>101993</commentid>
    <comment_count>0</comment_count>
    <who name="Vladislav Zavjalov">slazav</who>
    <bug_when>2009-10-22 19:51:03 +0400</bug_when>
    <thetext>#!/bin/bash

printf &quot;\xF7\xCf\xD4&quot; | LC_ALL=ru_RU.UTF8 sed -e &apos;s/./X/g&apos;

# ожидаемый вывод &quot;ХХХ&quot;, а выводит исходную строчку без изменений
# при том, что в в info sed написано &quot;Matches any character&quot;</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>116283</commentid>
    <comment_count>1</comment_count>
    <who name="Vladislav Zavjalov">slazav</who>
    <bug_when>2010-12-13 03:09:42 +0300</bug_when>
    <thetext>В том же info sed наткнулся:

`z&apos;
     This command empties the content of pattern space.  It is usually
     the same as `s/.*//&apos;, but is more efficient and works in the
     presence of invalid multibyte sequences in the input stream.
     POSIX mandates that such sequences are _not_ matched by `.&apos;, so
     that there is no portable way to clear `sed&apos;&apos;s buffers in the
     middle of the script in most multibyte locales (including UTF-8
     locales).

Так что это известно и считается правильным. Закрываю как NOTABUG, сочувствую обитателям локали utf8 :)</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>