Re: unicode on Linux

From: Benjamin Peterson (ben@jbrowse.com)
Date: Tue Oct 21 2003 - 14:19:10 CST


Edward H. Trager wrote:
> and to my knowledge Windows does not yet have grep at all ...

Oh, a curse on Bill Gates and his newfangled Micro$loth systems :) To
_my_ knowledge, however...

There's cygwin.

Or better yet, no cygwin! http://unxutils.sourceforge.net/

Or, GNU grep! http://gnuwin32.sourceforge.net/
There's a different build of GNU grep here:
http://members.ozemail.com.au/~crn/grep.html

Or, you could use the MS equivalent, findstr, which works on multibyte
characters provided it can guess the encoding from the current codepage
(i.e. you have to set code page to 932 to make it work on a shift-JIS
file, and so on). You'd think you could use it on utf-8 by setting
codepage to 65001 but it doesn't happen for me. On the other hand it
does recurse into directories.

Or, there's the DJGPP version of grep: http://www.delorie.com/djgpp/

And related to it, there's the version that uses the PW32 project:
http://pw32.sourceforge.net/

Or, there's cgrep and jgrep; but I don't know what particular encodings
they work with and I don't have the URLs to hand.

Or, there's a modified GNU grep here:
http://www.interlog.com/~tcharron/grep.html

...and so on. I usually use unxutils.

-- 
  Benjamin Peterson
  bjsp123@imap.cc


This archive was generated by hypermail 2.1.5 : Thu Jan 18 2007 - 15:54:24 CST