[Gllug] Find non-7-bit characters in files

Ian Norton bredroll at darkspace.org.uk
Thu Jun 16 17:24:06 UTC 2005


On Thu, Jun 16, 2005 at 06:02:44PM +0100, Richard Jones wrote:
> Here's a small Thursday afternoon puzzler for everyone.
> 
> I hae a large number of files (HTML files in fact, not that it
> matters).  A clueless^Wevil web monkey^Wdesigner has hidden bytes in
> them that are in the range 0x80 - 0xff, so the files aren't valid
> UTF-8.
> 
> I want to find those characters.  Preferably quickly from the command
> line.
> 
> I tried various combinations of egrep with the [:print:] character,
> but to no avail.
> 

convmv

http://j3e.de/linux/convmv/

enjoy

-- 
Ian Norton-Badrul

-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug




More information about the GLLUG mailing list