<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<style type="text/css" id="owaParaStyle" style="display: none; ">P {margin-top:0;margin-bottom:0;}</style>
</head>
<body dir="ltr" fpstyle="1" aria-label="Message body" tabindex="0" style="">
<div name="divtagdefaultwrapper" id="divtagdefaultwrapper" style="font-family: Calibri,Arial,Helvetica,sans-serif; font-size: 12pt; color: #000000; margin: 0">
Yeah, in general editors or other tools will put in place-holders for non printable characters. The ^M's are carriage returns (left by Windows) and can be safely ignored, what I saw was a null character good text editors shouldn't choke up on either of those
but the null characters would explain the encoding strangeness. You can look for non-printable characters using greps character range:
<div><br>
</div>
<div> grep -aP '[\x7f-\xff]' AZ021306-FOIArefund-list.txt</div>
<div><br>
</div>
<div>If you're interested in fixing this to be recognized as a text file, you can combined that with `nl`:</div>
<div><br>
</div>
<div> nl AZ021306-FOIArefund-list.txt | grep -aP '[\x7f-\xff]'<br>
<div><br>
</div>
<div>which should get you a line number. From there use a "programmer's text editor" (vim, emacs, sublime text, ect.) to removing the offending character.</div>
<div><br>
</div>
<div>Hope that helps.</div>
<div><br>
<br>
<div name="divtagdefaultwrapper" style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:; margin:0">
<div style="font-family:Tahoma; font-size:13px">Paul Mooring
<div>Operations Engineer</div>
<div>www.opscode.com</div>
</div>
</div>
</div>
<br>
<div style="color: rgb(40, 40, 40); ">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> plug-discuss-bounces@lists.phxlinux.org <plug-discuss-bounces@lists.phxlinux.org> on behalf of Michael Havens <bmike1@gmail.com><br>
<b>Sent:</b> Thursday, July 25, 2013 3:02 PM<br>
<b>To:</b> Main PLUG discussion list<br>
<b>Subject:</b> Re: I got a zip file from the government but it won't unzip properly</font>
<div> </div>
</div>
<div>cool..... vi opens it. I do not see any '^@' characters but I see a lot of '^M' characters. I see a few '@' characters. So I do not need to worry about those characters.... only the non ascii ones?<br clear="all">
<div>:-)~MIKE~(-:</div>
<br>
<br>
<div class="gmail_quote">On Thu, Jul 25, 2013 at 2:42 PM, Paul Mooring <span dir="ltr">
<<a href="mailto:paul@opscode.com" target="_blank">paul@opscode.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex; border-left:1px #ccc solid; padding-left:1ex">
<div dir="ltr">
<div name="divtagdefaultwrapper" style="font-size:12pt; margin:0; font-family:Calibri,Arial,Helvetica,sans-serif">
There's some null characters (look like ^@ in vim) in this file causing your problem, run `file -I <file_name>` and you'll see it's charset is binary. I found some null chars on line 155 (grep for non-ascii characters to find them). You can remove them with
sed or tr and be on your happy way.<br>
<div><br>
<br>
<div name="divtagdefaultwrapper">
<div style="font-family:Tahoma; font-size:13px">Paul Mooring
<div>Operations Engineer</div>
<div><a href="http://www.opscode.com" target="_blank">www.opscode.com</a></div>
</div>
</div>
</div>
<br>
<div style="color:rgb(40,40,40)">
<hr style="display:inline-block; width:98%">
<div dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b>
<a href="mailto:plug-discuss-bounces@lists.phxlinux.org" target="_blank">plug-discuss-bounces@lists.phxlinux.org</a> <<a href="mailto:plug-discuss-bounces@lists.phxlinux.org" target="_blank">plug-discuss-bounces@lists.phxlinux.org</a>> on behalf of Michael
Havens <<a href="mailto:bmike1@gmail.com" target="_blank">bmike1@gmail.com</a>><br>
<b>Sent:</b> Thursday, July 25, 2013 2:09 PM<br>
<b>To:</b> <a href="mailto:plug-discuss@phxlinux.org" target="_blank">plug-discuss@phxlinux.org</a><br>
<b>Subject:</b> I got a zip file from the government but it won't unzip properly</font>
<div> </div>
</div>
<div>
<div class="h5">
<div>It seems to unzip fine but the document that opens freezes libre writer and when I try to open it with a text editor it complains about encoding. here is the file:
<div><br>
</div>
<div><a href="https://www.dropbox.com/s/q5x24uzhyrv3mo1/az-foiarefund-list.zip" target="_blank">https://www.dropbox.com/s/q5x24uzhyrv3mo1/az-foiarefund-list.zip</a></div>
<div><br>
</div>
<div>Any ideas how to open it?<br clear="all">
<div>:-)~MIKE~(-:</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
---------------------------------------------------<br>
PLUG-discuss mailing list - <a href="mailto:PLUG-discuss@lists.phxlinux.org">PLUG-discuss@lists.phxlinux.org</a><br>
To subscribe, unsubscribe, or to change your mail settings:<br>
<a href="http://lists.phxlinux.org/mailman/listinfo/plug-discuss" target="_blank">http://lists.phxlinux.org/mailman/listinfo/plug-discuss</a><br>
</blockquote>
</div>
<br>
</div>
</div>
</div>
</div>
</body>
</html>