[File] ASCII text detected as data
Aman Sharma
amansha at kth.se
Mon Mar 17 18:26:54 UTC 2025
Hi,
Thanks for the response! I had overlooked as I was too fixated with the diff.
If this is the case, shouldn't the ouput of file be `Unicode text, UTF-8 text`? I verified the following with Python code as well. It does fail for encoding="ascii".
```
codecs.open("ref", encoding="utf-8", errors="strict").readlines()
```
Had it been some
Regards,
Aman Sharma
PhD Student
KTH Royal Institute of Technology
School of Electrical Engineering and Computer Science (EECS)
Department of Theoretical Computer Science (TCS)
<http://www.kth.se><https://www.kth.se/profile/amansha><https://www.kth.se/profile/amansha>
<https://www.kth.se/profile/amansha>https://algomaster99.github.io/
________________________________
From: Christos Zoulas <christos at zoulas.com>
Sent: Saturday, March 15, 2025 6:33:03 PM
To: Aman Sharma
Cc: file at astron.com
Subject: Re: [File] ASCII text detected as data
They contain non-ascii characters eg:
Ga \xc3\xa9\xc3\xa9n stap verder
christos
> On Mar 15, 2025, at 12:16 PM, Vadim <vad.mailingfile at vad.cx> wrote:
>
> Hi,
>
> > xxd -s 0x5a30 -l 16 -g 1 reb
> > xxd -s 0x5ed0 -l 16 -g 1 ref
>
> shows "0x1e" character, that's ASCII control char "record separator".
>
> xxd highlights UTF-8 characters in red too.
>
> Cheers,
> Vadim
> --
> File mailing list
> File at astron.com
> https://mailman.astron.com/mailman/listinfo/file
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.astron.com/pipermail/file/attachments/20250317/a604c0e8/attachment.htm>
More information about the File
mailing list