[File] [PATCH] Magdir/archive for CAZIP compressed file *.??_

Jörg Jenderek joerg.jen.der.ek at gmx.net
Sat Oct 30 00:18:50 UTC 2021


Hello,

some times ago i installed an old DOS software. In installation
directory are files with underscore as last character of file name
extension.

When running running file command version 5.41 on such compressed
files and the related unpacked files i get an output like:

BLINKER.WRI:  Microsoft Write 3.0 (Windows) Document,
	      115 pages,
	      Free T-shirts and up to 50% savings!
BLINKER.WR_:  data
CAOSETUP.EX_: data
CLIPDEFS.H:   C source, ASCII text, with CRLF line terminators
CLIPDEFS._:   data
CLIPPER.EXE:  MS-DOS executable, MZ for MS-DOS
CLIPPER.EX_:  data
FILEIO.CH:    ASCII text, with CRLF line terminators
FILEIO.C_:    data
SALES.DBF:    FoxBase+/dBase III DBF, 8 records * 31,
	      update-date 93-11-18, at offset 97
	      1st record "Susan Smith"
SALES.DB_:    data

For comparison reason i run the file format identification utility
TrID ( See https://mark0.net/soft-trid-e.html). This
identifies compressed examples like BLINKER.WR_ as "CAZIP compressed
file" by definition caz.trid.xml (See appended cazip_trid-v.txt.gz).
The reported file name extension seems to be wrong or incomplete.
Unfortunately the mentioned reference URL has become invalid. But
luckily i found a page about CAZIP on file formats archive team web
site. That informations are expressed by comment lines like:
# URL:		http://fileformats.archiveteam.org/wiki/CAZIP
# Reference:	http://mark0.net/download/triddefs_xml.7z
#		defs/c/caz.trid.xml

The file naming convention looks like for MS Compressed files. So i
add inside Magdir/archive lines after MS Compress archive section.

According to Identifiers section on documentation side i add magic
lines like:
0	string	\x0D\x0A\x1ACAZIP	CAZIP compressed file
!:mime	application/x-compress-cazip
!:ext	??_/?_/_
Instead of generic mime type application/octet-stream i choose a user
defined one.

After applying the above mentioned modifications by patch
file-5.41-archive-cazip.diff then all my inspected CAZIP compressed
files are now described. This now looks like:

BLINKER.WRI:  Microsoft Write 3.0 (Windows) Document,
	      115 pages,
	      Free T-shirts and up to 50% savings!
BLINKER.WR_:  CAZIP compressed file
CAOSETUP.EX_: CAZIP compressed file
CLIPDEFS.H:   ASCII text, with CRLF line terminators
CLIPDEFS._:   CAZIP compressed file
CLIPPER.EXE:  MS-DOS executable, MZ for MS-DOS
CLIPPER.EX_:  CAZIP compressed file
FILEIO.CH:    ASCII text, with CRLF line terminators
FILEIO.C_:    CAZIP compressed file
SALES.DBF:    FoxBase+/dBase III DBF, 8 records * 31,
	      update-date 93-11-18, at offset 97
	      1st record "Susan Smith
SALES.DB_:    CAZIP compressed file

I hope my diff file can be applied in future version of file utility.

With best wishes
Jörg Jenderek
--
Jörg Jenderek























































-------------- next part --------------
A non-text attachment was scrubbed...
Name: trid-v-dc42.txt.gz
Type: application/x-gzip
Size: 539 bytes
Desc: not available
URL: <https://mailman.astron.com/pipermail/file/attachments/20211030/118fb638/attachment.bin>
-------------- next part --------------
--- file-5.41/magic/Magdir/archive.old	2021-08-30 09:10:26 +0000
+++ file-5.41/magic/Magdir/archive	2021-10-29 23:50:18 +0000
@@ -684,10 +684,21 @@
 >0	string	SZ\x20		MS Compress archive data, QBasic variant
 !:mime	application/x-ms-compress-sz
 !:ext	??$
 >>8	ulelong	>0		\b, original size: %u bytes
 
+# Summary:	CAZIP compressed file
+# From:		Joerg Jenderek
+# URL:		http://fileformats.archiveteam.org/wiki/CAZIP
+# Reference:	http://mark0.net/download/triddefs_xml.7z/defs/c/caz.trid.xml
+# Note:		Format is distinct from CAZIPXP compressed
+0	string	\x0D\x0A\x1ACAZIP	CAZIP compressed file
+#!:mime	application/octet-stream
+!:mime	application/x-compress-cazip
+# like: BLINKER.WR_ CLIPDEFS._ CAOSETUP.EX_ CLIPPER.EX_ FILEIO.C_
+!:ext	??_/?_/_
+
 # Summary:	FTCOMP compressed archive
 # From:		Joerg Jenderek
 # URL:		http://fileformats.archiveteam.org/wiki/FTCOMP
 # Reference:    http://mark0.net/download/triddefs_xml.7z/defs/a/ark-ftcomp.trid.xml
 # Note:		called by TrID "FTCOMP compressed archive"
-------------- next part --------------
A non-text attachment was scrubbed...
Name: file-5.41-archive-cazip.diff.sig
Type: application/octet-stream
Size: 706 bytes
Desc: not available
URL: <https://mailman.astron.com/pipermail/file/attachments/20211030/118fb638/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: cazip_trid-v.txt.gz
Type: application/x-gzip
Size: 992 bytes
Desc: not available
URL: <https://mailman.astron.com/pipermail/file/attachments/20211030/118fb638/attachment-0001.bin>


More information about the File mailing list