[File] [PATCH] of Magdir/wordprocessors for older Microsoft Works documents (*.wps)

Jörg Jenderek joerg.jen.der.ek at gmx.net
Thu May 23 23:36:38 UTC 2019


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Hello,
some days ago i inspect some Microsoft Works documents with file
extension wps. I run version 5.37 on older documents. Such documents
are only described as "data".

The file identifying tool TrID ( http://mark0.net/soft-trid-e.html )
describes inspected examples as "Microsoft Works for DOS document".

Droid, the UK government national archives program describes such
examples more preciously as "Microsoft Works Word Processor 1-3 for
DOS and 2 for Windows" by fmt/163.

So i add lines to Magdir/wordprocessors. Some Information is found on
fileformats.archiveteam.org website. So i add comment line like
# fileformats.archiveteam.org/wiki/Microsoft_Works_Word_Processor

According to that site start magic is expressed by line
 0	ubeshort	=0x01FE
More test lines are generated by looking for patterns inside
TriD definition wps-works-dos.trid.xml found in packed database
triddefs_xml.7z. So second magic line becomes like
 >112 ubeshort =0x0100 	Microsoft Works 1-3 (DOS) or 2 (Windows)
Apparently this is the offset where the title of document is stored.
This is displayed by line like
 >>0x100	string		x		%s

Mime type and file name extension are shown by lines
 !:mime	application/vnd-ms-works
 !:ext	wps
According to macdisk website also apple id is shown by line
 !:apple	????AWWP

After applying the above mentioned modifications by patch
file-5.37-wordprocessors-wps.diff then such old MS Works documents are
identified and described like:

Das regeln wir schon.wps:
	Microsoft Works 1-3 (DOS) or 2 (Windows) document
	DAS REGELN WIR SCHON
The Great Khan Game.wps:
	Microsoft Works 1-3 (DOS) or 2 (Windows) document
	THE GREAT KHAN GAME
Alpha Omega.wps:
	Microsoft Works 1-3 (DOS) or 2 (Windows) document
	Alpha Omega
Gateway to the Stars.wps:
	Microsoft Works 1-3 (DOS) or 2 (Windows) document
	*** GATEWAY TO THE STARS ***

Newer Version of Microsoft Works documents are based on Microsoft
Compound File format. I will handle this in a future session.

I hope my diff file can be applied in future version of
file utility.

With best wishes
Jörg Jenderek
- --
Jörg Jenderek








-----BEGIN PGP SIGNATURE-----
Comment: Using GnuPG with Thunderbird - https://www.enigmail.net/

iF0EARECAB0WIQS5/qNWKD4ASGOJGL+v8rHJQhrU1gUCXOcuXQAKCRCv8rHJQhrU
1uGoAJsEDzR5fDc5OY5EofeM3sK4xlNdtgCglKRsZklE+0vyZji9HebUsfxvMwM=
=zw5e
-----END PGP SIGNATURE-----
-------------- next part --------------
--- file-5.37/magic/Magdir/wordprocessors.old	2019-04-19 00:42:27 +0000
+++ file-5.37/magic/Magdir/wordprocessors	2019-05-23 20:07:35 +0000
@@ -14,2 +14,17 @@
 
+# URL:	http://fileformats.archiveteam.org/wiki/Microsoft_Works_Word_Processor
+# reference:	http://mark0.net/download/triddefs_xml.7z
+#		/defs/w/wps-works-dos.trid.xml
+# From:	Joerg Jenderek
+# Note:	older non OLE 2 Compound based versions
+0	ubeshort	=0x01FE
+>112	ubeshort	=0x0100		Microsoft Works 1-3 (DOS) or 2 (Windows) document
+# title like THE GREAT KHAN GAME
+>>0x100	string		x		%s
+!:mime	application/vnd-ms-works
+#!:mime	application/x-msworks
+# https://www.macdisk.com/macsigen.php
+!:apple	????AWWP
+!:ext	wps
+
 # Corel/WordPerfect
-------------- next part --------------
A non-text attachment was scrubbed...
Name: file-5.37-wordprocessors-wps.diff.sig
Type: application/octet-stream
Size: 95 bytes
Desc: not available
URL: <https://mailman.astron.com/pipermail/file/attachments/20190524/ccfd7dc9/attachment.obj>


More information about the File mailing list