[File] [PATCH] of Magdir/wordprocessors for Corel WordPerfect Writing Tools *.CBT *.CBD

Christos Zoulas christos at zoulas.com
Mon Aug 8 13:01:13 UTC 2022


Committed, thanks!

christos

> On Aug 6, 2022, at 5:44 PM, Jörg Jenderek <joerg.jen.der.ek at gmx.net> wrote:
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Hello,
> 
> some days ago i send patches for DOS COM executables. One Syslinux
> COMboot variant use file name extension CBT instead of COM.
> 
> For control reason i look for other files with CBT extension on my
> systems.
> But there are dozen of such CBT files which are part of Corel
> WordPerfect Office suite. These are found in sub directory
> WritingTools inside WordPerfect program directory "c:\Program Files
> (x86)\Corel\WordPerfect Office 2021". The file names are like:
> Wt13cbede.cbt Wt13cbeit.cbt Wt13cbefr.cbt WT21cbede.cbt Wt13cbeEN.CBD
> WT21cbeEN.CBD. These start with 2 letter phrase WT followed by digits
> which corresponds to Word Perfect version. For version 2021 this
> digits are 21 and for an older version i found digits 13. The last
> capitals obviously correspond to used language. For English the file
> name extension is CBD whereas for all other languages it is CBT.
> 
> In the sub directory there exist more similar files but with other
> file name extensions like adv, hyd, icr, lex, mor and sav. The
> Writing Tools are used for spelling, grammar correction, thesaurus
> purpose in chosen language. Unfortunately i do not found for which
> part the CBT files are used. So i choose a "general" name for such
> CBT samples like "Writing Tools".
> 
> When running file command (version 5.42) on such examples and related
> files i get an output like:
> 
> WT21cbeEN.CBD: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbeEN.CBD: Corel WordPerfect: Unknown filetype 70, v1.0
> WT21cbede.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> WT21cbeit.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbeaf.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbede.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbedk.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbees.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbefr.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbeit.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbekd.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbenl.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbeno.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbepo.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> Wt13cbesv.cbt: Corel WordPerfect: Unknown filetype 70, v1.0
> 
> With --extension option only ??? is displayed. Furthermore with -i
> option for my samples only generic application/octet-stream is shown.
> 
> For comparison reason i also run the file format identification
> utility DROID ( See https://sourceforge.net/projects/droid/). This
> identifies all such examples wrong as "Comic Book Archive" by PUID
> fmt/1462 based on file name extension (See appended
> droid-wordperfect-cbt.csv.gz)
> 
> For comparison reason i run the file format identification utility
> TrID ( See https://mark0.net/soft-trid-e.html). This identifies all
> such examples with low rate as "WordPerfect (generic)" by
> wp-generic.trid.xml. And most examples are described with high rate
> as "WordPerfect Writing Tools data" by cbt-wp.trid.xml (See appended
> trid-wordperfect-cbt.txt.gz)
> 
> Unfortunately i found no information especially about file format
> specification for such WordPerfect CBT files. TrID list the used
> file name extension and often with -v option the related URL pointing
> to some information. This is expressed by comment lines inside
> Magdir/wordprocessors like:
> # URL:		https://en.wikipedia.org/wiki/WordPerfect
> # Reference:	https://github.com/OneWingedShark/WordPerfect/
> #		blob/master/doc/SDK_Help/FileFormats/
> #		WPFF_DocumentStructure.htm
> #		http://mark0.net/download/triddefs_xml.7z
> #		defs/w/wp-generic.trid.xml
> # 		defs/c/cbt-wp.trid.xml
> 
> The description happens inside Magdir/wordprocessors by starting like
> :
> 0	string	\xffWPC
> So we see that the first 4 bytes are the generic magic for all
> WordPerfect samples. By bytes at offset 8 and 9 sub classification is
> done. If sub class is not known as last step the sub class is shown
> by line like:
>>>> 9	byte	x	Corel WordPerfect: Unknown filetype %d
> So for my CBT examples i must insert before lines like:
>>> 9	byte	70	WordPerfect Writing Tools
> !:mime	application/x-wordperfect-cbt
> !:ext	cbd/cbt
> Instead of generic mime type application/octet-stream i show an user
> defined one.
> 
> After applying the above mentioned modifications by patch
> file-5.42-wordprocessors-cbt.diff then i get a more precise output
> like:
> WT21cbede.cbt: WordPerfect Writing Tools, v1.0
> WT21cbeit.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbeaf.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbede.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbedk.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbees.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbefr.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbeit.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbekd.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbenl.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbeno.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbepo.cbt: WordPerfect Writing Tools, v1.0
> Wt13cbesv.cbt: WordPerfect Writing Tools, v1.0
> WT21cbeEN.CBD: WordPerfect Writing Tools, v1.0
> Wt13cbeEN.CBD: WordPerfect Writing Tools, v1.0
> 
> I hope my diff file can be applied in future version of
> file utility.
> 
> With best wishes
> Jörg Jenderek
> - --
> Jörg Jenderek
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> -----BEGIN PGP SIGNATURE-----
> Comment: Using GnuPG with Thunderbird - https://www.enigmail.net/
> 
> iF0EARECAB0WIQS5/qNWKD4ASGOJGL+v8rHJQhrU1gUCYu5+OQAKCRCv8rHJQhrU
> 1nVqAKCD5wQl6USB+azcGsnSOVTw8uzcaQCdEcvi13Fu+0zoISBZfV8Pqh3Iqjo=
> =2FVI
> -----END PGP SIGNATURE-----
> <trid-wordperfect-cbt.txt.gz><droid-cbt-wordperfect.csv.gz><file-5_42-wordprocessors-cbt_diff.DEFANGED-11388><file-5_42-wordprocessors-cbt_diff_sig.DEFANGED-11389>--
> File mailing list
> File at astron.com
> https://mailman.astron.com/mailman/listinfo/file
> <sanitizer.log>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 235 bytes
Desc: Message signed with OpenPGP
URL: <https://mailman.astron.com/pipermail/file/attachments/20220808/0a858b67/attachment.asc>


More information about the File mailing list