[File] [PATCH] of Magdir/wordprocessors for Corel WordPerfect dictionary advise *.ADV
Christos Zoulas
christos at zoulas.com
Mon Aug 8 12:59:42 UTC 2022
Applied thanks!
christos
> On Aug 7, 2022, at 2:24 AM, Jörg Jenderek <joerg.jen.der.ek at gmx.net> wrote:
>
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hello,
>
> some days ago i send patch for Word Perfect CBT samples. These are
> found in sub directory WritingTools inside Word Perfect program
> directory "c:\Program Files (x86)\Corel\WordPerfect Office 2021".
> In the sub directory there exist more similar files but with other
> file name extensions like adv, hyd, icr, lex, mor and sav.
>
> For control reason i look for other Word Perfect files there.
> The ADV samples are used for giving advise to user. The file names
> are like:
> WT21de.adv Wt13de.adv Wt13es.adv Wt13fr.adv wt13us.adv
> These start with 2 letter phrase WT followed by digits
> which corresponds to Word Perfect version. For version 2021 this
> digits are 21 and for an older version i found digits 13. The last
> 2 capitals obviously correspond to used language. For English we
> get uk, de for German, fr for French, nl for Netherlands and so on.
> When running file command (version 5.42) on such examples i get an
> output like:
>
> WT21de.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> Wt13de.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> Wt13es.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> Wt13fr.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> Wt13nl.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> wt13kd.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> wt13uk.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
> wt13us.adv: Unknown Corel/Wordperfect product 34, file type 11, v6.0
>
> With --extension option only ??? is displayed. Furthermore with -i
> option for my samples only generic application/octet-stream is shown.
>
> For comparison reason i run the file format identification utility
> TrID ( See https://mark0.net/soft-trid-e.html). This identifies all
> such examples with low rate as "WordPerfect (generic)" by
> wp-generic.trid.xml and the examples are described with high rate
> as "WordPerfect dictionary advise" by adv-wp.trid.xml
> (See appended trid-v-wordperfect-adv.txt.gz).
>
> Unfortunately i found no information especially about file format
> specification for such WordPerfect ADV files. TrID list the used
> file name extension and often with -v option the related URL pointing
> to some information. This is expressed by comment lines inside
> Magdir/wordprocessors like:
> # URL: https://en.wikipedia.org/wiki/WordPerfect
> # Reference: https://github.com/OneWingedShark/WordPerfect/
> # blob/master/doc/SDK_Help/FileFormats/
> # WPFF_DocumentStructure.htm
> # Reference: http://mark0.net/download/triddefs_xml.7z
> # defs/a/adv-wp.trid.xml
>
> The description happens inside Magdir/wordprocessors by starting like
> :
> 0 string \xffWPC
> So we see that the first 4 bytes are the generic magic for all
> WordPerfect samples. By bytes at offset 8 and 9 sub classification is
> done. If sub class is not known as last step the sub class is shown
> by line like:
>> 8 default x
>>> 8 byte x Unknown Corel/Wordperfect product %d,
>>>> 9 byte x file type %d
>
> So for my ADV examples i must insert before lines like:
>> 8 byte 34
>>> 9 byte 11 Corel WordPerfect dictionary advise
> !:mime application/x-wordperfect-adv
> !:ext adv
> Instead of generic mime type application/octet-stream i show an user
> defined one.
>
> According to unofficial WordPerfect File Format documentation at
> offset 16 pointer is stored. So when inspecting this area for ADV
> samples we get advise text depending on language like "This is too
> informal for most writing." for English examples. Unfortunately
> often some tags like 580A comes before pure text. So i show excerpt
> from such advise text by additional line like:
>>>> (16.s+16) string x (...%-.33s...)
>
> After applying the above mentioned modifications by patch
> file-5.42-wordprocessors-adv.diff then i get a more precise output
> like:
>
> WT21de.adv: Corel WordPerfect dictionary advise
> (...schen Dezimalzahlen und ganzen Za...), v6.0
> Wt13de.adv: Corel WordPerfect dictionary advise
> (...schen Dezimalzahlen und ganzen Za...), v6.0
> Wt13es.adv: Corel WordPerfect dictionary advise
> (...ica porcentaje debe ir inmediatam...), v6.0
> Wt13fr.adv: Corel WordPerfect dictionary advise
> (...rd_ en genre et en nombre dans ce...), v6.0
> Wt13nl.adv: Corel WordPerfect dictionary advise
> (... een hoofdletter. Als dit niet he...), v6.0
> wt13kd.adv: Corel WordPerfect dictionary advise
> (...schen Dezimalzahlen und ganzen Za...), v6.0
> wt13uk.adv: Corel WordPerfect dictionary advise
> (...s too informal for most writing.|...), v6.0
> wt13us.adv: Corel WordPerfect dictionary advise
> (...s too informal for most writing.|...), v6.0
>
> I hope my diff file can be applied in future version of
> file utility.
>
> With best wishes
> Jörg Jenderek
> - --
> Jörg Jenderek
> -----BEGIN PGP SIGNATURE-----
> Comment: Using GnuPG with Thunderbird - https://www.enigmail.net/
>
> iF0EARECAB0WIQS5/qNWKD4ASGOJGL+v8rHJQhrU1gUCYu74RwAKCRCv8rHJQhrU
> 1hgIAJ9Ddvoto6P3j4SXrTx0E4HzxtlK5QCeNBL+ghyLVZxnxTO87XU6O9dnQk8=
> =pPQr
> -----END PGP SIGNATURE-----
> <Nachrichtenteil als Anhang.DEFANGED-68600><file-5_42-wordprocessors-adv_diff.DEFANGED-68601><file-5_42-wordprocessors-adv_diff_sig.DEFANGED-68602><trid-v-wordperfect-adv.txt.gz>--
> File mailing list
> File at astron.com
> https://mailman.astron.com/mailman/listinfo/file
> <sanitizer.log>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 235 bytes
Desc: Message signed with OpenPGP
URL: <https://mailman.astron.com/pipermail/file/attachments/20220808/82c7dc5d/attachment.asc>
More information about the File
mailing list