[File] [PATCH] Magdir/database Mozilla Mork database *.MSF versus *.DAT *.MAB
Christos Zoulas
christos at zoulas.com
Mon Oct 9 15:55:53 UTC 2023
Committed, thanks.
christos
> On Oct 7, 2023, at 6:07 PM, Jörg Jenderek (GMX) <joerg.jen.der.ek at gmx.net> wrote:
>
> Hello,
>
> some month ago i migrate to Windows 10 on system. Therefore i must
> transfer also my mail stuff handled by thunderbird. So i look at files
> belonging to thunderbird.
> When running file command version 5.45 such Thunderbird samples i get an
> output like:
>
> Drafts.msf: Mozilla Mork database, version 1.4
> Drafts_new.msf: Mozilla Mork database, version 1.4
> INBOX.msf: Mozilla Mork database, version 1.4
> Trash.msf: Mozilla Mork database, version 1.4
> empty.mab: Mozilla Mork database, version 1.4
> fmt-612-signature-id-948.mab: exported SGML document, ASCII text
> , with no line terminators
> panacea.dat: Mozilla Mork database, version 1.4
>
> With option -i only generic text/plain and with option --extension only
> ??? is displayed.
>
> For comparison reason i run the file format identification utility
> TrID ( See https://mark0.net/soft-trid-e.html). Many of the MSF
> samples are described as "Mozilla Mail Summary file" by msf.trid.xml.
> The sample panacea.dat is described as "Mozilla Mail folder cache"
> by dat-mork.trid.xml with correct suffix. The real MAB samples are
> described as "Mozilla Address Book" by mab.trid.xml with correct suffix
> (See appended trid-v-mork.txt.gz).
>
> For comparison reason i also run the file format identification
> utility DROID ( See https://sourceforge.net/projects/droid/).
> Here all examples are described as "Mork" by PUID fmt/612
>
> TrID list the used file name extension and often with -v option the
> related URL pointing to used file format information.
>
> With the help of this tools i add more lines. So this is now expressed
> inside Magdir/database by additional comment lines like:
> # URL: http://fileformats.archiveteam.org/wiki/Mork
> # https://en.wikipedia.org/wiki/Mork_(file_format)
> # Reference: http://mark0.net/download/triddefs_xml.7z
> # defs/d/msf.trid.xml
> # defs/m/mab.trid.xml
> # defs/d/dat-mork.trid.xml
>
> In current Magdir/database the description happen by lines like:
> 0 string //\ <!--\ <mdb:mork:z\ v=" Mozilla Mork database
> >23 string x \b, version %.3s
>
> Instead of generic text/plain mime type i choose an user defined one.
> According to TrID i look for specific other keywords to do sub
> classification with different file name extensions. So this is now done
> by lines like:
>
> 0 string //\ <!--\ <mdb:mork:z\ v=" Mozilla Mork database
> !:mime text/x-mozilla-mork
> >23 string x \b, version %.3s
> >26 search/7516 mailboxName \b, Mail Summary file
> !:ext msf
> >26 search/192 addrbk \b, Address Book
> !:ext mab
> >26 search/210 indexingPriority \b, Mail folder cache
> !:ext dat
>
> After applying the above mentioned modifications by patch
> file-5.45-database-mork.diff then my Thunderbird samples are now
> described with more correct details. This now looks like:
> Drafts.msf: Mozilla Mork database, version 1.4
> , Mail Summary file
> Drafts_new.msf: Mozilla Mork database, version 1.4
> , Mail Summary file
> INBOX.msf: Mozilla Mork database, version 1.4
> , Mail Summary file
> Trash.msf: Mozilla Mork database, version 1.4
> , Mail Summary file
> empty.mab: Mozilla Mork database, version 1.4
> , Address Book
> fmt-612-signature-id-948.mab: ASCII text, with no line terminators
> panacea.dat: Mozilla Mork database, version 1.4
> , Mail folder cache
>
> I hope my diff file can be applied in future version of file
> utility.
>
> With best wishes,
> Jörg Jenderek
> --
> Jörg Jenderek
> <trid-v-mork.txt.gz><file-5_45-database-mork_diff.DEFANGED-47692><file-5_45-database-mork_diff_sig.DEFANGED-47693>--
> File mailing list
> File at astron.com
> https://mailman.astron.com/mailman/listinfo/file
> <sanitizer.log>
More information about the File
mailing list