[File] [PATCH] Magdir/database Mozilla Mork database *.MSF versus *.DAT *.MAB

Christos Zoulas christos at zoulas.com
Mon Oct 9 15:55:53 UTC 2023


Committed, thanks.

christos

> On Oct 7, 2023, at 6:07 PM, Jörg Jenderek (GMX) <joerg.jen.der.ek at gmx.net> wrote:
> 
> Hello,
> 
> some month ago i migrate to Windows 10 on system. Therefore i must
> transfer also my mail stuff handled by thunderbird. So i look at files
> belonging to thunderbird.
> When running file command version 5.45 such Thunderbird samples i get an
> output like:
> 
> Drafts.msf:                   Mozilla Mork database, version 1.4
> Drafts_new.msf:               Mozilla Mork database, version 1.4
> INBOX.msf:                    Mozilla Mork database, version 1.4
> Trash.msf:                    Mozilla Mork database, version 1.4
> empty.mab:                    Mozilla Mork database, version 1.4
> fmt-612-signature-id-948.mab: exported SGML document, ASCII text
> 			      , with no line terminators
> panacea.dat:                  Mozilla Mork database, version 1.4
> 
> With option -i only generic text/plain and with option --extension only
> ??? is displayed.
> 
> For comparison reason i run the file format identification utility
> TrID ( See https://mark0.net/soft-trid-e.html). Many of the MSF
> samples are described as "Mozilla Mail Summary file" by msf.trid.xml.
> The sample panacea.dat is described as "Mozilla Mail folder cache"
> by dat-mork.trid.xml with correct suffix. The real MAB samples are
> described as "Mozilla Address Book" by mab.trid.xml with correct suffix
> (See appended trid-v-mork.txt.gz).
> 
> For comparison reason i also run the file format identification
> utility DROID ( See https://sourceforge.net/projects/droid/).
> Here all examples are described as "Mork" by PUID fmt/612
> 
> TrID list the used file name extension and often with -v option the
> related URL pointing to used file format information.
> 
> With the help of this tools i add more lines. So this is now expressed
> inside Magdir/database by additional comment lines like:
> # URL:		http://fileformats.archiveteam.org/wiki/Mork
> #		https://en.wikipedia.org/wiki/Mork_(file_format)
> # Reference:	http://mark0.net/download/triddefs_xml.7z
> #		defs/d/msf.trid.xml
> #		defs/m/mab.trid.xml
> #		defs/d/dat-mork.trid.xml
> 
> In current Magdir/database the description happen by lines like:
> 0	string	//\ <!--\ <mdb:mork:z\ v="	Mozilla Mork database
> >23	string	x		\b, version %.3s
> 
> Instead of generic text/plain mime type i choose an user defined one.
> According to TrID i look for specific other keywords to do sub
> classification with different file name extensions. So this is now done
> by lines like:
> 
> 0	string	//\ <!--\ <mdb:mork:z\ v="	Mozilla Mork database
> !:mime	text/x-mozilla-mork
> >23	string	x		\b, version %.3s
> >26	search/7516	mailboxName		\b, Mail Summary file
> !:ext						msf
> >26	search/192	addrbk			\b, Address Book
> !:ext						mab
> >26	search/210	indexingPriority	\b, Mail folder cache
> !:ext						dat
> 
> After applying the above mentioned modifications by patch
> file-5.45-database-mork.diff then my Thunderbird samples are now
> described with more correct details. This now looks like:
> Drafts.msf:                   Mozilla Mork database, version 1.4
> 			      , Mail Summary file
> Drafts_new.msf:               Mozilla Mork database, version 1.4
> 			      , Mail Summary file
> INBOX.msf:                    Mozilla Mork database, version 1.4
> 			      , Mail Summary file
> Trash.msf:                    Mozilla Mork database, version 1.4
> 			      , Mail Summary file
> empty.mab:                    Mozilla Mork database, version 1.4
> 			      , Address Book
> fmt-612-signature-id-948.mab: ASCII text, with no line terminators
> panacea.dat:                  Mozilla Mork database, version 1.4
> 			      , Mail folder cache
> 
> I hope my diff file can be applied in future version of file
> utility.
> 
> With best wishes,
> Jörg Jenderek
> --
> Jörg Jenderek
> <trid-v-mork.txt.gz><file-5_45-database-mork_diff.DEFANGED-47692><file-5_45-database-mork_diff_sig.DEFANGED-47693>-- 
> File mailing list
> File at astron.com
> https://mailman.astron.com/mailman/listinfo/file
> <sanitizer.log>



More information about the File mailing list