www.digitalmars.com         C & C++   DMDScript  

digitalmars.D.announce - mime - library for parsing shared MIME database

reply "FreeSlave" <freeslave93 gmail.com> writes:
Currently I'm working on mime library for D. Dub page: 
http://code.dlang.org/packages/mime
It can parse MIME database files, including binary ones, like 
mime.cache. It also has algorithms for mime type detecting by 
file name.

It's not fully implemented yet and does not have stable API. 
Issues and goals are listed on the github page: 
https://github.com/MyLittleRobo/mime

If someone is interested in the project, I would be glad to 
discuss interface and implementation details of the library.

If you don't know what is shared MIME database and why does it 
matter read this:
http://standards.freedesktop.org/shared-mime-info-spec/shared-mime-info-spec-latest.html#idm140625831778224
Aug 15 2015
parent reply Rikki Cattermole <alphaglosined gmail.com> writes:
On 16/08/2015 6:30 a.m., FreeSlave wrote:
 Currently I'm working on mime library for D. Dub page:
 http://code.dlang.org/packages/mime
 It can parse MIME database files, including binary ones, like
 mime.cache. It also has algorithms for mime type detecting by file name.

 It's not fully implemented yet and does not have stable API. Issues and
 goals are listed on the github page: https://github.com/MyLittleRobo/mime

 If someone is interested in the project, I would be glad to discuss
 interface and implementation details of the library.

 If you don't know what is shared MIME database and why does it matter
 read this:
 http://standards.freedesktop.org/shared-mime-info-spec/shared-mime-info-spec-latest.html#idm140625831778224
I had an mime implementation in Cmsed that basically was a hard coded file with a whole bunch of mime types along with file extensions. I would be interested in seeing if this can match it 1:1 for features, while not allocating. Say give me the mime type for payload. Possibly with its own override/addition csv files.
Aug 15 2015
parent "FreeSlave" <freeslave93 gmail.com> writes:
On Sunday, 16 August 2015 at 03:56:45 UTC, Rikki Cattermole wrote:
 On 16/08/2015 6:30 a.m., FreeSlave wrote:
 Currently I'm working on mime library for D. Dub page:
 http://code.dlang.org/packages/mime
 It can parse MIME database files, including binary ones, like
 mime.cache. It also has algorithms for mime type detecting by 
 file name.

 It's not fully implemented yet and does not have stable API. 
 Issues and
 goals are listed on the github page: 
 https://github.com/MyLittleRobo/mime

 If someone is interested in the project, I would be glad to 
 discuss
 interface and implementation details of the library.

 If you don't know what is shared MIME database and why does it 
 matter
 read this:
 http://standards.freedesktop.org/shared-mime-info-spec/shared-mime-info-spec-latest.html#idm140625831778224
I had an mime implementation in Cmsed that basically was a hard coded file with a whole bunch of mime types along with file extensions. I would be interested in seeing if this can match it 1:1 for features, while not allocating. Say give me the mime type for payload. Possibly with its own override/addition csv files.
This library focuses on shared MIME database used in freedesktop systems, usually for detecting file types in file managers to display appropriate icon and make correct choice of default application (well, that's another spec) to run on file. I don't think this suits the web world. Your "hardcoded" approach is what usually used in web. If I remember correctly mime types are hardcoded in Chromium too. But mime type is not only about extension. Generally pattern can be any glob pattern. That's why file managers can detect Makefile type as text/x-makefile, even though it does not have extension. Same for CMakeLists.txt - the preferred type is text/x-cmake, not just text/plain. Patterns are not alone. There're magic rules for the rescue when mime type can't be detected from the name of file. That's how Linux file managers differ shell script from python script even if both don't have extension (but they have leading comment like detect file with unknown pk3 extension (used in Quake III based games) as zip file. Because it's really just zip file by its contents. The whole shared MIME database thing is system and user dependent, so again it's not what you want to use for web, probably unless you manage the database yourself on the server. Even in this case patterns and magic rules are just hints. You can't rely on that to check if the uploading file is of the needed type. For example, if you want to validate image file, the only way to do it is to parse the whole file. Still you can use hints to cut off obviously invalid files. I did not target non-allocating code yet, though I believe it's possible to make MimeCache to not allocate when detecting file type.
Aug 16 2015