-
Notifications
You must be signed in to change notification settings - Fork 175
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Matches .doc as application/vnd.ms-excel #40
Comments
under the matchers folder matchers/document.go The func Doc(), func Xls() and func Ppt() all have the same magic numbers.
Is this the reason why the MimeType is always coming up different? |
Microsoft Office container files are |
@kumakichi It's working? |
@mateusmaaia It should work, if you found some MS office files can not be detected correctly, please let me know |
hi there, the |
@mateusmaaia @jeremywu0127 Oh, I was wrong #48 only add support for docx/xlsx/pptx(even not very good), leave doc/xls/ppt untouched So, extra work is needed |
@jeremywu0127 doc/xls/ppt check will be a little complex, we can detect they are Composite Document File V2 Document, but we don't know which one it is (doc/xls/ppt). Maybe we can check the name of creating application, get something like: Microsoft Office Word, it's OK; but if files are created by some none ms-office applications, say:WPS, we know nothing. So, this problem is not easy to resolve |
Yes, I tried a few stuffs but ended validating the Anyway, thanks! @kumakichi |
You can detect them by checking GUID of the root entry according to https://stackoverflow.com/questions/29211263/how-to-identify-doc-docx-pdf-xls-and-xlsx-based-on-file-header/48318648#48318648 , implemented in the Rust version bojand/infer#38 |
I am trying to detect the MIME type of a .doc file, and the result I get is of type
File type: xls. MIME: application/vnd.ms-excel
or
ile type: ppt. MIME: application/vnd.ms-powerpoint
The text was updated successfully, but these errors were encountered: