| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 2019-11-30 | Improve a bit ppt support | jvoisin | |
| 2019-11-30 | Improve a bit odt handling | jvoisin | |
| 2019-10-17 | Improve a bit the support for ppt files | jvoisin | |
| 2019-09-01 | Improve a bit the comments in the code | jvoisin | |
| This is related to the previous commit | |||
| 2019-09-01 | Remove nsid fields from MSOffice documents | jvoisin | |
| nsids are random identifiers, usually used to ease merging between documents, and can trivially be used for fingerprinting. | |||
| 2019-04-27 | Add tar archive support | jvoisin | |
| 2019-03-05 | Refactor {black,white}list into {block,allow}list | Brolf | |
| Closes #96 | |||
| 2019-02-08 | Improve a bit get_meta for libreoffice files | jvoisin | |
| 2019-02-07 | Use of the archive refactoring for the office documents too | jvoisin | |
| 2019-02-04 | Refactor a bit office get_meta handling | jvoisin | |
| This should make easier to get more metadata from archive-based file formats. | |||
| 2019-02-03 | Whenever possible, use bwrap for subprocesses | intrigeri | |
| This should closes #90 | |||
| 2018-10-25 | Implement get_meta() for archives | jvoisin | |
| 2018-10-12 | Bump mypy typing coverage | jvoisin | |
| 2018-10-05 | Improve both the typing and the comments | jvoisin | |
| 2018-10-04 | Trash word/people.xml in office files | jvoisin | |
| 2018-10-03 | Don't break office files for MS Office | jvoisin | |
| We didn't take the whitelist into account while removing dangling files from [Content_types].xml | |||
| 2018-10-03 | Improve mat2's cli reliability | jvoisin | |
| - Replace some class members by instance members - Don't thread the cleaning process anymore for now | |||
| 2018-10-02 | Use [Content_Types].xml to improve MS Office coverage | jvoisin | |
| 2018-10-02 | fix typo | georg | |
| 2018-10-01 | Files processed via MAT2 are now accepted without warnings by MS Office | jvoisin | |
| 2018-09-30 | Please mypy | jvoisin | |
| 2018-09-30 | Remove dangling references in MS Office's [Content_types].xml | jvoisin | |
| 2018-09-24 | Second pass of minor formatting | jvoisin | |
| 2018-09-24 | Fix some minor formatting issues | jvoisin | |
| 2018-09-24 | Implement rsid stripping for office files | jvoisin | |
| MS Office XML rsid is a "unique identifier used to track the editing session when the physical character representing this section mark was last formatted." See the following links for details: - https://msdn.microsoft.com/en-us/library/office/documentformat.openxml.wordprocessing.previoussectionproperties.rsidrpr.aspx - https://blogs.msdn.microsoft.com/brian_jones/2006/12/11/whats-up-with-all-those-rsids/. | |||
| 2018-09-24 | Lexicographical sort on xml attributes for office files | jvoisin | |
| In XML, the order of the attributes shouldn't be meaningful, however, MS Office sorts attributes for a given XML tag differently than LibreOffice. | |||
| 2018-09-06 | Split office and archives | jvoisin | |
| 2018-09-05 | Unknown Members: make policy use an Enum | Daniel Kahn Gillmor | |
| Closes #60 Note: this changeset also ensures that clean.cleaned.docx is removed up after the pytest is over. | |||
| 2018-09-05 | Remove defusedxml support and document why | jvoisin | |
| 2018-09-05 | Improve the previous commit | jvoisin | |
| 2018-09-04 | office: try all members, even when one fails | Daniel Kahn Gillmor | |
| the end result will be the same -- an abort -- but the user will get to see all the warnings for a particular file, instead of getting them one at a time. | |||
| 2018-09-04 | document all unknown/unhandlable files even on abort | Daniel Kahn Gillmor | |
| This makes it easy to get a list of all files that mat2 doesn't know how to handle, without having to choose -u keep or -u omit. | |||
| 2018-09-04 | office: create policy for what to do about unknown members | Daniel Kahn Gillmor | |
| previously, encountering an unknown member meant that any parser of this type would abort. now, the user can set parser.unknown_member_policy to either 'omit' or 'keep' if they don't want the current action of 'abort' note that this causes pylint to complain about branching depth for remove_all() because of the nuanced error-handling. I've disabled this check. | |||
| 2018-09-01 | Fix a minor formatting issue | jvoisin | |
| 2018-09-01 | Logging cleanup | dkg | |
| 2018-07-19 | Improve the code's documentation | jvoisin | |
| 2018-07-19 | Minor simplification in how we're handling xml for office files | jvoisin | |
| 2018-07-10 | Remove `print` from libmat, and use the `logging` module instead | jvoisin | |
| This should close #28 | |||
| 2018-07-09 | Make pylint even happier | jvoisin | |
| 2018-07-08 | Fix some pep8 issues spotted by pyflakes | jvoisin | |
| 2018-07-08 | Achieve 100% coverage! | jvoisin | |
| 2018-07-08 | Bump coverage for office files and fix some related crashes | jvoisin | |
| 2018-07-08 | Silence a mypy's stupid warning | jvoisin | |
| 2018-07-08 | Add defusedxml as an (optional) way to prevent XML-based attacks | jvoisin | |
| Those attacks are DoS-only. | |||
| 2018-07-07 | Fix a mistake in office file revisions handling | jvoisin | |
| 2018-07-02 | Improve a bit the formatting of the code thanks to pyflakes3 | jvoisin | |
| 2018-07-01 | Remove docx revisions | jvoisin | |
| 2018-07-01 | MAT2 is now cleaning revisions from odt files! | jvoisin | |
| 2018-07-01 | Remove the thumbnails from libreoffice files | jvoisin | |
| 2018-06-27 | Massively simplify how we're cleaning office files | jvoisin | |
