From 5027bba6c2c1c7546cfc0eb953de38181b7e4e7b Mon Sep 17 00:00:00 2001 From: jvoisin Date: Mon, 7 Jan 2013 22:37:54 +0100 Subject: Refactor the get_sharedir function, and move datas to a proper data folder --- MAT/FORMATS | 43 +++++++++++++++++++++++++++++++++---------- 1 file changed, 33 insertions(+), 10 deletions(-) (limited to 'MAT/FORMATS') diff --git a/MAT/FORMATS b/MAT/FORMATS index c497524..3f0751c 100644 --- a/MAT/FORMATS +++ b/MAT/FORMATS @@ -2,6 +2,7 @@ Portable Network Graphics .png + image/png full textual metadata + date removal of harmful fields is done with hachoir @@ -10,14 +11,19 @@ Jpeg .jpeg, .jpg - full + image/jpeg + partial comment + exif/photoshop/adobe removal of harmful fields is done with hachoir + Canon Raw tags : + http://www.sno.phy.queensu.ca/~phil/exiftool/TagNames/CanonRaw.html + Open Document .odt, .odx, .ods, ... + application/opendocument full a meta.xml file removal of the meta.xml file @@ -26,6 +32,7 @@ Office Openxml .docx, .pptx, .xlsx, ... + application/officeopenxml full a docProps folder containings xml metadata files removal of the docProps folder @@ -34,18 +41,21 @@ Portable Document Fileformat .pdf + application/pdf full a lot - rendering of the pdf file on a cairo surface with the help of - poppler in order to remove all the internal metadata, - then removal of the remaining metadata fields of the pdf itself with - pdfrw (the next version of python-cairo will support metadata, - so we should get rid of pdfrw) + rendering of the PDF file on a cairo surface with the help of + poppler in order to remove all the internal metadata. + For now, cairo create some metadata. + They can be remove if you install either exiftool, or python-pdfrw. + The next version of python-cairo will support PDF metadata. + Tape ARchive .tar, .tar.bz2, .tar.gz + application/x-tar, application/x-gzip, application/x-bzip2 full metadata from the file itself, metadata from the file contained into the archive, and metadata added by tar to the file at then @@ -58,11 +68,11 @@ Zip .zip - .partial + application/zip + partial metadata from the file itself, metadata from the file contained into the archive, and metadata added by zip to the file when added to - the archive. - + the archive. extraction of each file, treatement of the file, add treated file to a new archive. When the new archive is complete, remove all his metadata metadata added by zip itself to internal files @@ -70,7 +80,8 @@ MPEG Audio - .mp3, .mp2, .mp1 + .mp3, .mp2, .mp1, .mpa + audio/mpeg full id3 removal of harmful fields is done with hachoir @@ -79,6 +90,7 @@ Ogg Vorbis .ogg + audio/vorbis full Vorbis removal of harmful fields is done with mutagen @@ -87,8 +99,19 @@ Free Lossless Audio Codec .flac + audio/x-flac full Flac, Vorbis removal of harmful fields is done with mutagen + + + Torrent + .torrent + application/torrent + full + torrent + remove all the compromizing metadata with a heavily tuned version + of the bencode lib by Petru Paled + -- cgit v1.3