Comment on Elsevier
veganpizza69@lemmy.world 2 weeks ago
Purge metadata, convert PDF to rendered graphics (including bitmaps), add OCR layer.
Comment on Elsevier
veganpizza69@lemmy.world 2 weeks ago
Purge metadata, convert PDF to rendered graphics (including bitmaps), add OCR layer.
xenoclast@lemmy.world 2 weeks ago
There are tools for this already… but it sure would be nice to have a Firefox plugin that scrubs all metadata on downloads by default.
(Note I’m hoping this exists and someone will Um, Actually me)
nearhat@lemmy.world 2 weeks ago
It’s a multi step process, but if you still have the XPS Viewer from windows 10, you can ‘print’ the file to XPS, then open it in the XPS Viewer and ‘print’ to PDF using your favourite print to pdf solution. That strips the metadata but doesn’t rasterize everything.
purplemonkeymad@programming.dev 2 weeks ago
I feel like why not just print to pdf from your pdf viewer?
nearhat@lemmy.world 2 weeks ago
I tried that before, but was unsuccessful in clearing out metadata. Whatever options I tried, PDF-to-PDF just output an identical file with a different name.
lastweakness@lemmy.world 2 weeks ago
You could write a script to automatically watch for new files in a folder and strip metadata from every file i guess. I had done something like that for images way before.