@[email protected] to Programmer [email protected] • 2 months agoDOGE employeelemmy.worldimagemessage-square92fedilinkarrow-up1558arrow-down112cross-posted to: [email protected]
arrow-up1546arrow-down1imageDOGE employeelemmy.world@[email protected] to Programmer [email protected] • 2 months agomessage-square92fedilinkcross-posted to: [email protected]
minus-squarelime!linkfedilinkEnglish14•edit-21 month ago$ pandoc doc.pdf -o doc.txt Edit: welp, pandoc can’t do that. pdftotext it is.
minus-square@[email protected]linkfedilinkEnglish2•edit-22 months agomagick file.jpg file.html Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
minus-squarelime!linkfedilinkEnglish2•2 months agonot really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
minus-square@[email protected]linkfedilink1•2 months agoI thought pandoc didn’t support from PDF, only to?!
minus-squarelime!linkfedilinkEnglish2•2 months agodamn it, you’re right. should probably have checked that…
minus-square@[email protected]linkfedilink1•2 months agoDon’t worry, I didn’t know either and had to check to check too :P
$ pandoc doc.pdf -o doc.txt
Edit: welp, pandoc can’t do that.
pdftotext
it is.magick file.jpg file.html
Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
I thought pandoc didn’t support from PDF, only to?!
damn it, you’re right. should probably have checked that…
Don’t worry, I didn’t know either and had to check to check too :P