566
DOGE employee (lemmy.world)
submitted 4 months ago by [email protected] to c/[email protected]
you are viewing a single comment's thread
view the rest of the comments
[-] [email protected] 14 points 4 months ago* (last edited 4 months ago)

$ pandoc doc.pdf -o doc.txt

Edit: welp, pandoc can't do that. pdftotext it is.

[-] [email protected] 2 points 4 months ago* (last edited 4 months ago)
magick file.jpg file.html

Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)

[-] [email protected] 2 points 4 months ago

not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.

[-] [email protected] 1 points 4 months ago

I thought pandoc didn’t support from PDF, only to?!

[-] [email protected] 2 points 4 months ago

damn it, you're right. should probably have checked that...

[-] [email protected] 1 points 4 months ago

Don’t worry, I didn’t know either and had to check to check too :P

this post was submitted on 07 Feb 2025
566 points (97.8% liked)

Programmer Humor

24317 readers
444 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS