Show HN: Nano PDF – A CLI Tool to Edit PDFs with Gemini's Nano Banana

▲

Show HN: Nano PDF – A CLI Tool to Edit PDFs with Gemini's Nano Banana(github.com)

53 points byGavCo2 hours ago |9 comments

> Converts an image to a single-page PDF with a hidden text layer using Tesseract. This is the 'State Preservation' step.

Does this mean the text only pdf page is transformed into an image that covers the full page, but the text is still under there. So, any machine based extraction would still get the text, but would probably loose all the bounding box information and regular users cannot just use their mouse to select text anymore?

▲lxe1 hour ago

This is nuts and I absolutely love this. So you convert the PDF into image, edit the image, then convert the image back into a PDF.

▲treetalker1 hour ago

I'd love to see clearer examples: a video, or original pdf / command / result pdf. Very cool!

▲shevis55 minutes ago

A side effect of replacing entire pages with images is that the file size will expand dramatically. Most PDFs only contain a couple of images

▲sultson1 hour ago

How cool! It's frustrating how tedious many PDF workflows still are. I've been building something similar in this space[0], but web-based where you visually specify the area to edit. The biggest issue for now is the cost per edit as the Pro version amounts to roughly $0.15/image. However, with some finessing, the original Nano Banana seems to do a great job as well. Have you explored UI-based approaches yourself by any chance?

[0]https://docusera.com/

▲itsmevictor1 hour ago

Very nice! I wonder whether that could be used to get LLMs to annotate pdfs. Say an "agentic" CLI like Claude Code or Gemini-cli reviews a pdf and finds typos, could it use this to annotate the pdf like underlining them in red or something of that sort? That could be nice.

▲John787878131 minutes ago

Love this.

After several iterations of edits, would the image quality decrease?

▲mentalgear1 hour ago

Nice - but consider adding an animated screengrap like: https://github.com/pythops/oryx

▲yoavm11 minutes ago

Please don't add an animated gif to your README. Nothing worse than an autoplaying video with no controls, that has 10 frames but takes 5.4MB to download. Github supports normal video files. It allows the user to rewind or pause, and it results in a much smaller file size.

▲ThrowawayTestr58 minutes ago

I recently tried to change a single word in a PDF and nearly tore my hair out (thank you LibreOffice) I'll definitely keep this in mind for next time, thank you.

▲tkfoss37 minutes ago

Try photopea next time