PDF Notes

Russell Bateman
last update:


Open HTML to PDF

Here's a Java library that converts HTML/CSS to PDF: https://github.com/danfickle/openhtmltopdf


Apache Tika

Then, there's Apache Tika—a content analysis toolkit that's able to detect or recognize a PDF file from its contents as well as extract text and graphics from it.