I’m looking for a self hosted solution to this problem:

I want to create a full text search index from a collection of PDF manuals (text, not images, I don’t care about OCR here). There is a UI to search for text matches in documents, and clicking a search hit opens the PDF scrolled to where the search hit is (bonus points if the search hit is hilighted)

That’s a very specific problem and I don’t know if there is an existing solution that does exactly what you want.

paperless-ngx does a lot of the things you ask for, it lets you upload pdfs, does OCR and gives you full text search via a web ui. It’s just not made specifically for manuals and it does not highlight the search hits or scrolls to them.

Create a post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  • Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
  • No spam posting.
  • Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
  • Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
  • No trolling.

Resources:

> Any issues on the community? Report it using the report flag.

> Questions? DM the mods!

  • 1 user online
  • 218 users / day
  • 9 users / week
  • 244 users / month
  • 841 users / 6 months
  • 0 subscribers
  • 542 Posts
  • 8.93K Comments
  • Modlog