Re: PDF reader
- Posted by Spock Sep 29, 2015
- 2107 views
gwalters said...
Do we have somewhere a PDF reader?. I would like to be able to read a PDF version of a simple text printout. It happens that I cannot save the printout as a simple txt file but only a pdf file. I don't seem to see anything in the archives that could do that. Suggestions would be appreciated.
I use pdftotext for exactly this sort of task. In my office it gets a heavy workout - up to 1000 pdfs processed each day. When the txt is extracted I run a regular expressions library (my own, of course) over the data to pull out what I need. I used to try and work out coordinates of certain fields etc.. but in the end I found a context based approach much better.
EDIT: Whoa! Didn't see the date: 2013
Spock