parsing - PDFClown - Highlight words in PDF -

September 15, 2014

i have requirement search set of strings in pdf, if found hightlight them, followed example below link read pdf page , extract text highlight words in pdf can parse pdf , extract text // 2.1. extract page text! map> textstrings = textextractor.extract(page);

reading text, issue is, have 2 paragraphs columns in pdf page, , extracted string "textstrings" shows 1st 3 lines read 1st column(1 para) , 2nd 3 lines read 2nd column(2nd paragraph), not correct, there way, make parser read first paragraph completely, 2nd paragraph, if has index references section below third paragraph.

appreciate kind of help!

thanks!

Search This Blog

Lix

parsing - PDFClown - Highlight words in PDF -

Comments

Post a Comment

Popular posts from this blog

c++ - Difference between pre and post decrement in recursive function argument -

javascript - IE11 incompatibility with jQuery's 'readonly'? -

php - How can I echo out this array? -