**PDF Compare**

Zarathustra

Expert Member
Joined
Aug 7, 2003
Messages
1,164
Reaction score
223
Location
South Africa.
I need to compare a bunch of PDF's, find & output where the duplicates occur.

The challenge is that the duplicate pages will not necessary be in the same order.

I have tried DiffPDF (http://www.qtrac.eu/diffpdf.html), which is free very good, but only compares two files @ a time.

The PDF's which I need to compare are in the same directory. What I essentially require is something that would compare each PDF in the directory against all the others & list where the duplicates occur e.g.

Test1.pdf -> Page 1 is the same as Test10.pdf on Page 23 & the same as Test25.pdf on Page 19 etc.

Any ideas, anyone have a solution?!?
 
Sheesh, no idea.

What about converting everything to text, then examining the content with something else (or writing an app to do it for you)? there's a lot of text comparison tools.

I've used this before:
A-PDF Text Extractor Command line (PTCMD) - was $20 or so, I think. Gives a good text output with X/Y coordinates of where each text is found on the page.
http://www.a-pdf.com/
 
Top
Sign up to the MyBroadband newsletter
X