Extract text from PDF. PDF viewer

  • I am bewildered by the number of different WP tools that I seem to have on my disk. :)

    I only have 2 requirements:

    1- How to view, in Delphi, a PDF file from my application?

    2- How to extract all the text in that file so that I can do searches?

    Thank you,

    Alfred

  • Hello,

    Thank you for that information. I have another couple of questions:

    1- When are you hoping to ship version 3 ?

    2- Is it OK to incorporate this product into a program that has many users?

    Alfred

    • Offizieller Beitrag

    Hi,

    1) I hope to have it within 14 days.

    2) You may incorporate the product under the licenses restrictions (no printer driver, general PDF tool) and if ALL developers who took part to develop that product were licensed.

    Our components are not licensed per devloping machine, they are always licensed to the count of developers envolved directly or indirectly.

    So if you have a medical application devloped by 15 developers and only one person is doing the reports with the PDF proview of the stored PDF documents, you still need a SITE license.

    Regards,
    Julian

    • Offizieller Beitrag

    WPViewPDF V3 is available now - it can even save the text in RTF format.

    The demo uses this code to demonstrate:

    Code
    f := TFileStream.Create(SaveDialog1.FileName, fmCreate);
        try
          s := WPViewPDF1.GetPageText(pageNo - 1, 
                   ExtractFileExt(SaveDialog1.FileName));
          if s <> '' then f.Write(s[1], Length(s));
        finally
          f.Free;
        end;
  • Thank you for letting me know.

    I downloaded version 2 and tried it out. Unfortunately, the text it produced was of a very poor quality and it took 10 times as much processing time as another tool that I tested alongside it.

    You can find the file I used for testing here:


    I hope you can work out why this file caused it such problems.