pdf reader - ITEXT PDFReader not able to read PDF -


i not able read pdf file using . pdf valid pdf if tried open this.

url of pdf: http://www.fundslibrary.co.uk/fundslibrary.dataretrieval/documents.aspx?type=fund_class_kiid&id=f096b13b-3d0e-4580-8d3d-87cf4d002650&user=fidelitydocumentreport

the pdf in question encrypted.

according pdf specification,

encryption applies strings , streams in document's pdf file, following exceptions:

  • the values id entry in trailer
  • any strings in encrypt dictionary
  • any strings inside streams such content streams , compressed object streams, encrypted

later on there information on special cases in document level metadata stream not encrypted either or in attachments encrypted.


the cross-reference stream dictionary of pdf looks this:

<< /root 101 0 r /info 63 0 r /xref(stream) /encrypt 103 0 r /id[<d034de62220e1cbc2642ac517f0fe9c7><d034de62220e1cbc2642ac517f0fe9c7>] /type/xref /w[1 3 2] /index[0 107] /size 107 /length 642 >> 

as can see there non-encrypted string here, (stream), neither value id entry, nor in encrypt dictionary, nor inside stream. furthermore, afore mentioned special cases not apply here either.

thus, file violates pdf specification here. therefore, this file not valid pdf.


furthermore, according pdf specification

the last line of file shall contain end-of-file marker, %%eof.

the file @ handsends this

screenshot of last lines of file

thus, last line of file contain else end-of-file marker (which in line before), 0x06 , 0x0c.

the file, therefore, violates pdf specification here, too.


Comments