Prev: SMPP
Next: Adding row to databound Datagridview
From: sklett on 5 May 2008 14:34 I need to bang out a quick application to extract CCITT compressed TIF images from a ton of PDFs. I've used PDFSharp in the past to work with PDFs but ti doesn't have support for the PDF /CCITTFaxDecode filter. I've googled for the obvious terms to try to find some code samples or information about how to accomplish what I want but haven't found anything at all. If anyone here has experience in working with PDFs and extracting TIFFs could you possible help with the following questions (some are very newbie questions) 1) Does PDF store image data in a special PDF format or wrapped in any other objects? In other words, I thought I would just be able to write the image stream from the PDF to disk and it would result in a TIFF image (hah!) but this isn't the case, I'm wondering if the image data is wrapped in an additional format? 2) Anyone know of any (free) libraries that can decompress the CCITT codec? I'm not sure if it's group 3 or 4, I imagine that is in the header of the image data? Any info greatly appreciated, Steve
From: sklett on 5 May 2008 15:04 I've found and read the relevant sections of the PDF specification. It appears that the data is NOT wrapped in any additional structures, that it's basically a stream of CCITT G3/G4 encoded data. My challenge now is how to handle that encoded data. I'm wondering if I can create an Image object from a MemoryStream (stream from PDF) then save with the proper encoding? if anyone has experience decoding the CCITT coded I would still really appreciate any tips or help. I will post back if I can get this working. "sklett" <s(a)s.com> wrote in message news:em3hI9trIHA.4716(a)TK2MSFTNGP06.phx.gbl... >I need to bang out a quick application to extract CCITT compressed TIF >images from a ton of PDFs. I've used PDFSharp in the past to work with >PDFs but ti doesn't have support for the PDF /CCITTFaxDecode filter. > > I've googled for the obvious terms to try to find some code samples or > information about how to accomplish what I want but haven't found anything > at all. > If anyone here has experience in working with PDFs and extracting TIFFs > could you possible help with the following questions (some are very newbie > questions) > > 1) Does PDF store image data in a special PDF format or wrapped in any > other objects? In other words, I thought I would just be able to write > the image stream from the PDF to disk and it would result in a TIFF image > (hah!) but this isn't the case, I'm wondering if the image data is wrapped > in an additional format? > > 2) Anyone know of any (free) libraries that can decompress the CCITT > codec? I'm not sure if it's group 3 or 4, I imagine that is in the header > of the image data? > > Any info greatly appreciated, > Steve >
|
Pages: 1 Prev: SMPP Next: Adding row to databound Datagridview |