From: Woody on
On 02/07/2010 10:18, Jochem Huhmann wrote:
> usenet(a)alienrat.co.uk (Woody) writes:
>
>> A PDF can not be set up in such a way to let you know what a piece of
>> information means.
>>
>> Really at a basic level it is quite useless for structure. Even a table
>> is a collection of words with lines drawn in it - there isn't the
>> constructs inside a pdf to let the information within a table relate to
>> itself.
>>
>> Once information has gone into a PDF, it is just words, any structure it
>> has is completely lost.
>>
>> It is a purely display only format.
>
> There is tagged PDF though, which tries to address exactly this problem.

I would say that it is really starting from the wrong end. Wouldn't it
be better to start with something else that had structure, and put the
display layout into it?

You can embed full XML structures inside a PDF (or anything really,
videos, viruses, whatever), but is that not just trying to re-engineer
something to cope with something it was never designed (and contains no
structure) to do?
Effectively you would just have the information twice

--
Woody
From: Jochem Huhmann on
Woody <usenet(a)alienrat.co.uk> writes:

>> There is tagged PDF though, which tries to address exactly this problem.
>
> I would say that it is really starting from the wrong end. Wouldn't it
> be better to start with something else that had structure, and put the
> display layout into it?

If you start out with just content and structure (like an XML document
in DocBook or such) and generate a tagged PDF from it this point is
somewhat moot, I think.

> You can embed full XML structures inside a PDF (or anything really,
> videos, viruses, whatever), but is that not just trying to re-engineer
> something to cope with something it was never designed (and contains no
> structure) to do?
> Effectively you would just have the information twice

If you first generate a PDF and *then* try to put the structure into it
afterwards, yes. But if you're starting out with a different format
anyway and PDF is just one target format, creating tagged PDF is not a
bad idea at all. At least this allows screenreaders to work, text to be
re-flown, etc.



Jochem

--
"A designer knows he has arrived at perfection not when there is no
longer anything to add, but when there is no longer anything to take away."
- Antoine de Saint-Exupery
From: D.M. Procida on
Jochem Huhmann <joh(a)gmx.net> wrote:

> > Once information has gone into a PDF, it is just words, any structure it
> > has is completely lost.
> >
> > It is a purely display only format.
>
> There is tagged PDF though, which tries to address exactly this problem.

PDF is basically built upon PostScript, a language intended to be
interpreted by laser printers. All it does it tell the device: make
such-and-such a mark here.

It seems like the worst possible starting-point for trying to preserve
semantic content, never mind achieve accessibility.

Daniele
From: Jochem Huhmann on
real-not-anti-spam-address(a)apple-juice.co.uk (D.M. Procida) writes:

> Jochem Huhmann <joh(a)gmx.net> wrote:
>
>> > Once information has gone into a PDF, it is just words, any structure it
>> > has is completely lost.
>> >
>> > It is a purely display only format.
>>
>> There is tagged PDF though, which tries to address exactly this problem.
>
> PDF is basically built upon PostScript, a language intended to be
> interpreted by laser printers. All it does it tell the device: make
> such-and-such a mark here.
>
> It seems like the worst possible starting-point for trying to preserve
> semantic content, never mind achieve accessibility.

I'm certainly not advocating to use PDF as starting point for anything.
PDF is almost always created from something else (which may very well
have some structure in it) and if you have to create PDF anyway, a
structured PDF is better than an unstructured PDF.


Jochem

--
"A designer knows he has arrived at perfection not when there is no
longer anything to add, but when there is no longer anything to take away."
- Antoine de Saint-Exupery
From: Pd on
Woody <usenet(a)alienrat.co.uk> wrote:

> > There's no way a book containing large art prints can be rendered in a
> > form that can be enjoyed on an e-book reader in the same way.
>
> ok, I will give you that. Didn't occur to me as I wouldn't be looking at
> large art prints. However pictures in general I wouldn't have a problem
> with on an iPad style screen (the proper ereaders are very bad at
> pictures), just not crammed into a proper book form.

Plus computer screens can be better at displaying full colour pictures -
especially once we get into 300+ pixels per inch. One pixel on a screen
gives you the full range of colours in the available gamut (is that a
redundancy?), whereas a CMYK image printed at 300dpi takes a lot more
dots to give the illusion of the same colour. A book of large art prints
would need to be printed at a minimum of 600dpi to give the same detail
as an iPad screen, depending on the process used.

--
Pd