From: Lars Enderin on
Jukka Lahtinen wrote:
> Live DVB <livedvb(a)gmail.com> writes:
>
>> Well in the end, after few more hours of work I changed URIEncoding to
>> "windows-1250" at my server.xml, also changed all utf-8 to
>> windows-1250. I then changed the letters with their ascii(hex?)
>
> BAD change. You should use ISO-8859-1 instead, it's a universal standard
> unlike windows-anything.
>
ISO-8859-1 doesn't work for all European languages. There are other
ISO-8859-? variants.

From: bugbear on
Lars Enderin wrote:
> Jukka Lahtinen wrote:
>> Live DVB <livedvb(a)gmail.com> writes:
>>
>>> Well in the end, after few more hours of work I changed URIEncoding to
>>> "windows-1250" at my server.xml, also changed all utf-8 to
>>> windows-1250. I then changed the letters with their ascii(hex?)
>> BAD change. You should use ISO-8859-1 instead, it's a universal standard
>> unlike windows-anything.
>>
> ISO-8859-1 doesn't work for all European languages. There are other
> ISO-8859-? variants.
>

I'm confused. In 2010, coding a website under Java, why
would anyone use anything other than utf-8, giving
full Unicode support?

BugBear
From: RedGrittyBrick on
On 28/04/2010 09:06, bugbear wrote:
> Lars Enderin wrote:
>> Jukka Lahtinen wrote:
>>> Live DVB <livedvb(a)gmail.com> writes:
>>>
>>>> Well in the end, after few more hours of work I changed URIEncoding to
>>>> "windows-1250" at my server.xml, also changed all utf-8 to
>>>> windows-1250. I then changed the letters with their ascii(hex?)
>>> BAD change. You should use ISO-8859-1 instead, it's a universal standard
>>> unlike windows-anything.
>>>
>> ISO-8859-1 doesn't work for all European languages. There are other
>> ISO-8859-? variants.
>>
>
> I'm confused. In 2010, coding a website under Java, why
> would anyone use anything other than utf-8, giving
> full Unicode support?

I think the OP tried that and couldn't get it to work.

Surely there is a UTF-8 solution?

--
RGB
From: Jussi Piitulainen on
Jukka Lahtinen writes:
> Lars Enderin <lars.enderin(a)telia.com> writes:
> > Jukka Lahtinen wrote:
> >> BAD change. You should use ISO-8859-1 instead, it's a universal
> >> standard unlike windows-anything.
>
> > ISO-8859-1 doesn't work for all European languages. There are other
> > ISO-8859-? variants.
>
> Sorry, I forgot that, always using iso-8859-1 myself when writing
> Finnish. (Does windows-1250 support some languages not supported in

A man page, however, notes this about iso-8859-1: "it lacks the EURO
symbol and does not fully cover __Finnish__ and French." Emphasis is
mine. I'm not sure what all is missing for Finnish. S-with-caron is
one, or two, and maybe z-with-caron is counted as needed for
Finnish. Those are rare, but the euro symbol is rather important.

That's why we have iso-8859-15 aka latin-9, one of those less-used
iso-8859 variants. And, of course, UTF-8.

> Anyway, my point was that windows-1250 is platform-specific, meant
> only for M$ systems, while iso standards are platform-independent
> and universally standardized.

Yes.
From: Thomas Pornin on
According to Lars Enderin <lars.enderin(a)telia.com>:
> ISO-8859-1 doesn't work for all European languages.

All European languages cannot possibly fit in a mono-byte,
ASCII-compatible charset. Unless you have a restrictive view of Europa,
which does not include, e.g., Greece or Bulgaria. ISO-8859-1 was aiming
at languages from _Western_ Europe.

If you want to support all European scripts (at least all those which
are used by 1 million people or more), then you will need to use a
charset where at least some characters use more than one byte. In that
case, it makes little sense not to use a Unicode-based charset,
preferably UTF-8.


--Thomas Pornin