Re: [whatwg] Hyphenation

<17830.16477.894411.961072@gargle.gargle.HOWL>

Current votes: None.

Also sprach =D8istein E. Andersen:

 > > Prince6 (www.princexml.com) supports these properties:
 > >=20
 > >   hyphenate: none | auto
 > >   hyphenate-dictionary: none | url(...)
 > >   hyphenate-before: <int>
 > >   hyphenate-after: <int>
 > >   hyphenate-lines: none | <int>
 >=20
 > >From http://www.princexml.com/howcome/2006/p6/p6demo2.html:
 >=20
 > > Prince can read the hyphenation format pioneered by TeX and reused=
 by many
 > > other applications. OpenOffice hosts a number of hyphenation dicti=
onaries that
 > > are reusable in Prince6.

 ...
=20
 > This is, however, only one part of TeX's hyphenation system. The nex=
t level is a
 > hyphenation exception dictionary, a list of fully hyphenated words t=
hat would not
 > otherwise be hyphenated correctly.=20

Prince doesn't support exception dictionaries. Is it not possible to
encode exceptions in the hyphenation dictionary?

DSSSL has an 'hyphenation-exceptions' property which takes a list of
strings. I'm unsure if it has been implemented, though.

http://dsssl.netfolder.com/paragraph-flow-object.htm

 > In addition to this, hyphenation can be indicated locally. This is n=
eeded in order to
 > hyphenate words like rec-ord/re-cord and is the only level that deal=
s with
 > spelling changes.

This can be done by supplying your own dictionary through the
'hyphenate-dictionary' property.

 > There are a few additional caveats. For instance, it is not entirely=
 obvious what
 > should be considered to be a `word' or which characters should be al=
lowed in a
 > `word' (given that only `words' can be hyphenated using this kind of=
 algorithms).
 > TeX uses `category codes' to define letters, and Unicode's character=
 classes
 > give a good approximation, but they cannot be redefined to deal with=
 specific
 > issues. In Italian, for instance, dell'opera should be hyphenated de=
ll'o-
 > pera, but opera should not be hyphenated o-pera. (The particular exa=
mple may
 > be wrong, but the principle is correct.) Unless the apostrophe is
 > considered to be a `letter' (a constituent of a `word'), correct pat=
terns do not
 > help, as `dell'opera' will not be considered as one unit during hyph=
enation-point
 > look-up.
 >=20
 > Another example worth mentioning is that Polish and a few other lang=
uages
 > apparently require a hyphenated word like xxx-yyy to be hyphenated x=
xx-
 > -yyy (with an extra hyphen carried over). A truly flexible system wo=
uld allow
 > to specify, e.g., which non-letters to treat as part of words and wh=
ich to give
 > special treatment. (As we all know, TeX hyphenates xxx-yyy as xxx-
 > yyy; in addition, the hyphen prohibits xxx and yyy from being hyphen=
ated,
 > which may or may not be suitable depending on, e.g., column width.)
 >=20
 > How does Prince deal with these issues?

Prince6 does't try to go beyond Tex.

-h&kon
              H=E5kon Wium Lie                          CTO =B0=FEe=AE=AA=

howcome@opera.com                  http://people.opera.com/howcome