diff --git a/batk/btk.css b/batk/btk.css index 68e6c6850..218b86e1d 100755 --- a/batk/btk.css +++ b/batk/btk.css @@ -25,3 +25,20 @@ margin-block-end: 0.5em; line-height: 1; } + + + +.useBlockExamples .charExample .ex { + font-size: 3rem; + line-height: 1.2; + } +.useBlockExamples .charExample.inline .ex { + font-size: 1.4rem; + } + + + + + + + diff --git a/batk/btk.html b/batk/btk.html index 5239659c2..5e3ef6796 100755 --- a/batk/btk.html +++ b/batk/btk.html @@ -58,7 +58,7 @@

Contents

Updated -13 November, 2022 +23 December, 2022

@@ -116,23 +116,26 @@

Usage & history

Basic features

-

The Batak script is an abugida. Consonants carry an inherent vowel which can be modified by appending vowel-signs to the consonant. See the table to the right for a brief overview of features for the modern Batak orthography (the character counts are for a superset of all 5 languages described here). Each language uses its own set of consonants and vowel-signs from this set, and several Unicode code points are specific to a particular language.

-

Balinese text runs left to right in horizontal lines.

+

The Batak script is an abugida. Consonants carry an inherent vowel which can be modified by appending vowel signs to the consonant. See the table to the right for a brief overview of features for the modern Batak orthography (the character counts are for a superset of all 5 languages described here). Each language uses its own set of consonants and vowel signs from this set, and several Unicode code points are specific to a particular language.

-

Words are not separated by spaces, and text segmentation doesn't pay attention to word boundaries.

+

Batak text runs left to right in horizontal lines.

-

The 33 consonant letters in the Unicode Batak block include many duplicates because of variations in the orthography from language to language. Repertoire extensions for 2 non-native sounds are achieved in Mandailing only by applying the tompi diacritic to characters.

+

Words are not separated by spaces, and text segmentation doesn't pay attention to word boundaries.

-

There are no conjuncts or stacks.

-

Syllable-initial clusters appear to be restricted to prenasalised consonants (for which there are 2 dedicated letters) or nasal+consonant, which are written using an unpronounced standalone vowel, and these only occur in Karo.

+

The 33 consonant letters in the Unicode Batak block include many duplicates because of variations in the orthography from language to language. Repertoire extensions for 2 non-native sounds occur in Mandailing only, by applying the tompi diacritic to characters. ❯ consonants

-

Word-final consonant sounds may be represented by 2 final-consonant diacritics. Otherwise, if nothing follows, they are ordinary consonants followed by a visible [U+1BF2 BATAK PANGOLAT] (or [U+1BF3 BATAK PANONGONAN] in Karo and Simalungun).

-

A peculiarity of Batak is that a syllable CVCv (where 'v' represents the vowel-killer) is rendered as CCVv when the vowel is expressed using a vowel-sign.

+

There are no conjuncts or stacks.

-

The Balinese orthography has an inherent vowel, and represents vowels using language-specific selections from the superset of 9 vowel-signs. There are no pre-base vowels or circumgraphs. All vowel-signs are combining marks, and are stored after the base character.

+

Syllable-initial clusters appear to be restricted to prenasalised consonants (for which there are 2 dedicated letters) or nasal+consonant, which are written using an unpronounced standalone vowel, and these only occur in Karo. ❯ onsets

-

Batak has 3 independent vowels (one with 2 alternate shapes) which are optionally mixed with standalone vowels represented by vowel-signs applied to [U+1BC0 BATAK LETTER A] (or [U+1BC1 BATAK LETTER SIMALUNGUN A] in Simalungun).

+

Word-final consonant sounds may be represented by 2 final-consonant diacritics. Otherwise, if nothing follows, they are ordinary consonants followed by a visible [U+1BF2 BATAK PANGOLAT] (or [U+1BF3 BATAK PANONGONAN] in Karo and Simalungun). ❯ finals

+ +

A peculiarity of Batak is that a syllable CVCv (where 'v' represents the vowel-killer) is rendered as CCVv when the vowel is expressed using a vowel sign. ❯ vowel_reorder

+ +

The Batak orthography has an inherent vowel a, and represents vowels using language-specific selections from the superset of 9 vowel signs. There are no pre-base vowels or circumgraphs. All vowel signs are combining marks, and are stored after the base character. ❯ vowels

+ +

Batak has 3 independent vowels (one with 2 alternate shapes) which are optionally mixed with standalone vowels represented by vowel signs applied to [U+1BC0 BATAK LETTER A] (or [U+1BC1 BATAK LETTER SIMALUNGUN A] in Simalungun). ❯ standalone

Batak has very little punctuation, and the punctuation marks that are used generally indicate section boundaries.

@@ -505,130 +508,18 @@

Alphabet

Vowels

-
-

Vowel sounds to characters

- - -

This section maps the vowel sounds of several Batak languages to common graphemes in the Batak orthography, grouped by vowel-sign ( vs ) or standalone ( s ). Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

-

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.

- - - -
-

Plain vowels

- -
-
-
i
-
vs
-
-

[U+1BEA BATAK VOWEL SIGN I] for Karo, Mandailing, Pakpak, and Toba

-

[U+1BEB BATAK VOWEL SIGN KARO I] for Simalungun & as an alternative for Karo 

-
-
-
-
 
-
s
-
-

[U+1BE4 BATAK LETTER I] for all languages

-
-
-
u
-
vs
-
-

[U+1BEE BATAK VOWEL SIGN U] for Mandailing, Pakpak, Simalungun, & Toba

-

[U+1BEC BATAK VOWEL SIGN O] for Karo 

-

[U+1BEF BATAK VOWEL SIGN U FOR SIMALUNGUN SA] for Simalungun, used only with the letter [U+1BD9 BATAK LETTER SIMALUNGUN SA].

-
-
-
-
 
-
s
-
-

[U+1BE5 BATAK LETTER U] for all languages

-
-
-
- - -
-
-
e
-
vs
-
-

[U+1BE9 BATAK VOWEL SIGN EE] for Karo, Mandailing, Pakpak, Simalungun, & Toba

-
-
-
-
o
-
vs
-
-

[U+1BEC BATAK VOWEL SIGN O] for Mandailing, Pakpak, Simalungun, & Toba

-

[U+1BED BATAK VOWEL SIGN KARO O] for Karo

-

[U+1BE8 BATAK VOWEL SIGN PAKPAK E] for Karo 

-
-
-
- - -
-
-
ə
-
vs
-
-

[U+1BE7 BATAK VOWEL SIGN E] for Karo

-

[U+1BE8 BATAK VOWEL SIGN PAKPAK E] for Pakpak 

-
-
-
- -
-
-
a
-
-
-
-

Inherent vowel

-
-
-
-
 
-
s
-
-

[U+1BC0 BATAK LETTER A] for Mandailing, Toba, Karo & Pakpak

-

[U+1BC1 BATAK LETTER SIMALUNGUN A] for Simalungun 

-
-
-
- -
+
+

The Batak orthography has an inherent vowel a, and represents vowels using language-specific selections from the superset of 9 vowel signs. There are no pre-base vowels or circumgraphs. All vowel signs are combining marks, and are stored after the base character.

+

Batak has 3 independent vowels (one with 2 alternate shapes) which are optionally mixed with standalone vowels represented by vowel signs applied to [U+1BC0 BATAK LETTER A] (or [U+1BC1 BATAK LETTER SIMALUNGUN A] in Simalungun).

-
-

Diphthongs and other combinations

-
-
-
ou
-
vs
-
-

[U+1BED BATAK VOWEL SIGN KARO O] - for Simalungun

-
+

For a mapping of sounds to graphemes see vowel_mappings.

-
-
-
- - - - - - -
@@ -639,7 +530,9 @@

Inherent vowel

-

a following a consonant is not written, but is seen as an inherent part of the consonant letter, so ka is written by simply using the consonant letter [U+1BC2 BATAK LETTER HA].

+

a following a consonant is not written, but is seen as an inherent part of the consonant letter, so ka is written by simply using the consonant letter.

+ +

ka [U+1BC2 BATAK LETTER HA]

@@ -651,14 +544,17 @@

Inherent vowel

-

Vowel-signs

+

Vowel signs

+

-

Non-inherent vowel sounds that follow a consonant are represented using vowel-signs, eg. kiː is written ᯂᯪ [U+1BC2 BATAK LETTER HA + U+1BEA BATAK VOWEL SIGN I].

-

An orthography that uses vowel-signs is different from one that uses simple diacritics or letters for vowels in that the vowel-signs may be attached to the syllable as a whole, rather than just applied to the letter of the immediately preceding consonant. See finals.

+

Non-inherent vowel sounds that follow a consonant are represented using vowel signs, eg.

+ +

ᯂᯪ kiː [U+1BC2 BATAK LETTER HA + U+1BEA BATAK VOWEL SIGN I]

-

Batak vowel-signs are all combining characters. All vowel-signs are stored after the base consonant. There are no vowel-signs displayed before the base, and there are no circumgraphs.

-

Four vowel-signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

+

Batak vowel signs are all combining characters. All vowel signs are stored after the base consonant. There are no vowel signs displayed before the base, and there are no circumgraphs.

+

An orthography that uses vowel signs is different from one that uses simple diacritics or letters for vowels in that the vowel signs may be attached to the syllable onset , rather than just applied to the letter of the immediately preceding consonant. See finals.

+

Four vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

@@ -700,7 +596,7 @@

Karo

U ligatures

-

The vowel-sign [U+1BEE BATAK VOWEL SIGN U] often ligates with the base character, as can be seen in the following examples:

+

The vowel sign [U+1BEE BATAK VOWEL SIGN U] often ligates with the base character, as can be seen in the following examples:

ᯇᯮ␣ᯅᯮ␣ᯗᯮ␣ᯖᯮ␣ᯑᯮ␣ᯂᯮ␣ᯎᯮ␣ᯐᯮ␣ᯘᯮ␣ᯀᯮ␣ᯔᯮ␣ᯉᯮ␣ᯝᯮ␣ᯋᯮ␣ᯍᯮ␣ᯒᯮ␣ᯞᯮ␣ᯛᯮ
@@ -720,7 +616,7 @@

Standalone vowels

-

Vowel-signs

+

Vowel signs

ᯀᯪ␣ᯀᯫ␣ᯀᯬ␣ᯀᯩ␣ᯀᯧ␣ᯀᯨ␣ᯀᯭ

The normal approach combines a vowel sign with [U+1BC0 BATAK LETTER A] (or [U+1BC1 BATAK LETTER SIMALUNGUN A] in Simalungun).

@@ -738,7 +634,7 @@

Independent vowels

ᯤ␣ᯥ␣ᯀ␣ᯁ
-

Batak has independent vowels for 3 sounds. The use of [U+1BE4 BATAK LETTER I] and [U+1BE5 BATAK LETTER U], rather than the vowel-sign combinations shown above, is optional, and spelling of words may vary even within the same document.ab

+

Batak has independent vowels for 3 sounds. The use of [U+1BE4 BATAK LETTER I] and [U+1BE5 BATAK LETTER U], rather than the vowel sign combinations shown above, is optional, and spelling of words may vary even within the same document.ab

ᯤᯉ ᯥᯑᯉ᯲ @@ -773,12 +669,8 @@

Suppressing the inherent vowel

ᯀᯉᯂ᯳

-

See also finals, which describes some unusual behaviour for final consonants that are preceded by a vowel-sign.

+

See also finals, which describes some unusual behaviour for final consonants that are preceded by a vowel sign.

- - - - @@ -786,235 +678,155 @@

Suppressing the inherent vowel

+
+

Vowel sounds to characters

-
-

Consonants

- - -
-

Consonant sounds to characters

- -

This section maps consonant sounds for languages using the Batak script to common graphemes in the orthography. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

-

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, Sanskrit, etc. If no language is specified, the assignment is valid for all languages covered here.

+

This section maps the vowel sounds of several Batak languages to common graphemes in the Batak orthography, grouped by vowel sign ( vs ) or standalone ( s ). Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

+

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.

+
+

Plain vowels

-
-

Stops

-
+
-
p
-
l
-
-

[U+1BC7 BATAK LETTER PA] for Karo, Mandailing, Pakpak, & Toba

-

[U+1BC8 BATAK LETTER SIMALUNGUN PA] for Simalungun 

+
i
+
vs
+
+

[U+1BEA BATAK VOWEL SIGN I] for Karo, Mandailing, Pakpak, and Toba

+

[U+1BEB BATAK VOWEL SIGN KARO I] for Simalungun & as an alternative for Karo 

-
b
-
l
-
-

[U+1BC5 BATAK LETTER BA] for Mandailing, Pakpak, Toba, & Simalungun

-

[U+1BC6 BATAK LETTER KARO BA] for Karo 

+
 
+
s
+
+

[U+1BE4 BATAK LETTER I] for all languages

+
+
+
u
+
vs
+
+

[U+1BEE BATAK VOWEL SIGN U] for Mandailing, Pakpak, Simalungun, & Toba

+

[U+1BEC BATAK VOWEL SIGN O] for Karo 

+

[U+1BEF BATAK VOWEL SIGN U FOR SIMALUNGUN SA] for Simalungun, used only with the letter [U+1BD9 BATAK LETTER SIMALUNGUN SA].

-
ᵐb
-
l
-
-

[U+1BE3 BATAK LETTER MBA] for Karo

+
 
+
s
+
+

[U+1BE5 BATAK LETTER U] for all languages

+
+ + +
-
t
-
l
-
-

[U+1BD6 BATAK LETTER SOUTHERN TA] for Mandailing, Simalungun, and Toba

-

[U+1BD7 BATAK LETTER NORTHERN TA] for Karo, Pakpak, and also Toba 

+
e
+
vs
+
+

[U+1BE9 BATAK VOWEL SIGN EE] for Karo, Mandailing, Pakpak, Simalungun, & Toba

-
d
-
l
-
-

[U+1BD1 BATAK LETTER DA] for all languages

+
o
+
vs
+
+

[U+1BEC BATAK VOWEL SIGN O] for Mandailing, Pakpak, Simalungun, & Toba

+

[U+1BED BATAK VOWEL SIGN KARO O] for Karo

+

[U+1BE8 BATAK VOWEL SIGN PAKPAK E] for Karo 

+
+ + +
-
ⁿd
-
l
-
-

[U+1BE2 BATAK LETTER NDA] for Karo

+
ə
+
vs
+
+

[U+1BE7 BATAK VOWEL SIGN E] for Karo

+

[U+1BE8 BATAK VOWEL SIGN PAKPAK E] for Pakpak 

+
+ +
-
k
-
l
-
-

[U+1BC2 BATAK LETTER HA] for Pakpak & Karo in both syllable-initial and -final positions, and in syllable-final position only for Toba

-

[U+1BC3 BATAK LETTER SIMALUNGUN HA] for Simalungun (syllable-final only)

-

[U+1BC4 BATAK LETTER MANDAILING HA] for Mandailing in syllable-final position only 

-

ᯄ᯦ [U+1BC4 BATAK LETTER MANDAILING HA + U+1BE6 BATAK SIGN TOMPI] for Mandailing in syllable-initial position using the extended repertoire

+
a
+
-
+
+

Inherent vowel

-
ɡ
-
l
-
-

[U+1BCE BATAK LETTER GA] for Karo, Mandailing, Pakpak, & Toba

-

[U+1BCF BATAK LETTER SIMALUNGUN GA] for Simalungun 

+
 
+
s
+
+

[U+1BC0 BATAK LETTER A] for Mandailing, Toba, Karo & Pakpak

+

[U+1BC1 BATAK LETTER SIMALUNGUN A] for Simalungun 

+
-
-

Affricates

-
-
-
t͡ʃ
-
l
-
-

[U+1BD8 BATAK LETTER SA] for Pakpak 

-

[U+1BE1 BATAK LETTER CA] for Karo

-

[U+1BE0 BATAK LETTER NYA] also for Karo 

-

ᯚ᯦ [U+1BDA BATAK LETTER MANDAILING SA + U+1BE6 BATAK SIGN TOMPI] for Mandailing

-
-
+ + +
+

Diphthongs and other combinations

+
-
d͡ʒ
-
l
-
-

[U+1BD0 BATAK LETTER JA] for all languages

+
ou
+
vs
+
+

[U+1BED BATAK VOWEL SIGN KARO O] + for Simalungun

+
-
-

Fricatives

-
-
-
s
-
l
-
-

[U+1BD8 BATAK LETTER SA] for Toba, Pakpak, and Karo

-

[U+1BDA BATAK LETTER MANDAILING SA] for Mandailing

-

[U+1BD9 BATAK LETTER SIMALUNGUN SA] for Simalungun

-
-
-
-
h
-
l
-
-

[U+1BC4 BATAK LETTER MANDAILING HA] for Mandailing in syllable-initial position

-

[U+1BCC BATAK LETTER SIMALUNGUN WA] for Simalungun in syllable-initial position

-

[U+1BC2 BATAK LETTER HA] for Toba in syllable-initial position

-

  [U+1BC0 BATAK LETTER A] for Pakpak and Karo

-
-
-
-
-
-

Nasals

-
-
-
m
-
l
-
-

[U+1BD4 BATAK LETTER MA] in Karo, Mandailing, Pakpak, & Toba

-

[U+1BD5 BATAK LETTER SIMALUNGUN MA] for Simalungun 

-
-
-
-
n
-
l
-
-

[U+1BC9 BATAK LETTER NA] for all languages

-

[U+1BCA BATAK LETTER MANDAILING NA] as an alternative for Mandailing

-
-
-
-
ɲ
-
l
-
-

[U+1BE0 BATAK LETTER NYA] for Mandailing, Simalungun, & Toba

-
-
-
-
ŋ
-
l
-
-

[U+1BDD BATAK LETTER NGA] for all languages

-
-
-
-
+
+

Consonants

+ -
-

Other

-
-
-
w
-
l
-
-

[U+1BCB BATAK LETTER WA] for Mandailing, Toba, Pakpak, & Karo

-

[U+1BCC BATAK LETTER SIMALUNGUN WA] for Simalungun loan words

-

[U+1BCD BATAK LETTER PAKPAK WA] an alternative shape used for Pakpak & Toba

-
-
-
-
r
-
l
-
-

[U+1BD2 BATAK LETTER RA] for Karo, Mandailing, Pakpak, & Toba

-

[U+1BD3 BATAK LETTER SIMALUNGUN RA] for Simalungun 

-
-
-
-
l
-
l
-
-

[U+1BDE BATAK LETTER LA] for Karo, Mandailing, Pakpak, & Toba

-

[U+1BDF BATAK LETTER SIMALUNGUN LA] for Simalungun 

-
-
-
-
j
-
l
-
-

[U+1BDB BATAK LETTER YA] for Mandailing, Karo, & Pakpak, and in loan words for Toba

-

[U+1BDC BATAK LETTER SIMALUNGUN YA] for Simalungun loan words

-
-
-
-
-
+
+

The 33 consonant letters in the Unicode Batak block include many duplicates because of variations in the orthography from language to language. Repertoire extensions for 2 non-native sounds occur in Mandailing only, by applying the tompi diacritic to characters.

+

There are no conjuncts or stacks.

+

Syllable-initial clusters appear to be restricted to prenasalised consonants (for which there are 2 dedicated letters) or nasal+consonant, which are written using an unpronounced standalone vowel, and these only occur in Karo.

+

Word-final consonant sounds may be represented by 2 final-consonant diacritics. Otherwise, if nothing follows, they are ordinary consonants followed by a visible [U+1BF2 BATAK PANGOLAT] (or [U+1BF3 BATAK PANONGONAN] in Karo and Simalungun).

-
+

A peculiarity of Batak is that a syllable CVCv (where 'v' represents the vowel-killer) is rendered as CCVv when the vowel is expressed using a vowel sign.

+ +

For a mapping of sounds to graphemes see consonant_mappings.

+
@@ -1025,7 +837,7 @@

Basic consonants

-

Stops & affricate

+

Stops

Simalungun
ᯈ␣ᯅ␣ᯖ␣ᯑ␣ᯃ␣ᯏ
@@ -1175,12 +987,12 @@

Final consonants

Vowel reordering

-

Batak has a unique behaviour if a vowel-sign is used between onset and coda consonants.

-

If a vowel-sign is used, the glyphs for the onset and coda consonants are placed side by side, and are followed by the glyphs for the intervening vowel and the vowel-killer. This reordering is produced by the font, and the typed order or order stored in memory remains the same as the spoken order. fig_finals shows an example.

+

Batak has a unique behaviour if a vowel sign is used between onset and coda consonants.

+

If a vowel sign is used, the glyphs for the onset and coda consonants are placed side by side, and are followed by the glyphs for the intervening vowel and the vowel-killer. This reordering is produced by the font, and the typed order or order stored in memory remains the same as the spoken order. fig_finals shows an example.

ᯀᯝᯪᯉ᯲ -
The word a.ŋin ('wind') showing the location of onset and coda consonants and the vowel-sign after they have been rearranged by the font. The in memory or typed order is [a ŋ i n ∅]. +
The word a.ŋin ('wind') showing the location of onset and coda consonants and the vowel sign after they have been rearranged by the font. The in memory or typed order is [a ŋ i n ∅].
@@ -1204,6 +1016,229 @@

Consonant clusters

Batak consonant letters do not interact to create conjuncts.

Where a syllable onset folllows a syllable coda, the lack of vowels is indicated using a vowel killer sign. See novowel and finals.

+ + + + + + +
+

Consonant sounds to characters

+ +

This section maps consonant sounds for languages using the Batak script to common graphemes in the orthography. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

+

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, Sanskrit, etc. If no language is specified, the assignment is valid for all languages covered here.

+ + + + + +
+

Stops

+
+
+
p
+
l
+
+

[U+1BC7 BATAK LETTER PA] for Karo, Mandailing, Pakpak, & Toba

+

[U+1BC8 BATAK LETTER SIMALUNGUN PA] for Simalungun 

+
+
+
+
b
+
l
+
+

[U+1BC5 BATAK LETTER BA] for Mandailing, Pakpak, Toba, & Simalungun

+

[U+1BC6 BATAK LETTER KARO BA] for Karo 

+
+
+
+
ᵐb
+
l
+
+

[U+1BE3 BATAK LETTER MBA] for Karo

+
+
+
+
t
+
l
+
+

[U+1BD6 BATAK LETTER SOUTHERN TA] for Mandailing, Simalungun, and Toba

+

[U+1BD7 BATAK LETTER NORTHERN TA] for Karo, Pakpak, and also Toba 

+
+
+
+
d
+
l
+
+

[U+1BD1 BATAK LETTER DA] for all languages

+
+
+
+
ⁿd
+
l
+
+

[U+1BE2 BATAK LETTER NDA] for Karo

+
+
+
+
k
+
l
+
+

[U+1BC2 BATAK LETTER HA] for Pakpak & Karo in both syllable-initial and -final positions, and in syllable-final position only for Toba

+

[U+1BC3 BATAK LETTER SIMALUNGUN HA] for Simalungun (syllable-final only)

+

[U+1BC4 BATAK LETTER MANDAILING HA] for Mandailing in syllable-final position only 

+

ᯄ᯦ [U+1BC4 BATAK LETTER MANDAILING HA + U+1BE6 BATAK SIGN TOMPI] for Mandailing in syllable-initial position using the extended repertoire

+
+
+
+
ɡ
+
l
+
+

[U+1BCE BATAK LETTER GA] for Karo, Mandailing, Pakpak, & Toba

+

[U+1BCF BATAK LETTER SIMALUNGUN GA] for Simalungun 

+
+
+
+
+ + + + + +
+

Affricates

+
+
+
t͡ʃ
+
l
+
+

[U+1BD8 BATAK LETTER SA] for Pakpak 

+

[U+1BE1 BATAK LETTER CA] for Karo

+

[U+1BE0 BATAK LETTER NYA] also for Karo 

+

ᯚ᯦ [U+1BDA BATAK LETTER MANDAILING SA + U+1BE6 BATAK SIGN TOMPI] for Mandailing

+
+
+
+
d͡ʒ
+
l
+
+

[U+1BD0 BATAK LETTER JA] for all languages

+
+
+
+
+ + + + +
+

Fricatives

+
+
+
s
+
l
+
+

[U+1BD8 BATAK LETTER SA] for Toba, Pakpak, and Karo

+

[U+1BDA BATAK LETTER MANDAILING SA] for Mandailing

+

[U+1BD9 BATAK LETTER SIMALUNGUN SA] for Simalungun

+
+
+
+
h
+
l
+
+

[U+1BC4 BATAK LETTER MANDAILING HA] for Mandailing in syllable-initial position

+

[U+1BCC BATAK LETTER SIMALUNGUN WA] for Simalungun in syllable-initial position

+

[U+1BC2 BATAK LETTER HA] for Toba in syllable-initial position

+

  [U+1BC0 BATAK LETTER A] for Pakpak and Karo

+
+
+
+
+ + + + + +
+

Nasals

+
+
+
m
+
l
+
+

[U+1BD4 BATAK LETTER MA] in Karo, Mandailing, Pakpak, & Toba

+

[U+1BD5 BATAK LETTER SIMALUNGUN MA] for Simalungun 

+
+
+
+
n
+
l
+
+

[U+1BC9 BATAK LETTER NA] for all languages

+

[U+1BCA BATAK LETTER MANDAILING NA] as an alternative for Mandailing

+
+
+
+
ɲ
+
l
+
+

[U+1BE0 BATAK LETTER NYA] for Mandailing, Simalungun, & Toba

+
+
+
+
ŋ
+
l
+
+

[U+1BDD BATAK LETTER NGA] for all languages

+
+
+
+
+ + + + + +
+

Other

+
+
+
w
+
l
+
+

[U+1BCB BATAK LETTER WA] for Mandailing, Toba, Pakpak, & Karo

+

[U+1BCC BATAK LETTER SIMALUNGUN WA] for Simalungun loan words

+

[U+1BCD BATAK LETTER PAKPAK WA] an alternative shape used for Pakpak & Toba

+
+
+
+
r
+
l
+
+

[U+1BD2 BATAK LETTER RA] for Karo, Mandailing, Pakpak, & Toba

+

[U+1BD3 BATAK LETTER SIMALUNGUN RA] for Simalungun 

+
+
+
+
l
+
l
+
+

[U+1BDE BATAK LETTER LA] for Karo, Mandailing, Pakpak, & Toba

+

[U+1BDF BATAK LETTER SIMALUNGUN LA] for Simalungun 

+
+
+
+
j
+
l
+
+

[U+1BDB BATAK LETTER YA] for Mandailing, Karo, & Pakpak, and in loan words for Toba

+

[U+1BDC BATAK LETTER SIMALUNGUN YA] for Simalungun loan words

+
+
+
+
+
@@ -1371,14 +1406,14 @@

Context-based shaping & positioning

Care needs to be taken in certain cases where multiple diacritics are applied to the same base. When both [U+1BE9 BATAK VOWEL SIGN EE] and [U+1BF0 BATAK CONSONANT SIGN NG] occur together they are rendered side by side, rather than one atop the other. The same goes for [U+1BE9 BATAK VOWEL SIGN EE] and [U+1BF1 BATAK CONSONANT SIGN H].

-
+
ᯒᯩᯰ
Multiple combining marks rendered side by side above the base.

Another important aspect of shaping occurs where [U+1BEE BATAK VOWEL SIGN U] ligates with or is positioned differently according to its base. For example:

-
+

ᯖᯮ ᯇᯮ ᯌᯮ ᯞᯮ

Examples of shaping and positioning applied to [U+1BEE BATAK VOWEL SIGN U] when it follows a particular base character.
@@ -1439,7 +1474,7 @@

Basic typographic units

  1. tompi (a kind of nukta)
  2. -
  3. vowel-sign
  4. +
  5. vowel sign
  6. final consonant marks
@@ -1684,13 +1719,13 @@

Line breaking & hyphenation

-

On the other hand, proposals are currently being discussed by the Unicode Script Ad Hoc committee to break after a BCCS except where a vowel-killer appears – in which case, the whole phonetic syllable would be kept together. The latter exception is primarily motivated by the limitations of current rendering systems, which would struggle to achieve the reordering of final-consonant and vowel-sign before a vowel-killer if a line-break intervenes.

+

On the other hand, proposals are currently being discussed by the Unicode Script Ad Hoc committee to break after a BCCS except where a vowel-killer appears – in which case, the whole phonetic syllable would be kept together. The latter exception is primarily motivated by the limitations of current rendering systems, which would struggle to achieve the reordering of final-consonant and vowel sign before a vowel-killer if a line-break intervenes.

-

However, it is not hard to find examples of written Batak where line-breaks can occur before a vowel-sign or a vowel-killer, indicating that line-break opportunities can occur before any spacing glyph. Note that the vowel-sign and vowel-killer are represented in Unicode as combining marks, so we are splitting the BCCS unit here. Note also that the order of characters doesn't change. For examples, see handwritten examples of line-initial vowel-killer, and vowel-sign, and both in this printed text.

+

However, it is not hard to find examples of written Batak where line-breaks can occur before a vowel sign or a vowel-killer, indicating that line-break opportunities can occur before any spacing glyph. Note that the vowel sign and vowel-killer are represented in Unicode as combining marks, so we are splitting the BCCS unit here. Note also that the order of characters doesn't change. For examples, see handwritten examples of line-initial vowel-killer, and vowel sign, and both in this printed text.

@@ -1740,7 +1775,7 @@

Baselines, line height, etc.