Book Pahlavi

Download as pdf or txt
Download as pdf or txt
You are on page 1of 40

L2/18-276

2018-08-26

Preliminary proposal to encode Book Pahlavi in Unicode

Anshuman Pandey
[email protected]

August 26, 2018

1 Introduction

This is a proposal to encode the ‘Book Pahlavi’ script in Unicode. Other proposals for the script have been
submitted previously by different authors:

• 1993: “Unicode Technical Report #3”, Rick McGowan and Joe Becker

• 2007: “Preliminary proposal to encode the Book Pahlavi script in the BMP of the UCS” (L2/07-234),
Michael Everson, Roozbeh Pournader, and Desmond Durkin-Meisterernst

• 2013: “Preliminary proposal to encode the Book Pahlavi script in the Unicode Standard” (L2/13-141),
Roozbeh Pournader

• 2014: “Proposal for Encoding Book Pahlavi in the Unicode Standard” (L2/14-077), Abe Meyers

The present proposal differs from them by offering:

• an encoding that aligns with Unicode principles and the character-glyph model

• a character repertoire based upon semantically distinctive letters, numbers, and signs that can be used
for completely representing the script

• a model that supports the joining structure of the script and variations in the joining behavior of letters

• detailed information on orthography, ligatures, and properties of characters

This document is concerned primarily with presenting an encoding model for Book Pahlavi that provides
for the full encoding of printed texts, as these records are currently used by the Zoroastrian and Parsi com-
munities. I am actively conducting research to develop and expand the model. Towards that end, I request
feedback from experts and users of the script. A comparison of the advantages of my proposed encoding
with previous proposals will be offered in the formal proposal, which is forthcoming. The formal proposal
will also include additional background information and a set of specimens of usage. At present, the figures
provided in the previous proposals should be consulted.

1
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

2 Background

The ‘Book Pahlavi’ script is used for writing the Iranian language known as ‘Middle Persian’ (ISO 639-3:
pal). Originally spoken in southwestern Iran, this language began to flourish during the 3rd century with
the rise of the Sasanian dynasty, which succeeded the Parthian dynasty in 224 . Middle Persian was used
as a prestige language during the Sasanian dynasty, but began to decline after the Arab invasion in 651.

The script is one of three ‘Pahlavi’ writing systems (see table 1). The earliest is known as ‘Inscriptional
Pahlavi’. It is derived from the Parthian script, which evolved from a form of Imperial Aramaic. The
inscriptional Pahlavi script is a non-cursive abjad. The ‘Psalter Pahlavi’ is a full cursive joining abjad.
derived from the inscriptional form. It is attested in the Syriac Psalter, a Christian manuscript consisting of
twelve extant folios, from the c. 5th century . The ‘Book Pahlavi’ is the most well-known of these scripts
and has the largest extant corpus. It developed from the inscriptional type. Of the three, only Book Pahlavi
remains unencoded in Unicode.

The labels ‘inscriptional’ and ‘book’ are scholarly classifications based upon strict assessments of application
of the Pahlavi scripts in the available records. Although described as ‘book’ on account of its usage in
Zorosatrian literature, the script also occurs in inscriptions, coins, seals, and ostraca. From the perspective of
script encoding, the terms ‘inscriptional’, ’psalter’, and ‘book’ refer to the structure of the scripts, particularly
the lapidary nature of the ‘inscriptional’ type and the connected or cursive nature of the ‘psalter’ and ‘book’
forms.

Although common usage of Book Pahlavi declined after the introduction of the Arabic script in the 7th cen-
tury, it was maintained as an important liturgical and literary script. Alongside the Avestan script, Book
Pahlavi continues to possess significance for the Zoroastrian community. The extant literature of Zoroas-
trianism is written in these scripts. Book Pahlavi was adapted for printing in the late 19th century, and
Zoroastrian texts and Middle Persian grammatical studies continue to be printed in India in the script. The
script is also actively studied by scholars, especially of Middle Persian language and linguistics, and the
history and culture of pre-Islamic Iran.

3 Proposed Repertoire

The proposed repertoire for Book Pahlavi contains 29 characters:

• 20 letters
• 2 fixed-form letters
• 2 special ligatures
• 1 word ligature
• 1 particle
• 8 combining signs
• 1 end-of-word mark
• 2 punctuation signs
• 5 numbers

The code chart and names list follows p. 38. The encoded set may differ from traditional and scholarly in-
ventories of the script that occur in manuscript, inscriptional, and printed sources. Such differences naturally

2
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

arise from the requirements for digitally representing a script in plain text and for preserving the semantics
of characters.

Unicode character names are based upon those of ‘Imperial Aramaic’ characters. This convention has been
followed for Unicode encodings for related scripts, eg. Inscriptional Pahlavi and Psalter Pahlavi.

In this document names in italics refer to scholarly names for graphemes while names in small capitals refer
to Unicode characters, eg. 𐮱 is beth and . For sake of brevity, the descriptor
‘ ’ is dropped when refering to Book Pahlavi characters, eg.
may be referred to as . For letters that have been unified as one character, the graphemes may
be referred to using the names of the individual letters, while the character is known using the compound
name. For example, 𐮰 is the character - , but may be referred to as either
aleph or heth in discussion of the individual graphemes. Characters of other scripts are designated by their
full Unicode names. Latin transliteration of Book Pahlavi follows the current scholarly convention, with
Aramaic heterograms given in uppercase letters.

3.1 Letters

The following 20 basic letters are proposed. Details on the joining behavior of letters is given in § 5.2.

Character name Glyph Joining Latin

- 𐮰 dual
ʾ, h, x
𐮱 right b

- - 𐮲 dual g, d, y

𐮳 right d

𐮴 right h

- - - 𐮵 right w, n, , r
ʿ
𐮶 dual z

𐮷 right k

𐮸 right k

𐮹 dual r

𐮺 dual l, r

𐮻 dual l

3
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

𐮼 right l

- 𐮽 dual m, q

𐮾 dual s

𐮿 dual s

𐯀 right p

𐯁 right c

𐯂 dual š

𐯃 right t

3.2 Fixed-form letters

The following two ‘fixed-form’ characters are proposed in order to represent the respective letters when
they occur in cases where their normal joining behavior is suspended (see § 6.2 and § 6.4.2). If the dif-
ferent behaviors described in the aforementioned sections may be produced using existing Unicode control
characters, then these ‘fixed-forms’ letters may be removed from the proposed repertoire.

Character name Glyph Joining Latin

- - 𐯓 dual
ʾ, h, x
- - - 𐯔 dual g, d, y

3.3 Special Ligatures

The following 2 special ligatures are encoded as atomic characters and their character names are based upon
scholarly usage:

Character name Glyph Joining Latin

1 𐯄 non x1

2 𐯅 non x2

4
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

3.4 Word Ligature

The following character is the word for Ahriman, the Zoroastrian antagonist, rotated 180° counter-clockwise.
The orientation carries the metaphor of turning away the negative spirit. It occurs primarily in Pahlavi texts of
the 9th–12th centuries. It is proposed as an atomic character in order to provide a means for its representation
in plain text.

Character name Glyph Joining Latin

𐯆 non
ʾhlmn

3.5 Particle

The following character represents the Aramaic heterogram ZY. It is proposed as an atomic character in order
to provide for its representation in plain text.

Character name Glyph Joining Latin

𐯇 non ZY

3.6 Combining signs

The following 8 combining signs are used for distinguishing different values for letters that have the same
shape:

Character name Glyph

◌𐯈

◌𐯉

◌𐯊

◌𐯋

◌𐯌

◌𐯍

◌𐯎

◌𐯏

5
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

3.7 End of word mark

The following character is used for marking the end of a word. Also known in some scholarly works as
the ‘otiose stroke’, it is used only after letters that do not connect to the left. This character resembles 𐮵
- - - , but it is encoded as a separate character on account of its character semantics. It is a
non-joining character that is used solely for delimiting words.

Character name Glyph Joining Latin

𐯐 non .

3.8 Punctuation

The following two signs of punctuation occur in manuscripts and printed works. They resemble punctuation
already encoded in the Avesta block, ie. 𐬺 + 10B3C and
𐬾 + 10B3E . The difference is that the Book Pahlavi
punctuation are not ‘tiny’ or ‘large’ as the Avestan signs, but are of a ‘medium’ or ‘normal’ size. The below
characters are, therefore, encoded separately in order to accurately represent the proportions of the signs with
surround text.

Character name Glyph

𐯑
𐯒

3.9 Numbers

Character name Glyph Joining Latin

𐯕 right 1

𐯖 right 2

𐯗 right 3

𐯘 right 4

𐯙 right 100

6
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

4 Script Details

4.1 Structure

Book Pahlavi is a cursive joining abjad. It is written from right to left, with lines that advance from top to
bottom.

4.2 Layout considerations

Letters are written on a baseline. The nominal forms of letters are shown below where they occur in relation
to the baseline:

𐯃𐯂𐯁𐯀𐮿𐮾𐮽𐮼𐮻𐮺𐮹𐮸𐮷𐮶𐮵𐮴𐮳𐮲𐮱𐮰
The ‘baseline’ is not readily apparent. It may be established by taking the baselines of the nominal shapes of
the letters - , - - , , , and alternate forms of the latter. The typical
‘head-height’ may be established by the heights of - , , - - , etc. Accord-
ingly, all other letters have features that are either ascending or descending.

4.3 Punctuation

Spaces are commonly used for separating words. The proposed signs of punctuation are used for indicating
text segments of varying length.

4.4 Line-breaking

There are no formal rules for the breaking of words at the end of line. Moreover, the available sources do not
contain text with words broken across lines. It may be assumed that words were not split at line boundaries.
There are no indications of hyphens or other continuation marks. In digital layouts, line-breaks should occur
occur after words.

4.5 Collation

The sort order of the letters follows the encoded order:

𐮰 - < 𐮱 < 𐮲 - - < 𐮳 < 𐮴 <


𐮵 - - - < 𐮷 < 𐮸 < 𐮹 <
𐮺 < 𐮻 < 𐮼 < 𐮽 - <
𐮾 < 𐮿 < 𐯀 < 𐯂 < 𐯃

7
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

5 Joining behavior

5.1 Analysis of ligatures

It is commonly said that Book Pahlavi has numerous ‘standard’ or ‘obligatory’ ligatures. Previous proposals
for encoding the script did not provide a thorough analysis of these ligatures. However, examples of ligatures
are provided in published materials. Such statements and absence of information on ligatures are based upon
a lack of understanding of the joining rules for the script.

To be fair, there is no manuscript or scholarly manual that is readily available that specifies such rules. The
ambiguity of certain sequences of letters further adds to the supposed complexity of ligatures in the script.
Nonetheless, the first step in understanding such ligatures is to analyze the joining behavior of each letter of
Book Pahlavi. This process permits a practical method of analyzing all ligatures in the script.

The word šāhān ‘kings’ (pl. of šāh ‘king’) is written using the following letters:

nun aleph heth aleph shin

𐮵 𐮰 𐮰 𐮰 𐯂
According to the rules of the script, these five letters are not strung along as

𐰉𐯧𐯧𐯧𐱐
But, are rendered according to the rules of the script as:

𐰉𐯧𐯪𐯪𐱕
In the above, the original shapes of the underlying letters are not easily recognizable, with the exception of
the nun, and perhaps the penultimate aleph. For this reason, encoding 𐰉𐯧𐯪𐯪𐱕 into its constituent characters
is difficult. Without knowing the joining behavior of letters, one could conjure up several different ways of
analyzing the cursive properties of the letters.

One method is to segment ligatures into primitive graphical components, as was done for producing metal
types. Such an approach, however, is quite subjective. It provides for numerous dissections of the ligature
into glyphic elements. For example:

8
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

 

 𐰉 𐯵 𐱓 𐱓 𐱓 𐱓 


 


 


 





𐰉 𐲚 𐲣 𐲣 𐰄 𐱓 




 


 




𐰉 𐲚 𐲣 𐲣 𐲣 𐰄 



 


 





𐰉 𐲚 𐰄 𐲤 𐰄 𐲤 𐰄 𐲤 𐰄 



𐰉𐯧𐯪𐯪𐱕  𐰉 𐯵 𐲤 𐰄 𐲤 𐰄 𐲤 𐰄 𐲤 𐰄 
𐮵+𐮰+𐮰+𐮰+𐯂

 


 





𐰉 𐲚 𐰄 𐱓 𐱓 𐱓 




 


 




𐰉 𐲛 𐱓 𐱓 𐱓 



 


 





𐰉 𐲛 𐱓𐱓𐱓 




 

 𐰉 𐯵 𐱓𐱓𐱓𐱓 

There are many other possibilities. Composing Book Pahlavi text using a glyphic model was certainly fea-
sible for metal printing. For that purpose, it was sufficient to graphically reproduce the text of a particular
book or manuscript. But, such an approach is not useful for representation of Book Pahlavi texts in a digital
medium. It is necessary to represent the underlying characters, more than their graphical appearance. Instead
of stringing together a sequence of graphical primitives, it is more valuable from a plain text perspective to
use characters that correspond to letters of the script, as this transmits semantic values and identities, and to
use font technologies to render the ligatures.

As described in § 4.2, Book Pahlavi letters may be considered to be written on a baseline. The

𐯐𐯃𐮵𐯲 𐯨𐯪𐱕 𐯐𐰉𐯧𐰟𐯶𐯦 𐮵 𐯨𐯪𐱕 𐯐𐰉𐯧𐯪𐯪𐱕 𐯐𐱍𐱀𐯦𐱙𐱗𐮵


wištāsp šāhān šāh ud ērān šāh būd
<wšt sp′ š h n′ š h w yl n′ š h bwt′>
ʾ ʾʾ ʾ ʾ ʾ ʾ
Wištāsp was the king of kings and the king of the Iranians.

The joining rules of certain letters specify that the connection to the next letter occurs not at the baseline,
but using a loop that descends basically a full x-height before curving back up to the baseline to join the
next letter. In this regard, it is the responsibility of a given letter to ensure that it joins to the following letter
according to the rules. This should be applied to typography as well.

Based upon these rules, the cursive connections for producing šāhān are as follows:
{ }
𐰉𐯧𐯪𐯪𐱕 𐰉 𐯧 𐯪 𐯪 𐱕 𐮵+𐮰+𐮰+𐮰+𐯂

9
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

5.2 Joining features

Book Pahlavi letters are traditionally divided into two sets: seven dual-joining and seven right-joining letters.
Alternate forms of letters have the same joining properties as the conventional letter. The isolated or nominal
forms of letters are typically identical to their initial forms.

The joining features of the dual-joining letters are shown below:

Xn Xf Xm Xi

- 𐮰 𐯨- 𐯪- , -𐯫- , -𐯧- 𐯩 , -𐯫 , -𐯦
- - 𐮲 𐯷- -𐰅-, -𐯼-, -𐯵- -𐰄 , -𐯻 , -𐯴

𐮶 𐰌- -𐰌- -𐰎, -𐰋

𐮹 𐰝-, 𐰙- -𐰜-, -𐰗- -𐰚, -𐰕

𐮺 𐰭-, 𐰩- -𐰬-, -𐰧- -𐰪, -𐰥

𐮻 𐰱- -𐰰- -𐰯

- 𐮽 𐰻 𐰶 𐰲
𐮾 𐱂- -𐱂- -𐰿

𐮿 𐱊 -𐱉- -𐱈

𐯂 𐱒- -𐱔- -𐱓

10
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

The joining features of the right-joining letters are shown below:

Xn Xf Xi

𐮱 𐯱- 𐮱
𐮳 𐮳- 𐮳
𐮴 𐰇- 𐮴
- - - 𐮵 𐰉- 𐮵
𐮷 𐰑-, 𐰒- 𐮷
𐮸 𐰔- 𐰓
𐮼 𐰮- 𐮼
𐯀 𐱋-, 𐱍-, 𐱌- 𐯀
𐯁 𐱎- 𐯁
𐯃 𐯃| 𐯃

In order to develop a preliminary encoding model for Book Pahlavi, I have analyzed a variety of texts in
order to understand and identify the rules for connections between letters, as well as the contextual forms of
letters in cursive contexts. I provide these details in the next section.

11
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

6 Description of Letters

6.1 aleph, heth

The Book Pahlavi letters aleph and heth have the same shape and joining behavior. For this reason they are
unified into the single character 𐮰 - . This character is a dual-joining letter and has the following
behahior:

Initial -𐯦 before all letters except those below


also before 𐮲 - - in certain cases (see § 6.2)

𐯩 before 𐮰 - ,𐮲 - - ,𐯃

-𐯫 before 𐯀 ,𐯁

Medial -𐯧- before all letters except those below

𐯩- before 𐮰 - ,𐮲 - - ,𐯃

-𐯫- before 𐯀 ,𐯁

Final 𐯨- after all letters

The regular behavior of - is illustrated below. In some words, when - precedes -


- , its regular joining behavior is suspended, and its nominal form is used instead. This behavior
is described in detailed in § 6.2.

< zg>
ʾ azg branch 𐯷𐰌𐯦 𐮰 - ,
𐯷𐰌𐯦 𐮶 ,
𐮲 - -

< pyckyh>
ʾ abēzagīh purity 𐯨𐯾𐮷𐱋𐯻𐱋𐯫 𐮰 - ,
𐯨𐯾𐮷𐱋𐯻𐱋𐯫 𐯀 ,
𐮲 - - ,
𐯁 ,
𐮷 ,
𐮲 - - ,
𐮰 -

< thš>
ʾ ātaxš fire 𐱒𐯦𐱙𐯩 𐮰 - ,
𐱒𐯦𐱙𐯩 𐯃 ,
𐮰 - ,
𐯂

12
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<b ht>
ʾ baxt destiny 𐱙𐯮𐯩𐯯 𐮱 ,
𐱙𐯮𐯩𐯯 𐮰 - ,
𐮰 - ,
𐯃
<GBRʾ> mard man 𐮰𐮵𐯰𐯴 𐮲 - - ,
𐮰𐮵𐯰𐯴 𐮱 ,
𐮵 - - - ,
𐮰 -

< š dyh>
ʾʾ ašāyīh righteousness 𐯨𐯿𐯸𐯪𐱖𐯦 𐮰 - ,
𐯨𐯿𐯸𐯪𐱖𐯦 𐯂 ,
𐮰 - ,
𐮲 - - ,
𐮲 - - ,
𐮰 -

<g h>
ʾ gāh special place, 𐯨𐯪𐯾 𐮲 - - ,
throne 𐯨𐯪𐯾 𐮰 - ,
𐮰 -

<g h n>
ʾʾ gāhān the Gathas 𐰉𐯧𐯪𐯪𐯾 𐮲 - - ,
𐰉𐯧𐯪𐯪𐯾 𐮰 - ,
𐮰 - ,
𐮰 - ,
𐮵 - - -

<d’h’k’n> dehgān landowner 𐰉𐯦𐰑𐯧𐯪𐯾 𐮲 - - ,


𐰉𐯦𐰑𐯧𐯪𐯾 𐮰 - ,
𐮰 - ,
𐮷 ,
𐮰 - ,
𐮵 - - -

<dhywpt> dahībed lord of the land 𐯃𐯀𐰉𐯸𐯪𐯾 𐮲 - - ,


𐯃𐯀𐰉𐯸𐯪𐯾 𐮰 - ,
𐮲 - - ,
𐮵 - - - ,
𐯀 ,
𐯃
<d t l>
ʾʾ dādār creator 𐰙𐯦𐱙𐯮𐯾 𐮲 - - ,
𐰙𐯦𐱙𐯮𐯾 𐮰 - ,
𐯃 ,
𐮰 - ,
𐮹

13
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<hm hl>
ʾ hamahl someone of 𐰙𐯧𐯪𐰸𐯦 𐮰 - ,
equal social 𐰙𐯧𐯪𐰸𐯦 𐮰 - ,
standing 𐮽 ,
𐮰 - ,
𐮵 - - -

<z hr>
ʾ zahr poison, venom 𐰙𐯧𐯪𐰏 𐮶 ,
𐰙𐯧𐯪𐰏 𐮰 - ,
𐮰 - ,
𐮵 - - -

<l tyh>
ʾ rādīh generosity 𐯨𐯾𐱙𐯮𐰞 𐮹 ,
𐯨𐯾𐱙𐯮𐰞 𐮰 - ,
𐯃 ,
𐮲 - - ,
𐮰 -

<š h>
ʾ šāh king 𐯨𐯪𐱕 𐯂 ,
𐯨𐯪𐱕 𐮰 - ,
𐮰 -

6.2 ‘fixed-form’ aleph, heth

In some words, when 𐮰 - precedes 𐮲 - - , its regular joining behavior is sus-


pended, and its nominal form is used instead. This behavior is morphological in nature and cannot be pre-
dicted using conventional rules of the script. Instead of using a control charater for modifying the regular
behavior of - , a ‘fixed’ form of the letter is proposed for encoding: 𐯓 - - .
If experts agree that the representations below may be suitably represented using a control character, then
the ‘fixed-form’ letter may be withdrawn.

While the ‘fixed’ - is used before - - in attested records, it may technically


occur before any letter in modern encoded texts. In both of these cases, the following letter is rendered
according to its own joining behavior. In the examples below, representations of both regular and ‘fixed’
- are given for purposes of comparison:

abāyišnīg <’p’dšnyk> pleasing, 𐰒𐰄𐰉𐱑𐯸𐯦𐱋𐯫 𐮰 - ,


attractive 𐰒𐰄𐰉𐱑𐯸𐯦𐱋𐯫 𐯀 ,
𐯓 - ,
𐮲 - - ,
𐯂 ,
𐮵 - - - ,
𐮲 - - ,
𐮷

14
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

ēg <’DYN> then 𐰉𐯸𐯿𐯦 𐯓 - ,


𐰉𐯸𐯿𐯦 𐮲 - - ,
𐮲 - - ,
𐮵 - - -

ēk <’dwk> one 𐮷𐰉𐯸𐯩 𐮰 - ,


𐮷𐰉𐯸𐯩 𐮲 - - ,
𐮵 - - - ,
𐮷
ay <’y> O! (exclam. 𐯺𐯦 𐯓 - ,
part.) 𐯺𐯦 𐮲 - -

ēkānag <’ywk’nk> single, 𐮷𐰉𐯦𐮷𐰉𐯸𐯩 𐮰 - ,


identical 𐮷𐰉𐯦𐮷𐰉𐯸𐯩 𐮲 - - ,
𐮵 - - - ,
𐮷 ,
𐮰 - ,
𐮵 - - - ,
𐮷
kū <’YK> where? 𐰔𐯸𐯦 𐯓 - ,
that, so that 𐰔𐯸𐯦 𐮲 - - ,
𐮸
kas <’YŠ> person, body 𐱒𐯸𐯦 𐯓 - ,
𐱒𐯸𐯦 𐮲 - - ,
𐯂
ēdōn <’ytwn> thus, in this 𐮵𐮵𐱙𐰂𐯦 𐯓 - ,
way 𐮵𐮵𐱙𐰂𐯦 𐮲 - - ,
𐯃 ,
𐮵 - - - ,
𐮵 - - -

ēč <’yc> something 𐱋𐯼𐯦 𐯓 - ,


𐱋𐯼𐯦 𐮲 - - ,
𐯁
ašāyīh <’š’dyh> righteousness 𐯨𐯿𐯸𐯪𐱖𐯦 𐮰 - ,
𐯨𐯿𐯸𐯪𐱖𐯦 𐯂 ,
𐮰 - ,
𐮲 - - ,
𐮲 - - ,
𐮰 -

15
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

Ērānwēz <’yl’nwyc> mythical 𐱋𐯻𐮵𐰉𐯧𐰟𐯸𐯦 𐯓 - ,


homeland 𐮲 - - ,
of the 𐱋𐯻𐮵𐰉𐯧𐰟𐯸𐯦 𐮹 ,
Iranians 𐮰 - ,
𐮵 - - - ,
𐮵 - - - ,
𐮲 - - ,
𐯁
gyān <HY’> soul 𐯨𐯿𐯦 𐯓 - ,
𐯨𐯿𐯦 𐮲 - - ,
𐮰 -

huparistā <hwplst’y> of good 𐯺𐯦𐱙𐱆𐰕𐯀𐰉𐯦 𐮰 - ,


service 𐮵 - - - ,
𐯺𐯦𐱙𐱆𐰕𐯀𐰉𐯦 𐯀 ,
𐮹 ,
𐮾 ,
𐯃 ,
𐯓 - ,
𐮲 - -

way <w’d> bird 𐯺𐯦𐮵 𐮵 - - - ,


𐯺𐯦𐮵 𐯓 - ,
𐮲 - -

rāy <l’d> possessive 𐯺𐯧𐰕 𐮹 ,


postposition 𐯺𐯧𐰕 𐯓 - ,
𐮲 - -

nāyīzag <n’yck> reed, straw, 𐮷𐱋𐯼𐯦𐮵 𐮵 - - - ,


tube 𐮷𐱋𐯼𐯦𐮵 𐯓 - ,
𐮲 - - ,
𐯁 ,
𐮷

6.3 Beth

The letter beth is represented using 𐮱 . It is a right-joining letter. Its joining behavior is:

Final 𐯱 after all dual-joining letters

Letters that follow beth are written after the right descender of 𐮱 and above the horizontal stroke, nested
within the letter. The behavior of is illustrated below.

16
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<ʾB> pid father 𐯱𐯦 𐮰 - ,


𐯱𐯦 𐮱

<’hlwb> ahlaw righteous 𐮱𐰉𐰗𐯧𐯩 𐮰 - ,


𐮰 - ,
𐮱𐰉𐰗𐯧𐯩 𐮹 ,
𐮵 - - - ,
𐮱
<bg> bay god, majesty 𐮲𐯯 𐮱 ,
𐮲𐯯 𐮲 - -

<bwlnd> buland tall, high 𐮲𐰉𐰕𐮵𐯯 𐮱 ,


𐮵 - - - ,
𐮲𐰉𐰕𐮵𐯯 𐮹 ,
𐮵 - - - ,
𐮲 - -

<bwc> buz goat 𐯁𐮵𐯯 𐮱 ,


𐯁𐮵𐯯 𐮵 - - - ,
𐯁
<hlbwlc> Harburz the mountain 𐱎𐰚𐮵𐯰𐰗𐯦 𐮰 - ,
surrounding 𐮹 ,
the world 𐱎𐰚𐮵𐯰𐰗𐯦 𐮱 ,
𐮵 - - - ,
𐮹 ,
𐯁

When beth occurs more than once in character sequence, the horizontal stroke of each preceding beth is
lowered to accommodate each subsequent occurrence. This behavior results in a nested appearance in which
the horizontal stroke of the left-most beth is nested within the lowered stroke of each preciding beth.

<BB’> dar door, chapter 𐮰𐯯𐯳 𐮱 ,


𐮱 ,
𐮰𐯯𐯳 𐮰 -

6.4 gimel, daleth, yodh

The Book Pahlavi letters gimel, daleth, yodh have the same shape and joining behavior, and are therefore
unified as the single character 𐮲 - - . It is a dual-joining letter, whose regular behavior is
illustrated below.

17
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

Initial -𐯴 before all letters except those below


also before 𐮲 - - in certain cases (see below)

-𐯻 before 𐯀 ,𐮾
also before 𐮲 - - in certain cases (see below)

𐯾 before 𐮰 - ,𐮲 - - ,𐯃

-𐰄 before 𐮷

Medial -𐯵- before all letters except those below


also before 𐮲 - - in certain cases (see § 6.4.2)

-𐯼- before 𐯀 ,𐮾

𐯿 before 𐮰 - ,𐮲 - - ,𐯃

-𐰅- before 𐮷

Final 𐯷- after all letters

<b’pyl’yk> bābēlāyīg Babylonian 𐰒𐰅𐯧𐰟𐯴𐱋𐯫𐯯 𐮱 ,


𐮰 - ,
𐰒𐰅𐯧𐰟𐯴𐱋𐯫𐯯 𐯀 ,
𐮲 - - ,
𐮹 ,
𐮰 - ,
𐮲 - - ,
𐯃
<g’ywmlt> Gayōmard Gayōmard 𐱙𐰢𐰲𐰉𐯸𐯪𐯾 𐮲 - - ,
𐮰 - ,
𐱙𐰢𐰲𐰉𐯸𐯪𐯾 𐮲 - - ,
𐮽 - ,
𐮹 ,
𐯃
<gwlg> gurg wolf 𐯷𐰕𐰉𐯴 𐮲 - - ,
𐯷𐰕𐰉𐯴 𐮵 - - - ,
𐮹 ,
𐮲 - -

<d’m> dām creation 𐰻𐯧𐯾 𐮲 - - ,


𐰻𐯧𐯾 𐮰 - ,
𐮽 -

18
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<DKYA> pāk pure, clean 𐯨𐯾𐰒𐰄 𐮲 - - ,


𐯨𐯾𐰒𐰄 𐮷 ,
𐮲 - - ,
𐮰 -

<dlygws> driyōš poor 𐯂𐰉𐯸𐯿𐰗𐯴 𐮲 - - ,

𐯂𐰉𐯸𐯿𐰗𐯴 𐮹 ,
𐮲 - - ,
𐮲 - - ,
𐮵 - - - ,
𐯂
<drwyst> drust healthy, sound 𐱙𐱇𐯻𐮵𐰉𐯴 𐮲 - - ,
𐱙𐱇𐯻𐮵𐰉𐯴 𐮵 ,
𐮵 ,
𐮲 - - ,
𐮾 ,
𐯃
<ym> yam Jam 𐰻𐯴 𐮲 - - ,
𐰻𐯴 𐮽 -

<myš> mēš sheep 𐱒𐯸𐰲 𐮽 - ,


𐱒𐯸𐰲 𐮲 - - ,
𐯂
<šyl> šēr lion 𐰙𐯸𐱕 𐯂 ,
𐰙𐯸𐱕 𐮲 - - ,
𐮹

6.4.1 Rendering adjacent sequences

When - - is followed immediately by another instance of the same letter, then its contex-
tual form is determined by that of the second - - . The cases described below are to be
considered regular rendering behaviors for adjacent sequences of this letter.

1. When the second 𐮲 is rendered as 𐯾 — as before 𐮰 - or 𐯃 or another immediately


adjacent - - — then the first 𐮲 is rendered as 𐯻. Compare gēhān and gētīy to gyāg:

<gyh’n> gēhān living beings 𐰉𐯧𐯪𐰀𐯻 𐮲 - - ,


𐰉𐯧𐯪𐰀𐯻 𐮲 - - ,
𐮰 - ,
𐮵 - - -

19
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<gyw’g> gyāg place 𐰑𐯦𐰉𐯵𐯾 𐮲 - - ,


𐰑𐯦𐰉𐯵𐯾 𐮲 - - ,
𐮵 ,
𐮰 - ,
𐮷
<gyty> gētīy this world 𐮲𐱙𐰃𐯻 𐮲 - - ,
𐮲𐱙𐰃𐯻 𐮲 - - ,
𐯃 ,
𐮲 - -

2. When the second 𐮲 is shaped as 𐰄 — as before 𐮷 — then the first 𐮲 is rendered using its nominal
form:

<nzdyk> nazdīk near 𐰒𐰅𐯵𐰋𐮵 𐮵 - - - ,


𐰒𐰅𐯵𐰋𐮵 𐮶 ,
𐮲 - - ,
𐮲 - - ,
𐮰

6.4.2 ’Fixed-form’ gimel, daleth, yodh

In some words, in an adjacent sequence of - - the first is rendered using its nominal form,
while the second is shaped based upon the following letter. This behavior differs from the representation
of the words gēhān and gētīy, as described above. The exceptional cases require some mechanism for rep-
resenting a form of - - that does not change its shape. This behavior is morphological in
nature and cannot be predicted using conventional rules of the script. Instead of using a control charater for
modifying the regular behavior of - - , a ‘fixed’ form of the letter is proposed for encod-
ing: 𐯔 - - - . If experts agree that the representations below may be suitably
represented using a control character, then the ‘fixed-form’ letter may be withdrawn. The - -
- is to be used for representing the following cases:

1. Exception in the rendering of the sequence < - - , - - , - >.


Compare the rendering of the gēhān, from above, with spazgīh:

<gyh’n> gēhān living beings 𐰉𐯧𐯪𐰀𐯻 𐮲 - - ,


𐰉𐯧𐯪𐰀𐯻 𐮲 - - ,
𐮰 - ,
𐮵 - - -

20
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

<spzgyh> spazgīh slander 𐯨𐯿𐯵𐰋𐱍𐰿 𐮾 ,


𐯨𐯿𐯵𐰋𐱍𐰿 𐯀 ,
𐮶 ,
𐯔 - - -
,
𐮲 - - ,
𐮰 -

2. Exception in the rendering of the sequence < - - , - - >:

<’whrmzd> Ohrmazd Ahura Mazda 𐯷𐯵𐰲𐰉𐯦𐰉𐯦 𐮰 - ,


𐯷𐯵𐰲𐰉𐯦𐰉𐯦 𐮵 - - - ,
𐮰 - ,
𐮵 - - - ,
𐮽 ,
𐯔 - - -

𐮲 - -

The sequence 𐯷𐯵 - - - + - - has commonly been written


and interpreted as 𐮰, a sequence of - . The proposed model aims to provide a means for
encoding the underlying sequence of characters. Rendering the underlying text using a shaping variant
should be handled by substitutions in a font.

6.5 ‘old’ daleth

An archaic form 𐮳 of daleth occurs in historical spellings. This form is inherented from Psalter Pahlavi. It
has a distinctive shape and differs in its joining behavior from daleth. This letter is encoded separately as
.

zrēy <zlyd> sea, ocean 𐰆𐯸𐰞𐰋 𐮶 ,

𐰆𐯸𐰞𐰋 𐮹 ,
𐮲 - - ,
𐮳

6.6 he

The letter 𐮴 is used only in Aramaic heterograms. It often resembles the sequence 𐮴 or 𐰉𐰲 mem + 𐮵 nun
(or waw). But, it is encoded as a separate character because of its semantic value and its treatment as an
atomic unit.

21
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

xwēš <NPŠE> own 𐰇𐱐𐯀𐮵 𐮵 - - - ,


𐰇𐱐𐯀𐮵 𐯀 ,
𐯂 ,
𐮴
ham <HWEm> I am 𐮽𐮴𐰉𐯦 𐮰 - ,
𐮽𐮴𐰉𐯦 𐮵 - - - ,
𐮴 ,
𐮽
abāg <LWTE> together with 𐮴𐯃𐰉𐰕 𐮹 ,
𐮴𐯃𐰉𐰕 𐮵 - - - ,
𐯃 ,
𐮴
sahist <MDMHN-st> seemed 𐱙𐱅𐮵𐰈𐰷𐯸𐰲 𐮽 ,
𐮲 - - ,
𐱙𐱅𐮵𐰈𐰷𐯸𐰲 𐮽 ,
𐮴 ,
𐮵 - - - ,
𐮾 ,
𐯃
čē <MH> what, which? 𐰇𐰲 𐮽 ,
𐰇𐰲 𐮴

6.7 waw, nun, ayin, resh

These four letters have the same shape and joining behavior, and are unified as the single character 𐮵 -
- - .

urwar <’wlwl> plant 𐮹𐰉𐰕𐰉𐯦 𐮰 - ,


𐮵 - - - ,
𐮹𐰉𐰕𐰉𐯦 𐮹 ,
𐮵 - - - ,
𐮹
ādur <’twr> fire 𐮵𐮵𐱙𐯩 𐮰 - ,
𐮵𐮵𐱙𐯩 𐯃 ,
𐮵 - - - ,
𐮵 - - -

bun <bwn> beginning 𐮵𐮵𐯯 𐮱 ,


𐮵𐮵𐯯 𐮵 - - - ,
𐮵 - - -

22
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

gund <gwnd> troop, army 𐮲𐮵𐰉𐯴 𐮲 - - ,


𐮲𐮵𐰉𐯴 𐮵 - - - ,
𐮵 - - - ,
𐮲 - -

wan <wn> tree 𐮵𐮵 𐮵 - - - ,


𐮵𐮵 𐮵 - - -

wināh <wn’s> sin 𐱊𐯦𐮵𐮵 𐮵 - - - ,


𐱊𐯦𐮵𐮵 𐮵 - - - ,
𐮰 - ,
𐮿
rōn <lwn> direction 𐮵𐰉𐰕 𐮹 ,
𐮵 - - - ,
𐮵𐰉𐰕 𐮵 - - -

murw <mwlw> bird 𐰉𐰕𐰉𐰲 𐮽 ,


𐮵 - - - ,
𐰉𐰕𐰉𐰲 𐮲 ,
𐮵 - - -

nūn <K‘N> now (adv.) 𐮵𐮵𐮷 𐮷 ,


𐮵𐮵𐮷 𐮵 - - - ,
𐮵 - - -

kerbag <krpk> good deeds 𐮵𐯀𐮵𐮷 𐮷 ,


𐮵𐯀𐮵𐮷 𐮵 - - - ,
𐯀 ,
𐮵 - - -

6.8 zayin

Initial -𐰋 before all letters except those below

𐰏 before 𐮰 - ,𐮲 - - ,𐯃

Medial -𐰋 before all letters except those below

𐰏 before 𐮰 - ,𐮲 - - ,𐯃

Final 𐰍- after all letters

az <’z> goat 𐰍𐯦 𐮰 - ,
𐰍𐯦 𐮶
23
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

burzāwand <bwlz’wnd> lofty, tall 𐮲𐮵𐮵𐯧𐰐𐰕𐮵𐯲 𐮱 ,


𐮲𐮵𐮵𐯧𐰐𐰕𐮵𐯲 𐮵 - - - ,
𐮹 ,
𐮶 ,
𐮰 - ,
𐮵 - - - ,
𐮵 - - - ,
𐮲 - -

ahīy <KZY> before (adv.) 𐯷𐰋𐮷 𐮷 ,


𐯷𐰋𐮷 𐮶 ,
𐮲 - -

mizd <mzd> fee, reward 𐯷𐰌𐰲 𐮽 ,


𐯷𐰌𐰲 𐮶 ,
𐮲 - -

zarr <ZHBA> gold 𐮰𐯰𐯧𐰏 𐮶 ,


𐮰𐯰𐯧𐰏 𐮰 - ,
𐮱 ,
𐮰 -

zōd <zwt> chief priest 𐯃𐰉𐰋 𐮶 ,


𐯃𐰉𐰋 𐮵 - - - ,
𐯃
zīndag <zywndk> living 𐰒𐰄𐮵𐰉𐯸𐰏 𐮶 ,
𐰒𐰄𐮵𐰉𐯸𐰏 𐮲 - - ,
𐮵 - - - ,
𐮵 - - - ,
𐮲 - - ,
𐯃
zamānag <zm’nk> an appointed 𐮷𐰉𐯧𐰸𐰋 𐮶 ,
time 𐮷𐰉𐯧𐰸𐰋 𐮽 ,
𐮰 - ,
𐮵 - - - ,
𐮷

6.9 kaph

The letter kaph is written using 𐮷 , but it also has an archaic form 𐮸 that occurs in Aramaic heterograms
and historical spellings of words. This latter form is encoded as the separate character on account
of its distinctive shape.

24
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

Final 𐰑 after all dual-joining letters

𐰒 after 𐮲 - - , 𐮲 mem

akanārag <’kn’lk> limitless 𐰑𐰖𐯦𐮵𐰑𐯦 𐮰 - ,


𐰑𐰖𐯦𐮵𐰑𐯦 𐮷 ,
𐮵 - - - ,
𐮰 - ,
𐮹 ,
𐮷
gyāg <gyw’k> place 𐰑𐯦𐰉𐯸𐯾 𐮲 - - ,
𐰑𐯦𐰉𐯸𐯾 𐮲 - - ,
𐮵 - - - ,
𐮰 - ,
𐮷
kerbakkar <krpkkl> someone who 𐰘𐮷𐮷𐯀𐮵𐮷 𐮷 ,
does good
𐰘𐮷𐮷𐯀𐮵𐮷 𐮵 - - - ,
deeds 𐯀 ,
𐮷 ,
𐮷 ,
𐮹
nāyrīg <n’ylyk> adult woman 𐰒𐰅𐰗𐯸𐯦𐮵 𐮵 - - - ,
𐰒𐰅𐰗𐯸𐯦𐮵 𐯓 - ,
𐮲 - - ,
𐮹 ,
𐮲 - - ,
𐮷
ramag <lmk> flock 𐰒𐰷𐰕 𐮹 ,
𐰒𐰷𐰕 𐮽 ,
𐮷

6.10 ‘old’ kaph

ōh <KN> in that manner 𐮵𐰓 𐮸 ,


𐮵𐰓 𐮵 - - -

kū <’YK> that, so that 𐰔𐯸𐯦 𐯓 - ,


𐰔𐯸𐯦 𐮲 - - ,
𐮸

25
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

6.11 lamedh

Although palaeographically derived from Aramaic lamedh, the letter 𐮹 generally represents /r/ in
Book Pahlavi. The letters 𐮻 and 𐮼 represent lamedh in Aramaic heterograms. As they occur concurrently
and are preserved in historically spellings of words, they are encoded as the separate characters
and , respectively. When 𐮹 represents /l/ instead of /r/, it is marked with a small stroke
as 𐮺. This form is encoded as the letter .

artēštār <’ltyšt’l> soldier, 𐰙𐯦𐱙𐱖𐯴𐱙𐰢𐯦 𐮰 - ,


warrior 𐮹 ,
𐰙𐯦𐱙𐱖𐯴𐱙𐰢𐯦
𐯃 ,
𐮲 - - ,
𐯂 ,
𐯃 ,
𐮰 - ,
𐮹
dagr <dgl> long, 𐰝𐯽𐯻 𐮲 - - ,
long-lasting
𐰝𐯽𐯻 𐮲 - - ,
𐮹
didīgar <dtykl> second 𐰘𐰒𐰄𐱙𐰁 𐮲 - - ,
𐰘𐰒𐰄𐱙𐰁 𐯃 ,
𐮲 - - ,
𐮷 ,
𐮹
yal <yal> hero 𐰙𐯴 𐮲 - - ,
𐰙𐯴 𐮹

gōw- <YMLLWN> say, speak 𐮵𐰉𐰗𐰗𐰽𐯴 𐮲 - - ,


𐮵𐰉𐰗𐰗𐰽𐯴 𐮽 ,
𐮹 ,
𐮹 ,
𐮵 - - - ,
𐮵 - - -

framān <plm’n> order, 𐰉𐯧𐰸𐰕𐯀 𐮲 - - ,


command
𐰉𐯧𐰸𐰕𐯀 𐮲 - - ,
𐮹

Alternate forms of lamedh:

26
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

Alaksandar <’lksndl> Alexander 𐰙𐯴𐰉𐱈𐰑𐰧𐯦 𐮰 - ,


𐰙𐯴𐰉𐱈𐰑𐰧𐯦 𐮺 ,
𐮿 ,
𐮵 - - - ,
𐮲 - - ,
𐮻
pas <’HL> after, 𐰮𐯧𐯩 𐮰 - ,
afterwards 𐰮𐯧𐯩 𐮰 - ,
𐮼
𐰱𐯧𐯩 𐮰 - ,
𐮰 - ,
𐰱𐯧𐯩 𐮻
pasīh <’HLyh> rear 𐯨𐯿𐰰𐯧𐯩 𐮰 - ,
𐮰 - ,
𐯨𐯿𐰰𐯧𐯩 𐮻 ,
𐮲 - - ,
𐮰 -

ma <’L> do not 𐰮𐯦 𐮰 - ,
(neg. part.) 𐰮𐯦 𐮼
𐰱𐯦 𐮰 - ,
𐰱𐯦 𐮻
fradāg <MHL> tomorrow 𐰮𐯧𐰲 𐮽 ,
𐰮𐯧𐰲 𐮰 - ,
𐮼
𐰱𐯧𐰲 𐮽 ,
𐰱𐯧𐰲 𐮰 - ,
𐮻
ō <‘L> to (prep.) 𐮼𐮵 𐮵 - - - ,
𐮼𐮵 𐮼
𐮻𐮵 𐮵 - - - ,
𐮻𐮵 𐮻

6.12 mem, qoph

The letters mem and qoph are written using the same shape 𐮽. They have the same joining behavior. The
letter qoph rarely occurs, and only in Aramaic heterograms.

27
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

ka <’MT> when, if 𐱙𐰹𐯦 𐮰 - ,


𐱙𐰹𐯦 𐮽 ,
𐯃
āsmān <’sm’n> sky 𐰉𐯧𐰸𐱉𐯦 𐮰 - ,
𐰉𐯧𐰸𐱉𐯦 𐮿 ,
𐮽 ,
𐮰 - ,
𐮵 - - -

būm <bwm> land 𐮽𐮵𐯯 𐮱 ,


𐮽𐮵𐯯 𐮵 - - - ,
𐮽
hamēmāl <hmym’l> opponent (in 𐰙𐯧𐰸𐯸𐰸𐯦 𐮰 - ,
war and law) 𐮽 ,
𐰙𐯧𐰸𐯸𐰸𐯦 𐮲 - - ,
𐮽 ,
𐮰 - ,
𐮹
garmīh <glmyh> heat 𐯨𐯿𐰽𐰖𐯴 𐮲 - - ,
𐯨𐯿𐰽𐰖𐯴 𐮹 ,
𐮽 ,
𐮲 - - ,
𐮰 -

may <HML’> wine 𐯨𐰠𐰷𐯦 𐮲 - - ,


𐯨𐰠𐰷𐯦 𐮰 - ,
𐮽 ,
𐮷
ǰāmag <y’mk> garment, 𐰒𐰷𐯧𐯾 𐮲 - - ,
coat 𐰒𐰷𐯧𐯾 𐮰 - ,
𐮽 ,
𐮷
māh <m’h> moon 𐯨𐯪𐰴 𐮽 ,
𐯨𐯪𐰴 𐮰 - ,
𐯨𐯪𐰲 𐮰 -

mehmān <m’hm’n> guest, 𐰉𐯧𐰸𐯧𐯪𐰴 𐮽 ,


intimate 𐰉𐯧𐰸𐯧𐯪𐰴 𐮰 - ,
𐮰 - ,
𐮽 ,
𐮰 - ,
𐮵 - - -

28
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

mardōm <mltwm> people 𐮽𐮵𐱙𐰢𐰲 𐮽 ,


𐮽𐮵𐱙𐰢𐰲 𐮹 ,
𐯃 ,
𐮵 - - - ,
𐮽
mihr <mtr> Mithra; love 𐮵𐱙𐰵 𐮽 ,
𐮵𐱙𐰵 𐯃 ,
𐮵 - - -

mihr <mtr> Mithra; love 𐮵𐯃𐰲 𐮽 ,


𐮵𐯃𐰲 𐯃 ,
𐮵 - - -

wārān <MTLA> rain 𐯨𐰞𐱙𐰵 𐮽 ,


𐯨𐰞𐱙𐰴 𐯃 ,
𐮹 ,
𐮰 -

abar <QDM> on (prep.) 𐰻𐯸𐰴 𐮽 ,


𐰻𐯸𐰴 𐮲 - - ,
𐮽
abar <QDM> on (prep.) 𐰻𐯸𐰲 𐮽 ,
𐰻𐯸𐰲 𐮲 - - ,
𐮽
ānōh <TMH> there 𐰇𐰲𐯃 𐯃 ,
𐰇𐰲𐯃 𐮽 ,
𐮴

slopes southwest from the baseline.

sahist <MDMH-st> seemed 𐰈𐰷𐯸𐰲 𐮽 ,


𐮲 - - ,
𐰈𐰷𐯸𐰲 𐮽 ,
𐮴

6.13 samekh

The samekh is written using the two distinctive forms 𐮾 and 𐮿. These forms are not glyphic variants, but
may occur concurrently in a text, and also within a word. The 𐮾 is encoded as , while 𐮿 is encoded
as .

29
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

asar <’sr> eternal, end- 𐰝𐱀𐯦 𐮰 - ,


less 𐰝𐱀𐯦 𐮾 ,
𐮹
hunsand <hwnsnd> content 𐮲𐰉𐱈𐰉𐯦 𐮰 - ,
𐮲𐰉𐱈𐰉𐯦 𐮵 - - - ,
𐮿 ,
𐮵 - - - ,
𐮲 - -

rāh <l’s> road, path 𐱊𐯧𐰞 𐮹 ,


𐱊𐯧𐰞 𐮰 - ,
𐮿
nask <nsk> book of the 𐰒𐰿𐮵 𐮵 - - - ,
Avesta 𐰒𐰿𐮵 𐮾 ,
𐮷
saxt <sht> hard, firm 𐱙𐯮𐱃 𐮾 ,
𐱙𐯮𐱃 𐮰 - ,
𐯃
spāsdār <sp’sd’l> grateful 𐰙𐯧𐯿𐱉𐯦𐱍𐰿 𐮾 ,
𐰙𐯧𐯿𐱉𐯦𐱍𐰿 𐯀 ,
𐮰 - ,
𐮿 ,
𐮲 - - ,
𐮰 - ,
𐮹
sālār <srd’l> leader, chief, 𐰙𐯧𐯾𐰉𐱈 𐮿 ,
governor
𐰙𐯧𐯾𐰉𐱈 𐮵 - - - ,
𐮲 - - ,
𐮹
pāygōs <p’tkws> district 𐮿𐮵𐮷𐱋𐯫𐯀 𐯀 ,
𐮿𐮵𐮷𐱋𐯫𐯀 𐮰 - ,
𐯃 ,
𐮷 ,
𐮵 - - - ,
𐮿

30
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

6.14 pe

ābādīh <ʾptyh> wealth, 𐯨𐯾𐯃𐱋𐯫 𐮲 - - ,


prosperity 𐯨𐯾𐯃𐱋𐯫 𐮵 - - - ,
𐮾 ,
𐯀 ,
𐮵 - - - ,
𐮲 - -

gōspand <gwspnd> domestic 𐮲𐮵𐱍𐰿𐰉𐯴 𐮲 - - ,


animal 𐮲𐮵𐱍𐰿𐰉𐯴 𐮵 - - - ,
𐮾 ,
𐯀 ,
𐮵 - - - ,
𐮲 - -

xōb <hwp> good 𐯀𐰉𐯦 𐮰 - ,


𐯀𐰉𐯦 𐮵 - - - ,
𐯀
pōlābd <pwl’pt> steel 𐯃𐱋𐯬𐰞𐮵𐯀 𐯀 ,
𐯃𐱋𐯬𐰞𐮵𐯀 𐮵 - - - ,
𐮹 ,
𐮰 - ,
𐯀 ,
𐯃
rēbāh <lyp’s> rhubarb 𐱊𐯦𐱋𐯼𐰕 𐮹 ,
𐱊𐯦𐱋𐯼𐰕 𐮲 - - ,
𐯀 ,
𐮰 - ,
𐮾
rabihwintar <lpytpyntl> southern 𐮹𐯃𐰉𐯴𐯀𐱙𐰁𐱌𐰕 𐮹 ,
𐮹𐯃𐰉𐯴𐯀𐱙𐰁𐱌𐰕 𐯀 ,
𐮲 - - ,
𐯃 ,
𐯀 ,
𐮲 - - ,
𐮵 - - - ,
𐯃 ,
𐮹
paydāg <pytʾk> apparent, 𐰑𐯦𐱙𐰁𐯀 𐯀 ,
evident 𐰑𐯦𐱙𐰁𐯀 𐮲 - - ,
𐯃 ,
𐮰 - ,
𐮷

31
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

6.15 sadhe

ačārag <’c’lk> helpless 𐮷𐰉𐯦𐱋𐯫 𐮰 - ,


𐮷𐰉𐯦𐱋𐯫 𐮵 - - - ,
𐮲 - - ,
𐮹 ,
𐯁
handarz <hndlc> advice 𐱋𐰛𐯴𐰉𐯦 𐮰 - ,
𐱋𐰛𐯴𐰉𐯦 𐮵 - - - ,
𐮲 - - ,
𐮹 ,
𐯁
nēmrōz <nymlwc> noon 𐯁𐰉𐰗𐰽𐯴𐮵 𐮵 - - - ,
𐯁𐰉𐰗𐰽𐯴𐮵 𐮲 - - ,
𐮽 ,
𐮹 ,
𐮵 - - - ,
𐯁
pērōzgar <pylwcgl> victorious 𐰙𐯴𐯁𐰉𐰗𐯴𐯀 𐯀 ,
𐮲 - - ,
𐰙𐯴𐯁𐰉𐰗𐯴𐯀 𐮹 ,
𐮵 - - - ,
𐯁 ,
𐮲 - - ,
𐮹 ,

čarb <clp> amenable 𐱌𐰕𐯁 𐯁 ,

𐱌𐰕𐯁 𐮹 ,
𐯀
sang <CCA> stone 𐮰𐯁𐯁 𐯁 ,
𐮰𐯁𐯁 𐯁 ,
𐮰 -

32
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

6.16 shin

ōšībām <’wšyb’m> dawn 𐰺𐯦𐯰𐯸𐱕𐰉𐯦 𐮰 - ,


𐰺𐯦𐯰𐯸𐱕𐰉𐯦 𐮵 - - - ,
𐯂 ,
𐮲 - - ,
𐮱 ,
𐮰 - ,
𐮽
āštīh <’štyh> peace 𐯨𐯾𐱙𐱘𐯦 𐮰 - ,
𐯨𐯾𐱙𐱘𐯦 𐯂 ,
𐯃 ,
𐮲 - - ,
𐮰 -

xwarišn <hwlšn> food 𐰉𐱑𐰕𐰉𐯦 𐮰 - ,


𐰉𐱑𐰕𐰉𐯦 𐮵 - - - ,
𐮹 ,
𐯂 ,
𐮵 - - -

Warkaš <wlkš> the world 𐯂𐰑𐰕𐮵 𐮵 - - - ,


ocean 𐯂𐰑𐰕𐮵 𐮹 ,
𐮷 ,
𐯂
mēš <myš> sheep 𐱒𐯸𐰲 𐮽 - ,
𐮲 - - ,
𐯂
šāhān <š’h’n> kings 𐰉𐯧𐯪𐯪𐱕 𐯂 ,
𐰉𐯧𐯪𐯪𐱕 𐮰 - ,
𐮰 - ,
𐮰 - ,
𐮵 - - - ,

dēwān <ŠDYA’n> bad gods 𐰉𐯧𐯪𐯿𐯸𐱕 𐯂 ,


𐰉𐯧𐯪𐯿𐯸𐱕 𐮲 - - ,
𐮲 - - ,
𐮰 - ,
𐮰 - ,
𐮵 - - - ,

33
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

weh <ŠPYL> good, better 𐰙𐯴𐱌𐱐 𐯂 ,


𐰙𐯴𐱌𐱐 𐯀 ,
𐮲 - - ,
𐮹
Kašmīr <kšmyl> Kashmir 𐰙𐯸𐰸𐱐𐮷 𐮷 ,
𐰙𐯸𐰸𐱐𐮷 𐯂 ,
𐮽 ,
𐮲 - - ,
𐮹
tuxšāg <twhš’k> diligent 𐰑𐯧𐱖𐯦𐮵𐯃 𐯃 ,
𐰑𐯧𐱖𐯦𐮵𐯃 𐮵 - - - ,
𐮰 - ,
𐯂 ,
𐮰 - ,
𐮷

6.17 taw

dād <d’t> law 𐱙𐯮𐯾 𐮲 - - ,


𐱙𐯮𐯾 𐮰 - ,
𐯃
zamestān <dmst’n> winter 𐰉𐯦𐱋𐱀𐰶𐯴 𐮲 - - ,
𐰉𐯦𐱋𐱀𐰶𐯴 𐮽 ,
𐮾 ,
𐯃 ,
𐮰 - ,
𐮵 - - -

zōd <zwt> chief priest 𐯃𐰉𐰋 𐮶 ,


𐯃𐰉𐰋 𐮵 - - - ,
𐯃 ,

Zarduxšt <zltwhšt> Zarathustra 𐱙𐱘𐯦𐮵𐱙𐰢𐰋 𐮶 ,


𐱙𐱘𐯦𐮵𐱙𐰢𐰋 𐮹 ,
𐯃 ,
𐮵 - - - ,
𐮰 - ,
𐯂 ,
𐯃

34
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

pahikār <ptk’l> strife 𐰙𐯦𐮷𐯃𐯀 𐯀 ,


𐰙𐯦𐮷𐯃𐯀 𐯃 ,
𐮷 ,
𐮰 - ,
𐮹
tan <tn> body 𐮵𐯃 𐯃 ,
𐮵𐯃 𐮵 - - -

6.18 x1, x2

andar <BYN> in (prep.) 𐯄 𐯄 1


𐯄

35
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

7 Description of numbers

7.1 Primary units

1 ēk one 𐯕 𐯕
2 dō two 𐯖 𐯖
3 sē three 𐯗 𐯗
4 čahār four 𐯘 𐯘
5 panǰ five 𐯖𐱠 𐯗 ,
𐯖
6 šaš six 𐯗𐱠 𐯗 ,
𐯗
7 haft seven 𐯗𐱡 𐯗 ,
𐯗
8 hašt eight 𐯘𐱡 𐯗 ,
𐯗
9 nō nine 𐯗𐱠𐱠 𐯗 ,
𐯗 ,
𐯗

36
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

8 Character Properties

8.1 Core data: UnicodeData.txt

10BB0;BOOK PAHLAVI LETTER ALEPH-HETH;Lo;0;R;;;;;N;;;;;


10BB1;BOOK PAHLAVI LETTER BETH;Lo;0;R;;;;;N;;;;;
10BB2;BOOK PAHLAVI LETTER GIMEL-DALETH-YODH;Lo;0;R;;;;;N;;;;;
10BB3;BOOK PAHLAVI LETTER OLD DALETH;Lo;0;R;;;;;N;;;;;
10BB4;BOOK PAHLAVI LETTER HE;Lo;0;R;;;;;N;;;;;
10BB5;BOOK PAHLAVI LETTER WAW-NUN-AYIN-RESH;Lo;0;R;;;;;N;;;;;
10BB6;BOOK PAHLAVI LETTER ZAYIN;Lo;0;R;;;;;N;;;;;
10BB7;BOOK PAHLAVI LETTER KAPH;Lo;0;R;;;;;N;;;;;
10BB8;BOOK PAHLAVI LETTER OLD KAPH;Lo;0;R;;;;;N;;;;;
10BB9;BOOK PAHLAVI LETTER LAMEDH;Lo;0;R;;;;;N;;;;;
10BBA;BOOK PAHLAVI LETTER STROKED LAMEDH;Lo;0;R;;;;;N;;;;;
10BBB;BOOK PAHLAVI LETTER HOOKED LAMEDH;Lo;0;R;;;;;N;;;;;
10BBC;BOOK PAHLAVI LETTER OLD LAMEDH;Lo;0;R;;;;;N;;;;;
10BBD;BOOK PAHLAVI LETTER MEM-QOPH;Lo;0;R;;;;;N;;;;;
10BBE;BOOK PAHLAVI LETTER SAMEKH;Lo;0;R;;;;;N;;;;;
10BBF;BOOK PAHLAVI LETTER ALTERNATE SAMEKH;Lo;0;R;;;;;N;;;;;
10BC0;BOOK PAHLAVI LETTER PE;Lo;0;R;;;;;N;;;;;
10BC1;BOOK PAHLAVI LETTER SADHE;Lo;0;R;;;;;N;;;;;
10BC2;BOOK PAHLAVI LETTER SHIN;Lo;0;R;;;;;N;;;;;
10BC3;BOOK PAHLAVI LETTER TAW;Lo;0;R;;;;;N;;;;;
10BC4;BOOK PAHLAVI LETTER LIGATURE X1;Lo;0;R;;;;;N;;;;;
10BC5;BOOK PAHLAVI LETTER LIGATURE X2;Lo;0;R;;;;;N;;;;;
10BC6;BOOK PAHLAVI LETTER LIGATURE TURNED AHRIMAN;Lo;0;R;;;;;N;;;;;
10BC7;BOOK PAHLAVI COMBINING DOT ABOVE;Mn;230;NSM;;;;;N;;;;;
10BC8;BOOK PAHLAVI COMBINING DOT BELOW;Mn;220;NSM;;;;;N;;;;;
10BC9;BOOK PAHLAVI COMBINING TWO DOTS ABOVE;Mn;230;NSM;;;;;N;;;;;
10BCA;BOOK PAHLAVI COMBINING TWO DOTS BELOW;Mn;220;NSM;;;;;N;;;;;
10BCB;BOOK PAHLAVI COMBINING THREE DOTS ABOVE;Mn;230;NSM;;;;;N;;;;;
10BCC;BOOK PAHLAVI COMBINING THREE DOTS BELOW;Mn;220;NSM;;;;;N;;;;;
10BCD;BOOK PAHLAVI COMBINING HAT ABOVE;Mn;230;NSM;;;;;N;;;;;
10BCE;BOOK PAHLAVI COMBINING HAT BELOW;Mn;220;NSM;;;;;N;;;;;
10BCF;BOOK PAHLAVI END OF WORD MARK;Po;0;AL;;;;;N;;;;;
10BD0;BOOK PAHLAVI PUNCTUATION THREE DOTS;Po;0;AL;;;;;N;;;;;
10BD1;BOOK PAHLAVI PUNCTUATION THREE CIRCLES;Po;0;AL;;;;;N;;;;;
10BD2;BOOK PAHLAVI LETTER FIXED-FORM ALEPH-HETH;Lo;0;R;;;;;N;;;;;
10BD3;BOOK PAHLAVI LETTER FIXED-FORM GIMEL-DALETH-YODH;Lo;0;R;;;;;N;;;;;
10BD4;BOOK PAHLAVI NUMBER ONE;No;0;R;;;;1;N;;;;;
10BD5;BOOK PAHLAVI NUMBER TWO;No;0;R;;;;2;N;;;;;
10BD6;BOOK PAHLAVI NUMBER THREE;No;0;R;;;;3;N;;;;;
10BD7;BOOK PAHLAVI NUMBER FOUR;No;0;R;;;;4;N;;;;;
10BD8;BOOK PAHLAVI NUMBER ONE HUNDRED;No;0;R;;;;100;N;;;;;

8.2 Linebreak data: LineBreak.txt

10BB0..10BC6;AL # Lo [23] BOOK PAHLAVI LETTER ALEPH..


BOOK PAHLAVI LIGATURE TURNED AHRIMAN
10BC7..10BCE;AL # Cm [8] BOOK PAHLAVI COMBINING DOT ABOVE..
BOOK PAHLAVI COMBINING HAT BELOW
10BCF..10BD1;AL # Po [3] BOOK PAHLAVI END OF WORD MARK..
BOOK PAHLAVI PUNCTUATION THREE CIRCLES
10BD2..10BD3;AL # Lo [2] BOOK PAHLAVI LETTER FIXED-FORM ALEPH-HETH..
BOOK PAHLAVI LETTER FIXED-FORM GIMEL-DALETH-YODH
10BD4..10BD8;AL # No [5] BOOK PAHLAVI NUMBER ONE..
BOOK PAHLAVI NUMBER ONE HUNDRED

37
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

9 Acknowledgments

I would like to thank Roozbeh Pournader for sharing his materials on the Book Pahlavi script and for moti-
vating me to carry on the effort to develop an encoding for the script.

This project has been made possible in part by funding from the Adopt-A-Character program of the Unicode
Consortium, and has been supervised by Deborah Anderson and Rick McGowan.

It was also made possible in part by a grant from the U.S. National Endowment for the Humanities, which
funded the Universal Scripts Project (part of the Script Encoding Initiative at UC Berkeley). Any views,
findings, conclusions or recommendations expressed in this publication do not necessarily reflect those of
the National Endowment of the Humanities.

38
10BB0 Book Pahlavi 10BDF

10BB 10BC 10BD

0 𐮰 𐯀 𐯐
10BB0 10BC0 10BD0

1 𐮱 𐯁 𐯑
10BB1 10BC1 10BD1

2 𐮲 𐯂 𐯒
10BB2 10BC2 10BD2

3 𐮳 𐯃 𐯓
10BB3 10BC3 10BD3

4 𐮴 𐯄 𐯔
10BB4 10BC4 10BD4

5 𐮵 𐯅 𐯕
10BB5 10BC5 10BD5

6 𐮶 𐯆 𐯖
10BB6 10BC6 10BD6

7 𐮷 𐯇 𐯗
10BB7 10BC7 10BD7

8 𐮸 $𐯈 𐯘
10BB8 10BC8 10BD8

9 𐮹 $𐯉 𐯙
10BB9 10BC9 10BD9

A 𐮺 $𐯊
10BBA 10BCA

B 𐮻 $𐯋
10BBB 10BCB

C 𐮼 $𐯌
10BBC 10BCC

D 𐮽 $𐯍
10BBD 10BCD

E 𐮾 $𐯎
10BBE 10BCE

F 𐮿 $𐯏
10BBF 10BCF

Printed using UniBook™


(http://www.unicode.org/unibook/)
Preliminary proposal to encode Book Pahlavi in Unicode Anshuman Pandey

Book Psalter Inscriptional Inscriptional Imperial


Pahlavi Pahlvai Pahlavi Parthian Aramaic

aleph 𐮰 𐮀 𐭠 𐭀 𐡀
beth 𐮱 𐮁 𐭡 𐭁 𐡁
gimel 𐮲 𐮂 𐭢 𐭂 𐡂
daleth (𐮲), 𐮳 𐮃 𐭣 𐭃 𐡃
he 𐮴 𐮄 𐭤 𐭄 𐡄
waw 𐮵 𐮅 𐭥 𐭅 𐡅
zayin 𐮶 𐮆 𐭦 𐭆 𐡆
heth (𐮰 ) 𐮇 𐭧 𐭇 𐡇
teth — — 𐭨 𐭈 𐡈
yodh (𐮲 ) 𐮈 𐭩 𐭉 𐡉
kaph 𐮷, 𐮸 𐮉 𐭪 𐭊 𐡊
lamedh 𐮹, 𐮺 , 𐮻 , 𐮼 𐮊 𐭫 𐭋 𐡋
mem 𐮽 𐮋 𐭬 𐭌 𐡌
nun (𐮵 ) 𐮌 𐭭 𐭍 𐡍
samekh 𐮾, 𐮿 𐮍 𐭮 𐭎 𐡎
ayin (𐮵 ) — (𐭥) 𐭏 𐡏
pe 𐯀 𐮎 𐭯 𐭐 𐡐
sadhe 𐯁 𐮏 𐭰 𐭑 𐡑
qoph (𐮽) — (𐭬) 𐭒 𐡒
resh (𐮵 ) (𐭥) 𐭓 𐡓
shin 𐯂 𐮐 𐭱 𐭔 𐡔
taw 𐯃 𐮑 𐭲 𐭕 𐡕

Table 1: Comparison of the Pahlavi scripts with Parthian and Aramaic. Parenthesis indicate that a
letter has been unified with another in the respective encoding. In Inscriptional Pahlavi, ayin and
resh are unified with waw, and qoph with mem.

40

You might also like