Script
- class Script
The PangoScript
enumeration identifies different writing
systems.
The values correspond to the names as defined in the Unicode standard. See Unicode Standard Annex 24: Script names
Note that this enumeration is deprecated and will not be updated to include values
in newer versions of the Unicode standard. Applications should use the
UnicodeScript
enumeration instead,
whose values are interchangeable with PangoScript
.
Methods
- class Script
- for_unichar(ch: str) Script
Looks up the script for a particular character.
The script of a character is defined by Unicode Standard Annex 24: Script names.
No check is made for
ch
being a valid Unicode character; if you pass in invalid character, the result is undefined.Note that while the return type of this function is declared as
PangoScript
, as of Pango 1.18, this function simply returns the return value ofunichar_get_script
. Callers must be prepared to handle unknown values.Added in version 1.4.
Deprecated since version 1.44.: Use
unichar_get_script()
- Parameters:
ch – a Unicode character
- get_sample_language(script: Script) Language | None
Finds a language tag that is reasonably representative of
script
.The language will usually be the most widely spoken or used language written in that script: for instance, the sample language for
CYRILLIC
is ru (Russian), the sample language forARABIC
is ar.For some scripts, no sample language will be returned because there is no language that is sufficiently representative. The best example of this is
HAN
, where various different variants of written Chinese, Japanese, and Korean all use significantly different sets of Han characters and forms of shared characters. No sample language can be provided for many historical scripts as well.As of 1.18, this function checks the environment variables
PANGO_LANGUAGE
andLANGUAGE
(checked in that order) first. If one of them is set, it is parsed as a list of language tags separated by colons or other separators. This function will return the first language in the parsed list that Pango believes may usescript
for writing. This last predicate is tested usingincludes_script
. This can be used to control Pango’s font selection for non-primary languages. For example, aPANGO_LANGUAGE
enviroment variable set to “en:fa” makes Pango choose fonts suitable for Persian (fa) instead of Arabic (ar) when a segment of Arabic text is found in an otherwise non-Arabic text. The same trick can be used to choose a default language forHAN
when setting context language is not feasible.Added in version 1.4.
- Parameters:
script – a
PangoScript
Fields
- class Script
- AHOM
Ahom. Since: 1.40
- ANATOLIAN_HIEROGLYPHS
Anatolian Hieroglyphs. Since: 1.40
- ARABIC
Arabic
- ARMENIAN
Armenian
- BALINESE
Balinese. Since 1.14
- BASSA_VAH
Bassa. Since: 1.40
- BATAK
Batak. Since 1.32
- BENGALI
Bengali
- BOPOMOFO
Bopomofo
- BRAHMI
Brahmi. Since 1.32
- BRAILLE
Braille
- BUGINESE
Buginese. Since 1.10
- BUHID
Buhid
- CANADIAN_ABORIGINAL
Canadian Aboriginal
- CARIAN
Carian. Since 1.20.1
- CAUCASIAN_ALBANIAN
Caucasian Albanian. Since: 1.40
- CHAKMA
Chakma. Since: 1.32
- CHAM
Cham. Since 1.20.1
- CHEROKEE
Cherokee
- COMMON
A character used by multiple different scripts
- COPTIC
Coptic
- CUNEIFORM
Cuneiform. Since 1.14
- CYPRIOT
Cypriot
- CYRILLIC
Cyrillic
- DESERET
Deseret
- DEVANAGARI
Devanagari
- DUPLOYAN
Duployan. Since: 1.40
- ELBASAN
Elbasan. Since: 1.40
- ETHIOPIC
Ethiopic
- GEORGIAN
Georgian
- GLAGOLITIC
Glagolitic. Since 1.10
- GOTHIC
Gothic
- GRANTHA
Grantha. Since: 1.40
- GREEK
Greek
- GUJARATI
Gujarati
- GURMUKHI
Gurmukhi
- HAN
Han
- HANGUL
Hangul
- HANUNOO
Hanunoo
- HATRAN
Hatran. Since: 1.40
- HEBREW
Hebrew
- HIRAGANA
Hiragana
- INHERITED
A mark glyph that takes its script from the base glyph to which it is attached
- INVALID_CODE
A value never returned from pango_script_for_unichar()
- KANNADA
Kannada
- KATAKANA
Katakana
- KAYAH_LI
Kayah Li. Since 1.20.1
- KHAROSHTHI
Kharoshthi. Since 1.10
- KHMER
Khmer
- KHOJKI
Kjohki. Since: 1.40
- KHUDAWADI
Khudawadi, Sindhi. Since: 1.40
- LAO
Lao
- LATIN
Latin
- LEPCHA
Lepcha. Since 1.20.1
- LIMBU
Limbu
- LINEAR_A
Linear A. Since: 1.40
- LINEAR_B
Linear B
- LYCIAN
Lycian. Since 1.20.1
- LYDIAN
Lydian. Since 1.20.1
- MAHAJANI
Mahajani. Since: 1.40
- MALAYALAM
Malayalam
- MANDAIC
Mandaic. Since 1.32
- MANICHAEAN
Manichaean. Since: 1.40
- MENDE_KIKAKUI
Mende Kikakui. Since: 1.40
- MEROITIC_CURSIVE
Meroitic Cursive. Since: 1.32
- MEROITIC_HIEROGLYPHS
Meroitic Hieroglyphs. Since: 1.32
- MIAO
Miao. Since: 1.32
- MODI
Modi. Since: 1.40
- MONGOLIAN
Mongolian
- MRO
Return a type’s method resolution order.
- MULTANI
Multani. Since: 1.40
- MYANMAR
Myanmar
- NABATAEAN
Nabataean. Since: 1.40
- NEW_TAI_LUE
New Tai Lue. Since 1.10
- NKO
N’Ko. Since 1.14
- OGHAM
Ogham
- OLD_HUNGARIAN
Old Hungarian. Since: 1.40
- OLD_ITALIC
Old Italic
- OLD_NORTH_ARABIAN
Old North Arabian. Since: 1.40
- OLD_PERMIC
Old Permic. Since: 1.40
- OLD_PERSIAN
Old Persian. Since 1.10
- OL_CHIKI
Ol Chiki. Since 1.20.1
- ORIYA
Oriya
- OSMANYA
Osmanya
- PAHAWH_HMONG
Pahawh Hmong. Since: 1.40
- PALMYRENE
Palmyrene. Since: 1.40
- PAU_CIN_HAU
Pau Cin Hau. Since: 1.40
- PHAGS_PA
Phags-pa. Since 1.14
- PHOENICIAN
Phoenician. Since 1.14
- PSALTER_PAHLAVI
Psalter Pahlavi. Since: 1.40
- REJANG
Rejang. Since 1.20.1
- RUNIC
Runic
- SAURASHTRA
Saurashtra. Since 1.20.1
- SHARADA
Sharada. Since: 1.32
- SHAVIAN
Shavian
- SIDDHAM
Siddham. Since: 1.40
- SIGNWRITING
Signwriting. Since: 1.40
- SINHALA
Sinhala
- SORA_SOMPENG
Sora Sompeng. Since: 1.32
- SUNDANESE
Sundanese. Since 1.20.1
- SYLOTI_NAGRI
Syloti Nagri. Since 1.10
- SYRIAC
Syriac
- TAGALOG
Tagalog
- TAGBANWA
Tagbanwa
- TAI_LE
Tai Le
- TAKRI
Takri. Since: 1.32
- TAMIL
Tamil
- TELUGU
Telugu
- THAANA
Thaana
- THAI
Thai
- TIBETAN
Tibetan
- TIFINAGH
Tifinagh. Since 1.10
- TIRHUTA
Tirhuta. Since: 1.40
- UGARITIC
Ugaritic
- UNKNOWN
An unassigned code point. Since 1.14
- VAI
Vai. Since 1.20.1
- WARANG_CITI
Warang Citi. Since: 1.40
- YI
Yi