Jump to content

Sindhi transliteration

From Wikipedia, the free encyclopedia
(Redirected from Sindhi Transliteration)

Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indian subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan[1] and (formally) in extended-Devanagari by Sindhis in partitioned India.[2] Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.[3]

Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.[4] In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,[5] hence one cannot use script conversions like Hindi-Urdu Transliteration.

Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[6] Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.[7] For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.[8]

In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[9][10] [11]

Consonants

[edit]

The following table provides an approximate one-to-one mapping for modern Sindhi consonants,[12] especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.[13]

Sindhi Consonants
Sindhi Abjad Urdu-Shahmukhi[14] Roman Devanagari[15]
ڪ ک k
ک کھ kh
ق ق q क़
خ خ k͟h ख़
گ گ g
غ غ g͟h ग़
ڳ ڳ
گھ گھ gh
چ چ c
ڇ چھ ch
ج ج j
جھ جھ jh
ڄ ڄ
ز ز z ज़
ذ ذ z ज़
ض ض z ज़
ظ ظ z ज़
ژ ژ zh झ़
ٽ ٹ
ٺ ٹھ ṭh
ڏ ݙ
ڊ ڈ
ڍ ڈھ ḍh
ڙ ڑ ड़
ڙھ ڑھ ṛh ढ़
ت ت t
ٿ تھ th
ط ط t
د د d
ڌ دھ dh
ن ن n
ڻ ݨ
ڱ ن٘
ڃ ݩ ñ
پ پ p
ڦ پھ ph
ف ف f फ़
ب ب b
ٻ ٻ ॿ
ڀ بھ bh
م م m
ي ی y
ر ر r
ل ل l
ݪ لؕ
و و v
ش ش sh
ݜ سؕ ṣh
س س s
ص ص s
ث ث s
ه ہ h
ح ح h
ۃ ۃ h
ع ع ʿ ʿ

Single-letter word ligatures

[edit]
Sindhi Urdu Roman Devanagari
۽ اَیں a͠i ऐं
۾ میں mẽ में

Numerals

[edit]
Usage Numeral System
Pakistan Arabic numerals ۰ ۱ ۲ ۳ ۴ ۵ ٦ ۷ ۸ ۹
International Hindu-Arabic 0 1 2 3 4 5 6 7 8 9
India Modern Devanagari

Punctuations and symbols

[edit]
Script Period Question Mark Comma Semi-colon Slash Percent End of verse
Perso-Arabic ۔ ؟ ، ؛ ؍ ٪ ۝
Modern Devanagari ? , ; / %

References

[edit]
  1. ^ "Sindhi gains national status". The Express Tribune. 2016-10-03. Retrieved 2021-07-02.
  2. ^ "Languages Included in the Eighth Schedule of the Indian Constution [sic]". Department of Official Language, Ministry of Home Affairs. Retrieved 2018-04-09.
  3. ^ Lakhani, Rakesh (2013-06-06). "Forgotten Sindhi Script – Waranki". Indigenous Sindhis. Retrieved 2021-07-02.
  4. ^ "Transcending barriers: Software to break down the wall within the Sindhi language". The Express Tribune. 2014-09-18. Retrieved 2021-07-02.
  5. ^ "The Imposition Of Urdu". NAWAIWAQT GROUP OF NEWSPAPERS. September 10, 2015. Retrieved 12 September 2015.
  6. ^ Leghari, Mehwish; U Rahman, Mutee (2015-10-30). "Towards Transliteration between Sindhi Scripts Using Roman Script". Linguistics and Literature Review. 1 (2): 95–104. doi:10.32350/llr.12.03. SSRN 3820433.
  7. ^ Lehal, Gurpreet Singh; Saini, Tejinder Singh (December 2014). "Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model": 232–239. {{cite journal}}: Cite journal requires |journal= (help)
  8. ^ Sodhar, Irum Naz; Jalbani, Akhtar Hussain; Channa, Muhammad Ibrahim; Hakro, Dil Nawaz (2021-04-01). "Romanized Sindhi Rules for Text Communication". Mehran University Research Journal of Engineering and Technology. 40 (2): 298–304. Bibcode:2021MURJE..40..298S. doi:10.22581/muet1982.2102.04. ISSN 2413-7219.
  9. ^ "Perso-Arabic To Indic Script Transliteration". sangam.learnpunjabi.org. Retrieved 2021-04-07.
  10. ^ "Saraiki - Devanagari Machine Transliteration System - SDMTS". www.sanlp.org. Retrieved 2021-08-09.
  11. ^ Lawaye, Aadil; Kak, Aadil; Mehdi, Nali (January 2010). "Building a Cross Script Kashmiri Converter: Issues and Solutions". Proceedings of Oriental COCOSDA.
  12. ^ NC, Gokul (2021-05-07), GokulNC/Indic-PersoArabic-Script-Converter, retrieved 2021-05-28
  13. ^ Pedersen, Thomas T. "Sindhi: Transliteration of Non-Roman Scripts" (PDF). Transliteration of Non-Roman Scripts.
  14. ^ Carefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ)
  15. ^ All the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants