Sindhi transliteration
Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indian subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan[1] and (formally) in extended-Devanagari by Sindhis in partitioned India.[2] Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.[3]
Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.[4] In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,[5] hence one cannot use script conversions like Hindi-Urdu Transliteration.
Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[6] Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.[7] For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.[8]
In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[9][10] [11]
Consonants
[edit]The following table provides an approximate one-to-one mapping for modern Sindhi consonants,[12] especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.[13]
Sindhi Abjad | Urdu-Shahmukhi[14] | Roman | Devanagari[15] |
---|---|---|---|
ڪ | ک | k | क |
ک | کھ | kh | ख |
ق | ق | q | क़ |
خ | خ | k͟h | ख़ |
گ | گ | g | ग |
غ | غ | g͟h | ग़ |
ڳ | ڳ | g̤ | ॻ |
گھ | گھ | gh | घ |
چ | چ | c | च |
ڇ | چھ | ch | छ |
ج | ج | j | ज |
جھ | جھ | jh | झ |
ڄ | ڄ | j̈ | ॼ |
ز | ز | z | ज़ |
ذ | ذ | z | ज़ |
ض | ض | z | ज़ |
ظ | ظ | z | ज़ |
ژ | ژ | zh | झ़ |
ٽ | ٹ | ṭ | ट |
ٺ | ٹھ | ṭh | ठ |
ڏ | ݙ | d̤ | ॾ |
ڊ | ڈ | ḍ | ड |
ڍ | ڈھ | ḍh | ढ |
ڙ | ڑ | ṛ | ड़ |
ڙھ | ڑھ | ṛh | ढ़ |
ت | ت | t | त |
ٿ | تھ | th | थ |
ط | ط | t | त |
د | د | d | द |
ڌ | دھ | dh | ध |
ن | ن | n | न |
ڻ | ݨ | ṇ | ण |
ڱ | ن٘ | ṅ | ङ |
ڃ | ݩ | ñ | ञ |
پ | پ | p | प |
ڦ | پھ | ph | फ |
ف | ف | f | फ़ |
ب | ب | b | ब |
ٻ | ٻ | ḇ | ॿ |
ڀ | بھ | bh | भ |
م | م | m | म |
ي | ی | y | य |
ر | ر | r | र |
ل | ل | l | ल |
ݪ | لؕ | ḷ | ळ |
و | و | v | व |
ش | ش | sh | श |
ݜ | سؕ | ṣh | ष |
س | س | s | स |
ص | ص | s | स |
ث | ث | s | स |
ه | ہ | h | ह |
ح | ح | h | ह |
ۃ | ۃ | h | ह |
ع | ع | ʿ | ʿ |
Single-letter word ligatures
[edit]Sindhi | Urdu | Roman | Devanagari |
---|---|---|---|
۽ | اَیں | a͠i | ऐं |
۾ | میں | mẽ | में |
Numerals
[edit]Usage | Numeral System | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Pakistan | Arabic numerals | ۰ | ۱ | ۲ | ۳ | ۴ | ۵ | ٦ | ۷ | ۸ | ۹ |
International | Hindu-Arabic | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
India | Modern Devanagari | ० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ |
Punctuations and symbols
[edit]Script | Period | Question Mark | Comma | Semi-colon | Slash | Percent | End of verse |
---|---|---|---|---|---|---|---|
Perso-Arabic | ۔ | ؟ | ، | ؛ | ؍ | ٪ | |
Modern Devanagari | । | ? | , | ; | / | % | ॥ |
References
[edit]- ^ "Sindhi gains national status". The Express Tribune. 2016-10-03. Retrieved 2021-07-02.
- ^ "Languages Included in the Eighth Schedule of the Indian Constution [sic]". Department of Official Language, Ministry of Home Affairs. Retrieved 2018-04-09.
- ^ Lakhani, Rakesh (2013-06-06). "Forgotten Sindhi Script – Waranki". Indigenous Sindhis. Retrieved 2021-07-02.
- ^ "Transcending barriers: Software to break down the wall within the Sindhi language". The Express Tribune. 2014-09-18. Retrieved 2021-07-02.
- ^ "The Imposition Of Urdu". NAWAIWAQT GROUP OF NEWSPAPERS. September 10, 2015. Retrieved 12 September 2015.
- ^ Leghari, Mehwish; U Rahman, Mutee (2015-10-30). "Towards Transliteration between Sindhi Scripts Using Roman Script". Linguistics and Literature Review. 1 (2): 95–104. doi:10.32350/llr.12.03. SSRN 3820433.
- ^ Lehal, Gurpreet Singh; Saini, Tejinder Singh (December 2014). "Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model": 232–239.
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ Sodhar, Irum Naz; Jalbani, Akhtar Hussain; Channa, Muhammad Ibrahim; Hakro, Dil Nawaz (2021-04-01). "Romanized Sindhi Rules for Text Communication". Mehran University Research Journal of Engineering and Technology. 40 (2): 298–304. Bibcode:2021MURJE..40..298S. doi:10.22581/muet1982.2102.04. ISSN 2413-7219.
- ^ "Perso-Arabic To Indic Script Transliteration". sangam.learnpunjabi.org. Retrieved 2021-04-07.
- ^ "Saraiki - Devanagari Machine Transliteration System - SDMTS". www.sanlp.org. Retrieved 2021-08-09.
- ^ Lawaye, Aadil; Kak, Aadil; Mehdi, Nali (January 2010). "Building a Cross Script Kashmiri Converter: Issues and Solutions". Proceedings of Oriental COCOSDA.
- ^ NC, Gokul (2021-05-07), GokulNC/Indic-PersoArabic-Script-Converter, retrieved 2021-05-28
- ^ Pedersen, Thomas T. "Sindhi: Transliteration of Non-Roman Scripts" (PDF). Transliteration of Non-Roman Scripts.
- ^ Carefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ)
- ^ All the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants