Sindhi transliteration
Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indian subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan[1] and (formally) in extended-Devanagari by Sindhis in partitioned India.[2] Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.[3]
Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.[4] In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,[5] hence one cannot use script conversions like Hindi-Urdu Transliteration.
Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[6] Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.[7] For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.[8]
In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[9][10] [11]
Consonants
The following table provides an approximate one-to-one mapping for modern Sindhi consonants,[12] especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.[13]
| Sindhi Abjad | Urdu-Shahmukhi[14] | Roman | Devanagari[15] | 
|---|---|---|---|
| ڪ | ک | k | क | 
| ک | کھ | kh | ख | 
| ق | ق | q | क़ | 
| خ | خ | k͟h | ख़ | 
| گ | گ | g | ग | 
| غ | غ | g͟h | ग़ | 
| ڳ | ڳ | g̤ | ॻ | 
| گھ | گھ | gh | घ | 
| چ | چ | c | च | 
| ڇ | چھ | ch | छ | 
| ج | ج | j | ज | 
| جھ | جھ | jh | झ | 
| ڄ | ڄ | j̈ | ॼ | 
| ز | ز | z | ज़ | 
| ذ | ذ | z | ज़ | 
| ض | ض | z | ज़ | 
| ظ | ظ | z | ज़ | 
| ژ | ژ | zh | झ़ | 
| ٽ | ٹ | ṭ | ट | 
| ٺ | ٹھ | ṭh | ठ | 
| ڏ | ݙ | d̤ | ॾ | 
| ڊ | ڈ | ḍ | ड | 
| ڍ | ڈھ | ḍh | ढ | 
| ڙ | ڑ | ṛ | ड़ | 
| ڙھ | ڑھ | ṛh | ढ़ | 
| ت | ت | t | त | 
| ٿ | تھ | th | थ | 
| ط | ط | t | त | 
| د | د | d | द | 
| ڌ | دھ | dh | ध | 
| ن | ن | n | न | 
| ڻ | ݨ | ṇ | ण | 
| ڱ | ن٘ | ṅ | ङ | 
| ڃ | ݩ | ñ | ञ | 
| پ | پ | p | प | 
| ڦ | پھ | ph | फ | 
| ف | ف | f | फ़ | 
| ب | ب | b | ब | 
| ٻ | ٻ | ḇ | ॿ | 
| ڀ | بھ | bh | भ | 
| م | م | m | म | 
| ي | ی | y | य | 
| ر | ر | r | र | 
| ل | ل | l | ल | 
| ݪ | لؕ | ḷ | ळ | 
| و | و | v | व | 
| ش | ش | sh | श | 
| ݜ | سؕ | ṣh | ष | 
| س | س | s | स | 
| ص | ص | s | स | 
| ث | ث | s | स | 
| ه | ہ | h | ह | 
| ح | ح | h | ह | 
| ۃ | ۃ | h | ह | 
| ع | ع | ʿ | ʿ | 
Single-letter word ligatures
| Sindhi | Urdu | Roman | Devanagari | 
|---|---|---|---|
| ۽ | اَیں | a͠i | ऐं | 
| ۾ | میں | mẽ | में | 
Numerals
| Usage | Numeral System | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Pakistan | Arabic numerals | ۰ | ۱ | ۲ | ۳ | ۴ | ۵ | ٦ | ۷ | ۸ | ۹ | 
| International | Hindu-Arabic | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 
| India | Modern Devanagari | ० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ | 
Punctuations and symbols
| Script | Period | Question Mark | Comma | Semi-colon | Slash | Percent | End of verse | 
|---|---|---|---|---|---|---|---|
| Perso-Arabic | ۔ | ؟ | ، | ؛ | ؍ | ٪ |  | 
| Modern Devanagari | । | ? | , | ; | / | % | ॥ | 
References
- ^ "Sindhi gains national status". The Express Tribune. 2016-10-03. Retrieved 2021-07-02.
- ^ "Languages Included in the Eighth Schedule of the Indian Constution [sic]". Department of Official Language, Ministry of Home Affairs. Retrieved 2018-04-09.
- ^ Lakhani, Rakesh (2013-06-06). "Forgotten Sindhi Script – Waranki". Indigenous Sindhis. Retrieved 2021-07-02.
- ^ "Transcending barriers: Software to break down the wall within the Sindhi language". The Express Tribune. 2014-09-18. Retrieved 2021-07-02.
- ^ "The Imposition Of Urdu". NAWAIWAQT GROUP OF NEWSPAPERS. September 10, 2015. Retrieved 12 September 2015.
- ^ Leghari, Mehwish; U Rahman, Mutee (2015-10-30). "Towards Transliteration between Sindhi Scripts Using Roman Script". Linguistics and Literature Review. 1 (2): 95–104. doi:10.32350/llr.12.03. SSRN 3820433.
- ^ Lehal, Gurpreet Singh; Saini, Tejinder Singh (December 2014). "Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model": 232–239. {{cite journal}}: Cite journal requires|journal=(help)
- ^ Sodhar, Irum Naz; Jalbani, Akhtar Hussain; Channa, Muhammad Ibrahim; Hakro, Dil Nawaz (2021-04-01). "Romanized Sindhi Rules for Text Communication". Mehran University Research Journal of Engineering and Technology. 40 (2): 298–304. Bibcode:2021MURJE..40..298S. doi:10.22581/muet1982.2102.04. ISSN 2413-7219.
- ^ "Perso-Arabic To Indic Script Transliteration". sangam.learnpunjabi.org. Retrieved 2021-04-07.
- ^ "Saraiki - Devanagari Machine Transliteration System - SDMTS". www.sanlp.org. Retrieved 2021-08-09.
- ^ Lawaye, Aadil; Kak, Aadil; Mehdi, Nali (January 2010). "Building a Cross Script Kashmiri Converter: Issues and Solutions". Proceedings of Oriental COCOSDA.
- ^ NC, Gokul (2021-05-07), GokulNC/Indic-PersoArabic-Script-Converter, retrieved 2021-05-28
- ^ Pedersen, Thomas T. "Sindhi: Transliteration of Non-Roman Scripts" (PDF). Transliteration of Non-Roman Scripts.
- ^ Carefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ)
- ^ All the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants