Wp/mjx/Unicode

< Wp‎ | mjx
Wp > mjx > Unicode

Unicode, official lekate Unicode standard, hoyoḱ kana Unicode Consortium darai te cacalao akan mit́ṭạṅ ol encoding manok, jahã do jeget reaḱ sanam muṛut́ ol hora te ol reaḱ beohar goṛo lạgit́ benao huy akana. 15.1 anaḱ manok version re 1,49,813 akhor ko menaḱa ar 162 goṭaṅ lipi reaḱ pạnạrsi lekate lạy sodor akada jahã do ayma lekan, sãohed, secet́ ar kạri anaḱ satam re beoharoḱa.

El, cikhnạ ar eṭaḱ cinhạ sãote ayma menela el, manok mudre midoḱ kana ona kodo eṭaḱ eṭaḱ ol hora lạgit́ bhegar te baṅ ko manaw akada. Unicode sasay goṭaṅ emoji encode leda, jahã reaḱ calawan utnạo sãote manok reaḱ hĩs lekate Consortium darai te cacalao lena. Ona begor hõ, Unicode reaḱ ạḍi ḍher ãgoc khạtir japan bahre re emoji reaḱ hoṛ kusi ḍheroḱ re enem menaḱa. Unicode mucạt́ re 11,00,000 khon hõ jạsti akhor encode korao daṛeyaḱa.

Unicode do internet re ayma gan ol e encode lạgit́ beoharoḱa, jahã mudre ayma web pej menaḱa ar unokto reaḱ software utnạo re satam an Unicode goṛo mit́ṭaṅ menela bupujhạw re benao akana.

Etohoṕ ar utnạo edit

Unicode etohoṕ lekate ona okto hạbić benao sanam ṭesṭ encoding re menaḱ simạ ko pasnãw reaḱ uyhạr te benao huy lena. Sanam encoding ajaḱ poriman re beohar lạgit́ cetan re ṭẽhaṭ tahẽ kana, menkhan eṭaḱ ko são sa-budis reaḱ jahan goṭabuṭạ onumạn baṅ tahẽ kana. Goṭa lekate, bachnaw barya encoding mit́ sãote beohar okto jaoge sanamaḱ kạmi baṅ korao ganoḱ kan tahẽ, jahã mudre mit́ṭạṅ re encode akan ol do eṭaḱ darai te jobra el lekate bichnạw huyuḱa. Jạstikay encoding sumuṅ mid-bar script mudre ontor bebhar reaḱ goṛo lạgit́ benao huy lena- jaoge mit́ṭaṅ onkan script ar leṭin akhor ko mudre—ayma script mudre baṅ, ar mit́ lekate beohar huyuḱ kan sanam script são do baṅ.

Joto khon mucạt́ thar re, Unicode sanam akhor ko lạgit́ code poyenṭ ńutum te mit́ṭaṅ bhegar nombor emaḱa. Ńelan uduḱ sodor reaḱ ayma satam - akar, goṛhon ar so̱yli sãote- do so̱pṭo̱war reaḱ katha lekate korao ho̱yo̱ḱa, jeleka o̱yeṕ brawjar se ạṛạ prosesor. Enkhan, logon hatConsortium lạgit́ te onumạn lekate, nõwa muṛut́ moḍel reaḱ solheyaḱ tet́ okto são sãote kichu jạsti pasnao akana ar manok reaḱ utnạo okto re ayma kạmi ạri chạṛ em huy akana.

Po̱ylo̱ 256 code poyenṭ do ISO/IEC sabadok e ńel leda, jahã reaḱ jos do huyuḱ kana pochim yuro̱piyo script re ol ol reaḱ bonodol do huḍiń korConsortium. Ayma birạdạli encoding darai te benao kan bhegar tet́ ko doho lạgid, onate jahan tottho baṅ hạn kate ona ko ar Unicode mudre bonodol reaḱ chạṛ em lạgid, ayma akhor ko, ńelan ar uyhạran kạmi banar re, eṭaḱ ko são pray mit́ leka ge rehõ, bhegar code poyenṭ em huy lena.

Unicode bulḍo̱g sirpạ do Unicode reaḱ utnạo re eseran mente mone huyuḱ kan hoṛ ko lạgit́ em huyuḱa, jahãy ko mudre menaḱ kowa totsuwo̱ ko̱bayesi, tamas milo̱, rujbe purnaḍar, ken lunḍ ar Michael Everson.

Nagam edit

Unicode etohoṕ 1980 gel serma re jero̱ks reaḱ jero̱ks curit code manok (xccs) são selet́ mit́ gadel hoṛ ṭhen khon ńam ganoḱa. 1987 serma re, jero̱ks kạmiyạ jo̱ bekar, epel ren kạmiyạ li ko̱lins ar mark ḍebhis sãote, mit́ṭaṅ jegetạri curit seṭ benConsortium reaḱ beoharan tolas ko ehoṕ leda. Peṭar phenuyik ar ḍebh opsṭaḍ aḱ bạṛti tetet́ te, bekar do 1988 serma reaḱ Agosṭ re mit́ṭạṅ "jeget́ jakat/sãge pạrsi te ol curit encoding hora" lạgit́ mit́ṭaṅ tạlika reaḱ katha uchạn leday, jahã do onumạn lekate Unicode ńutum te uprum lena. Uni bichnạw leday, "'Unicode' ńutum do mit́ṭaṅ bhenegar, midag, jegetạri encoding reaḱ katha bujhạw oco lạgit́ benao akana.

Unicode 88 ńutuman nõwa nothi re bekar 16 biṭ beohar kate mit́ṭaṅ skim reaḱ rup uduḱ sodor leday:

Unicode reaḱ jos do mit́ṭaṅ kạmi an, batawaḱ jeget́ ok encoding reaḱ lạkti ko ńel lạgid. IUnicode do sadharon lekate "e̱es.si.ay.ay reaḱ pasnConsortium hoṛmo" lekate bornon huy daṛeyaḱa jahã do jeget reaḱ sanam jiyet́ pạrsi reaḱ curit ko selet́ lạgit́ 16 biṭ hạbić pasnao huy akana. Mit́ṭaṅ bhage lekate benConsortium akan ḍijayin re, nõwa kạmi lạgit́ joto curit re 16 biṭ ge nowa pro̱po̱j lạgit́ lek mana.

Nõwa benao reaḱ goṭabuṭạ do nõwa uyhạr cetan ṭehaṭ kate Consortium ho̱y lena je 'nahag' beohar re menaḱ akhor sumuṅ ar curit ko lạgit́ encoding reaḱ lạkti huyuḱa.

Unicode do maṛaṅ reaḱ maren jinis ko rukhiyạ khon tayom daram lạgit́ beohar reaḱ katha goṭa lạgit́ cetan re sores (lahanti) e em akada. Unicode reaḱ jos do po̱ylo̱ okte re nahaḱ ol re uchạn akan curit ko (jeleka 1988 serma re jeget́ re uchạn sanam khobor sakam ar patham reaḱ sakam re), jaha kowaḱ el do baṅ rehõ 214 = 16,384 khon ạḍi latar. Nõwa nahaḱ beoharan akhor ko begor, eṭaḱ sanamaḱ ge be-colti se baṅ bebharoḱ lekate bornon huy daṛeyaḱa, sadharon lekate beoharan Unicode reaḱ pablik tạlkạ benao khon be-sãota beohar rejisṭar lạgit́ nõwa ko arhõ bhageya.

1989 serma reaḱ etohoṕ sed, Unicode kạmi gadel do pasnao kate meṭapho̱r ren ken hislar ar meik karnagan, risarc libriri grup ren keren simit-iyo̱simura ar jawan elipranḍ ar Sun Microsystems ren gelen rayiṭ selet́ lena. 1990 re, Microsoft ren misel suyinarḍ ar esmas phrayṭaḱ ar nạes.ṭi ren rik mekgo̱wan hõ nõwa gadel re selet́ lena. 1990 sal reaḱ mucạt́ sed, nahaḱ manok ko mep ruwạṛ reaḱ jạsti kạmi ge mucạt́ lena ar Unicode reaḱ mit́ṭạṅ mucạt́ re biḍạw ḍraphoṭ benao lena.

1991 serma reaḱ 3 januyạri re California re Unicode Consortium selet́ lena, ar Unicode standard reaḱ po̱ylo̱ rada do ona okṭo̱bor re uchạn lena. Dosar hĩs, jahã nit han ayḍiwo̱graph selet́ akana, 1992 serma reaḱ jun cãdo̱ re uchạn lena.

1996 serma re, Unicode 2.0 re mit́ṭaṅ surrogate akhor hora etohoṕ lena, jahãte Unicode do 16 biṭ hạbić ge simạ eset́ baṅ tahẽ lena. Nõwa do Unicode codespes do gel lakh goṭaṅ khon hõ ḍher code poyenṭ re bạṛti lena, jahã do ayma nagam anaḱ lipi, jeleka misor reaḱ hayaro̱gliph ar sasay goṭaṅ kom beoharoḱ kan se be-colti akhor ko encoding reaḱ chạṛ e em leda, jahã do nõwa manok re selet́ lạgit́ baṅ ko uyhạr akan tahẽ. Nõwa curit mudre menaḱa ayma kom beoharoḱ kan si.je.ke akhor ko.

2992 serma re uchạn akan Microsoft reaḱ ṭru ṭayip bises tet́ reaḱ 1.0 anaḱ bornon re ńutum reaḱ ṭebil re peṭram ayḍi lạgit́ 'Unicode' renaḱ bodol te 'epel Unicode "ńutum beohar huy lena.

Unicode Consortium edit

Unicode Consortium do mit́ṭạṅ be-phayda a semlet́ kana, jahã do Unicode reaḱ utnạo lạgit́e kạmiya., Apple, Facebook, Google, IBM, maykro̱so̱poṭ,neṭphliks ar es.e.pi.es.i sãote ol-bebenao manok re jahã lekan sana menaḱa, ona jạsti gan muṛut́ kompiyuṭar so̱pṭo̱war ar harḍo̱war ko̱mpani ko pursᱹ-puri ko selet́ akana.

Serma tayom serma ayma disom baṅkhan sorkaraḱ semlet́ Unicode Consortium ren rạsiyạ ko huy akana. Nahaḱ sumuṅ enḍo̱menṭ dhorom kạmi montroć (o̱man) bho̱ṭ reaḱ daṛe sãote mit́ṭaṅ goṭabuṭạ rạsiyạ kanay.

Kanso̱rṭiyam reaḱ maraṅ jos huyuḱ kana nahaḱ akhor encoding scheme ko mucạt́ dhạbić Unicode ar ona reaḱ manok Unicode Transformation Format (UTF) scheme ko são bodol korao, oje do nahaḱ skim ko mudre ạḍi gan skim ko akar ar daw re simit ar sãge pạrsi ko tala re baṅ kạmi daṛe kana.

Menaḱa script ko edit

Ayma nahaḱ application Unicode re ayma script reaḱ mit́ṭạṅ jạsti uposoṭ em daṛeyaḱa, jahã do openoffice.org application khon nõwa screenshot talate uduḱ sodor akana. Unicode re nahaḱ beoharoḱ kan jạsti gan muṛut́ ol hora ko selet́ akada.

Kichu nahaḱ benao ciki ko jahã nit hạbić Unicode re selet́ baṅ huy akana (jeleka, Tengwar) se jahã do dinạm din beohar reaḱ kom ojete Unicode re selet́ lạgit́ baṅ ko hameṭ akada (jeleka, Klingon), be-sorkari menkhan maraṅ lekate beoharoḱ kan be-sorkari beohar ṭoṭha code assignment sãote conscript Unicode rejisṭri re selet́ akana.

Nõwa begor hõ mit́ṭạṅ talma jug Unicode font etohoṕ menaḱa jaha do talma jug reaḱ muṛut́ leṭin curit ko cetan met́-lutur menaḱa. Nõwa poramos ko reaḱ kichu hĩs maṛaṅ khon ge Unicode re selet́ huy akana.

Script encoding etohoṕ edit

Skripṭ encoding etohob, brekle reaḱ yunibhesṭarṭi oph kelpharniya re ḍebo̱ra enḍarson darai te cacalaw mit́ṭaṅ pantha 2002 sạhit re thapon lena jahã reaḱ jos tahẽ kana nõwa hạbić mandare encode baṅ huy akan script lạgit́ poramos kạwḍi emoḱ. Nahaḱ serma kore nõwa porjikol nõwa mandare emon selet́ reaḱ mit́ṭaṅ muṛut́ pheḍat huy akana.

Nawa uchạn ko edit

Unicode konso̱rṭiyom do serma re mit́ dhaw arbaṅ serma re bar dhaw Unicode standard reaḱ nawa barson e bebhara. 16.0 bherson do 2024 serma re parsal huyuḱa. Nõwa bherson re 6 goṭaṅ nawa ciki (to̱dhri, sunuwar, guruṅ khema, kiraṭ ray, garay ar ol onol), san ar mon ciki lạgit́ arhõ bormis elkha ko, hegel kompyuṭiṅ lạgit́ arhõ eṭaḱ cinhạ ko ar 6 goṭaṅ nawa emoji ko selet́ menaḱa.