Pūnaewele'O Polokalamu

UTF-8 - ho'opā'ālua kiko

Ho'olako'o Unicode i nā'ōpuni pūnaewele i kēia manawa. 'O ka ho'opunipuni maika'i loa no kahi ho'onohonoho pūnaewele Unicode ka ho'oponopono UTF-8. Hā'awi ia i ka launa'ana me ka ASCII, ka kū'ē'ana i ka palaho'ikepili, ka maika'i a me ka ma'alahi o ka ho'oponopono'ana. Akā, pili i nā mea a pau.

Nā palapala like

Ke ho'ohana nei nā pūnaewele me nā helu'a'ole like me nā mea pā'anikapili pa'a, akā,'o ka hui pū'ana o nā pūnaewele o ka mālama a me ka processing o nā'ike-nui a me nā hua'ōlelo 32-bit. Hoʻopā'ālua hae e lawe i kēia i loko o ka mea ana i haawi aku ai ia i ka wa i maopopo ai i ka helu o nā huapalapala.

I loko o nā polokalamu kamepiula, ua mālama'ia nā pilina pūnaewele i nā pūnaeho ho'omana'o o 8 bits (1 byte), 16 ai'ole 32 mau bits. 'O kēlā me kēia palapala ho'opili helu Unicode e ho'oholo ana i ka pae o nā pūnaehana ho'omana'o he mea helu ma'amau i kahi'ano. Hō'ike'ia ka papahana i'ekolu'ano like'ole o ka ho'oponopono helu Unicode: 8, 16 a me nā poho 32-bit. No laila, ua kapa'ia lākou'o UTF-8, UTF-16 a me UTF-32. 'O ka inoa UTF ke'ano o ka hō'ailona ho'ololi o Unicode. 'O kēlā me kēia o nā ho'opā'ālua'ekolu he'ano like ia no ka pani'ana i nā hua'ōlelo Unicode, he mau pono i nā'ano like'ole.

Hiki ke ho'ohana'ia kēia mau ho'opi'i e pani i nā hua'ōlelo a pau o ka Unicode. No laila, ua kūpono loa lākou no nā hō'oia no nā kumu like'ole e ho'ohana ai i nā'ano like'ole o nā coding. Hiki i nā ho'ololi ke ho'ololi maoli i loko o kekahi o nā lua'ē a'e me ka'ole o nā'ikepili.

Ka kulekele no ka ho'okomo'ole

Ua ho'onohonoho'ia kēlā me kēia'ano ho'opi'i ho'ohui a Unicode i ka'ike'ana i ka hiki'ole ke pale'ana. Eia kekahi la'ana,'o Windows-932 ke ho'olele i nā huapalapala mai ho'okahi a'elua paha mau kaomi o ke code. 'O ka lō'ihi o ka papahana e pili ana ia i ka māka mua, no laila, ke kumu waiwai alaka'i ma ka papa o nā māka'elua a ho'okahi ho'okahi byte'a'ole e launa. Eia nō na'e, ua like ke'ano o nā helu o ka māka ho'okahi a me ka pau'ana o ka laina. 'O ke'ano o kēia mea, ma ka huli'ana i ka halihi D (helu 44), hiki iā'oe ke'ike pono i ka ho'okomo'ana i ka'ao'ao'elua o ke ka'ina o nā ka'ina'elua o ke'ano "D" (helu 84 44). No ka ho'oholo'ana i kahi o ka ho'opili pololei, pono e mālama ka papahana i nā'aeho mua.

Ho'onui ka pa'akikī i ka hopena inā pili ka haku a me ka pā'ani hope. 'O kēia ke'ano o ka ho'ololi'ana i ka kuhi, e hana'ia ka huli hope'ana a hiki i ka ho'omaka'ana o ka kikokikona a i'ole ka palena o ka code code. 'A'ole pono kēia kūpono, akā,'a'ole i pale'ia e kū'ē i nā hemahema hiki'ole, no ka mea, ho'okahi wale nō hewa'ino e lawa ai ka mea heluhelu'ole.

Hō'ole ka hō'ailona ho'ololi o Unicode i kēia pilikia no ka mea,'a'ole i kūlike nā koina o ke alaka'i, ka hulina a me ka hui ho'okahi o ka waihona'ike'ike. Ma muli o kēia, ua pono nā ho'opi'i Unicode a pau no ka huli a me ke ho'ohālikelike'ana, me ka hā'awi'ole i ka hopena kūpono ma muli o ke kaulike o nā'ano like'ole o ke code kiko. 'O ka'oia'i'o o kēia mau pepa ho'opi'i e pili ana i ka kulekele o ka'oihana'ole e ho'oka'awale iā lākou mai nā ho'opili'ālua o nā'āpana o'Aseta Hikina.

Kekahi kēia lākiō nonintersection Unicode hoʻopā'ālua ' mea i kela a me keia ano i ka maopopo hoakaka palena. Ho'opau kēia i ka pono e nānā pono i kahi helu pau'ole o nā hua'ōlelo mua. 'O kēia hi'ohi'ona o nā ho'opā'ālua e kapa'ia i kekahi manawa'o ka synchronization self. 'O ka ho'oka'awale'ana o ho'okahi'āpana o ke code e alaka'i i ka ho'ohaunaele o ho'okahi wale nō kanaka, a'o nā hi'ohi'ona e pili ana e pa'a mau. Ma ka hō'ano li'ili'i 8-bit, inā e kuhikuhi ka po'ene'ene i kahi byte i ho'omaka me 10xxxxxx (i loko o ke cododal binary), kahi i'ekolu ho'ololi o ke ka'a e pono ai e'ike i ka ho'omaka o ke'ano.

Ke kūlike

'O ka Unicode Consortium e kāko'o nui i nā'ōnaeho ho'ololi'ekolu. He mea nui'a'ole e kū'ē i ka UTF-8 a me Unicode, no ka mea,'o nā palapala ho'ohui ho'ohālikelike he pono like nā ho'okomo o nā ho'opi'i ho'opā'ālua Unicode.

Kūlana Kūkākūkā

No ka hō'ailona o ka hō'ailona UTF-32, pono'oe i kahi 32-bit unit of code that matches the code Unicode. 'O ka UTF-16 - mai kahi a'elua a 16 mau bit. A ho'ohana'o UTF-8 i 4 bytes.

Ua ho'ohālike'ia ka ho'opā'ālua o UTF-8 no ka pa'i'ana me nā pūnaewele ho'ohālike byte e pili ana ma ASCII. 'O ka hapa nui o nā'oihana loea a me nā hana'enehana loea ua ho'olā'ihi'ia i ka manawa lō'ihi ma luna o ka hō'ailona o nā hō'ailona ma ke'ano o ka laina o nā bytes. Mau Hola Pūnaewele hilinaʻi nui ma luna o ka kūpaʻa o ASCII hoʻopā'ālua , a hoʻohana kekahi hookaaokoa ae mai o ka mea kūikawā e hooponopono huapalapala. 'O kahi ala ma'alahi e ho'ololi ai i ka Unicode i nā'ano kūlana, ma o ka ho'ohana'ana i nā ho'opi'i 8-bit e kū ai i nā hua'ōlelo Unicode e like me kekahi'ano ASCII a i'ole ka mana ho'omalu. No kēia, ua ho'ololi'ia ka ho'opā'ālua UTF-8.

Ka lō'ihi loli

'O ka UTF-8 he ho'opā'ālua-lō'ihi i loko o nā waihona hūnaewele'ike'ike 8-bit nona nā'āpana hana ki'eki'e e hō'ike i kahi hapa o ka pūnaewele o kēlā a me kēlā me kēia. Ho'okahi ka nui o nā kumukū'ai i mālama'ia no ka māhele mua o ke ka'ina code, a'o kekahi no nā kumuhana hou. Mālama kēia i ka ho'ololi'ana o ka'āpana.

ASCII

'O ka ho'opā'ālua UTF-8 i kāko'o mau i nā code ASCII (0x00-0x7F). 'O kēia ke'ano i ho'ololi'ia nā leka Unicode U + 0000-U + 007F i kahi byte 0x00-0x7F UTF-8 a no laila hiki ke ho'oma'ema'e'ia mai ka ASCII. Eia hou, i mea e pale ai i ka hīna'i'ole,'a'ole i ho'ohana hou'ia nā uaina 0x00-0x7F i kēlā me kēia mahele o ka hō'ike'ana o ke'ano Unicode. No ka ho'opale i nā hō'ailona non-kikooho'ē a'e ma mua o ka ASCII, e ho'ohana'ia kekahi papahana o nā a'i'elua. Hō'ike'ia nā kaha o ke kikowaena o ka U + 0800-U + FFFF e'ekolu bytes, a'o nā mea'ē a'e me nā wae i'oi aku ma mua o U + FFFF e koi i nā'eke'ehā.

Ke kumu o ka noi

'O ka ho'opā'ālua o UTF-8 ka mea'oi loa i ka mea HTML i like me ia.

Ua lilo ka XML i ka papa mua me ke kāko'o ho'opi'i UTF-8 piha. 'O nā hui e pili ana i ka ho'olālā'ana, e like me ka'ōlelo. Kākoʻo pilikia i loko o ka HKH aae? I mea okoa mai ka ASCII-huapalapala, i hoʻonā 'ia ai ka eiini? W3C a me ka IETF' enekinia, hui i hele mai i kekahi 'aelike ma ka helu kuʻina o nā mea a pau HKH helu kuhi henua wale nō i loko o UTF-8.

Ho'ololi ka papahana me ASCII i ka ho'ololi'ana i ka polokalamu hou. Me UTF-8, ka nui o nā nūpepa kikokikona e hana, me JEdit, Emacs, BBEdit, Eclipse, a me Notepad o ka pūnaewele hana o Windows. 'A'ohe mea'ē a'e o Unicode coding hiki ke kaena i kēlā kāko'o mai nā mea hana.

'O ka pono o ka ho'opā'ālua'ana, aia ia he mahele o nā bytes. Me ka aho UTF-8, he ma'alahi ka hana ma C a me nā'ōlelo ho'olālā'ē a'e. 'O kēia wale nō ke'ano o ka ho'opā'ālua e koi'ole ai i ka makainoa o ka papa o nā beta BOM a i'ole ka'ōlelo ho'opau i ka XML.

Ka hana polokalamu pilikino

Ma kahi'ōnaehana e ho'ohana ana i ka hana'ana 8-bit, ma ka ho'ohālike'ana i nā ho'opā'ālua multi-byte,'o UTF-8 nā mea e pono ai:

  • 'O ka helu mua o ke ka'ina o nā code he mau'ike e pili ana i kona lō'ihi. Ho'onui kēia i ka'imi o ka huli ma'amau.
  • He mea ma'alahi ka'ike'ana i ka ho'omaka o ke'ano, no ka mea, ua kaupalena'ia ka māka mua i kekahi mau koina kūpono.
  • 'A'ohe pae ho'oili o nā kumukū helu.

'O ke kūlike o nā pono

'O ka ho'ololi o ka UTF-8 he pa'akikī. Akā i ka manawa e noi ana i ka ho'opā'ālua'ana o nā huapalapala o ka'Āina o Asia Hikina (Kānanī, Japanese, Korean, ho'ohana'ia nā hua'ano Kina) ho'ohana'ia 3-byte. A'o ka UTF-8-ho'opā'ālua he mea'u'uku ia i nā'ano like'ole o ke ho'opā'ālua ma ka ho'oponopono wikiwiki. Hiki i ka hua'ōlelo ma'amau ka hua'ōlelo like'ole me kahi'ano Unicode.

'O ka ho'olālā kiko'āinana

'O ka ho'opā'ālua ho'opā'ālua i loko o ke'ano ho'opi'i kiko a me ke'ano o ka ho'onohonoho byte-by-pixel o nā pūnaewele. No ka ho'oholo i ka ho'omohala ho'opi'i'ana i ka laulā Unicode, ua hā'awi'ia ka ho'ohana'ana i ka pahu kauoha māna mua (BOM, Byte order mark).

Ke ho'ololi'oe i ka BOM ma UTF-8, e kaupalena'ia ka hana o ka lepili ma ka hō'ike'ana i ka ho'ohana'ana i ke'ano ho'opi'i. 'O ka pilikia o ka ho'oholo'ana i ke ka'ina o nā bytes i ka UTF-8,'a'ole ia, no ka mea, ho'okahi ka'ina o kāna pūnaewele ho'ohui. 'O ka ho'ohana'ana i BOM no kēia pepa ho'opi'i ka mea kūpono a'a'ole ho'i i mana'o'ia. Hiki i ka BOM ke ho'ololi i nā mea i ho'ololi'ia mai nā ho'opā'ālua e ho'ohana ana i ka māka ho'oiho māna, ai'ole no ka hō'ailona o ka ho'opi'i UTF-8. Mea he kaʻina o 3 nāʻai EF BB 16 16 BF 16.

Pehea e ho'onoho ai i ka ho'opā'ālua UTF-8

Ke HTML helu kuʻina UTF-8 Ua hoʻouka kekahi me ka kēia kuhi:

Kaui

˂meta http-equiv = "Inoa-'Ana" pane = "kikokikona / html; charset = utf-8" ˃

Ma PHP, ho'opau'ia ka cododik UTF-8 ma ke ho'ohana'ana i ke po'o pane () i ka ho'omaka mua o ka pūnaewele ma hope o ka ho'onohonoho'ana i ke kumu o ka pae puka puka hapa:

Php

Error_reporting (-1);

:, EeIeOIeOIeAa ( "ka maʻiʻo-Type: kikokikona / mana html; charset = UTF-8 ');

No ka ho'ohui'ana i nā'ikepili MySQL, ho'opau'ia ka ho'opā'ālua o UTF-8 penei:

Php

Mysql_set_charset ('utf8');

Ma nā kāleka CSS, ua ho'ohālikelike'ia ka ho'opā'ālua UTF-8 e like me kēia:

@charset "utf-8";

I ka mālama'ana i nā pūnaewele o nā'ano a pau, ua koho'ia ka ho'opau UTF-8 me ka'ole BOM,'oiai'a'ole e pa'a ka pae. No ka hana'ana i kēia, ma ka Polokalamu DreamWeave, koho i kahi koho "Modifications - Properties Title - Title / Encoding", ho'ololi i ka ho'opā'ālua'ana i UTF-8. A laila e ho'ouka hou i ka'ao'ao, e wehe i ka pahu "E ho'okomo i nā Signatures Unicode (BOM)" a ho'ololi i nā loli. Inā ua ho'okomo'ia kekahi kikokikona ma ka'ao'ao a i'ole ma loko o ka waihona'ikepili e kekahi'ano o ke coding, a laila e pono e ho'okomo hou'ia a i ho'opa'a hou'ia paha. Ke hana me nā'ōlelo ma'amau, pono ke ho'ohana i ka ho'ololi ho'ololi.

Hiki iā'oe ke mālama i ka faila ma ka ho'oponopono UTF-8 i Windows Notepad. Ma hope o ke koho'ana i ka mea koho "File - Save As ..." e ho'onohonoho i ka palapala ho'opili pono e pono ai a mālama i ka faila i ka ho'opi'i UTF-8.

Ma ka mea ho'opuka helu Notepad ++, inā he'oko'a ke cododing mai UTF-8, ho'ololi i ka ho'opā'ālua a mālama i ka ho'opi'i UTF-8 ma o ka mea koho "Convert i ka UTF-8 me ka'ole BOM".

'A'ohe mea'oko'a

Ma ka pō'aiapuni o ka ho'olālā honua, i ka wā e ho'opilikia'ia ai ka palena politika a me ka'ōlelo, nā mea ho'ona'auao i loa'a'ole nā pono o ka nohona. 'O ke Unicode ka mea hō'ike wale no e kāko'o ana i nā localizations. A'o UTF-8 he hō'ailona o ka ho'okomo pono o Unicode, a penei:

  • Kāko'o i kahi o nā mea hana like'ole, e like me ka launa pū'ana me ka ho'oponopono helu ASCII;
  • He palekana i ka palaho'ikepili;
  • Ka'ala a maika'i i ka hana'ana;
  • 'A'ole ia e hilina'i ma luna o ka paepae.

Me ka hiki'ana mai o UTF-8, nā kūkākūkā e pili ana i ke'ano o ka ho'opā'ālua a i'ole ka ho'onohonoho pūnaewele i'oi aku ka maika'i ua lilo i mea'ole.

Similar articles

 

 

 

 

Trending Now

 

 

 

 

Newest

Copyright © 2018 haw.birmiss.com. Theme powered by WordPress.