To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????ZoBF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011010011011110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5a6f4246
SJIS-WIN 筌l????循??筌l????醫??ZoBF 11100010101000111000001010001100001111110011111100111111001111111000111101111010001111110011111111100010101000111000001010001100001111110011111100111111001111111110011111001110001111110011111101011010011011110100001001000110 e2a3828c3f3f3f3f8f7a3f3fe2a3828c3f3f3f3fe7ce3f3f5a6f4246
EUC-JP 筌l????循??筌l????醫??ZoBF 11100100101001011010001111101100001111110011111100111111001111111011110111011011001111110011111111100100101001011010001111101100001111110011111100111111001111111110111011010000001111110011111101011010011011110100001001000110 e4a5a3ec3f3f3f3fbddb3f3fe4a5a3ec3f3f3f3feed03f3f5a6f4246
UTF-8 筌l쥙李됵㏊循뚣룒筌l쥙李됵㎖醫륁돸ZoBF 11100111101011011000110011101111101111011000110011101100101001011001100111101111101001111010000111101011100100001011010111100011100011111000101011100101101111101010101011101011100110101010001111101011101000111001001011100111101011011000110011101111101111011000110011101100101001011001100111101111101001111010000111101011100100001011010111100011100011101001011011101001100001101010101111101011101001011000000111101011100011111011100001011010011011110100001001000110 e7ad8cefbd8ceca599efa7a1eb90b5e38f8ae5beaaeb9aa3eba392e7ad8cefbd8ceca599efa7a1eb90b5e38e96e986abeba581eb8fb85a6f4246
UHC 筌l쥙李됵㏊循뚣룒筌l쥙李됵㎖醫륁돸ZoBF 11101111101001111010001111101100101000101000111011101100101100001000100111101111101001111011010111100010111000001000110011100011100011111000111111101111101001111010001111101100101000101000111011101100101100001000100111101111101001111010001011101100101000101000111111101100100010011011101101011010011011110100001001000110 efa7a3eca28eecb089efa7b5e2e08ce38f8fefa7a3eca28eecb089efa7a2eca28fec89bb5a6f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)