To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 縞捧?紗?峰?淡 10001110110010001001010111111001001111111000111011010001001111111001010111110100001111111001001001010111 8ec895f93f8ed13f95f43f9257
EUC-JP 縞捧?紗?峰?淡 10111100110010101100101011111011001111111011110011010011001111111100101011110110001111111100001110111000 bccacafb3fbcd33fcaf63fc3b8
UTF-8 縞捧렠紗렪峰렍淡 111001111011100010011110111001101000110110100111111010111010000010100000111001111011010010010111111010111010000010101010111001011011001110110000111010111010000010001101111001101011011110100001 e7b89ee68da7eba0a0e7b497eba0aae5b3b0eba08de6b7a1
UHC 縞捧렠紗렪峰렍淡 11111011110101101101110011101001100011101011000111011110111010011000111010111000110111001110100010001110101000111101001110111111 fbd6dce98eb1dee98eb8dce88ea3d3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)