To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 樗?蒸?檍碇???異?樗?蒸?檍碇???異?^ 100100101001010000111111100011111111011000111111100111101111100010010010111101000011111100111111001111111000100011011001001111111001001010010100001111111000111111110110001111111001111011111000100100101111010000111111001111110011111110001000110110010011111101011110 92943f8ff63f9ef892f43f3f3f88d93f92943f8ff63f9ef892f43f3f3f88d93f5e
EUC-JP 樗?蒸?檍碇???異?樗?蒸?檍碇???異?^ 110000111111010000111111101111101111100000111111110111001111101011000100111101100011111100111111001111111011000011011011001111111100001111110100001111111011111011111000001111111101110011111010110001001111011000111111001111110011111110110000110110110011111101011110 c3f43fbef83fdcfac4f63f3f3fb0db3fc3f43fbef83fdcfac4f63f3f3fb0db3f5e
UTF-8 樗렍蒸얏檍碇잦렮렧異렡樗렍蒸얏檍碇잦렮렧異렡^ 11100110101010001001011111101011101000001000110111101000100100101011100011101100100101101000111111100110101010101000110111100111101000101000011111101100100111101010011011101011101000001010111011101011101000001010011111100111100101011011000011101011101000001010000111100110101010001001011111101011101000001000110111101000100100101011100011101100100101101000111111100110101010101000110111100111101000101000011111101100100111101010011011101011101000001010111011101011101000001010011111100111100101011011000011101011101000001010000101011110 e6a897eba08de892b8ec968fe6aa8de7a287ec9ea6eba0aeeba0a7e795b0eba0a1e6a897eba08de892b8ec968fe6aa8de7a287ec9ea6eba0aeeba0a7e795b0eba0a15e
UHC 樗렍蒸얏檍碇잦렮렧異렡樗렍蒸얏檍碇잦렮렧異렡^ 111011101100000010001110101000111111000111111010101111101110011011100101111001011110111111101101110000001110011010001110101110111000111010110110111011001011011010001110101100101110111011000000100011101010001111110001111110101011111011100110111001011110010111101111111011011100000011100110100011101011101110001110101101101110110010110110100011101011001001011110 eec08ea3f1fabee6e5e5efedc0e68ebb8eb6ecb68eb2eec08ea3f1fabee6e5e5efedc0e68ebb8eb6ecb68eb25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)