To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 央る?壓??額??? 1000100110011011100000101110100100111111100110101101100000111111001111111000101001111010001111110011111100111111 899b82e93f9ad83f3f8a7a3f3f3f
EUC-JP 央る?壓??額??? 1011000111111011101001001110101100111111110101001101101000111111001111111011001111011011001111110011111100111111 b1fba4eb3fd4da3f3fb3db3f3f3f
UTF-8 央る젻壓꾨쑙額계뇾溜 111001011010010010101110111000111000001010001011111011001010000010111011111001011010001110010011111010101011111010101000111011001001000110011001111010011010000110001101111010101011001110000100111010111000011110111110111011111010011110001011 e5a4aee3828beca0bbe5a393eabea8ec9199e9a18deab384eb87beefa78b
UHC 央る젻壓꾨쑙額계뇾溜 1110010011100111101010101110101110100000101011101110010011100010100001001110101110011100101110001110010011111110101100001110100010000111100111111110101011111110 e4e7aaeba0aee4e284eb9cb8e4feb0e8879feafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)