To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猷??癲?猷??癲?B 100101110101000100111111001111111110000110011111001111111001011101010001001111110011111111100001100111110011111101000010 97513f3fe19f3f97513f3fe19f3f42
EUC-JP 猷??癲?猷??癲?B 110011011011001000111111001111111110001010100001001111111100110110110010001111110011111111100010101000010011111101000010 cdb23f3fe2a13fcdb23f3fe2a13f42
UTF-8 猷뜻넀癲뇁猷뜻넀癲뇁B 11100111100011001011011111101011100111001011101111101011100001001000000011100111100110011011001011101011100001111000000111100111100011001011011111101011100111001011101111101011100001001000000011100111100110011011001011101011100001111000000101000010 e78cb7eb9cbbeb8480e799b2eb8781e78cb7eb9cbbeb8480e799b2eb878142
UHC 猷뜻넀癲뇁猷뜻넀癲뇁B 111010111010001110110110111001101000011010010000111011111010011010000111011010011110101110100011101101101110011010000110100100001110111110100110100001110110100101000010 eba3b6e68690efa68769eba3b6e68690efa6876942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)