To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???L???LB 001111110011111100111111010011000011111100111111001111110100110001000010 3f3f3f4c3f3f3f4c42
SJIS-WIN 亨℡?L亨℡?LB 10001011100111001000011110000100001111110100110010001011100111001000011110000100001111110100110001000010 8b9c87843f4c8b9c87843f4c42
EUC-JP 亨??L亨??LB 1011010111111100001111110011111101001100101101011111110000111111001111110100110001000010 b5fc3f3f4cb5fc3f3f4c42
UTF-8 亨℡윿L亨℡윿LB 111001001011101010101000111000101000010010100001111011001001110010111111010011001110010010111010101010001110001010000100101000011110110010011100101111110100110001000010 e4baa8e284a1ec9cbf4ce4baa8e284a1ec9cbf4c42
UHC 亨℡윿L亨℡윿LB 111110101111101110100010111001011001111110110111010011001111101011111011101000101110010110011111101101110100110001000010 fafba2e59fb74cfafba2e59fb74c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)