To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????釉??B 0011111100111111001111110011111100111111001111111110011111010110001111110011111101000010 3f3f3f3f3f3fe7d63f3f42
EUC-JP 獒?????釉??B 10001111110010111011101100111111001111110011111100111111001111111110111011011000001111110011111101000010 8fcbbb3f3f3f3f3feed83f3f42
UTF-8 獒앸툨略김짔釉잌껑B 11100111100011011001001011101100100101011011100011101101100010001010100011101111101001011011011011101010101110011000000011101100101001111001010011101001100001111000100111101100100111101000110011101010101110111001000101000010 e78d92ec95b8ed88a8efa5b6eab980eca794e98789ec9e8ceabb9142
UHC 獒앸툨略김짔釉잌껑B 11101000101000111001110111101011101110001001111111100101101100101011000111101000101000111001110111101011101110001001111111100101101100101011000101000010 e8a39debb89fe5b2b1e8a39debb89fe5b2b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)