To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 午ユ?午ユ?B 1000110011011111100000111000011000111111100011001101111110000011100001100011111101000010 8cdf83863f8cdf83863f42
EUC-JP 午ユ?午ユ?B 1011100011100001101001011110011000111111101110001110000110100101111001100011111101000010 b8e1a5e63fb8e1a5e63f42
UTF-8 午ユ뼬午ユ뼬B 11100101100011011000100011100011100000111010011011101011101111001010110011100101100011011000100011100011100000111010011011101011101111001010110001000010 e58d88e383a6ebbcace58d88e383a6ebbcac42
UHC 午ユ뼬午ユ뼬B 11100111111011011010101111100110100101101010111111100111111011011010101111100110100101101010111101000010 e7edabe696afe7edabe696af42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)