To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 甯治崧爾?貳鮓 11111010101010001000111010100001111110101010111110001110101000100011111111100110110001101110100110110110 faa88ea1faaf8ea23fe6c6e9b6
EUC-JP 甯治崧爾嵪貳鮓 1000111111001101101010101011110010100011100011111011101111001010101111001010010010001111101110111101110111101100110010001111001010111000 8fcdaabca38fbbcabca48fbbddecc8f2b8
UTF-8 甯治崧爾嵪貳鮓 111001111001010010101111111001101011001010111011111001011011010010100111111001111000100010111110111001011011010110101010111010001011001010110011111010011010111010010011 e794afe6b2bbe5b4a7e788bee5b5aae8b2b3e9ae93
UHC ?治崧爾?貳? 0011111111110110101111011110001011111110111011001011001100111111111011001100001100111111 3ff6bde2feecb33fecc33f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)