To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 驛??悠?驛??悠?B 111010011000001100111111001111111001011101001001001111111110100110000011001111110011111110010111010010010011111101000010 e9833f3f97493fe9833f3f97493f42
EUC-JP 驛??悠?驛??悠?B 111100011110001100111111001111111100110110101010001111111111000111100011001111110011111111001101101010100011111101000010 f1e33f3fcdaa3ff1e33f3fcdaa3f42
UTF-8 驛노쵇悠퀆驛노쵇悠퀆B 11101001101010011001101111101011100001011011100011101100101101011000011111100110100000101010000011101101100000001000011011101001101010011001101111101011100001011011100011101100101101011000011111100110100000101010000011101101100000001000011001000010 e9a99beb85b8ecb587e682a0ed8086e9a99beb85b8ecb587e682a0ed808642
UHC 驛노쵇悠퀆驛노쵇悠퀆B 111001101011111010110011111010111010110010001001111010101110110110110011011101101110011010111110101100111110101110101100100010011110101011101101101100110111011001000010 e6beb3ebac89eaedb376e6beb3ebac89eaedb37642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)