To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??虞???藕甲 0011111100111111100010111111000100111111001111110011111111100101010110001000110101100010 3f3f8bf13f3f3fe5588d62
EUC-JP 鋌?虞?鋌?藕甲 100011111110010010111011001111111011011011110011001111111000111111100100101110110011111111101001101110011011100111000011 8fe4bb3fb6f33f8fe4bb3fe9b9b9c3
UTF-8 鋌렫虞렕鋌렫藕甲 111010011000101110001100111010111010000010101011111010001001100110011110111010111010000010010101111010011000101110001100111010111010000010101011111010001001011110010101111001111001010010110010 e98b8ceba0abe8999eeba095e98b8ceba0abe89795e794b2
UHC 鋌렫虞렕鋌렫藕甲 11101111111110111000111010111001111010011110010110001110101010101110111111111011100011101011100111101001111001001100101110100011 effb8eb9e9e58eaaeffb8eb9e9e4cba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)