To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 梧??搖?ⅱ褥 1000110011100110001111110011111110011101100010100011111111111010010000011110010111110001 8ce63f3f9d8a3ffa41e5f1
EUC-JP 梧??搖??褥 10111000111010000011111100111111110110011110101000111111001111111110101011110011 b8e83f3fd9ea3f3feaf3
UTF-8 梧귨쉠搖얏ⅱ褥 111001101010001010100111111010101011011110101000111011001000100110100000111001101001000010010110111011001001011010001111111000101000010110110001111010001010010010100101 e6a2a7eab7a8ec89a0e69096ec968fe285b1e8a4a5
UHC 梧귨쉠搖얏ⅱ褥 1110011111111100100000101110111110111101101010101110100011110100101111101110011010100101101000101110100110110011 e7fc82efbdaae8f4bee6a5a2e9b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)