To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 撓?????五?? 1001110110011010001111110011111100111111001111110011111110001100110111000011111100111111 9d9a3f3f3f3f3f8cdc3f3f
EUC-JP 撓??縯??五?? 11011001111110100011111100111111100011111101010011001011001111110011111110111000110111100011111100111111 d9fa3f3f8fd4cb3f3fb8de3f3f
UTF-8 撓붹볜縯롨옾五볞벵 111001101001001010010011111010111011011010111001111010111011001110011100111001111011100010101111111010111010000110101000111011001001100010111110111001001011101010010100111010111011001110011110111010111011001010110101 e69293ebb6b9ebb39ce7b8afeba1a8ec98bee4ba94ebb39eebb2b5
UHC 撓붹볜縯롨옾五볞벵 111010001111010110010100111001101011101010110111111001101110000010001110111010001001111010110011111001111110100110010011111001001011101010101100 e8f594e6bab7e6e08ee89eb3e7e993e4baac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)