To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 撓???繹???如 100111011001101000111111001111110011111111100011100010000011111100111111001111111001010001000000 9d9a3f3f3fe3883f3f3f9440
EUC-JP 撓???繹???如 110110011111101000111111001111110011111111100101111010000011111100111111001111111100011110100001 d9fa3f3f3fe5e83f3f3fc7a1
UTF-8 撓눸쇨텥繹☏룸쾷如 111001101001001010010011111010111000100010111000111011001000011110101000111011011000010110100101111001111011100110111001111000101001100010001111111010111010001110111000111011001011111010110111111001011010011010000010 e69293eb88b8ec87a8ed85a5e7b9b9e2988feba3b8ecbeb7e5a682
UHC 撓눸쇨텥繹☏룸쾷如 111010001111010110000111110011101011110011101010101101101001101011100110101110101010001011001110101101111110101110110010100011011110010111111101 e8f587cebceab69ae6baa2ceb7ebb28de5fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)