To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????里??釜 0011111100111111001111110011111100111111100101111010001000111111001111111000101010011000 3f3f3f3f3f97a23f3f8a98
EUC-JP ?????里??釜 0011111100111111001111110011111100111111110011101010010000111111001111111011001111111000 3f3f3f3f3fcea43f3fb3f8
UTF-8 앍렢션솬뤓里낵퓬釜 111011001001010110001101111010111010000010100010111011001000010110011000111011001000011010101100111010111010010010010011111010011000011110001100111010111000001010110101111011011001001110101100111010011000011110011100 ec958deba0a2ec8598ec86aceba493e9878ceb82b5ed93ace9879c
UHC 앍렢션솬뤓里낵퓬釜 101111101100110010001110101100111011110011000111101111001101111110001111110000111101011111101100101100111011110011000111101111001101110110111100 becc8eb3bcc7bcdf8fc3d7ecb3bcc7bcddbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)