To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??怨?? 0011111100111111001111111110001010000110001111110011111110001001100001010011111100111111 3f3f3fe2863f3f89853f3f
EUC-JP ???竊??怨?? 0011111100111111001111111110001111100110001111110011111110110001111001010011111100111111 3f3f3fe3e63f3fb1e53f3f
UTF-8 嶺뚭낮竊뚥빳怨⑹꽑 111011111010011010101011111010111001101010101101111010111000001010101110111001111010101110001010111010111001101010100101111010111011100110110011111001101000000010101000111000101001000110111001111010101011110110010001 efa6abeb9aadeb82aee7ab8aeb9aa5ebb9b3e680a8e291b9eabd91
UHC 嶺뚭낮竊뚥빳怨⑹꽑 111001111010110110001100111010101011001110110111111011111011110010001100111001001011101110100101111010101011001110101001111011001000010010100000 e7ad8ceab3b7efbc8ce4bba5eab3a9ec84a0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)