To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 賊?∃??賊?∃??B 100100011010111100111111100000011100111000111111001111111001000110101111001111111000000111001110001111110011111101000010 91af3f81ce3f3f91af3f81ce3f3f42
EUC-JP 賊?∃??賊?∃??B 110000101011000100111111101000101101000000111111001111111100001010110001001111111010001011010000001111110011111101000010 c2b13fa2d03f3fc2b13fa2d03f3f42
UTF-8 賊꿱∃롊렚賊꿱∃롊렚B 11101000101100111000101011101010101111111011000111100010100010001000001111101011101000011000101011101011101000001001101011101000101100111000101011101010101111111011000111100010100010001000001111101011101000011000101011101011101000001001101001000010 e8b38aeabfb1e28883eba18aeba09ae8b38aeabfb1e28883eba18aeba09a42
UHC 賊꿱∃롊렚賊꿱∃롊렚B 111011101110010010110010111010001010001010100100100011101101000010001110101011011110111011100100101100101110100010100010101001001000111011010000100011101010110101000010 eee4b2e8a2a48ed08eadeee4b2e8a2a48ed08ead42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)