To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁〓???????雅???????????? 10011111111100011000000110101100001111110011111100111111001111110011111100111111001111111000100111101011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 9ff181ac3f3f3f3f3f3f3f89eb3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP 潁〓???????雅???????????? 11011110111100111010001010101110001111110011111100111111001111110011111100111111001111111011001011101101001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 def3a2ae3f3f3f3f3f3f3fb2ed3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 潁〓젙遼김콊溜싲젚雅먮젙料ㅺ퀎隸욂틦溜뺞뿭溜 111001101011110110000001111000111000000010010011111011001010000010011001111011111010011110000011111010101011100110000000111011001011110110001010111011111010011110001011111011001000101110110010111011001010000010011010111010011001101110000101111010111010100010101110111011001010000010011001111011111010011010111110111000111000010110111010111011011000000010001110111011111010011010111000111011001001101010000010111011011000101110100110111011111010011110001011111010111011101010011110111010111011111110101101111011111010011110001011 e6bd81e38093eca099efa783eab980ecbd8aefa78bec8bb2eca09ae99b85eba8aeeca099efa6bee385baed808eefa6b8ec9a82ed8ba6efa78bebba9eebbfadefa78b
UHC 潁〓젙遼김콊溜싲젚雅먮젙料ㅺ퀎隸욂틦溜뺞뿭溜 1110011110111000101000011110101110100000100101011110100110101100101100011110100010110001100001101110101011111110100110101110101110100000100101101110010010111010100100001110101110100000100101011110100011110111101001001110101010110011100001001110011111100110100111101110010010111010100100001110101011111110100101011110011010010111101011011110101011111110 e7b8a1eba095e9acb1e8b186eafe9aeba096e4ba90eba095e8f7a4eab384e7e69ee4ba90eafe95e697adeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)