To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??諭????き???宜??恂ル?娃 1110000110011111001111110011111110010111010000000011111100111111001111110011111110000010101010110011111100111111001111111000101101011000001111110011111110011100100101101000001110001011001111111000100010100001 e19f3f3f97403f3f3f3f82ab3f3f3f8b583f3f9c96838b3f88a1
EUC-JP 癲??諭??洹?き???宜??恂ル?娃 11100010101000010011111100111111110011011010000100111111001111111000111111000111101110100011111110100100101011010011111100111111001111111011010110111001001111110011111111010111111101101010010111101011001111111011000010100011 e2a13f3fcda13f3f8fc7ba3fa4ad3f3f3fb5b93f3fd7f6a5eb3fb0a3
UTF-8 癲섍퉭諭㏘풌洹욌き廬믩챷宜ㅿ쫳恂ル쭕娃 111001111001100110110010111011001000010010001101111011011000100110101101111010001010101110101101111000111000111110011000111011011001001010001100111001101011010010111001111011001001101010001100111000111000000110001101111011111010011010000010111010111010111110101001111011001011000110110111111001011010111010011100111000111000010110111111111011001010101110110011111001101000000110000010111000111000001110101011111011001010110110010101111001011010100010000011 e799b2ec848ded89ade8abade38f98ed928ce6b4b9ec9a8ce3818defa682ebafa9ecb1b7e5ae9ce385bfecabb3e68182e383abecad95e5a883
UHC 癲섍퉭諭㏘풌洹욌き廬믩챷宜ㅿ쫳恂ル쭕娃 1110111110100110100110001110101010111001100001011110101110110001101000101110010010111110100100011110101010110111100111101110101110101010101011011110010111111110100100101110101110101010100001001110101111110001101001001110111110100110100010111110001011100001101010111110101110100111100011011110100011011111 efa698eab985ebb1a2e4be91eab79eebaaade5fe92ebaa84ebf1a4efa68be2e1abeba78de8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)