To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??泣??惟??癲??違??湲??豫 1110001110100000001111110011111110001011100000110011111100111111100010001101001000111111001111111110000110011111001111110011111110001000111000010011111100111111100111111101000100111111001111111001100010101100 e3a03f3f8b833f3f88d23f3fe19f3f3f88e13f3f9fd13f3f98ac
EUC-JP 罌??泣??惟??癲??違??湲??豫 1110011010100010001111110011111110110101111000110011111100111111101100001101010000111111001111111110001010100001001111110011111110110000111000110011111100111111110111101101001100111111001111111101000010101110 e6a23f3fb5e33f3fb0d43f3fe2a13f3fb0e33f3fded33f3fd0ae
UTF-8 罌삘댙泣ㅵ컜惟듭뒳癲욍꺇違욕럳湲브섭豫 111001111011110110001100111011001000001010011000111010111000110010011001111001101011001110100011111000111000010110110101111011001011101110011100111001101000001110011111111010111001001110101101111010111001001010110011111001111001100110110010111011001001101010001101111010101011101010000111111010011000000110010101111011001001101010010101111010111001111110110011111001101011100110110010111010111011100010001100111011001000010010101101111010001011000110101011 e7bd8cec8298eb8c99e6b3a3e385b5ecbb9ce6839feb93adeb92b3e799b2ec9a8deaba87e98195ec9a95eb9fb3e6b9b2ebb88cec84ade8b1ab
UHC 罌삘댙泣ㅵ컜惟듭뒳癲욍꺇違욕럳湲브섭豫 1110010110100010101110111110001010001000101111011110101111101000101001001110010110110000100001111110101011101110101101011110110010001010101011001110111110100110101111111110001110000011101011101110101011011110101111111110010110001110100100111110101010111000101110101110101010111100101101111110011111100011 e5a2bbe288bdebe8a4e5b087eaeeb5ec8aacefa6bfe383aeeadebfe58e93eab8baeabcb7e7e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)