To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥?????儀??馭??宥??節??鈺??循 1001101010001011001111110011111100111111001111110011111110001011010101100011111100111111111010010110011000111111001111111001011101000111001111110011111110010000110111110011111100111111111110111100010000111111001111111000111101111010 9a8b3f3f3f3f3f8b563f3fe9663f3f97473f3f90df3f3ffbc43f3f8f7a
EUC-JP 嚥??瑗??儀??馭??宥??節??鈺??循 1101001111101011001111110011111110001111110011001100000000111111001111111011010110110111001111110011111111110001110001110011111100111111110011011010100000111111001111111100000011100001001111110011111110001111111000111101010100111111001111111011110111011011 d3eb3f3f8fccc03f3fb5b73f3ff1c73f3fcda83f3fc0e13f3f8fe3d53f3fbddb
UTF-8 嚥싳쉸瑗룡끽儀묆걶馭귙룊宥욆돳節뗭뒩鈺곕끇循 111001011001101010100101111011001000101110110011111011001000100110111000111001111001000110010111111010111010001110100001111010111000000110111101111001011000010010000000111010111010110010000110111010101011000110110110111010011010011010101101111010101011011110011001111010111010001110001010111001011010111010100101111011001001101010000110111010111000111110110011111001111010111110000000111010111001011110101101111010111001001010101001111010011000100010111010111010101011001110010101111010111000000110000111111001011011111010101010 e59aa5ec8bb3ec89b8e79197eba3a1eb81bde58480ebac86eab1b6e9a6adeab799eba38ae5aea5ec9a86eb8fb3e7af80eb97adeb92a9e988baeab395eb8187e5beaa
UHC 嚥싳쉸瑗룡끽儀묆걶馭귙룊宥욆돳節뗭뒩鈺곕끇循 1110011010111111100110101110110010011010100011101110101010111100101101111110011010110011101000111110101111110000100100011110001110000001100111001110010111011111100000101110001110001111100010011110101011101001100111101110100010001001101101101110111110111101100010111110110010001010101000111110100010101101101100001110101110000101101110111110001011100000 e6bf9aec9a8eeabcb7e6b3a3ebf091e3819ce5df82e38f89eae99ee889b6efbd8bec8aa3e8adb0eb85bbe2e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)