To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??諭??怨??嚥〓?萸??醫?????? 1000110011101011001111110011111110010111010000000011111100111111100010011000010100111111001111111001101010001011100000011010110000111111111001001100111000111111001111111110011111001110001111110011111100111111001111110011111100111111 8ceb3f3f97403f3f89853f3f9a8b81ac3fe4ce3f3fe7ce3f3f3f3f3f3f
EUC-JP 誤??諭??怨??嚥〓?萸??醫??孼??? 10111000111011010011111100111111110011011010000100111111001111111011000111100101001111110011111111010011111010111010001010101110001111111110100011010000001111110011111111101110110100000011111100111111100011111011101011000011001111110011111100111111 b8ed3f3fcda13f3fb1e53f3fd3eba2ae3fe8d03f3feed03f3f8fbac33f3f3f
UTF-8 誤곸룆諭뜻젔怨삘뵹嚥〓끉萸루럦醫롫닰孼뽏듬퓱 111010001010101010100100111010101011001110111000111010111010001110000110111010001010101110101101111010111001110010111011111011001010000010010100111001101000000010101000111011001000001010011000111010111011010110111001111001011001101010100101111000111000000010010011111010111000000110001001111010001001000010111000111010111010001110101000111010111001111110100110111010011000011010101011111010111010000110101011111010111000101110110000111001011010110110111100111010111011110110001111111010111001001110101100111011011001001110110001 e8aaa4eab3b8eba386e8abadeb9cbbeca094e680a8ec8298ebb5b9e59aa5e38093eb8189e890b8eba3a8eb9fa6e986abeba1abeb8bb0e5adbcebbd8feb93aced93b1
UHC 誤곸룆諭뜻젔怨삘뵹嚥〓끉萸루럦醫롫닰孼뽏듬퓱 1110100010100110100000011110110010001111100001011110101110110001101101101110011010100000100100101110101010110011101110111110001010010100101101111110011010111111101000011110101110000101101111001110101110101101101101111110011110001110100010011110110010100010100011101110101110001000101001101110010111101101100101101100111010110101111010111011111110010111 e8a681ec8f85ebb1b6e6a092eab3bbe294b7e6bfa1eb85bcebadb7e78e89eca28eeb88a6e5ed96ceb5ebbf97

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)