To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄??韋?ぜ乙m?域??宥??柔ル?沃??愉 100101101110111100111111001111111110100011101000001111111000001010111010100010011011001110000010100011010011111110001000111001100011111100111111100101110100011100111111001111111000111101011111100000111000101100111111100101111000000000111111001111111001011011111001 96ef3f3fe8e83f82ba89b3828d3f88e63f3f97473f3f8f5f838b3f97803f3f96f9
EUC-JP 厄??韋?ぜ乙m?域??宥??柔ル?沃??愉 110011001111000100111111001111111111000011101010001111111010010010111100101100101011010110100011111011010011111110110000111010000011111100111111110011011010100000111111001111111011110111000000101001011110101100111111110011011110000000111111001111111100110011111011 ccf13f3ff0ea3fa4bcb2b5a3ed3fb0e83f3fcda83f3fbdc0a5eb3fcde03f3fccfb
UTF-8 厄닿낮韋귟ぜ乙m넿域㏃뼦宥묊춯柔ル쿋沃쇈룤愉 111001011000111010000100111010111000101110111111111010111000001010101110111010011001111110001011111010101011011110011111111000111000000110011100111001001011100110011001111011111011110110001101111010111000010010111111111001011001111110011111111000111000111110000011111010111011110010100110111001011010111010100101111010111010110010001010111011001011011010101111111001101001111110010100111000111000001110101011111011001011111110001011111001101011001010000011111011001000011110001000111010111010001110100100111001101000010010001001 e58e84eb8bbfeb82aee99f8beab79fe3819ce4b999efbd8deb84bfe59f9fe38f83ebbca6e5aea5ebac8aecb6afe69f94e383abecbf8be6b283ec8788eba3a4e68489
UHC 厄닿낮韋귟ぜ乙m넿域㏃뼦宥묊춯柔ル쿋沃쇈룤愉 1110010011111000101101001110101010110011101101111110101011011111100000101110100010101010101111001110101111100000101000111110110110000110101110011110011010110100101001111110110010010110101010011110101011101001100100011110011110101101100011001110101011110101101010111110101110110010101000001110100010101010101111001110001110001111100111011110101011110000 e4f8b4eab3b7eadf82e8aabcebe0a3ed86b9e6b4a7ec96a9eae991e7ad8ceaf5abebb2a0e8aabce38f9deaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)