To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雲????酊?盜檍◇雲????酊?盜檍●^ 10001001010111110011111100111111001111110011111111100111110000100011111110011111010110001001111011111000100000011001111010001001010111110011111100111111001111110011111111100111110000100011111110011111010110001001111011111000100000011001110001011110 895f3f3f3f3fe7c23f9f589ef8819e895f3f3f3f3fe7c23f9f589ef8819c5e
EUC-JP 雲????酊?盜檍◇雲????酊?盜檍●^ 10110001110000000011111100111111001111110011111111101110110001000011111111011101101110011101110011111010101000011111111010110001110000000011111100111111001111110011111111101110110001000011111111011101101110011101110011111010101000011111110001011110 b1c03f3f3f3feec43fddb9dcfaa1feb1c03f3f3f3feec43fddb9dcfaa1fc5e
UTF-8 雲띨렋찌깻酊ㅽ盜檍◇雲띨렋찌깻酊ㅽ盜檍●^ 11101001100110111011001011101011100111011010100011101011101000001000101111101100101100001000110011101010101110011011101111101001100001011000101011100011100001011011110111100111100110111001110011100110101010101000110111100010100101111000011111101001100110111011001011101011100111011010100011101011101000001000101111101100101100001000110011101010101110011011101111101001100001011000101011100011100001011011110111100111100110111001110011100110101010101000110111100010100101111000111101011110 e99bb2eb9da8eba08becb08ceab9bbe9858ae385bde79b9ce6aa8de29787e99bb2eb9da8eba08becb08ceab9bbe9858ae385bde79b9ce6aa8de2978f5e
UHC 雲띨렋찌깻酊ㅽ盜檍◇雲띨렋찌깻酊ㅽ盜檍●^ 1110101010100011101101101110111010001110101000101100001011101110101100101010001011101111111110001010010011101101110101001010100011100101111001011010000111011110111010101010001110110110111011101000111010100010110000101110111010110010101000101110111111111000101001001110110111010100101010001110010111100101101000011101110001011110 eaa3b6ee8ea2c2eeb2a2eff8a4edd4a8e5e5a1deeaa3b6ee8ea2c2eeb2a2eff8a4edd4a8e5e5a1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)