To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??鍮?ぜ應??熬?????柔ル?沃??? 100010010101000100111111001111111110100001001010001111111000001010111010100111001110010000111111001111111110000010010010001111110011111100111111001111110011111110001111010111111000001110001011001111111001011110000000001111110011111100111111 89513f3fe84a3f82ba9ce43f3fe0923f3f3f3f3f8f5f838b3f97803f3f3f
EUC-JP 渦??鍮?ぜ應??熬??嫄??柔ル?沃??彛 10110001101100100011111100111111111011111010101100111111101001001011110011011000111001100011111100111111110111111111001000111111001111111000111110111010101000010011111100111111101111011100000010100101111010110011111111001101111000000011111100111111100011111011110011111010 b1b23f3fefab3fa4bcd8e63f3fdff23f3f8fbaa13f3fbdc0a5eb3fcde03f3f8fbcfa
UTF-8 渦기뫁鍮뽬ぜ應뀁쭇熬곥룗嫄밭춯柔ル굜沃쇰냲彛 111001101011100010100110111010101011100010110000111010111010101110000001111010011000110110101110111010111011110110101100111000111000000110011100111001101000011110001001111010111000000010000001111011001010110110000111111001111000011010101100111010101011001110100101111010111010001110010111111001011010101110000100111010111011000010101101111011001011011010101111111001101001111110010100111000111000001110101011111010101011010110011100111001101011001010000011111011001000011110110000111010111000001110110010111001011011110110011011 e6b8a6eab8b0ebab81e98daeebbdace3819ce68789eb8081ecad87e786aceab3a5eba397e5ab84ebb0adecb6afe69f94e383abeab59ce6b283ec87b0eb83b2e5bd9b
UHC 渦기뫁鍮뽬ぜ應뀁쭇熬곥룗嫄밭춯柔ル굜沃쇰냲彛 1110100010111110101100011110001010010001101001011110101110111001100101101110100010101010101111001110101111101011101100101110110010100111100000111110100010100010100000011110001110001111100100111110101010110001101110011110011110101101100011001110101011110101101010111110101110000010100001001110100010101010101111001110101110000110100000101110110010101101 e8beb1e291a5ebb996e8aabcebebb2eca783e8a281e38f93eab1b9e7ad8ceaf5abeb8284e8aabceb8682ecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)