To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???違??宋??歪??而??柔??佯??? 00111111001111110011111110001000111000010011111100111111100100010111011000111111001111111001100001100011001111110011111110001110101001110011111100111111100011110101111100111111001111111001100011010001001111110011111100111111 3f3f3f88e13f3f91763f3f98633f3f8ea73f3f8f5f3f3f98d13f3f3f
EUC-JP ???違??宋??歪??而??柔??佯??彛 001111110011111100111111101100001110001100111111001111111100000111010111001111110011111111001111110001000011111100111111101111001010100100111111001111111011110111000000001111110011111111010000110100110011111100111111100011111011110011111010 3f3f3fb0e33f3fc1d73f3fcfc43f3fbca93f3fbdc03f3fd0d33f3f8fbcfa
UTF-8 囹덈씞違욥춳宋먮돎歪묅뫁而숂춯柔고맂佯얜툦彛 111011111010011010101001111010111000110110001000111011001001010010011110111010011000000110010101111011001001101010100101111011001011011010110011111001011010111010001011111010111010100010101110111010111000111110001110111001101010110110101010111010111010110010000101111010111010101110000001111010001000000010001100111011001000100010000010111011001011011010101111111001101001111110010100111010101011001110100000111010111010011110000010111001001011110110101111111011001001011010011100111011011000100010100110111001011011110110011011 efa6a9eb8d88ec949ee98195ec9aa5ecb6b3e5ae8beba8aeeb8f8ee6adaaebac85ebab81e8808cec8882ecb6afe69f94eab3a0eba782e4bdafec969ced88a6e5bd9b
UHC 囹덈씞違욥춳宋먮돎歪묅뫁而숂춯柔고맂佯얜툦彛 1110011110101010100010001110101110011101101100101110101011011110101111111110100110101101100011111110000111100100100100001110101110110101101110101110100011100000100100011110001010010001101001011110110010111011100110011110011110101101100011001110101011110101101100001110110110010000100111001110010110111010101111101110101110111000100111011110110010101101 e7aa88eb9db2eadebfe9ad8fe1e490ebb5bae8e091e291a5ecbb99e7ad8ceaf5b0ed909ce5babeebb89decad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)