To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???意??幽??繹??違??恂る?筌?? 00111111001111110011111110001000110100110011111100111111100101110100100000111111001111111110001110001000001111110011111110001000111000010011111100111111100111001001011010000010111010010011111111100010101000110011111100111111 3f3f3f88d33f3f97483f3fe3883f3f88e13f3f9c9682e93fe2a33f3f
EUC-JP ???意??幽??繹??違??恂る?筌?? 00111111001111110011111110110000110101010011111100111111110011011010100100111111001111111110010111101000001111110011111110110000111000110011111100111111110101111111011010100100111010110011111111100100101001010011111100111111 3f3f3fb0d53f3fcda93f3fe5e83f3fb0e33f3fd7f6a4eb3fe4a53f3f
UTF-8 嶺뚮슢意덁펺幽곹맩繹먮겧違듿톹恂る궞筌됱톯 111011111010011010101011111010111001101010101110111011001000101010100010111001101000010010001111111010111000110110000001111011011000111010111010111001011011100110111101111010101011001110111001111010111010011110101001111001111011100110111001111010111010100010101110111010101011001010100111111010011000000110010101111010111001001110111111111011011000011010111001111001101000000110000010111000111000001010001011111010101011011010011110111001111010110110001100111010111001000010110001111011011000011010101111 efa6abeb9aaeec8aa2e6848feb8d81ed8ebae5b9bdeab3b9eba7a9e7b9b9eba8aeeab2a7e98195eb93bfed86b9e68182e3828beab69ee7ad8ceb90b1ed86af
UHC 嶺뚮슢意덁펺幽곹맩繹먮겧違듿톹恂る궞筌됱톯 111001111010110110001100111010111001101010101110111010111111001010001000111001001011110010001010111010101110101110000001111011011001000010110001111001101011101010010000111010111000000110111001111010101101111010001010111001011011011110001101111000101110000110101010111010111000001010110001111011111010011110001001111011001011011110000111 e7ad8ceb9aaeebf288e4bc8aeaeb81ed90b1e6ba90eb81b9eade8ae5b78de2e1aaeb82b1efa789ecb787

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)