To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臧?????移縫??臧?????移縫??^ 111001000110100000111111001111110011111100111111001111111000100011011010100101100100010000111111001111111110010001101000001111110011111100111111001111110011111110001000110110101001011001000100001111110011111101011110 e4683f3f3f3f3f88da96443f3fe4683f3f3f3f3f88da96443f3f5e
EUC-JP 臧?????移縫??臧?????移縫??^ 111001111100100100111111001111110011111100111111001111111011000011011100110010111010010100111111001111111110011111001001001111110011111100111111001111110011111110110000110111001100101110100101001111110011111101011110 e7c93f3f3f3f3fb0dccba53f3fe7c93f3f3f3f3fb0dccba53f3f5e
UTF-8 臧얩렗몇렍렠移縫렍춈臧얩렗몇렍렠移縫렍쵱^ 11101000100001111010011111101100100101101010100111101011101000001001011111101011101010101000011111101011101000001000110111101011101000001010000011100111101001111011101111100111101110001010101111101011101000001000110111101100101101101000100011101000100001111010011111101100100101101010100111101011101000001001011111101011101010101000011111101011101000001000110111101011101000001010000011100111101001111011101111100111101110001010101111101011101000001000110111101100101101011011000101011110 e887a7ec96a9eba097ebaa87eba08deba0a0e7a7bbe7b8abeba08decb688e887a7ec96a9eba097ebaa87eba08deba0a0e7a7bbe7b8abeba08decb5b15e
UHC 臧얩렗몇렍렠移縫렍춈臧얩렗몇렍렠移縫렍쵱^ 1110110111110101101111101110110110001110101011001011100011101110100011101010001110001110101100011110110010111001110111001110111010001110101000111100001111011110111011011111010110111110111011011000111010101100101110001110111010001110101000111000111010110001111011001011100111011100111011101000111010100011110000111101110001011110 edf5beed8eacb8ee8ea38eb1ecb9dcee8ea3c3deedf5beed8eacb8ee8ea38eb1ecb9dcee8ea3c3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)