To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?????????????????艾 001111110011111100111111111000101000011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111110010010001000 3f3f3fe2863f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fe488
EUC-JP ???竊?????嫄????????靷??艾 00111111001111110011111111100011111001100011111100111111001111110011111100111111100011111011101010100001001111110011111100111111001111110011111100111111001111110011111110001111111001111011110100111111001111111110011111101000 3f3f3fe3e63f3f3f3f3f8fbaa13f3f3f3f3f3f3f3f8fe7bd3f3fe7e8
UTF-8 列룸씈竊붿콌列룸쑑嫄뽨쉴琉듦묘列룸씈靷욏씘艾 111011111010011010011100111010111010001110111000111011001001010010001000111001111010101110001010111010111011011010111111111011001011110110001100111011111010011010011100111010111010001110111000111011001001000110010001111001011010101110000100111010111011110110101000111011001000100110110100111011111010011110001100111010111001001110100110111010111010110010011000111011111010011010011100111010111010001110111000111011001001010010001000111010011001110110110111111011001001101010001111111011001001010010011000111010001000100110111110 efa69ceba3b8ec9488e7ab8aebb6bfecbd8cefa69ceba3b8ec9191e5ab84ebbda8ec89b4efa78ceb93a6ebac98efa69ceba3b8ec9488e99db7ec9a8fec9498e889be
UHC 列룸씈竊붿콌列룸쑑嫄뽨쉴琉듦묘列룸씈靷욏씘艾 1110011011101010101101111110101110011101101000001110111110111100100101001110110010110001100010001110011011101010101101111110101110011100101100001110101010110001100101101110010010111101101011111110101110100100101101011110101010111001101001101110011011101010101101111110101110011101101000001110110011100110100111101110110110011101101011011110010011110101 e6eab7eb9da0efbc94ecb188e6eab7eb9cb0eab196e4bdafeba4b5eab9a6e6eab7eb9da0ece69eed9dade4f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)