To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????儒?????肉ε?揄?????援 001111110011111100111111001111110011111100111111100011101111001000111111001111110011111100111111001111111001001111110111100000111100001100111111100111011000100100111111001111110011111100111111001111111000100110000111 3f3f3f3f3f3f8ef23f3f3f3f3f93f783c33f9d893f3f3f3f3f8987
EUC-JP ???嫄??儒?????肉ε?揄?????援 0011111100111111001111111000111110111010101000010011111100111111101111001111010000111111001111110011111100111111001111111100011011111001101001101100010100111111110110011110100100111111001111110011111100111111001111111011000111100111 3f3f3f8fbaa13f3fbcf43f3f3f3f3fc6f9a6c53fd9e93f3f3f3f3fb1e7
UTF-8 列룸뱪嫄뽫빊儒덇콢列룔깺肉ε푻揄쇰떭列룸뱢援 1110111110100110100111001110101110100011101110001110101110110001101010101110010110101011100001001110101110111101101010111110101110111001100010101110010110000100100100101110101110001101100001111110110010111101101000101110111110100110100111001110101110100011100101001110101010111001101110101110100010000010100010011100111010110101111011011001000110111011111001101000111110000100111011001000011110110000111010111001011010101101111011111010011010011100111010111010001110111000111010111011000110100010111001101000111110110100 efa69ceba3b8ebb1aae5ab84ebbdabebb98ae58492eb8d87ecbda2efa69ceba394eab9bae88289ceb5ed91bbe68f84ec87b0eb96adefa69ceba3b8ebb1a2e68fb4
UHC 列룸뱪嫄뽫빊儒덇콢列룔깺肉ε푻揄쇰떭列룸뱢援 1110011011101010101101111110101110010011100100001110101010110001100101101110011110010101101100001110101011100011100010001110101010110001100110101110011011101010101101111110001110000011101001101110101110111111101001011110010110111110100001111110101011110001101111001110101110001011101111011110011011101010101101111110101110010011100010001110101010110101 e6eab7eb9390eab196e795b0eae388eab19ae6eab7e383a6ebbfa5e5be87eaf1bceb8bbde6eab7eb9388eab5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)