To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 襠セ蜀撰ス・譁懶セ帶收蒿撰ス・襁懶セ竸 11100101111110111011111011100101100001101001000011101111101111011010010111100110100101101001110011101111101111101001101111100110100111011011111011100100111001001001000011101111101111011010010111100101111101001001110011101111101111101001100101011110 e5fbbee58690efbda5e6969cefbe9be69dbee4e490efbda5e5f49cefbe995e
EUC-JP 襠セ蜀撰ス・譁懶セ帶收蒿撰ス・襁懶セ竸 1110101011111101100011101011111011101001111001101100000011110001100011101011110110001110101001011110101111110110110110001111000110001110101111101101011011101000110110101100000011101000111001101100000011110001100011101011110110001110101001011110101011110110110110001111000110001110101111101101000110111111 eafd8ebee9e6c0f18ebd8ea5ebf6d8f18ebed6e8dac0e8e6c0f18ebd8ea5eaf6d8f18ebed1bf
UTF-8 襠セ蜀撰ス・譁懶セ帶收蒿撰ス・襁懶セ竸 111010001010010110100000111011111011110110111110111010001001110010000000111001101001001010110000111011111011110110111101111011111011110110100101111010001010110110000001111001101000011110110110111011111011110110111110111001011011100010110110111001101001010010110110111010001001001010111111111001101001001010110000111011111011110110111101111011111011110110100101111010001010010110000001111001101000011110110110111011111011110110111110111001111010101110111000 e8a5a0efbdbee89c80e692b0efbdbdefbda5e8ad81e687b6efbdbee5b8b6e694b6e892bfe692b0efbdbdefbda5e8a581e687b6efbdbee7abb8
UHC ??蜀撰??譁懶?帶收蒿撰??襁懶?? 0011111100111111111101011011100111110011101111000011111100111111111111001010011011010100111110110011111111010011111000011110001010100101111110111101101011110011101111000011111100111111110010111011101011010100111110110011111100111111 3f3ff5b9f3bc3f3ffca6d4fb3fd3e1e2a5fbdaf3bc3f3fcbbad4fb3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)