To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陞ッ繧崎у蟲オ蟄コ陞ッ繧崎у蟲オ蟄ク^ 11101000100111101010111111100011100000101000110111101000100001001000010111100101101100111011010111100101101011011011101011101000100111101010111111100011100000101000110111101000100001001000010111100101101100111011010111100101101011011011100001011110 e89eafe3828de88485e5b3b5e5adbae89eafe3828de88485e5b3b5e5adb85e
EUC-JP 陞ッ繧崎у蟲オ蟄コ陞ッ繧崎у蟲オ蟄ク^ 11101111111111101000111010101111111001011110001010111010111010101010011111100101111010101011010110001110101101011110101010101111100011101011101011101111111111101000111010101111111001011110001010111010111010101010011111100101111010101011010110001110101101011110101010101111100011101011100001011110 effe8eafe5e2baeaa7e5eab58eb5eaaf8ebaeffe8eafe5e2baeaa7e5eab58eb5eaaf8eb85e
UTF-8 陞ッ繧崎у蟲オ蟄コ陞ッ繧崎у蟲オ蟄ク^ 1110100110011001100111101110111110111101101011111110011110111001101001111110010110110100100011101101000110000011111010001001111110110010111011111011110110110101111010001001111110000100111011111011110110111010111010011001100110011110111011111011110110101111111001111011100110100111111001011011010010001110110100011000001111101000100111111011001011101111101111011011010111101000100111111000010011101111101111011011100001011110 e9999eefbdafe7b9a7e5b48ed183e89fb2efbdb5e89f84efbdbae9999eefbdafe7b9a7e5b48ed183e89fb2efbdb5e89f84efbdb85e
UHC 陞??崎у蟲?蟄?陞??崎у蟲?蟄?^ 1110001110110011001111110011111111010000111110001010110011100101111101011111100100111111111101101101111000111111111000111011001100111111001111111101000011111000101011001110010111110101111110010011111111110110110111100011111101011110 e3b33f3fd0f8ace5f5f93ff6de3fe3b33f3fd0f8ace5f5f93ff6de3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)