To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 諸?廓?絲??廓?釜諸?廓?絲??廓?釜^ 10001111100101000011111110001010011001100011111111100011010011100011111100111111100010100110011000111111100010101001100010001111100101000011111110001010011001100011111111100011010011100011111100111111100010100110011000111111100010101001100001011110 8f943f8a663fe34e3f3f8a663f8a988f943f8a663fe34e3f3f8a663f8a985e
EUC-JP 諸?廓?絲??廓?釜諸?廓?絲??廓?釜^ 10111101111101000011111110110011110001110011111111100101101011110011111100111111101100111100011100111111101100111111100010111101111101000011111110110011110001110011111111100101101011110011111100111111101100111100011100111111101100111111100001011110 bdf43fb3c73fe5af3f3fb3c73fb3f8bdf43fb3c73fe5af3f3fb3c73fb3f85e
UTF-8 諸계廓롊絲렠계廓롊釜諸계廓롊絲렠계廓롊釜^ 11101000101010111011100011101010101100111000010011100101101110111001001111101011101000011000101011100111101101011011001011101011101000001010000011101010101100111000010011100101101110111001001111101011101000011000101011101001100001111001110011101000101010111011100011101010101100111000010011100101101110111001001111101011101000011000101011100111101101011011001011101011101000001010000011101010101100111000010011100101101110111001001111101011101000011000101011101001100001111001110001011110 e8abb8eab384e5bb93eba18ae7b5b2eba0a0eab384e5bb93eba18ae9879ce8abb8eab384e5bb93eba18ae7b5b2eba0a0eab384e5bb93eba18ae9879c5e
UHC 諸계廓롊絲렠계廓롊釜諸계廓롊絲렠계廓롊釜^ 1111000010110011101100001110100011001110101010011000111011010000110111101110101010001110101100011011000011101000110011101010100110001110110100001101110110111100111100001011001110110000111010001100111010101001100011101101000011011110111010101000111010110001101100001110100011001110101010011000111011010000110111011011110001011110 f0b3b0e8cea98ed0deea8eb1b0e8cea98ed0ddbcf0b3b0e8cea98ed0deea8eb1b0e8cea98ed0ddbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)