To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 造????市?市??造????市?市??^ 100100011010001000111111001111110011111100111111100011100111001100111111100011100111001100111111001111111001000110100010001111110011111100111111001111111000111001110011001111111000111001110011001111110011111101011110 91a23f3f3f3f8e733f8e733f3f91a23f3f3f3f8e733f8e733f3f5e
EUC-JP 造??紈?市?市??造??紈?市?市??^ 11000010101001000011111100111111100011111101001111001101001111111011101111010100001111111011101111010100001111110011111111000010101001000011111100111111100011111101001111001101001111111011101111010100001111111011101111010100001111110011111101011110 c2a43f3f8fd3cd3fbbd43fbbd43f3fc2a43f3f8fd3cd3fbbd43fbbd43f3f5e
UTF-8 造섦뤗紈쟈市킃市렊롚造섦뤗紈쟈市킃市렊롘^ 11101001100000001010000011101100100001001010011011101011101001001001011111100111101101001000100011101100100111111000100011100101101110001000001011101101100000101000001111100101101110001000001011101011101000001000101011101011101000011001101011101001100000001010000011101100100001001010011011101011101001001001011111100111101101001000100011101100100111111000100011100101101110001000001011101101100000101000001111100101101110001000001011101011101000001000101011101011101000011001100001011110 e980a0ec84a6eba497e7b488ec9f88e5b882ed8283e5b882eba08aeba19ae980a0ec84a6eba497e7b488ec9f88e5b882ed8283e5b882eba08aeba1985e
UHC 造섦뤗紈쟈市킃市렊롚造섦뤗紈쟈市킃市렊롘^ 1111000011100011101111001011010010001111110001111111110010111100110000001111000011100011101111001011010010001111111000111011110010001110101000011000111011011110111100001110001110111100101101001000111111000111111111001011110011000000111100001110001110111100101101001000111111100011101111001000111010100001100011101101110001011110 f0e3bcb48fc7fcbcc0f0e3bcb48fe3bc8ea18edef0e3bcb48fc7fcbcc0f0e3bcb48fe3bc8ea18edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)