To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 陷夲スッ闕オ貊皮舞陷夲スッ闕オ貊皮舞B 11101000100111001001101011101111101111011010111111101000100011011011010111100110101110111001010011100111100101011001000111101000100111001001101011101111101111011010111111101000100011011011010111100110101110111001010011100111100101011001000101000010 e89c9aefbdafe88db5e6bb94e79591e89c9aefbdafe88db5e6bb94e7959142
EUC-JP 陷夲スッ闕オ貊皮舞陷夲スッ闕オ貊皮舞B 11101111111111001101010011110001100011101011110110001110101011111110111111101101100011101011010111101100101111011100100011101001110010011111000111101111111111001101010011110001100011101011110110001110101011111110111111101101100011101011010111101100101111011100100011101001110010011111000101000010 effcd4f18ebd8eafefed8eb5ecbdc8e9c9f1effcd4f18ebd8eafefed8eb5ecbdc8e9c9f142
UTF-8 陷夲スッ闕オ貊皮舞陷夲スッ闕オ貊皮舞B 11101001100110011011011111100101101001001011001011101111101111011011110111101111101111011010111111101001100101111001010111101111101111011011010111101000101100101000101011100111100110101010111011101000100010001001111011101001100110011011011111100101101001001011001011101111101111011011110111101111101111011010111111101001100101111001010111101111101111011011010111101000101100101000101011100111100110101010111011101000100010001001111001000010 e999b7e5a4b2efbdbdefbdafe99795efbdb5e8b28ae79aaee8889ee999b7e5a4b2efbdbdefbdafe99795efbdb5e8b28ae79aaee8889e42
UHC 陷???闕?貊皮舞陷???闕?貊皮舞B 1111100111101000001111110011111100111111110011111111010000111111110110001110011111111001101010111101100111110001111110011110100000111111001111110011111111001111111101000011111111011000111001111111100110101011110110011111000101000010 f9e83f3f3fcff43fd8e7f9abd9f1f9e83f3f3fcff43fd8e7f9abd9f142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)