To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陷台ケ滂スー貊捺Β陷台ケ滂スー貊捺Β^ 11101000100111001001000111100100101110011001111111101111101111011011000011100110101110111001001111100110100000111010000011101000100111001001000111100100101110011001111111101111101111011011000011100110101110111001001111100110100000111010000001011110 e89c91e4b99fefbdb0e6bb93e683a0e89c91e4b99fefbdb0e6bb93e683a05e
EUC-JP 陷台ケ滂スー貊捺Β陷台ケ滂スー貊捺Β^ 11101111111111001100001011100110100011101011100111011110111100011000111010111101100011101011000011101100101111011100011011101000101001101010001011101111111111001100001011100110100011101011100111011110111100011000111010111101100011101011000011101100101111011100011011101000101001101010001001011110 effcc2e68eb9def18ebd8eb0ecbdc6e8a6a2effcc2e68eb9def18ebd8eb0ecbdc6e8a6a25e
UTF-8 陷台ケ滂スー貊捺Β陷台ケ滂スー貊捺Β^ 1110100110011001101101111110010110001111101100001110111110111101101110011110011010111011100000101110111110111101101111011110111110111101101100001110100010110010100010101110011010001101101110101100111010010010111010011001100110110111111001011000111110110000111011111011110110111001111001101011101110000010111011111011110110111101111011111011110110110000111010001011001010001010111001101000110110111010110011101001001001011110 e999b7e58fb0efbdb9e6bb82efbdbdefbdb0e8b28ae68dbace92e999b7e58fb0efbdb9e6bb82efbdbdefbdb0e8b28ae68dbace925e
UHC 陷台?滂??貊捺Β陷台?滂??貊捺Β^ 11111001111010001111011110111011001111111101101110110101001111110011111111011000111001111101000111110100101001011100001011111001111010001111011110111011001111111101101110110101001111110011111111011000111001111101000111110100101001011100001001011110 f9e8f7bb3fdbb53f3fd8e7d1f4a5c2f9e8f7bb3fdbb53f3fd8e7d1f4a5c25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)