To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陞ゆク頑刄陞ょ沁縺枝陞ゆク頑刄陞ょ沁縺施^ 111010001001111010000010111001001011100010001010111001101001100110000011111010001001111010000010111001011001111110001110111000111000000110001110011111011110100010011110100000101110010010111000100010101110011010011001100000111110100010011110100000101110010110011111100011101110001110000001100011100111101101011110 e89e82e4b88ae69983e89e82e59f8ee3818e7de89e82e4b88ae69983e89e82e59f8ee3818e7b5e
EUC-JP 陞ゆク頑刄陞ょ沁縺枝陞ゆク頑刄陞ょ沁縺施^ 1110111111111110101001001110011010001110101110001011010011101000110100011110001111101111111111101010010011100111110111011110111011100101111000011011101111011110111011111111111010100100111001101000111010111000101101001110100011010001111000111110111111111110101001001110011111011101111011101110010111100001101110111101110001011110 effea4e68eb8b4e8d1e3effea4e7ddeee5e1bbdeeffea4e68eb8b4e8d1e3effea4e7ddeee5e1bbdc5e
UTF-8 陞ゆク頑刄陞ょ沁縺枝陞ゆク頑刄陞ょ沁縺施^ 11101001100110011001111011100011100000101000011011101111101111011011100011101001101000001001000111100101100010001000010011101001100110011001111011100011100000101000011111100110101100101000000111100111101110001011101011100110100111101001110111101001100110011001111011100011100000101000011011101111101111011011100011101001101000001001000111100101100010001000010011101001100110011001111011100011100000101000011111100110101100101000000111100111101110001011101011100110100101101011110101011110 e9999ee38286efbdb8e9a091e58884e9999ee38287e6b281e7b8bae69e9de9999ee38286efbdb8e9a091e58884e9999ee38287e6b281e7b8bae696bd5e
UHC 陞ゆ?頑?陞ょ沁?枝陞ゆ?頑?陞ょ沁?施^ 1110001110110011101010101110011000111111111010001101011100111111111000111011001110101010111001111110001111111110001111111111001010101011111000111011001110101010111001100011111111101000110101110011111111100011101100111010101011100111111000111111111000111111111000111011111101011110 e3b3aae63fe8d73fe3b3aae7e3fe3ff2abe3b3aae63fe8d73fe3b3aae7e3fe3fe3bf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)