To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 冶⑤?揖??楡??冶⑤????遺??冶⑤?泣 1001011011101000100001110100010000111111100101110100101100111111001111111001111010111110001111110011111110010110111010001000011101000100001111110011111100111111001111111000100011100010001111110011111110010110111010001000011101000100001111111000101110000011 96e887443f974b3f3f9ebe3f3f96e887443f3f3f3f88e23f3f96e887443f8b83
EUC-JP 冶??揖??楡??冶?????遺??冶??泣 1100110011101010001111110011111111001101101011000011111100111111110111001100000000111111001111111100110011101010001111110011111100111111001111110011111110110000111001000011111100111111110011001110101000111111001111111011010111100011 ccea3f3fcdac3f3fdcc03f3fccea3f3f3f3f3fb0e43f3fccea3f3fb5e3
UTF-8 冶⑤슦揖ⓨ㎤楡곗돣冶⑤슣李볩㎖遺우퐟冶⑤슦泣 111001011000011010110110111000101001000110100100111011001000101010100110111001101000111110010110111000101001001110101000111000111000111010100100111001101010010110100001111010101011001110010111111010111000111110100011111001011000011010110110111000101001000110100100111011001000101010100011111011111010011110100001111010111011001110101001111000111000111010010110111010011000000110111010111011001001101010110000111011011001000010011111111001011000011010110110111000101001000110100100111011001000101010100110111001101011001110100011 e586b6e291a4ec8aa6e68f96e293a8e38ea4e6a5a1eab397eb8fa3e586b6e291a4ec8aa3efa7a1ebb3a9e38e96e981baec9ab0ed909fe586b6e291a4ec8aa6e6b3a3
UHC 冶⑤슦揖ⓨ㎤楡곗돣冶⑤슣李볩㎖遺우퐟冶⑤슦泣 1110010110100111101010001110101110011010101100001110101111100111101010001110010110100111101010001110101011111000101100001110110010001001101010001110010110100111101010001110101110011010101011111110110010110000100100111110111110100111101000101110101110110110101111111110110010111101100010001110010110100111101010001110101110011010101100001110101111101000 e5a7a8eb9ab0ebe7a8e5a7a8eaf8b0ec89a8e5a7a8eb9aafecb093efa7a2ebb6bfecbd88e5a7a8eb9ab0ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)