To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伍??泣???▲?檍??逾ょ?濡〓?鈺 10001100110111100011111100111111100010111000001100111111001111110011111110000001101000110011111110011110111110000011111100111111111001111010010110000010111001010011111110010100010001111000000110101100001111111111101111000100 8cde3f3f8b833f3f3f81a33f9ef83f3fe7a582e53f944781ac3ffbc4
EUC-JP 伍??泣??飡▲?檍??逾ょ?濡〓?鈺 10111000111000000011111100111111101101011110001100111111001111111000111111101000110010001010001010100101001111111101110011111010001111110011111111101110101001111010010011100111001111111100011110101000101000101010111000111111100011111110001111010101 b8e03f3fb5e33f3f8fe8c8a2a53fdcfa3f3feea7a4e73fc7a8a2ae3f8fe3d5
UTF-8 伍밸씮泣쒏끽飡▲뀋檍됱뜦逾ょ춯濡〓쳛鈺 111001001011110010001101111010111011000010111000111011001001010010101110111001101011001110100011111011001001001010001111111010111000000110111101111010011010001110100001111000101001011010110010111010111000000010001011111001101010101010001101111010111001000010110001111010111001110010100110111010011000000010111110111000111000001010000111111011001011011010101111111001101011111110100001111000111000000010010011111011001011001110011011111010011000100010111010 e4bc8debb0b8ec94aee6b3a3ec928feb81bde9a3a1e296b2eb808be6aa8deb90b1eb9ca6e980bee38287ecb6afe6bfa1e38093ecb39be988ba
UHC 伍밸씮泣쒏끽飡▲뀋檍됱뜦逾ょ춯濡〓쳛鈺 1110011111101010101110011110101110011101101111111110101111101000100111001110011010110011101000111110000111100010101000011110001110000101100001111110010111100101100010011110110010001101101010011110101110110101101010101110011110101101100011001110101110100001101000011110101110101011100000011110100010101101 e7eab9eb9dbfebe89ce6b3a3e1e2a1e38587e5e589ec8da9ebb5aae7ad8ceba1a1ebab81e8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)