To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??H?????????H???????? 001111110011111101001000001111110011111100111111001111110011111100111111001111110011111100111111010010000011111100111111001111110011111100111111001111110011111100111111 3f3f483f3f3f3f3f3f3f3f3f483f3f3f3f3f3f3f3f
SJIS-WIN 障?H虞?弔?寃???彬H虞?弔?衣??甲 100011111110000100111111010010001000101111110001001111111001001010100010001111111001101110000011001111110011111100111111100101010110101001001000100010111111000100111111100100101010001000111111100010001101111100111111001111111000110101100010 8fe13f488bf13f92a23f9b833f3f3f956a488bf13f92a23f88df3f3f8d62
EUC-JP 障?H虞?弔?寃???彬H虞?弔?衣??甲 101111101110001100111111010010001011011011110011001111111100010010100100001111111101010111100011001111110011111100111111110010011100101101001000101101101111001100111111110001001010010000111111101100001110000100111111001111111011100111000011 bee33f48b6f33fc4a43fd5e33f3f3fc9cb48b6f33fc4a43fb0e13f3fb9c3
UTF-8 障렚H虞렧弔렲寃당렟닻彬H虞렧弔렲衣쯔렋甲 1110100110011010100111001110101110100000100110100100100011101000100110011001111011101011101000001010011111100101101111001001010011101011101000001011001011100101101011111000001111101011100010111011100111101011101000001001111111101011100010111011101111100101101111011010110001001000111010001001100110011110111010111010000010100111111001011011110010010100111010111010000010110010111010001010000110100011111011001010111110010100111010111010000010001011111001111001010010110010 e99a9ceba09a48e8999eeba0a7e5bc94eba0b2e5af83eb8bb9eba09feb8bbbe5bdac48e8999eeba0a7e5bc94eba0b2e8a1a3ecaf94eba08be794b2
UHC 障렚H虞렧弔렲寃당렟닻彬H虞렧弔렲衣쯔렋甲 11101110101000011000111010101101010010001110100111100101100011101011011011110000110000001000111010111111111010101011001010110100111001111000111010110000101101001110100111011110101011110100100011101001111001011000111010110110111100001100000010001110101111111110101111111101110000101110101010001110101000101100101110100011 eea18ead48e9e58eb6f0c08ebfeab2b4e78eb0b4e9deaf48e9e58eb6f0c08ebfebfdc2ea8ea2cba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)