To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??蔭??怨??雅 1001101001101010001111110011111110001000111111000011111100111111100010011000010100111111001111111000100111101011 9a6a3f3f88fc3f3f89853f3f89eb
EUC-JP 嗚??蔭??怨??雅 1101001111001011001111110011111110110000111111100011111100111111101100011110010100111111001111111011001011101101 d3cb3f3fb0fe3f3fb1e53f3fb2ed
UTF-8 嗚삠굦蔭쇘윢怨뺣솀雅 111001011001011110011010111011001000001010100000111010101011010110100110111010001001010010101101111011001000011110011000111011001001110010100010111001101000000010101000111010111011101010100011111011001000011010000000111010011001101110000101 e5979aec82a0eab5a6e894adec8798ec9ca2e680a8ebbaa3ec8680e99b85
UHC 嗚삠굦蔭쇘윢怨뺣솀雅 1110011111110000101110111110001110000010100011001110101111100011101111001110011110011111101000111110101010110011100101011110101110011001100001011110010010111010 e7f0bbe3828cebe3bce79fa3eab395eb9985e4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)