To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??ゴ??ケ◇? 0011111100111111100000110101001100111111001111111000001101010000100000011001111000111111 3f3f83533f3f8350819e3f
EUC-JP ??ゴ??ケ◇? 0011111100111111101001011011010000111111001111111010010110110001101000011111111000111111 3f3fa5b43f3fa5b1a1fe3f
UTF-8 룴횕ゴ룵핊ケ◇룶 111010111010001110110100111011011001101010010101111000111000001010110100111010111010001110110101111011011001010110001010111000111000001010110001111000101001011110000111111010111010001110110110 eba3b4ed9a95e382b4eba3b5ed958ae382b1e29787eba3b6
UHC 룴횕ゴ룵핊ケ◇룶 10001111101010011100001110001111101010111011010010001111101010101100000010001111101010111011000110100001110111101000111110101011 8fa9c38fabb48faac08fabb1a1de8fab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)