To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????nB 001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f6e42
SJIS-WIN ?牙??モ?仇nB 001111111000100111100101001111110011111110000011100000100011111110001011011101110110111001000010 3f89e53f3f83823f8b776e42
EUC-JP ?牙??モ?仇nB 001111111011001011100111001111110011111110100101111000100011111110110101110110000110111001000010 3fb2e73f3fa5e23fb5d86e42
UTF-8 룶牙룴횕モ룶仇nB 1110101110100011101101101110011110001001100110011110101110100011101101001110110110011010100101011110001110000011101000101110101110100011101101101110010010111011100001110110111001000010 eba3b6e78999eba3b4ed9a95e383a2eba3b6e4bb876e42
UHC 룶牙룴횕モ룶仇nB 10001111101010111110010010110011100011111010100111000011100011111010101111100010100011111010101111001110111110110110111001000010 8fabe4b38fa9c38fabe28fabcefb6e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)