To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 珈珈鞦珈哈獺怐 1110000011011011111000001101101111101000111000011110000011011011100110011111101111100000110110101001110010000001 e0dbe0dbe8e1e0db99fbe0da9c81
EUC-JP 珈珈鞦珈哈獺怐 1110000011011101111000001101110111110000111000111110000011011101110100101111110111100000110111001101011111100001 e0dde0ddf0e3e0ddd2fde0dcd7e1
UTF-8 珈珈鞦珈哈獺怐 111001111000111110001000111001111000111110001000111010011001111010100110111001111000111110001000111001011001001110001000111001111000110110111010111001101000000010010000 e78f88e78f88e99ea6e78f88e59388e78dbae68090
UHC ????哈獺? 001111110011111100111111001111111111100111101011110100111011011100111111 3f3f3f3ff9ebd3b73f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)