To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????\ 00111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f5c
SJIS-WIN ?藻額?藻額\ 0011111110010001100101001000101001111010001111111001000110010100100010100111101001011100 3f91948a7a3f91948a7a5c
EUC-JP ?藻額?藻額\ 0011111111000001111101001011001111011011001111111100000111110100101100111101101101011100 3fc1f4b3db3fc1f4b3db5c
UTF-8 왬藻額왬藻額\ 11101100100110011010110011101000100101111011101111101001101000011000110111101100100110011010110011101000100101111011101111101001101000011000110101011100 ec99ace897bbe9a18dec99ace897bbe9a18d5c
UHC 왬藻額왬藻額\ 10111111110110011111000011011101111001001111111010111111110110011111000011011101111001001111111001011100 bfd9f0dde4febfd9f0dde4fe5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)