To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN ?業業?業業B 0011111110001011110001101000101111000110001111111000101111000110100010111100011001000010 3f8bc68bc63f8bc68bc642
EUC-JP ?業業?業業B 0011111110110110110010001011011011001000001111111011011011001000101101101100100001000010 3fb6c8b6c83fb6c8b6c842
UTF-8 멛業業멛業業B 11101011101010011001101111100110101001011010110111100110101001011010110111101011101010011001101111100110101001011010110111100110101001011010110101000010 eba99be6a5ade6a5adeba99be6a5ade6a5ad42
UHC 멛業業멛業業B 10010001010011001110010111110110111001011111011010010001010011001110010111110110111001011111011001000010 914ce5f6e5f6914ce5f6e5f642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)