To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????~ 00111111001111110011111100111111001111110011111100111111001111110011111101111110 3f3f3f3f3f3f3f3f3f7e
SJIS-WIN ?????????~ 00111111001111110011111100111111001111110011111100111111001111110011111101111110 3f3f3f3f3f3f3f3f3f7e
EUC-JP ?????????~ 00111111001111110011111100111111001111110011111100111111001111110011111101111110 3f3f3f3f3f3f3f3f3f7e
UTF-8 셍롙롘셔샬셍셈셔샬~ 11101100100001011000110111101011101000011001100111101011101000011001100011101100100001011001010011101100100000111010110011101100100001011000110111101100100001011000100011101100100001011001010011101100100000111010110001111110 ec858deba199eba198ec8594ec83acec858dec8588ec8594ec83ac7e
UHC 셍롙롘셔샬셍셈셔샬~ 10111100110001001000111011011101100011101101110010111100110001011011110010100011101111001100010010111100110000001011110011000101101111001010001101111110 bcc48edd8edcbcc5bca3bcc4bcc0bcc5bca37e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)