To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???????誌???憺????????? 0011111100111111001111110011111100111111001111110011111110001110100011110011111100111111001111111001110011101001001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f8e8f3f3f3f9ce93f3f3f3f3f3f3f3f3f
EUC-JP ???????誌???憺????????? 0011111100111111001111110011111100111111001111110011111110111011111011110011111100111111001111111101100011101011001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3fbbef3f3f3fd8eb3f3f3f3f3f3f3f3f3f
UTF-8 셈섞렯렣셈섞뤵誌읊칿펿憺읊칿펿낵읊칿펿캬읊 111011001000010110001000111011001000010010011110111010111010000010101111111010111010000010100011111011001000010110001000111011001000010010011110111010111010010010110101111010001010101010001100111011001001110110001010111011001011100110111111111011011000111010111111111001101000011010111010111011001001110110001010111011001011100110111111111011011000111010111111111010111000001010110101111011001001110110001010111011001011100110111111111011011000111010111111111011001011101010101100111011001001110110001010 ec8588ec849eeba0afeba0a3ec8588ec849eeba4b5e8aa8cec9d8aecb9bfed8ebfe686baec9d8aecb9bfed8ebfeb82b5ec9d8aecb9bfed8ebfecbaacec9d8a
UHC 셈섞렯렣셈섞뤵誌읊칿펿憺읊칿펿낵읊칿펿캬읊 101111001100000010111100101011111000111010111100100011101011010010111100110000001011110010101111100011111110001111110010101111001100000010111100101011111000111010111100100011101101001110111100110000001011110010101111100011101011110010001110101100111011110011000000101111001010111110001110101111001000111011000100101111001100000010111100 bcc0bcaf8ebc8eb4bcc0bcaf8fe3f2bcc0bcaf8ebc8ed3bcc0bcaf8ebc8eb3bcc0bcaf8ebc8ec4bcc0bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)