To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?泣??膺??艶k?猷?????亦 111000011001111110000011100010110011111110001011100000110011111100111111111001000101111000111111001111111000100110010000100000101000101100111111100101110101000100111111001111110011111100111111001111111001011010010010 e19f838b3f8b833f3fe45e3f3f8990828b3f97513f3f3f3f3f9692
EUC-JP 癲ル?泣??膺??艶kı猷??洹??亦 11100010101000011010010111101011001111111011010111100011001111110011111111100111101111110011111100111111101100011111000010100011111010111000111110101001110001011100110110110010001111110011111110001111110001111011101000111111001111111100101111110010 e2a1a5eb3fb5e33f3fe7bf3f3fb1f0a3eb8fa9c5cdb23f3f8fc7ba3f3fcbf2
UTF-8 癲ル슢泣€꼷膺용연艶kı猷뗧솻洹섎툩亦 1110011110011001101100101110001110000011101010111110110010001010101000101110011010110011101000111110001010000010101011001110101010111100101101111110100010000110101110101110110010011010101010011110110010010111101100001110100010001001101101101110111110111101100010111100010010110001111001111000110010110111111010111001011110100111111011001000011010111011111001101011010010111001111011001000010010001110111011011000100010101001111001001011101010100110 e799b2e383abec8aa2e6b3a3e282aceabcb7e886baec9aa9ec97b0e889b6efbd8bc4b1e78cb7eb97a7ec86bbe6b4b9ec848eed88a9e4baa6
UHC 癲ル슢泣€꼷膺용연艶kı猷뗧솻洹섎툩亦 1110111110100110101010111110101110011010101011101110101111101000101000101110011010000100100011111110101111101100101111111110101110111111101011001110011011111101101000111110101110101001101001011110101110100011100010111110011110011001101100001110101010110111100110001110101110111000101000001110011010110010 efa6abeb9aaeebe8a2e6848febecbfebbface6fda3eba9a5eba38be799b0eab798ebb8a0e6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)