To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???諭ら?矣??艶l???┏遺??魚 001111110011111100111111100101110100000010000010111001110011111111100001111000010011111100111111100010011001000010000010100011000011111100111111001111111000010010101100100010001110001000111111001111111000101110011011 3f3f3f974082e73fe1e13f3f8990828c3f3f3f84ac88e23f3f8b9b
EUC-JP ???諭ら?矣??艶l???┏遺??魚 001111110011111100111111110011011010000110100100111010010011111111100010111000110011111100111111101100011111000010100011111011000011111100111111001111111010100010101110101100001110010000111111001111111011010111111011 3f3f3fcda1a4e93fe2e33f3fb1f0a3ec3f3f3fa8aeb0e43f3fb5fb
UTF-8 閱묐갭諭ら걬矣ㅻ룥艶l꼶溜쀦┏遺얜꺊魚 111010011001011010110001111010111010110010010000111010101011000010101101111010001010101110101101111000111000001010001001111010101011000110101100111001111001111110100011111000111000010110111011111010111010001110100101111010001000100110110110111011111011110110001100111010101011110010110110111011111010011110001011111011001000000010100110111000101001010010001111111010011000000110111010111011001001011010011100111010101011101010001010111010011010110110011010 e996b1ebac90eab0ade8abade38289eab1ace79fa3e385bbeba3a5e889b6efbd8ceabcb6efa78bec80a6e2948fe981baec969ceaba8ae9ad9a
UHC 閱묐갭諭ら걬矣ㅻ룥艶l꼶溜쀦┏遺얜꺊魚 1110011011110011100100011110101110110000101110001110101110110001101010101110100110000001100101011110101111111000101001001110101110001111100111101110011011111101101000111110110010000100100011101110101011111110100101111110011010100110101011101110101110110110101111101110101110000011101100011110010111100000 e6f391ebb0b8ebb1aae98195ebf8a4eb8f9ee6fda3ec848eeafe97e6a6aeebb6beeb83b1e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)