To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???依щ?淫?┌淫????????癲l?^ 0011111100111111001111111000100011001011100001001000101100111111100010001111101000111111100001001010000110001000111110100011111100111111001111110011111100111111001111110011111100111111111000011001111110000010100011000011111101011110 3f3f3f88cb848b3f88fa3f84a188fa3f3f3f3f3f3f3f3fe19f828c3f5e
EUC-JP ???依щ?淫?┌淫????????癲l?^ 0011111100111111001111111011000011001101101001111110101100111111101100001111110000111111101010001010001110110000111111000011111100111111001111110011111100111111001111110011111100111111111000101010000110100011111011000011111101011110 3f3f3fb0cda7eb3fb0fc3fa8a3b0fc3f3f3f3f3f3f3f3fe2a1a3ec3f5e
UTF-8 琉싩춱依щ젗淫앾┌淫쒕쩀嶺띿맟溜볦꽍癲l꽌^ 111011111010011110001100111011001000101110101001111011001011011010110001111001001011111010011101110100011000100111101100101000001001011111100110101101111010101111101100100101011011111011100010100101001000110011100110101101111010101111101100100100101001010111101100101010011000000011101111101001101010101111101011100111011011111111101011101001111001111111101111101001111000101111101011101100111010011011101010101111011000110111100111100110011011001011101111101111011000110011101010101111011000110001011110 efa78cec8ba9ecb6b1e4be9dd189eca097e6b7abec95bee2948ce6b7abec9295eca980efa6abeb9dbfeba79fefa78bebb3a6eabd8de799b2efbd8ceabd8c5e
UHC 琉싩춱依щ젗淫앾┌淫쒕쩀嶺띿맟溜볦꽍癲l꽌^ 11101011101001001001101011100111101011011000110111101011111011101010110011101011101000001001001111101011111000101001110111101111101001101010001111101011111000101001110011101011101001001001101011100111101011011000110111101100100100001010110011101010111111101001001111101100100001001001110111101111101001101010001111101100100001001001110001011110 eba49ae7ad8debeeaceba093ebe29defa6a3ebe29ceba49ae7ad8dec90aceafe93ec849defa6a3ec849c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)