To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[}?????????[{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101101101111101001111110011111100111111001111110011111100111111001111110011111100111111010110110111101101011110 3f3f3f3f3f3f3f3f3f5b7d3f3f3f3f3f3f3f3f3f5b7b5e
SJIS-WIN 猥??澳??映ф?[}猥??澳??映ф?[{^ 11100000110011100011111100111111111000000101001100111111001111111000100101100110100001001000011000111111010110110111110111100000110011100011111100111111111000000101001100111111001111111000100101100110100001001000011000111111010110110111101101011110 e0ce3f3fe0533f3f896684863f5b7de0ce3f3fe0533f3f896684863f5b7b5e
EUC-JP 猥??澳??映ф?[}猥??澳??映ф?[{^ 11100000110100000011111100111111110111111011010000111111001111111011000111000111101001111110011000111111010110110111110111100000110100000011111100111111110111111011010000111111001111111011000111000111101001111110011000111111010110110111101101011110 e0d03f3fdfb43f3fb1c7a7e63f5b7de0d03f3fdfb43f3fb1c7a7e63f5b7b5e
UTF-8 猥울쉽澳뺧숯映ф돹[}猥울쉽澳뺧숯映ф돹[{^ 111001111000110010100101111011001001101010111000111011001000100110111101111001101011111010110011111010111011101010100111111011001000100010101111111001101001100010100000110100011000010011101011100011111011100101011011011111011110011110001100101001011110110010011010101110001110110010001001101111011110011010111110101100111110101110111010101001111110110010001000101011111110011010011000101000001101000110000100111010111000111110111001010110110111101101011110 e78ca5ec9ab8ec89bde6beb3ebbaa7ec88afe698a0d184eb8fb95b7de78ca5ec9ab8ec89bde6beb3ebbaa7ec88afe698a0d184eb8fb95b7b5e
UHC 猥울쉽澳뺧숯映ф돹[}猥울쉽澳뺧숯映ф돹[{^ 1110100011100101101111111110111110111101101100011110011111111110100101011110111110111101101000011110011110110001101011001110011010001001101111000101101101111101111010001110010110111111111011111011110110110001111001111111111010010101111011111011110110100001111001111011000110101100111001101000100110111100010110110111101101011110 e8e5bfefbdb1e7fe95efbda1e7b1ace689bc5b7de8e5bfefbdb1e7fe95efbda1e7b1ace689bc5b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)