To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?l?誼??釉??????l?誼??釉?????B 00111111100000101000110000111111100010110110001000111111001111111110011111010110001111110011111100111111001111110011111100111111100000101000110000111111100010110110001000111111001111111110011111010110001111110011111100111111001111110011111101000010 3f828c3f8b623f3fe7d63f3f3f3f3f3f828c3f8b623f3fe7d63f3f3f3f3f42
EUC-JP 渶l?誼??釉?????渶l?誼??釉?????B 1000111111000111111011011010001111101100001111111011010111000011001111110011111111101110110110000011111100111111001111110011111100111111100011111100011111101101101000111110110000111111101101011100001100111111001111111110111011011000001111110011111100111111001111110011111101000010 8fc7eda3ec3fb5c33f3feed83f3f3f3f3f8fc7eda3ec3fb5c33f3feed83f3f3f3f3f42
UTF-8 渶l쉶誼욕껙釉붾솴閭잛츪渶l쉶誼욕껙釉붾솴閭잛츪B 11100110101110001011011011101111101111011000110011101100100010011011011011101000101010101011110011101100100110101001010111101010101110111001100111101001100001111000100111101011101101101011111011101100100001101011010011101111101001101000011011101100100111101001101111101100101110001010101011100110101110001011011011101111101111011000110011101100100010011011011011101000101010101011110011101100100110101001010111101010101110111001100111101001100001111000100111101011101101101011111011101100100001101011010011101111101001101000011011101100100111101001101111101100101110001010101001000010 e6b8b6efbd8cec89b6e8aabcec9a95eabb99e98789ebb6beec86b4efa686ec9e9becb8aae6b8b6efbd8cec89b6e8aabcec9a95eabb99e98789ebb6beec86b4efa686ec9e9becb8aa42
UHC 渶l쉶誼욕껙釉붾솴閭잛츪渶l쉶誼욕껙釉붾솴閭잛츪B 11100111101101111010001111101100100110101000110011101011111111101011111111100101101100101011001111101011101110001001010011101011100110011010100111100110101011011001111111101100101011101001111111100111101101111010001111101100100110101000110011101011111111101011111111100101101100101011001111101011101110001001010011101011100110011010100111100110101011011001111111101100101011101001111101000010 e7b7a3ec9a8cebfebfe5b2b3ebb894eb99a9e6ad9fecae9fe7b7a3ec9a8cebfebfe5b2b3ebb894eb99a9e6ad9fecae9f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)