To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 遏」鬧冑遏」鬧冉N}遏」鬧冑遏」鬧冉N{^ 111001111001111110100011111010011010011110011001011010001110011110011111101000111110100110100111100110010110011001001110011111011110011110011111101000111110100110100111100110010110100011100111100111111010001111101001101001111001100101100110010011100111101101011110 e79fa3e9a79968e79fa3e9a799664e7de79fa3e9a79968e79fa3e9a799664e7b5e
EUC-JP 遏」鬧冑遏」鬧冉N}遏」鬧冑遏」鬧冉N{^ 11101110101000011000111010100011111100101010100111010001110010011110111010100001100011101010001111110010101010011101000111000111010011100111110111101110101000011000111010100011111100101010100111010001110010011110111010100001100011101010001111110010101010011101000111000111010011100111101101011110 eea18ea3f2a9d1c9eea18ea3f2a9d1c74e7deea18ea3f2a9d1c9eea18ea3f2a9d1c74e7b5e
UTF-8 遏」鬧冑遏」鬧冉N}遏」鬧冑遏」鬧冉N{^ 1110100110000001100011111110111110111101101000111110100110101100101001111110010110000110100100011110100110000001100011111110111110111101101000111110100110101100101001111110010110000110100010010100111001111101111010011000000110001111111011111011110110100011111010011010110010100111111001011000011010010001111010011000000110001111111011111011110110100011111010011010110010100111111001011000011010001001010011100111101101011110 e9818fefbda3e9aca7e58691e9818fefbda3e9aca7e586894e7de9818fefbda3e9aca7e58691e9818fefbda3e9aca7e586894e7b5e
UHC ??鬧???鬧?N}??鬧???鬧?N{^ 00111111001111111101011110100010001111110011111100111111110101111010001000111111010011100111110100111111001111111101011110100010001111110011111100111111110101111010001000111111010011100111101101011110 3f3fd7a23f3f3fd7a23f4e7d3f3fd7a23f3f3fd7a23f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)