To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 鄒ゥ鬧冑鄒ゥ鬧冉N}鄒ゥ鬧冑鄒ゥ鬧冉N{^ 111001111011111010101001111010011010011110011001011010001110011110111110101010011110100110100111100110010110011001001110011111011110011110111110101010011110100110100111100110010110100011100111101111101010100111101001101001111001100101100110010011100111101101011110 e7bea9e9a79968e7bea9e9a799664e7de7bea9e9a79968e7bea9e9a799664e7b5e
EUC-JP 鄒ゥ鬧冑鄒ゥ鬧冉N}鄒ゥ鬧冑鄒ゥ鬧冉N{^ 11101110110000001000111010101001111100101010100111010001110010011110111011000000100011101010100111110010101010011101000111000111010011100111110111101110110000001000111010101001111100101010100111010001110010011110111011000000100011101010100111110010101010011101000111000111010011100111101101011110 eec08ea9f2a9d1c9eec08ea9f2a9d1c74e7deec08ea9f2a9d1c9eec08ea9f2a9d1c74e7b5e
UTF-8 鄒ゥ鬧冑鄒ゥ鬧冉N}鄒ゥ鬧冑鄒ゥ鬧冉N{^ 1110100110000100100100101110111110111101101010011110100110101100101001111110010110000110100100011110100110000100100100101110111110111101101010011110100110101100101001111110010110000110100010010100111001111101111010011000010010010010111011111011110110101001111010011010110010100111111001011000011010010001111010011000010010010010111011111011110110101001111010011010110010100111111001011000011010001001010011100111101101011110 e98492efbda9e9aca7e58691e98492efbda9e9aca7e586894e7de98492efbda9e9aca7e58691e98492efbda9e9aca7e586894e7b5e
UHC 鄒?鬧?鄒?鬧?N}鄒?鬧?鄒?鬧?N{^ 1111010111011011001111111101011110100010001111111111010111011011001111111101011110100010001111110100111001111101111101011101101100111111110101111010001000111111111101011101101100111111110101111010001000111111010011100111101101011110 f5db3fd7a23ff5db3fd7a23f4e7df5db3fd7a23ff5db3fd7a23f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)