To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???c}???c{^ 0011111100111111001111110110001101111101001111110011111100111111011000110111101101011110 3f3f3f637d3f3f3f637b5e
SJIS-WIN 辱i?c}辱i?c{^ 100100000100101010000010100010010011111101100011011111011001000001001010100000101000100100111111011000110111101101011110 904a82893f637d904a82893f637b5e
EUC-JP 辱i?c}辱i?c{^ 101111111010101110100011111010010011111101100011011111011011111110101011101000111110100100111111011000110111101101011110 bfaba3e93f637dbfaba3e93f637b5e
UTF-8 辱i뮈c}辱i뮈c{^ 1110100010111110101100011110111110111101100010011110101110101110100010000110001101111101111010001011111010110001111011111011110110001001111010111010111010001000011000110111101101011110 e8beb1efbd89ebae88637de8beb1efbd89ebae88637b5e
UHC 辱i뮈c}辱i뮈c{^ 1110100110110100101000111110100110111001101111110110001101111101111010011011010010100011111010011011100110111111011000110111101101011110 e9b4a3e9b9bf637de9b4a3e9b9bf637b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)