To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???m}???m{^ 0011111100111111001111110110110101111101001111110011111100111111011011010111101101011110 3f3f3f6d7d3f3f3f6d7b5e
SJIS-WIN 魄情おm}魄情おm{^ 1110100110101110100011111110111010000010101010000110110101111101111010011010111010001111111011101000001010101000011011010111101101011110 e9ae8fee82a86d7de9ae8fee82a86d7b5e
EUC-JP 魄情おm}魄情おm{^ 1111001010110000101111101111000010100100101010100110110101111101111100101011000010111110111100001010010010101010011011010111101101011110 f2b0bef0a4aa6d7df2b0bef0a4aa6d7b5e
UTF-8 魄情おm}魄情おm{^ 1110100110101101100001001110011010000011100001011110001110000001100010100110110101111101111010011010110110000100111001101000001110000101111000111000000110001010011011010111101101011110 e9ad84e68385e3818a6d7de9ad84e68385e3818a6d7b5e
UHC 魄情おm}魄情おm{^ 1101101111011110111011111101011110101010101010100110110101111101110110111101111011101111110101111010101010101010011011010111101101011110 dbdeefd7aaaa6d7ddbdeefd7aaaa6d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)