To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蜈ょ?娃hⅹ???^ 111001011000010110000010111001010011111110001000101000011000001010001000111110100100100100111111001111110011111101011110 e58582e53f88a18288fa493f3f3f5e
EUC-JP 蜈ょ?娃h?縯??^ 11101001111001011010010011100111001111111011000010100011101000111110100000111111100011111101010011001011001111110011111101011110 e9e5a4e73fb0a3a3e83f8fd4cb3f3f5e
UTF-8 蜈ょ굤娃hⅹ縯딃겂^ 11101000100111001000100011100011100000101000011111101010101101011010010011100101101010001000001111101111101111011000100011100010100001011011100111100111101110001010111111101011100101001000001111101010101100101000001001011110 e89c88e38287eab5a4e5a883efbd88e285b9e7b8afeb9483eab2825e
UHC 蜈ょ굤娃hⅹ縯딃겂^ 11101000101001011010101011100111100000101000101011101000110111111010001111101000101001011010101011100110111000001000101011101001100000011010001101011110 e8a5aae7828ae8dfa3e8a5aae6e08ae981a35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)