To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔?虞?弔?鬱頭??弔?弔?虞?弔?鬱頭??弔?^ 10010010101000100011111110001011111100010011111110010010101000100011111110011111010101001001001110101010001111110011111110010010101000100011111110010010101000100011111110001011111100010011111110010010101000100011111110011111010101001001001110101010001111110011111110010010101000100011111101011110 92a23f8bf13f92a23f9f5493aa3f3f92a23f92a23f8bf13f92a23f9f5493aa3f3f92a23f5e
EUC-JP 弔?虞?弔?鬱頭??弔?弔?虞?弔?鬱頭??弔?^ 11000100101001000011111110110110111100110011111111000100101001000011111111011101101101011100011010101100001111110011111111000100101001000011111111000100101001000011111110110110111100110011111111000100101001000011111111011101101101011100011010101100001111110011111111000100101001000011111101011110 c4a43fb6f33fc4a43fddb5c6ac3f3fc4a43fc4a43fb6f33fc4a43fddb5c6ac3f3fc4a43f5e
UTF-8 弔렟虞렧弔렟鬱頭렖렕弔렟弔렟虞렧弔렟鬱頭렖렕弔렟^ 11100101101111001001010011101011101000001001111111101000100110011001111011101011101000001010011111100101101111001001010011101011101000001001111111101001101011001011000111101001101000001010110111101011101000001001011011101011101000001001010111100101101111001001010011101011101000001001111111100101101111001001010011101011101000001001111111101000100110011001111011101011101000001010011111100101101111001001010011101011101000001001111111101001101011001011000111101001101000001010110111101011101000001001011011101011101000001001010111100101101111001001010011101011101000001001111101011110 e5bc94eba09fe8999eeba0a7e5bc94eba09fe9acb1e9a0adeba096eba095e5bc94eba09fe5bc94eba09fe8999eeba0a7e5bc94eba09fe9acb1e9a0adeba096eba095e5bc94eba09f5e
UHC 弔렟虞렧弔렟鬱頭렖렕弔렟弔렟虞렧弔렟鬱頭렖렕弔렟^ 11110000110000001000111010110000111010011110010110001110101101101111000011000000100011101011000011101010101001101101010011101001100011101010101110001110101010101111000011000000100011101011000011110000110000001000111010110000111010011110010110001110101101101111000011000000100011101011000011101010101001101101010011101001100011101010101110001110101010101111000011000000100011101011000001011110 f0c08eb0e9e58eb6f0c08eb0eaa6d4e98eab8eaaf0c08eb0f0c08eb0e9e58eb6f0c08eb0eaa6d4e98eab8eaaf0c08eb05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)