To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?〃愿??儒??? 1110000110011111001111111000000101010110100111001100001100111111001111111000111011110010001111110011111100111111 e19f3f81569cc33f3f8ef23f3f3f
EUC-JP 癲?〃愿?ˇ儒??? 11100010101000010011111110100001101101111101100011000101001111111000111110100010101100001011110011110100001111110011111100111111 e2a13fa1b7d8c53f8fa2b0bcf43f3f3f
UTF-8 癲뗫〃愿득ˇ儒묐렩劣 1110011110011001101100101110101110010111101010111110001110000000100000111110011010000100101111111110101110010011100111011100101110000111111001011000010010010010111010111010110010010000111010111010000010101001111011111010011010011101 e799b2eb97abe38083e684bfeb939dcb87e58492ebac90eba0a9efa69d
UHC 癲뗫〃愿득ˇ儒묐렩劣 1110111110100110100010111110101110100001101010001110101010110100101101011110011010100010101001111110101011100011100100011110101110001110101101111110011011101011 efa68beba1a8eab4b5e6a2a7eae391eb8eb7e6eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)