To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?℡?釗?Б秧??^ 0011111110000111100001000011111111111011101110110011111110000100010000011110001001011110001111110011111101011110 3f87843ffbbb3f8441e25e3f3f5e
EUC-JP ???釗?Б秧??^ 0011111100111111001111111000111111100011101001100011111110100111101000101110001110111111001111110011111101011110 3f3f3f8fe3a63fa7a2e3bf3f3f5e
UTF-8 遼℡뜔釗좈Б秧쀢뵠^ 111011111010011110000011111000101000010010100001111010111001110010010100111010011000011110010111111011001010001010001000110100001001000111100111101001111010011111101100100000001010001011101011101101011010000001011110 efa783e284a1eb9c94e98797eca288d091e7a7a7ec80a2ebb5a05e
UHC 遼℡뜔釗좈Б秧쀢뵠^ 11101001101011001010001011100101100011011001011111100001111100101010000011101001101011001010001011100100111010111001011111100010100101001010000001011110 e9aca2e58d97e1f2a0e9aca2e4eb97e294a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)