To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲????癲????B 11100001100111110011111100111111001111110011111111100001100111110011111100111111001111110011111101000010 e19f3f3f3f3fe19f3f3f3f3f42
EUC-JP 癲????癲????B 11100010101000010011111100111111001111110011111111100010101000010011111100111111001111110011111101000010 e2a13f3f3f3fe2a13f3f3f3f42
UTF-8 癲잞퐞紐쬲癲잞퐞紐쬲B 11100111100110011011001011101100100111101001111011101101100100001001111011101111101001111000111111101100101011001011001011100111100110011011001011101100100111101001111011101101100100001001111011101111101001111000111111101100101011001011001001000010 e799b2ec9e9eed909eefa78fecacb2e799b2ec9e9eed909eefa78fecacb242
UHC 癲잞퐞紐쬲癲잞퐞紐쬲B 111011111010011010011111111011111011110110000111111010111010101010100111011010011110111110100110100111111110111110111101100001111110101110101010101001110110100101000010 efa69fefbd87ebaaa769efa69fefbd87ebaaa76942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)