To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 俉??域?俉??域?B 111110100110000100111111001111111000100011100110001111111111101001100001001111110011111110001000111001100011111101000010 fa613f3f88e63ffa613f3f88e63f42
EUC-JP 俉??域?俉??域?B 1000111110110001101110110011111100111111101100001110100000111111100011111011000110111011001111110011111110110000111010000011111101000010 8fb1bb3f3fb0e83f8fb1bb3f3fb0e83f42
UTF-8 俉쇔콉域쾧俉쇔콉域쾧B 11100100101111111000100111101100100001111001010011101100101111011000100111100101100111111001111111101100101111101010011111100100101111111000100111101100100001111001010011101100101111011000100111100101100111111001111111101100101111101010011101000010 e4bf89ec8794ecbd89e59f9fecbea7e4bf89ec8794ecbd89e59f9fecbea742
UHC 俉쇔콉域쾧俉쇔콉域쾧B 111001111110101110111100111001011011000110000101111001101011010010110010011110011110011111101011101111001110010110110001100001011110011010110100101100100111100101000010 e7ebbce5b185e6b4b279e7ebbce5b185e6b4b27942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)