To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 省蜴ホ室省蠑ア省 10001111110010001110010110001110110011101111001011000000100011101011101010001111110010001110010110111100101100011000111111001000 8fc8e58ecef2c08eba8fc8e5bcb18fc8
EUC-JP 省蜴ホ?室省蠑ア省 1011111011001010111010011110111010001110110011100011111110111100101111001011111011001010111010101011111010001110101100011011111011001010 becae9ee8ece3fbcbcbecaeabe8eb1beca
UTF-8 省蜴ホ室省蠑ア省 111001111001110010000001111010001001110010110100111011111011111010001110111011101000011110110111111001011010111010100100111001111001110010000001111010001010000010010001111011111011110110110001111001111001110010000001 e79c81e89cb4efbe8eee87b7e5aea4e79c81e8a091efbdb1e79c81
UHC 省???室省??省 11100000111111010011111100111111001111111110001111111000111000001111110100111111001111111110000011111101 e0fd3f3f3fe3f8e0fd3f3fe0fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)