To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 省ア而爵柀ア而杓 1000111111001000101100011000111010100111100011101101110111111010111001001011000110001110101001111000111011011011 8fc8b18ea78eddfae4b18ea78edb
EUC-JP 省ア而爵柀ア而杓 1011111011001010100011101011000110111100101010011011110011011111100011111100001110111001100011101011000110111100101010011011110011011101 beca8eb1bca9bcdf8fc3b98eb1bca9bcdd
UTF-8 省ア而爵柀ア而杓 111001111001110010000001111011111011110110110001111010001000000010001100111001111000100010110101111001101001111110000000111011111011110110110001111010001000000010001100111001101001110110010011 e79c81efbdb1e8808ce788b5e69f80efbdb1e8808ce69d93
UHC 省?而爵??而杓 11100000111111010011111111101100101110111110110111001001001111110011111111101100101110111111100011110101 e0fd3fecbbedc93f3fecbbf8f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)