To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??汚??齬↑?? 1001100001100000001111110011111110001001100110000011111100111111111010101001011110000001101010100011111100111111 98603f3f89983f3fea9781aa3f3f
EUC-JP 倭??汚??齬↑?縯 11001111110000010011111100111111101100011111100000111111001111111111001111110111101000101010110000111111100011111101010011001011 cfc13f3fb1f83f3ff3f7a2ac3f8fd4cb
UTF-8 倭듸쉐汚띄떁齬↑퓴縯 111001011000000010101101111010111001001110111000111011001000100110010000111001101011000110011010111010111001110110000100111010111001011010000001111010011011110110101100111000101000011010010001111011011001001110110100111001111011100010101111 e580adeb93b8ec8990e6b19aeb9d84eb9681e9bdace28691ed93b4e7b8af
UHC 倭듸쉐汚띄떁齬↑퓴縯 1110100011011110101101011110111110111101101001101110011111111101101101101110011110001011100101111110010111100001101000011110100010111111100110101110011011100000 e8deb5efbda6e7fdb6e78b97e5e1a1e8bf9ae6e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)