To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霑「莉呎i霑「莉呎iB 11101000101111111010001011100100101110111001100111100110100000101000100111101000101111111010001011100100101110111001100111100110100000101000100101000010 e8bfa2e4bb99e68289e8bfa2e4bb99e6828942
EUC-JP 霑「莉呎i霑「莉呎iB 111100001100000110001110101000101110100010111101110100101110100010100011111010011111000011000001100011101010001011101000101111011101001011101000101000111110100101000010 f0c18ea2e8bdd2e8a3e9f0c18ea2e8bdd2e8a3e942
UTF-8 霑「莉呎i霑「莉呎iB 11101001100111001001000111101111101111011010001011101000100011101000100111100101100100011000111011101111101111011000100111101001100111001001000111101111101111011010001011101000100011101000100111100101100100011000111011101111101111011000100101000010 e99c91efbda2e88e89e5918eefbd89e99c91efbda2e88e89e5918eefbd8942
UHC 霑?莉?i霑?莉?iB 1110111111000101001111111101011111101001001111111010001111101001111011111100010100111111110101111110100100111111101000111110100101000010 efc53fd7e93fa3e9efc53fd7e93fa3e942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)