To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 菴?????蟻?B 1110010010111101001111110011111100111111001111110011111110001011011000010011111101000010 e4bd3f3f3f3f3f8b613f42
EUC-JP 菴?????蟻?B 1110100010111111001111110011111100111111001111110011111110110101110000100011111101000010 e8bf3f3f3f3f3fb5c23f42
UTF-8 菴꿔굥梨욘룚蟻퍂B 11101000100011111011010011101010101111111001010011101010101101011010010111101111101001111010001011101100100110101001100011101011101000111001101011101000100111111011101111101101100011011000001001000010 e88fb4eabf94eab5a5efa7a2ec9a98eba39ae89fbbed8d8242
UHC 菴꿔굥梨욘룚蟻퍂B 1110010011100000101100101110001110000010100010111110110010110001101111111110011010001111100101101110101111111100101110110111010101000010 e4e0b2e3828becb1bfe68f96ebfcbb7542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)