To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?b?鎰?┷猷⑥??b?鎰?┷猷⑥?^ 0011111110000010100000100011111111101000010011000011111110000100101110001001011101010001100001110100010100111111001111111000001010000010001111111110100001001100001111111000010010111000100101110101000110000111010001010011111101011110 3f82823fe84c3f84b8975187453f3f82823fe84c3f84b8975187453f5e
EUC-JP 渶b?鎰?┷猷??渶b?鎰?┷猷??^ 10001111110001111110110110100011111000100011111111101111101011010011111110101000101110101100110110110010001111110011111110001111110001111110110110100011111000100011111111101111101011010011111110101000101110101100110110110010001111110011111101011110 8fc7eda3e23fefad3fa8bacdb23f3f8fc7eda3e23fefad3fa8bacdb23f3f5e
UTF-8 渶b뫅鎰뺧┷猷⑥끼渶b뫅鎰뺧┷猷⑥쾳^ 11100110101110001011011011101111101111011000001011101011101010111000010111101001100011101011000011101011101110101010011111100010100101001011011111100111100011001011011111100010100100011010010111101011100000011011110011100110101110001011011011101111101111011000001011101011101010111000010111101001100011101011000011101011101110101010011111100010100101001011011111100111100011001011011111100010100100011010010111101100101111101011001101011110 e6b8b6efbd82ebab85e98eb0ebbaa7e294b7e78cb7e291a5eb81bce6b8b6efbd82ebab85e98eb0ebbaa7e294b7e78cb7e291a5ecbeb35e
UHC 渶b뫅鎰뺧┷猷⑥끼渶b뫅鎰뺧┷猷⑥쾳^ 11100111101101111010001111100010100100011010100011101100111100001001010111101111101001101011101011101011101000111010100011101100101100111010001011100111101101111010001111100010100100011010100011101100111100001001010111101111101001101011101011101011101000111010100011101100101100101000100101011110 e7b7a3e291a8ecf095efa6baeba3a8ecb3a2e7b7a3e291a8ecf095efa6baeba3a8ecb2895e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)