To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 遯壱圷粤主ェ嶢褄 111001111010101010001000111010111001101010101000111000101110001110001110111001011010101010011011110100001110010111101011 e7aa88eb9aa8e2e38ee5aa9bd0e5eb
EUC-JP 遯壱圷粤主ェ嶢褄 11101110101011001011000011101101110101001010101011100100111001011011110011100111100011101010101011010110110100101110101011101101 eeacb0edd4aae4e5bce78eaad6d2eaed
UTF-8 遯壱圷粤主ェ嶢褄 111010011000000110101111111001011010001110110001111001011001110010110111111001111011001010100100111001001011100010111011111011111011110110101010111001011011011010100010111010001010010010000100 e981afe5a3b1e59cb7e7b2a4e4b8bbefbdaae5b6a2e8a484
UHC 遯???主?嶢? 1101010011101110001111110011111100111111111100011010101100111111111010001111001000111111 d4ee3f3f3ff1ab3fe8f23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)