To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?然槽埇?然槽埇^ 001111111001000101010010100100011000010111111010100110100011111110010001010100101001000110000101111110101001101001011110 3f91529185fa9a3f91529185fa9a5e
EUC-JP ?然槽埇?然槽埇^ 0011111111000001101100111100000111100101100011111011011111100111001111111100000110110011110000011110010110001111101101111110011101011110 3fc1b3c1e58fb7e73fc1b3c1e58fb7e75e
UTF-8 뤗然槽埇뤗然槽埇^ 11101011101001001001011111100111100001001011011011100110101001111011110111100101100111111000011111101011101001001001011111100111100001001011011011100110101001111011110111100101100111111000011101011110 eba497e784b6e6a7bde59f87eba497e784b6e6a7bde59f875e
UHC 뤗然槽埇뤗然槽埇^ 1000111111000111111001101101010011110000110010111110100110111001100011111100011111100110110101001111000011001011111010011011100101011110 8fc7e6d4f0cbe9b98fc7e6d4f0cbe9b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)