To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 贈絅??贈絅??^ 10010001101000011110001101000100001111110011111110010001101000011110001101000100001111110011111101011110 91a1e3443f3f91a1e3443f3f5e
EUC-JP 贈絅??贈絅??^ 11000010101000111110010110100101001111110011111111000010101000111110010110100101001111110011111101011110 c2a3e5a53f3fc2a3e5a53f3f5e
UTF-8 贈絅렯슨贈絅렯슨^ 11101000101101001000100011100111101101011000010111101011101000001010111111101100100010101010100011101000101101001000100011100111101101011000010111101011101000001010111111101100100010101010100001011110 e8b488e7b585eba0afec8aa8e8b488e7b585eba0afec8aa85e
UHC 贈絅렯슨贈絅렯슨^ 1111000111111100110011001110011110001110101111001011110110111100111100011111110011001100111001111000111010111100101111011011110001011110 f1fccce78ebcbdbcf1fccce78ebcbdbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)