To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 受〓+???猷?4徇 10001110111100111000000110101100100000010111101100111111001111110011111110010111010100010011111110000010010100111001110001101101 8ef381ac817b3f3f3f97513f82539c6d
EUC-JP 受〓+???猷?4徇 10111100111101011010001010101110101000011101110000111111001111110011111111001101101100100011111110100011101101001101011111001110 bcf5a2aea1dc3f3f3fcdb23fa3b4d7ce
UTF-8 受〓+若녈궊猷띠4徇 111001011000111110010111111000111000000010010011111011111011110010001011111011111010010110110100111010111000010110001000111010101011011010001010111001111000110010110111111010111001110110100000111011111011110010010100111001011011111010000111 e58f97e38093efbc8befa5b4eb8588eab68ae78cb7eb9da0efbc94e5be87
UHC 受〓+若녈궊猷띠4徇 1110000111110100101000011110101110100011101010111110010110101110101100111110001110000010101000011110101110100011101101101110110010100011101101001110001011011111 e1f4a1eba3abe5aeb3e382a1eba3b6eca3b4e2df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)