To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??酉??艶l?猷??濡ル?魚 11100001100111110011111100111111100010001101001100111111001111111001001111010001001111110011111110001001100100001000001010001100001111111001011101010001001111110011111110010100010001111000001110001011001111111000101110011011 e19f3f3f88d33f3f93d13f3f8990828c3f97513f3f9447838b3f8b9b
EUC-JP 癲??意??酉??艶l?猷??濡ル?魚 11100010101000010011111100111111101100001101010100111111001111111100011011010011001111110011111110110001111100001010001111101100001111111100110110110010001111110011111111000111101010001010010111101011001111111011010111111011 e2a13f3fb0d53f3fc6d33f3fb1f0a3ec3fcdb23f3fc7a8a5eb3fb5fb
UTF-8 癲덈챶意덃룚酉몄맋艶l뮆猷녺춯濡ル렲魚 111001111001100110110010111010111000110110001000111011001011000110110110111001101000010010001111111010111000110110000011111010111010001110011010111010011000010110001001111010111010101010000100111010111010011110001011111010001000100110110110111011111011110110001100111010111010111010000110111001111000110010110111111010111000010110111010111011001011011010101111111001101011111110100001111000111000001110101011111010111010000010110010111010011010110110011010 e799b2eb8d88ecb1b6e6848feb8d83eba39ae98589ebaa84eba78be889b6efbd8cebae86e78cb7eb85baecb6afe6bfa1e383abeba0b2e9ad9a
UHC 癲덈챶意덃룚酉몄맋艶l뮆猷녺춯濡ル렲魚 1110111110100110100010001110101110101010100000111110101111110010100010001110011010001111100101101110101110110111101110001110110010010000101000111110011011111101101000111110110010010010100101011110101110100011100001101110011110101101100011001110101110100001101010111110101110001110101111111110010111100000 efa688ebaa83ebf288e68f96ebb7b8ec90a3e6fda3ec9295eba386e7ad8ceba1abeb8ebfe5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)