To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??釉??艶l?依③?蟻??堯 11100001100111110011111100111111100010001101001100111111001111111110011111010110001111110011111110001001100100001000001010001100001111111000100011001011100001110100001000111111100010110110000100111111001111111110101010011111 e19f3f3f88d33f3fe7d63f3f8990828c3f88cb87423f8b613f3fea9f
EUC-JP 癲??意??釉??艶l?依??蟻??堯 111000101010000100111111001111111011000011010101001111110011111111101110110110000011111100111111101100011111000010100011111011000011111110110000110011010011111100111111101101011100001000111111001111111111010010100001 e2a13f3fb0d53f3feed83f3fb1f0a3ec3fb0cd3f3fb5c23f3ff4a1
UTF-8 癲덈챶意덃룚釉앹춻艶l뮆依③쉬蟻숇걗堯 111001111001100110110010111010111000110110001000111011001011000110110110111001101000010010001111111010111000110110000011111010111010001110011010111010011000011110001001111011001001010110111001111011001011011010111011111010001000100110110110111011111011110110001100111010111010111010000110111001001011111010011101111000101001000110100010111011001000100110101100111010001001111110111011111011001000100010000111111010101011000110010111111001011010000010101111 e799b2eb8d88ecb1b6e6848feb8d83eba39ae98789ec95b9ecb6bbe889b6efbd8cebae86e4be9de291a2ec89ace89fbbec8887eab197e5a0af
UHC 癲덈챶意덃룚釉앹춻艶l뮆依③쉬蟻숇걗堯 1110111110100110100010001110101110101010100000111110101111110010100010001110011010001111100101101110101110111000100111011110110010101101100101111110011011111101101000111110110010010010100101011110101111101110101010001110100110111101101011001110101111111100100110011110101110000001100000101110100011101011 efa688ebaa83ebf288e68f96ebb89decad97e6fda3ec9295ebeea8e9bdacebfc99eb8182e8eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)