To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鳶??韋??溢??儒??繹??B 100100111100111000111111001111111110100011101000001111110011111110001000111011000011111100111111100011101111001000111111001111111110001110001000001111110011111101000010 93ce3f3fe8e83f3f88ec3f3f8ef23f3fe3883f3f42
EUC-JP 鳶??韋??溢??儒??繹??B 110001101101000000111111001111111111000011101010001111110011111110110000111011100011111100111111101111001111010000111111001111111110010111101000001111110011111101000010 c6d03f3ff0ea3f3fb0ee3f3fbcf43f3fe5e83f3f42
UTF-8 鳶롫끏韋귝틦溢믥땸儒용걗繹먰릳B 11101001101100111011011011101011101000011010101111101011100000011000111111101001100111111000101111101010101101111001110111101101100010111010011011100110101110101010001011101011101011111010010111101011100101011011100011100101100001001001001011101100100110101010100111101010101100011001011111100111101110011011100111101011101010001011000011101011101001101011001101000010 e9b3b6eba1abeb818fe99f8beab79ded8ba6e6baa2ebafa5eb95b8e58492ec9aa9eab197e7b9b9eba8b0eba6b342
UHC 鳶롫끏韋귝틦溢믥땸儒용걗繹먰릳B 11100110111010011000111011101011100001011011111111101010110111111000001011100110101110101001000011101100111011101001001011100111100010111000111011101010111000111011111111101011100000011000001011100110101110101001000011101101100100001001001001000010 e6e98eeb85bfeadf82e6ba90ecee92e78b8eeae3bfeb8182e6ba90ed909242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)