To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪属尊竪足揃奪属存竪属尊竪足揃奪属存B 10010010010001111001000110101110100100011011100010010010010001111001000110101011100100011011010110010010010001001001000110101110100100011011011010010010010001111001000110101110100100011011100010010010010001111001000110101011100100011011010110010010010001001001000110101110100100011011011001000010 924791ae91b8924791ab91b5924491ae91b6924791ae91b8924791ab91b5924491ae91b642
EUC-JP 竪属尊竪足揃奪属存竪属尊竪足揃奪属存B 11000011101010001100001010110000110000101011101011000011101010001100001010101101110000101011011111000011101001011100001010110000110000101011100011000011101010001100001010110000110000101011101011000011101010001100001010101101110000101011011111000011101001011100001010110000110000101011100001000010 c3a8c2b0c2bac3a8c2adc2b7c3a5c2b0c2b8c3a8c2b0c2bac3a8c2adc2b7c3a5c2b0c2b842
UTF-8 竪属尊竪足揃奪属存竪属尊竪足揃奪属存B 11100111101010111010101011100101101100011001111011100101101100001000101011100111101010111010101011101000101101101011001111100110100011111000001111100101101001011010101011100101101100011001111011100101101011011001100011100111101010111010101011100101101100011001111011100101101100001000101011100111101010111010101011101000101101101011001111100110100011111000001111100101101001011010101011100101101100011001111011100101101011011001100001000010 e7abaae5b19ee5b08ae7abaae8b6b3e68f83e5a5aae5b19ee5ad98e7abaae5b19ee5b08ae7abaae8b6b3e68f83e5a5aae5b19ee5ad9842
UHC 竪?尊竪足?奪?存竪?尊竪足?奪?存B 11100010101101010011111111110000111011101110001010110101111100001110101100111111111101111010110000111111111100001110110111100010101101010011111111110000111011101110001010110101111100001110101100111111111101111010110000111111111100001110110101000010 e2b53ff0eee2b5f0eb3ff7ac3ff0ede2b53ff0eee2b5f0eb3ff7ac3ff0ed42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)