To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪即揃奪造足巽則造竪即揃奪造足巽則造B 10010010010001111001000110100110100100011011010110010010010001001001000110100010100100011010101110010010010001101001000110100101100100011010001010010010010001111001000110100110100100011011010110010010010001001001000110100010100100011010101110010010010001101001000110100101100100011010001001000010 924791a691b5924491a291ab924691a591a2924791a691b5924491a291ab924691a591a242
EUC-JP 竪即揃奪造足巽則造竪即揃奪造足巽則造B 11000011101010001100001010101000110000101011011111000011101001011100001010100100110000101010110111000011101001111100001010100111110000101010010011000011101010001100001010101000110000101011011111000011101001011100001010100100110000101010110111000011101001111100001010100111110000101010010001000010 c3a8c2a8c2b7c3a5c2a4c2adc3a7c2a7c2a4c3a8c2a8c2b7c3a5c2a4c2adc3a7c2a7c2a442
UTF-8 竪即揃奪造足巽則造竪即揃奪造足巽則造B 11100111101010111010101011100101100011011011001111100110100011111000001111100101101001011010101011101001100000001010000011101000101101101011001111100101101101111011110111100101100010011000011111101001100000001010000011100111101010111010101011100101100011011011001111100110100011111000001111100101101001011010101011101001100000001010000011101000101101101011001111100101101101111011110111100101100010011000011111101001100000001010000001000010 e7abaae58db3e68f83e5a5aae980a0e8b6b3e5b7bde58987e980a0e7abaae58db3e68f83e5a5aae980a0e8b6b3e5b7bde58987e980a042
UHC 竪??奪造足巽則造竪??奪造足巽則造B 111000101011010100111111001111111111011110101100111100001110001111110000111010111110000111011110111101101100111011110000111000111110001010110101001111110011111111110111101011001111000011100011111100001110101111100001110111101111011011001110111100001110001101000010 e2b53f3ff7acf0e3f0ebe1def6cef0e3e2b53f3ff7acf0e3f0ebe1def6cef0e342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)