To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 丈ソ柴濵宍シャ上贖 100011111110010010111111100011101100010011111011010011011000111010110011101111001010110010001111111000111110011011011100 8fe4bf8ec4fb4d8eb3bcac8fe3e6dc
EUC-JP 丈ソ柴濵宍シャ上贖 10111110111001101000111010111111101111001100011010001111110010011010011010111100101101011000111010111100100011101010110010111110111001011110110011011110 bee68ebfbcc68fc9a6bcb58ebc8eacbee5ecde
UTF-8 丈ソ柴濵宍シャ上贖 111001001011100010001000111011111011110110111111111001101001111110110100111001101011111110110101111001011010111010001101111011111011110110111100111011111011110110101100111001001011100010001010111010001011010010010110 e4b888efbdbfe69fb4e6bfb5e5ae8defbdbcefbdace4b88ae8b496
UHC 丈?柴????上贖 11101101110110110011111111100011110000110011111100111111001111110011111111011111101111101110000111011011 eddb3fe3c33f3f3f3fdfbee1db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)