To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 劑?m雍???m雍? 100110011001110100111111100000101000110111101000101101000011111100111111001111111000001010001101111010001011010000111111 999d3f828de8b43f3f3f828de8b43f
EUC-JP 劑?m雍???m雍? 110100011111110100111111101000111110110111110000101101100011111100111111001111111010001111101101111100001011011000111111 d1fd3fa3edf0b63f3f3fa3edf0b63f
UTF-8 劑屢m雍꿸렍屢m雍굻 111001011000101010010001111011111010010110001011111011111011110110001101111010011001101110001101111010101011111110111000111010111010000010001101111011111010010110001011111011111011110110001101111010011001101110001101111010101011010110111011 e58a91efa58befbd8de99b8deabfb8eba08defa58befbd8de99b8deab5bb
UHC 劑屢m雍꿸렍屢m雍굻 1111000010100101110100101110010110100011111011011110100010111100101100101110101010001110101000111101001011100101101000111110110111101000101111001011000110111111 f0a5d2e5a3ede8bcb2ea8ea3d2e5a3ede8bcb1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)