To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????罐??? 0011111100111111001111110011111100111111001111111110001110100011001111110011111100111111 3f3f3f3f3f3fe3a33f3f3f
EUC-JP ???沅??罐??? 00111111001111110011111110001111110001101110100100111111001111111110011010100101001111110011111100111111 3f3f3f8fc6e93f3fe6a53f3f3f
UTF-8 樂끸뫁沅뤸껸罐易곩뭣 111011111010011010111111111010111000000110111000111010111010101110000001111001101011001010000101111010111010010010111000111010101011101110111000111001111011110110010000111011111010011110100000111010101011001110101001111010111010110110100011 efa6bfeb81b8ebab81e6b285eba4b8eabbb8e7bd90efa7a0eab3a9ebada3
UHC 樂끸뫁沅뤸껸罐易곩뭣 1110100011111001100001011110001010010001101001011110101010110110100011111110011010110010101110011100111010111000111011001010111110000001111001011011100110111101 e8f985e291a5eab68fe6b2b9ceb8ecaf81e5b9bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)