To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????寃??汚 001111110011111100111111001111110011111100111111100110111000001100111111001111111000100110011000 3f3f3f3f3f3f9b833f3f8998
EUC-JP ???沅??寃??汚 0011111100111111001111111000111111000110111010010011111100111111110101011110001100111111001111111011000111111000 3f3f3f8fc6e93f3fd5e33f3fb1f8
UTF-8 樂띿슦沅좄굢寃쎈쿅汚 111011111010011010111111111010111001110110111111111011001000101010100110111001101011001010000101111011001010001010000100111010101011010110100010111001011010111110000011111011001000111010001000111011001011111110000101111001101011000110011010 efa6bfeb9dbfec8aa6e6b285eca284eab5a2e5af83ec8e88ecbf85e6b19a
UHC 樂띿슦沅좄굢寃쎈쿅汚 1110100011111001100011011110110010011010101100001110101010110110101000001110100010000010100010011110101010110010101111011110101110110010100110101110011111111101 e8f98dec9ab0eab6a0e88289eab2bdebb29ae7fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)