To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??? 0011111100111111001111110011111100111111001111111001101001100111001111110011111100111111 3f3f3f3f3f3f9a673f3f3f
EUC-JP ??????喩??孼 00111111001111110011111100111111001111110011111111010011110010000011111100111111100011111011101011000011 3f3f3f3f3f3fd3c83f3f8fbac3
UTF-8 閱묐끂留붷칰喩붾꺑孼 111010011001011010110001111010111010110010010000111010111000000110000010111011111010011110001101111010111011011010110111111011001011100110110000111001011001011010101001111010111011011010111110111010101011101010010001111001011010110110111100 e996b1ebac90eb8182efa78debb6b7ecb9b0e596a9ebb6beeaba91e5adbc
UHC 閱묐끂留붷칰喩붾꺑孼 1110011011110011100100011110101110000101101110001110101110100111100101001110010110101111100000111110101011100111100101001110101110000011101101111110010111101101 e6f391eb85b8eba794e5af83eae794eb83b7e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)