To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癌??湲????? 1000101011100000001111110011111110011111110100010011111100111111001111110011111100111111 8ae03f3f9fd13f3f3f3f3f
EUC-JP 癌??湲????? 1011010011100010001111110011111111011110110100110011111100111111001111110011111100111111 b4e23f3fded33f3f3f3f3f
UTF-8 癌뺤옺湲멱뿈礪껊쭏 111001111001100110001100111010111011101010100100111011001001100010111010111001101011100110110010111010111010100110110001111010111011111110001000111011111010011010000101111010101011101110001010111011001010110110001111 e7998cebbaa4ec98bae6b9b2eba9b1ebbf88efa685eabb8aecad8f
UHC 癌뺤옺湲멱뿈礪껊쭏 111001001101111110010101111011001001111010110000111010101011100010111000111010001001011110001111111001101010011110000011111010111010011110001000 e4df95ec9eb0eab8b8e8978fe6a783eba788

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)