To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 第???虞?楮? 1001000111100110001111110011111100111111100010111111000100111111100111101011100000111111 91e63f3f3f8bf13f9eb83f
EUC-JP 第???虞?楮? 1100001011101000001111110011111100111111101101101111001100111111110111001011101000111111 c2e83f3f3fb6f33fdcba3f
UTF-8 第얩렍렖虞렓楮렡 111001111010110010101100111011001001011010101001111010111010000010001101111010111010000010010110111010001001100110011110111010111010000010010011111001101010010110101110111010111010000010100001 e7acacec96a9eba08deba096e8999eeba093e6a5aeeba0a1
UHC 第얩렍렖虞렓楮렡 11110000101011111011111011101101100011101010001110001110101010111110100111100101100011101010100011101110101111111000111010110010 f0afbeed8ea38eabe9e58ea8eebf8eb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)