To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??????褥 0011111100111111001111110011111100111111001111111110010111110001 3f3f3f3f3f3fe5f1
EUC-JP 旿??縕??褥 100011111100000111110100001111110011111110001111110101001100001000111111001111111110101011110011 8fc1f43f3f8fd4c23f3feaf3
UTF-8 旿딉슁縕됧퉬褥 111001101001011110111111111010111001010010001001111011001000101010000001111001111011100010010101111010111001000010100111111011011000100110101100111010001010010010100101 e697bfeb9489ec8a81e7b895eb90a7ed89ace8a4a5
UHC 旿딉슁縕됧퉬褥 1110011111111010100010101110111110111101101100111110100010110010100010011110010110111001100001001110100110110011 e7fa8aefbdb3e8b289e5b984e9b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)