To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ???欲??? 0011111100111111001111111001011101111110001111110011111100111111 3f3f3f977e3f3f3f
EUC-JP 獒??欲??? 10001111110010111011101100111111001111111100110111011111001111110011111100111111 8fcbbb3f3fcddf3f3f3f
UTF-8 獒듸풛欲곤쉥輦 111001111000110110010010111010111001001110111000111011011001001010011011111001101010110010110010111010101011001110100100111011001000100110100101111011111010011010011000 e78d92eb93b8ed929be6acb2eab3a4ec89a5efa698
UHC 獒듸풛欲곤쉥輦 1110100010100011101101011110111110111110100111101110100110110000101100001110111110111101101010111110011011100100 e8a3b5efbe9ee9b0b0efbdabe6e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)