To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 窈??矣?.? 11100010011101110011111100111111111000011110000100111111100000010100010000111111 e2773f3fe1e13f81443f
EUC-JP 窈??矣?.? 11100011110110000011111100111111111000101110001100111111101000011010010100111111 e3d83f3fe2e33fa1a53f
UTF-8 窈뚮쓣矣㏝.溜 111001111010101010001000111010111001101010101110111011001001001110100011111001111001111110100011111000111000111110011101111011111011110010001110111011111010011110001011 e7aa88eb9aaeec93a3e79fa3e38f9defbc8eefa78b
UHC 窈뚮쓣矣㏝.溜 1110100110100001100011001110101110011101100001001110101111111000101001111110100110100011101011101110101011111110 e9a18ceb9d84ebf8a7e9a3aeeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)