To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 渉寀ム曺渼ムシ 1000111111000010111110101010011111010001111110101101111011111011010010011101000110111100 8fc2faa7d1fadefb49d1bc
EUC-JP 渉寀ム曺渼ムシ 1011111011000100100011111011101011011011100011101101000110001111110000101110110110001111110001111111000010001110110100011000111010111100 bec48fbadb8ed18fc2ed8fc7f08ed18ebc
UTF-8 渉寀ム曺渼ムシ 111001101011100010001001111001011010111110000000111011111011111010010001111001101001101110111010111001101011100010111100111011111011111010010001111011111011110110111100 e6b889e5af80efbe91e69bbae6b8bcefbe91efbdbc
UHC ?寀?曺渼?? 00111111111100111111001000111111111100001100011011011010101101000011111100111111 3ff3f23ff0c6dab43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)