To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???塢??悅?? 0011111100111111001111111001101011000111001111110011111111111010101111010011111100111111 3f3f3f9ac73f3ffabd3f3f
EUC-JP ???塢????? 00111111001111110011111111010100110010010011111100111111001111110011111100111111 3f3f3fd4c93f3f3f3f3f
UTF-8 閱ㅷ룑塢곭븻悅롨뫗 111010011001011010110001111000111000010110110111111010111010001110010001111001011010000110100010111010101011001110101101111010111011100010111011111001101000001010000101111010111010000110101000111010111010101110010111 e996b1e385b7eba391e5a1a2eab3adebb8bbe68285eba1a8ebab97
UHC 閱ㅷ룑塢곭븻悅롨뫗 111001101111001110100100111001111000111110001110111001111111000110000001111001111001010110100100111001101110110110001110111010001001000110111001 e6f3a4e78f8ee7f181e795a4e6ed8ee891b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)