To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??????宜 11111011110001000011111100111111001111110011111100111111001111111000101101011000 fbc43f3f3f3f3f3f8b58
EUC-JP 鈺??沅???宜 10001111111000111101010100111111001111111000111111000110111010010011111100111111001111111011010110111001 8fe3d53f3f8fc6e93f3f3fb5b9
UTF-8 鈺쎌렲沅녺퉯짰宜 111010011000100010111010111011001000111010001100111010111010000010110010111001101011001010000101111010111000010110111010111011011000100110101111111011001010011110110000111001011010111010011100 e988baec8e8ceba0b2e6b285eb85baed89afeca7b0e5ae9c
UHC 鈺쎌렲沅녺퉯짰宜 11101000101011011011110111101100100011101011111111101010101101101000011011100111101110011000011111000010101011101110101111110001 e8adbdec8ebfeab686e7b987c2aeebf1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)