To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??意??苡〓? 11101001111100100011111100111111100010001101001100111111001111111110010010001111100000011010110000111111 e9f23f3f88d33f3fe48f81ac3f
EUC-JP 鶯??意??苡〓? 11110010111101000011111100111111101100001101010100111111001111111110011111101111101000101010111000111111 f2f43f3fb0d53f3fe7efa2ae3f
UTF-8 鶯방찇意뚨맘苡〓쪕 111010011011011010101111111010111011000010101001111011001011000010000111111001101000010010001111111010111001101010101000111010111010011110011000111010001000101110100001111000111000000010010011111011001010101010010101 e9b6afebb0a9ecb087e6848feb9aa8eba798e88ba1e38093ecaa95
UHC 鶯방찇意뚨맘苡〓쪕 111001011010001110111001111001101010100110001011111010111111001010001100111001111011100010111110111011001011111010100001111010111010010110001111 e5a3b9e6a98bebf28ce7b8beecbea1eba58f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)