To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 遙??約↑?艶?? 11101010101000010011111100111111100101101111000110000001101010100011111110001001100100000011111100111111 eaa13f3f96f181aa3f89903f3f
EUC-JP 遙??約↑?艶?? 11110100101000110011111100111111110011001111001110100010101011000011111110110001111100000011111100111111 f4a33f3fccf3a2ac3fb1f03f3f
UTF-8 遙뤹뼯約↑츘艶⒵빊 111010011000000110011001111010111010010010111001111010111011110010101111111001111011010010000100111000101000011010010001111011001011100010011000111010001000100110110110111000101001001010110101111010111011100110001010 e98199eba4b9ebbcafe7b484e28691ecb898e889b6e292b5ebb98a
UHC 遙뤹뼯約↑츘艶⒵빊 111010011010101110001111111001111001011010110010111001011011001110100001111010001010111010010010111001101111110110101001111001101001010110110000 e9ab8fe796b2e5b3a1e8ae92e6fda9e695b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)