To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?f?^}Y?f?^}bE 00111111011001100011111101011110011111010101100100111111011001100011111101011110011111010110001001000101 3f663f5e7d593f663f5e7d6245
SJIS-WIN 西f西^}Y西f西^}bE 1001000010111100011001101001000010111100010111100111110101011001100100001011110001100110100100001011110001011110011111010110001001000101 90bc6690bc5e7d5990bc6690bc5e7d6245
EUC-JP 西f西^}Y西f西^}bE 1100000010111110011001101100000010111110010111100111110101011001110000001011111001100110110000001011111001011110011111010110001001000101 c0be66c0be5e7d59c0be66c0be5e7d6245
UTF-8 西f西^}Y西f西^}bE 111010001010010110111111011001101110100010100101101111110101111001111101010110011110100010100101101111110110011011101000101001011011111101011110011111010110001001000101 e8a5bf66e8a5bf5e7d59e8a5bf66e8a5bf5e7d6245
UHC 西f西^}Y西f西^}bE 1110000010100100011001101110000010100100010111100111110101011001111000001010010001100110111000001010010001011110011111010110001001000101 e0a466e0a45e7d59e0a466e0a45e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)