To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?f?^}Y?f?^}bE 00111111011001100011111101011110011111010101100100111111011001100011111101011110011111010110001001000101 3f663f5e7d593f663f5e7d6245
SJIS-WIN 什f什^}Y什f什^}bE 1000111101011001011001101000111101011001010111100111110101011001100011110101100101100110100011110101100101011110011111010110001001000101 8f59668f595e7d598f59668f595e7d6245
EUC-JP 什f什^}Y什f什^}bE 1011110110111010011001101011110110111010010111100111110101011001101111011011101001100110101111011011101001011110011111010110001001000101 bdba66bdba5e7d59bdba66bdba5e7d6245
UTF-8 什f什^}Y什f什^}bE 111001001011101110000000011001101110010010111011100000000101111001111101010110011110010010111011100000000110011011100100101110111000000001011110011111010110001001000101 e4bb8066e4bb805e7d59e4bb8066e4bb805e7d6245
UHC 什f什^}Y什f什^}bE 1110010010100111011001101110010010100111010111100111110101011001111001001010011101100110111001001010011101011110011111010110001001000101 e4a766e4a75e7d59e4a766e4a75e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)