To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?h?fN}?h?fN{^ 00111111011010000011111101100110010011100111110100111111011010000011111101100110010011100111101101011110 3f683f664e7d3f683f664e7b5e
SJIS-WIN 嘔h嘔fN}嘔h嘔fN{^ 1001101001110001011010001001101001110001011001100100111001111101100110100111000101101000100110100111000101100110010011100111101101011110 9a71689a71664e7d9a71689a71664e7b5e
EUC-JP 嘔h嘔fN}嘔h嘔fN{^ 1101001111010010011010001101001111010010011001100100111001111101110100111101001001101000110100111101001001100110010011100111101101011110 d3d268d3d2664e7dd3d268d3d2664e7b5e
UTF-8 嘔h嘔fN}嘔h嘔fN{^ 111001011001100010010100011010001110010110011000100101000110011001001110011111011110010110011000100101000110100011100101100110001001010001100110010011100111101101011110 e5989468e59894664e7de5989468e59894664e7b5e
UHC 嘔h嘔fN}嘔h嘔fN{^ 1100111110100101011010001100111110100101011001100100111001111101110011111010010101101000110011111010010101100110010011100111101101011110 cfa568cfa5664e7dcfa568cfa5664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)