To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????h??? 00111111001111110011111100111111001111110011111101101000001111110011111100111111 3f3f3f3f3f3f683f3f3f
SJIS-WIN 豼喜甑豼頑Γh豼喜甑 11100110101111111000101011101100100011011001100111100110101111111000101011100110100000111010000101101000111001101011111110001010111011001000110110011001 e6bf8aec8d99e6bf8ae683a168e6bf8aec8d99
EUC-JP 豼喜甑豼頑Γh豼喜甑 11101100110000011011010011101110101110011111100111101100110000011011010011101000101001101010001101101000111011001100000110110100111011101011100111111001 ecc1b4eeb9f9ecc1b4e8a6a368ecc1b4eeb9f9
UTF-8 豼喜甑豼頑Γh豼喜甑 111010001011000110111100111001011001011010011100111001111001010010010001111010001011000110111100111010011010000010010001110011101001001101101000111010001011000110111100111001011001011010011100111001111001010010010001 e8b1bce5969ce79491e8b1bce9a091ce9368e8b1bce5969ce79491
UHC ?喜甑?頑Γh?喜甑 00111111111111011110110011110001111101110011111111101000110101111010010111000011011010000011111111111101111011001111000111110111 3ffdecf1f73fe8d7a5c3683ffdecf1f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)