To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 咀?儲捧??儲謗 10011001111100000011111110010110110101111001010111111001001111110011111110010110110101111110011010001110 99f03f96d795f93f3f96d7e68e
EUC-JP 咀?儲捧??儲謗 11010010111100100011111111001100110110011100101011111011001111110011111111001100110110011110101111101110 d2f23fccd9cafb3f3fccd9ebee
UTF-8 咀렦儲捧렪렦儲謗 111001011001001010000000111010111010000010100110111001011000010010110010111001101000110110100111111010111010000010101010111010111010000010100110111001011000010010110010111010001010110010010111 e59280eba0a6e584b2e68da7eba0aaeba0a6e584b2e8ac97
UHC 咀렦儲捧렪렦儲謗 11101110101110101000111010110101111011101011100111011100111010011000111010111000100011101011010111101110101110011101101110111111 eeba8eb5eeb9dce98eb88eb5eeb9dbbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)