To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ???援?ぜ矣? 0011111100111111001111111000100110000111001111111000001010111010111000011110000100111111 3f3f3f89873f82bae1e13f
EUC-JP ???援?ぜ矣? 0011111100111111001111111011000111100111001111111010010010111100111000101110001100111111 3f3f3fb1e73fa4bce2e33f
UTF-8 歷띰퐢援앲ぜ矣뺢 111011111010011010001100111010111001110110110000111011011001000010100010111001101000111110110100111011001001010110110010111000111000000110011100111001111001111110100011111010111011101010100010 efa68ceb9db0ed90a2e68fb4ec95b2e3819ce79fa3ebbaa2
UHC 歷띰퐢援앲ぜ矣뺢 11100110101110001011011011101111101111011000101111101010101101011001110111101000101010101011110011101011111110001001010111101010 e6b8b6efbd8beab59de8aabcebf895ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)