To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 余??泣???⑥???????諭??潁??揖 10010111010111010011111100111111100010111000001100111111001111110011111110000111010001010011111100111111001111110011111100111111001111110011111110010111010000000011111100111111100111111111000100111111001111111001011101001011 975d3f3f8b833f3f3f87453f3f3f3f3f3f3f97403f3f9ff13f3f974b
EUC-JP 余??泣??洹????????諭??潁??揖 1100110110111110001111110011111110110101111000110011111100111111100011111100011110111010001111110011111100111111001111110011111100111111001111110011111111001101101000010011111100111111110111101111001100111111001111111100110110101100 cdbe3f3fb5e33f3f8fc7ba3f3f3f3f3f3f3f3fcda13f3fdef33f3fcdac
UTF-8 余쒕콈泣섊툞洹⑥돩嶺쎄쐽理쎿썫諭꾩뒛潁뺢랬揖 111001001011110110011001111011001001001010010101111011001011110110001000111001101011001110100011111011001000010010001010111011011000100010011110111001101011010010111001111000101001000110100101111010111000111110101001111011111010011010101011111011001000111010000100111011001001000010111101111011111010011110100100111011001000111010111111111011001000110110101011111010001010101110101101111010101011111010101001111010111001001010011011111001101011110110000001111010111011101010100010111010111001111010101100111001101000111110010110 e4bd99ec9295ecbd88e6b3a3ec848aed889ee6b4b9e291a5eb8fa9efa6abec8e84ec90bdefa7a4ec8ebfec8dabe8abadeabea9eb929be6bd81ebbaa2eb9eace68f96
UHC 余쒕콈泣섊툞洹⑥돩嶺쎄쐽理쎿썫諭꾩뒛潁뺢랬揖 1110010111111001100111001110101110110001100001001110101111101000100110001110011110111000100101011110101010110111101010001110110010001001101011001110011110101101101111011110101010111110101000111110110010110101100110111110011010011011100111001110101110110001100001001110110010001010100110001110011110111000100101011110101010110111101010001110101111100111 e5f99cebb184ebe898e7b895eab7a8ec89ace7adbdeabea3ecb59be69b9cebb184ec8a98e7b895eab7a8ebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)