To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 蠢??????迂?}蠢??????迂?{^ 11100101101111110011111100111111001111110011111100111111001111111000100101001001001111110111110111100101101111110011111100111111001111110011111100111111001111111000100101001001001111110111101101011110 e5bf3f3f3f3f3f3f89493f7de5bf3f3f3f3f3f3f89493f7b5e
EUC-JP 蠢??????迂?}蠢??????迂?{^ 11101010110000010011111100111111001111110011111100111111001111111011000110101010001111110111110111101010110000010011111100111111001111110011111100111111001111111011000110101010001111110111101101011110 eac13f3f3f3f3f3fb1aa3f7deac13f3f3f3f3f3fb1aa3f7b5e
UTF-8 蠢렋履머븀렢렦迂렋}蠢렋履머븀렢렦迂렋{^ 111010001010000010100010111010111010000010001011111011111010011110011111111010111010100010111000111010111011100010000000111010111010000010100010111010111010000010100110111010001011111110000010111010111010000010001011011111011110100010100000101000101110101110100000100010111110111110100111100111111110101110101000101110001110101110111000100000001110101110100000101000101110101110100000101001101110100010111111100000101110101110100000100010110111101101011110 e8a0a2eba08befa79feba8b8ebb880eba0a2eba0a6e8bf82eba08b7de8a0a2eba08befa79feba8b8ebb880eba0a2eba0a6e8bf82eba08b7b5e
UHC 蠢렋履머븀렢렦迂렋}蠢렋履머븀렢렦迂렋{^ 111100011110001110001110101000101110110010101010101110001101001110111010111001111000111010110011100011101011010111101001111001101000111010100010011111011111000111100011100011101010001011101100101010101011100011010011101110101110011110001110101100111000111010110101111010011110011010001110101000100111101101011110 f1e38ea2ecaab8d3bae78eb38eb5e9e68ea27df1e38ea2ecaab8d3bae78eb38eb5e9e68ea27b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)