To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??援θぜ宋??艶k?誼?Ⅶ儒??沃?? 1110000110011111001111110011111110001001100001111000001111000110100000101011101010010001011101100011111100111111100010011001000010000010100010110011111110001011011000100011111110000111010110101000111011110010001111110011111110010111100000000011111100111111 e19f3f3f898783c682ba91763f3f8990828b3f8b623f875a8ef23f3f97803f3f
EUC-JP 癲??援θぜ宋??艶k?誼??儒??沃?? 11100010101000010011111100111111101100011110011110100110110010001010010010111100110000011101011100111111001111111011000111110000101000111110101100111111101101011100001100111111001111111011110011110100001111110011111111001101111000000011111100111111 e2a13f3fb1e7a6c8a4bcc1d73f3fb1f0a3eb3fb5c33f3fbcf43f3fcde03f3f
UTF-8 癲앷풝援θぜ宋볦㉭艶k슣誼쏉Ⅶ儒멤닪沃쇱쾿 1110011110011001101100101110110010010101101101111110110110010010100111011110011010001111101101001100111010111000111000111000000110011100111001011010111010001011111010111011001110100110111000111000100110101101111010001000100110110110111011111011110110001011111011001000101010100011111010001010101010111100111011001000111110001001111000101000010110100110111001011000010010010010111010111010100110100100111010111000101110101010111001101011001010000011111011001000011110110001111011001011111010111111 e799b2ec95b7ed929de68fb4ceb8e3819ce5ae8bebb3a6e389ade889b6efbd8bec8aa3e8aabcec8f89e285a6e58492eba9a4eb8baae6b283ec87b1ecbebf
UHC 癲앷풝援θぜ宋볦㉭艶k슣誼쏉Ⅶ儒멤닪沃쇱쾿 111011111010011010011101111010101011111010100000111010101011010110100101111010001010101010111100111000011110010010010011111011001010100010111110111001101111110110100011111010111001101010101111111010111111111010011011111011111010010110110110111010101110001110111000111000101000100010100101111010001010101010111100111011001011001010010101 efa69deabea0eab5a5e8aabce1e493eca8bee6fda3eb9aafebfe9befa5b6eae3b8e288a5e8aabcecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)