To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??援θぜ宋??域??誼??怨??沃?? 111000011001111100111111001111111000100110000111100000111100011010000010101110101001000101110110001111110011111110001000111001100011111100111111100010110110001000111111001111111000100110000101001111110011111110010111100000000011111100111111 e19f3f3f898783c682ba91763f3f88e63f3f8b623f3f89853f3f97803f3f
EUC-JP 癲??援θぜ宋??域??誼??怨??沃?? 111000101010000100111111001111111011000111100111101001101100100010100100101111001100000111010111001111110011111110110000111010000011111100111111101101011100001100111111001111111011000111100101001111110011111111001101111000000011111100111111 e2a13f3fb1e7a6c8a4bcc1d73f3fb0e83f3fb5c33f3fb1e53f3fcde03f3f
UTF-8 癲앷풝援θぜ宋믩뮅域㏓벡誼잞쫫怨뚯뫊沃쇱걖 1110011110011001101100101110110010010101101101111110110110010010100111011110011010001111101101001100111010111000111000111000000110011100111001011010111010001011111010111010111110101001111010111010111010000101111001011001111110011111111000111000111110010011111010111011001010100001111010001010101010111100111011001001111010011110111011001010101110101011111001101000000010101000111010111001101010101111111010111010101110001010111001101011001010000011111011001000011110110001111010101011000110010110 e799b2ec95b7ed929de68fb4ceb8e3819ce5ae8bebafa9ebae85e59f9fe38f93ebb2a1e8aabcec9e9eecababe680a8eb9aafebab8ae6b283ec87b1eab196
UHC 癲앷풝援θぜ宋믩뮅域㏓벡誼잞쫫怨뚯뫊沃쇱걖 111011111010011010011101111010101011111010100000111010101011010110100101111010001010101010111100111000011110010010010010111010111001001010010100111001101011010010100111111010111011101010100100111010111111111010011111111011111010011010000100111010101011001110001100111011001001000110101100111010001010101010111100111011001000000110000001 efa69deabea0eab5a5e8aabce1e492eb9294e6b4a7ebbaa4ebfe9fefa684eab38cec91ace8aabcec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)