To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼??恂ワ?沃??愉????Ъ??? 001111110011111100111111100010110110001000111111001111111001110010010110100000111000111100111111100101111000000000111111001111111001011011111001001111110011111100111111001111111000010001011011001111110011111100111111 3f3f3f8b623f3f9c96838f3f97803f3f96f93f3f3f3f845b3f3f3f
EUC-JP ???誼??恂ワ?沃??愉??洧?Ъ??? 0011111100111111001111111011010111000011001111110011111111010111111101101010010111101111001111111100110111100000001111110011111111001100111110110011111100111111100011111100011110110100001111111010011110111100001111110011111100111111 3f3f3fb5c33f3fd7f6a5ef3fcde03f3fccfb3f3f8fc7b43fa7bc3f3f3f
UTF-8 列룸씈誼끾씭恂ワ폋沃쇰뙼愉꾤뙴洧붾Ъ呂얠쾿 1110111110100110100111001110101110100011101110001110110010010100100010001110100010101010101111001110101110000001101111101110110010010100101011011110011010000001100000101110001110000011101011111110110110001111100010111110011010110010100000111110110010000111101100001110101110011001101111001110011010000100100010011110101010111110101001001110101110011001101101001110011010110100101001111110101110110110101111101101000010101010111011111010011010000000111011001001011010100000111011001011111010111111 efa69ceba3b8ec9488e8aabceb81beec94ade68182e383afed8f8be6b283ec87b0eb99bce68489eabea4eb99b4e6b4a7ebb6bed0aaefa680ec96a0ecbebf
UHC 列룸씈誼끾씭恂ワ폋沃쇰뙼愉꾤뙴洧붾Ъ呂얠쾿 111001101110101010110111111010111001110110100000111010111111111010000101111001101001110110111110111000101110000110101011111011111011110010010110111010001010101010111100111010111000110010111111111010101111000010000100111001111000110010110111111010101111101110010100111010111010110010111100111001011111101110111110111011001011001010010101 e6eab7eb9da0ebfe85e69dbee2e1abefbc96e8aabceb8cbfeaf084e78cb7eafb94ebacbce5fbbeecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)