To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥〓∥誼??揄??嚥〓∥誼??揄??^ 1001101010001011100000011010110010000001011000011000101101100010001111110011111110011101100010010011111100111111100110101000101110000001101011001000000101100001100010110110001000111111001111111001110110001001001111110011111101011110 9a8b81ac81618b623f3f9d893f3f9a8b81ac81618b623f3f9d893f3f5e
EUC-JP 嚥〓‖誼??揄??嚥〓‖誼??揄??^ 1101001111101011101000101010111010100001110000101011010111000011001111110011111111011001111010010011111100111111110100111110101110100010101011101010000111000010101101011100001100111111001111111101100111101001001111110011111101011110 d3eba2aea1c2b5c33f3fd9e93f3fd3eba2aea1c2b5c33f3fd9e93f3f5e
UTF-8 嚥〓∥誼놂쭓揄욍돧嚥〓∥誼놂쭓揄용뤁^ 11100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100011111000010011101100100110101000110111101011100011111010011111100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100011111000010011101100100110101010100111101011101001001000000101011110 e59aa5e38093e288a5e8aabceb8682ecad93e68f84ec9a8deb8fa7e59aa5e38093e288a5e8aabceb8682ecad93e68f84ec9aa9eba4815e
UHC 嚥〓∥誼놂쭓揄욍돧嚥〓∥誼놂쭓揄용뤁^ 11100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111101010111100011011111111100011100010011010101111100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111101010111100011011111111101011100011111011001001011110 e6bfa1eba1abebfeb3efa78beaf1bfe389abe6bfa1eba1abebfeb3efa78beaf1bfeb8fb25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)