To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥〓∥誼??攸??嚥〓∥誼??攸??^ 1001101010001011100000011010110010000001011000011000101101100010001111110011111110011101101111110011111100111111100110101000101110000001101011001000000101100001100010110110001000111111001111111001110110111111001111110011111101011110 9a8b81ac81618b623f3f9dbf3f3f9a8b81ac81618b623f3f9dbf3f3f5e
EUC-JP 嚥〓‖誼??攸??嚥〓‖誼??攸??^ 1101001111101011101000101010111010100001110000101011010111000011001111110011111111011010110000010011111100111111110100111110101110100010101011101010000111000010101101011100001100111111001111111101101011000001001111110011111101011110 d3eba2aea1c2b5c33f3fdac13f3fd3eba2aea1c2b5c33f3fdac13f3f5e
UTF-8 嚥〓∥誼놂쭓攸낆죶嚥〓∥誼놂쭓攸낆죶^ 11100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100101001011100011101011100000101000011011101100101000111011011011100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100101001011100011101011100000101000011011101100101000111011011001011110 e59aa5e38093e288a5e8aabceb8682ecad93e694b8eb8286eca3b6e59aa5e38093e288a5e8aabceb8682ecad93e694b8eb8286eca3b65e
UHC 嚥〓∥誼놂쭓攸낆죶嚥〓∥誼놂쭓攸낆죶^ 11100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111101010111100101000010111101100101000011001000011100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111101010111100101000010111101100101000011001000001011110 e6bfa1eba1abebfeb3efa78beaf285eca190e6bfa1eba1abebfeb3efa78beaf285eca1905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)