To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????瑤??有??魏??霓??援??飮?? 00111111001111110011111100111111111010101010001000111111001111111001011101001100001111110011111111101001101100000011111100111111111010001011110100111111001111111000100110000111001111110011111110011111010110100011111100111111 3f3f3f3feaa23f3f974c3f3fe9b03f3fe8bd3f3f89873f3f9f5a3f3f
EUC-JP ????瑤??有??魏??霓??援??飮?? 00111111001111110011111100111111111101001010010000111111001111111100110110101101001111110011111111110010101100100011111100111111111100001011111100111111001111111011000111100111001111110011111111011101101110110011111100111111 3f3f3f3ff4a43f3fcdad3f3ff2b23f3ff0bf3f3fb1e73f3fddbb3f3f
UTF-8 嶺싸살벞瑤뗫끆有잏윢魏낃쉑霓띰퐢援앶솻飮곸굦 111011111010011010101011111011001000101110111000111011001000001010110100111010111011001010011110111001111001000110100100111010111001011110101011111010111000000110000110111001101001110010001001111011001001111010001111111011001001110010100010111010011010110110001111111010111000001010000011111011001000100110010001111010011001110010010011111010111001110110110000111011011001000010100010111001101000111110110100111011001001010110110110111011001000011010111011111010011010001110101110111010101011001110111000111010101011010110100110 efa6abec8bb8ec82b4ebb29ee791a4eb97abeb8186e69c89ec9e8fec9ca2e9ad8feb8283ec8991e99c93eb9db0ed90a2e68fb4ec95b6ec86bbe9a3aeeab3b8eab5a6
UHC 嶺싸살벞瑤뗫끆有잏윢魏낃쉑霓띰퐢援앶솻飮곸굦 1110011110101101101111011100111010111011111011001001001110111001111010001111110110001011111010111000010110111010111010101111001110011111111001111001111110100011111010101110000010000101111010101011110110100111111001111110011110110110111011111011110110001011111010101011010110011101111010011001100110110000111010111110011010000001111011001000001010001100 e7adbdcebbec93b9e8fd8beb85baeaf39fe79fa3eae085eabda7e7e7b6efbd8beab59de999b0ebe681ec828c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)