To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 源?衣篩?縡?鷹??蒡源?衣篩?縡?鷹??蒡^ 1000110010111001001111111000100011011111111000101011111100111111111000110111000100111111100100011110100100111111001111111110010011101110100011001011100100111111100010001101111111100010101111110011111111100011011100010011111110010001111010010011111100111111111001001110111001011110 8cb93f88dfe2bf3fe3713f91e93f3fe4ee8cb93f88dfe2bf3fe3713f91e93f3fe4ee5e
EUC-JP 源?衣篩?縡?鷹??蒡源?衣篩?縡?鷹??蒡^ 1011100010111011001111111011000011100001111001001100000100111111111001011101001000111111110000101110101100111111001111111110100011110000101110001011101100111111101100001110000111100100110000010011111111100101110100100011111111000010111010110011111100111111111010001111000001011110 b8bb3fb0e1e4c13fe5d23fc2eb3f3fe8f0b8bb3fb0e1e4c13fe5d23fc2eb3f3fe8f05e
UTF-8 源렰衣篩백縡렕鷹꿰렠蒡源렰衣篩백縡렕鷹꿰렠蒡^ 11100110101110101001000011101011101000001011000011101000101000011010001111100111101011111010100111101011101100001011000111100111101110001010000111101011101000001001010111101001101101111011100111101010101111111011000011101011101000001010000011101000100100101010000111100110101110101001000011101011101000001011000011101000101000011010001111100111101011111010100111101011101100001011000111100111101110001010000111101011101000001001010111101001101101111011100111101010101111111011000011101011101000001010000011101000100100101010000101011110 e6ba90eba0b0e8a1a3e7afa9ebb0b1e7b8a1eba095e9b7b9eabfb0eba0a0e892a1e6ba90eba0b0e8a1a3e7afa9ebb0b1e7b8a1eba095e9b7b9eabfb0eba0a0e892a15e
UHC 源렰衣篩백縡렕鷹꿰렠蒡源렰衣篩백縡렕鷹꿰렠蒡^ 111010101011100110001110101111011110101111111101110111101110100010111001111010011110111010101101100011101010101011101011111011011011001011100111100011101011000111011011101111001110101010111001100011101011110111101011111111011101111011101000101110011110100111101110101011011000111010101010111010111110110110110010111001111000111010110001110110111011110001011110 eab98ebdebfddee8b9e9eead8eaaebedb2e78eb1dbbceab98ebdebfddee8b9e9eead8eaaebedb2e78eb1dbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)