To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???踰????????靭?㎡???如??悠 001111110011111100111111111001101111101000111111001111110011111100111111001111110011111100111111001111111001000001111000001111111000011101110101001111110011111100111111100101000100000000111111001111111001011101001001 3f3f3fe6fa3f3f3f3f3f3f3f3f90783f87753f3f3f94403f3f9749
EUC-JP ???踰????????靭??洹??如??悠 00111111001111110011111111101100111111000011111100111111001111110011111100111111001111110011111100111111101111111101100100111111001111111000111111000111101110100011111100111111110001111010000100111111001111111100110110101010 3f3f3fecfc3f3f3f3f3f3f3f3fbfd93f3f8fc7ba3f3fc7a13f3fcdaa
UTF-8 麗몃쓷踰경룄紐꾨뭼麗몃쓷靭뚳㎡洹앹뒇如붽퀣悠 111011111010011010001000111010111010101010000011111011001001001110110111111010001011100010110000111010101011001010111101111010111010001110000100111011111010011110001111111010101011111010101000111010111010110110111100111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010110011111000111000111010100001111001101011010010111001111011001001010110111001111010111001001010000111111001011010011010000010111010111011011010111101111011011000000010100011111001101000001010100000 efa688ebaa83ec93b7e8b8b0eab2bdeba384efa78feabea8ebadbcefa688ebaa83ec93b7e99dadeb9ab3e38ea1e6b4b9ec95b9eb9287e5a682ebb6bded80a3e682a0
UHC 麗몃쓷踰경룄紐꾨뭼麗몃쓷靭뚳㎡洹앹뒇如붽퀣悠 1110011010110000101110001110101110011101100101001110101110110010101100001110011010001111100001001110101110101010100001001110101110010010100010111110011010110000101110001110101110011101100101001110110011100101100011001110111110100111101100111110101010110111100111011110110010001010100001011110010111111101100101001110101010110011100101111110101011101101 e6b0b8eb9d94ebb2b0e68f84ebaa84eb928be6b0b8eb9d94ece58cefa7b3eab79dec8a85e5fd94eab397eaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)