To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 橈?????釗?????節??窈??汚??^ 100111101111010000111111001111110011111100111111001111111111101110111011001111110011111100111111001111110011111110010000110111110011111100111111111000100111011100111111001111111000100110011000001111110011111101011110 9ef43f3f3f3f3ffbbb3f3f3f3f3f90df3f3fe2773f3f89983f3f5e
EUC-JP 橈?????釗?????節??窈??汚??^ 11011100111101100011111100111111001111110011111100111111100011111110001110100110001111110011111100111111001111110011111111000000111000010011111100111111111000111101100000111111001111111011000111111000001111110011111101011110 dcf63f3f3f3f3f8fe3a63f3f3f3f3fc0e13f3fe3d83f3fb1f83f3f5e
UTF-8 橈롳슴樂됮줁釗녘웺寧좈쐩節욥썖窈섃뵵汚녽죳^ 11100110101010011000100011101011101000011011001111101100100010101011010011101111101001101011111111101011100100001010111011101100101001001000000111101001100001111001011111101011100001011001100011101100100110111011101011101111101001101010101011101100101000101000100011101100100100001010100111100111101011111000000011101100100110101010010111101100100011011001011011100111101010101000100011101100100001001000001111101011101101011011010111100110101100011001101011101011100001011011110111101100101000111011001101011110 e6a988eba1b3ec8ab4efa6bfeb90aeeca481e98797eb8598ec9bbaefa6aaeca288ec90a9e7af80ec9aa5ec8d96e7aa88ec8483ebb5b5e6b19aeb85bdeca3b35e
UHC 橈롳슴樂됮줁釗녘웺寧좈쐩節욥썖窈섃뵵汚녽죳^ 11101000111110101000111011101111101111011011111111101000111110011000100111101001101000011001100011100001111100101011001111101000100111111000011011100111101011001010000011101001100111001000111011101111101111011011111111101001100110111000100111101001101000011001100011100010100101001011001111100111111111011000011011101001101000011000111001011110 e8fa8eefbdbfe8f989e9a198e1f2b3e89f86e7aca0e99c8eefbdbfe99b89e9a198e294b3e7fd86e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)