To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[}?????????[{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101101101111101001111110011111100111111001111110011111100111111001111110011111100111111010110110111101101011110 3f3f3f3f3f3f3f3f3f5b7d3f3f3f3f3f3f3f3f3f5b7b5e
SJIS-WIN 藥??竊??諛??[}藥??竊??諛??[{^ 1110010101011010001111110011111111100010100001100011111100111111111001101000011100111111001111110101101101111101111001010101101000111111001111111110001010000110001111110011111111100110100001110011111100111111010110110111101101011110 e55a3f3fe2863f3fe6873f3f5b7de55a3f3fe2863f3fe6873f3f5b7b5e
EUC-JP 藥??竊??諛??[}藥??竊??諛??[{^ 1110100110111011001111110011111111100011111001100011111100111111111010111110011100111111001111110101101101111101111010011011101100111111001111111110001111100110001111110011111111101011111001110011111100111111010110110111101101011110 e9bb3f3fe3e63f3febe73f3f5b7de9bb3f3fe3e63f3febe73f3f5b7b5e
UTF-8 藥쎌늾竊뽬쯁諛깅빒[}藥쎌늾竊뽬쯁諛깅빒[{^ 1110100010010111101001011110110010001110100011001110101110001010101111101110011110101011100010101110101110111101101011001110110010101111100000011110100010101011100110111110101010111001100001011110101110111001100100100101101101111101111010001001011110100101111011001000111010001100111010111000101010111110111001111010101110001010111010111011110110101100111011001010111110000001111010001010101110011011111010101011100110000101111010111011100110010010010110110111101101011110 e897a5ec8e8ceb8abee7ab8aebbdacecaf81e8ab9beab985ebb9925b7de897a5ec8e8ceb8abee7ab8aebbdacecaf81e8ab9beab985ebb9925b7b5e
UHC 藥쎌늾竊뽬쯁諛깅빒[}藥쎌늾竊뽬쯁諛깅빒[{^ 1110010110110111101111011110110010001000100001111110111110111100100101101110100010101000100111011110101110110000101100011110101110010101101101100101101101111101111001011011011110111101111011001000100010000111111011111011110010010110111010001010100010011101111010111011000010110001111010111001010110110110010110110111101101011110 e5b7bdec8887efbc96e8a89debb0b1eb95b65b7de5b7bdec8887efbc96e8a89debb0b1eb95b65b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)