To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 乙??縡紗???趾?乙??縡紗???趾?^ 1000100110110011001111110011111111100011011100011000111011010001001111110011111100111111111001101110010000111111100010011011001100111111001111111110001101110001100011101101000100111111001111110011111111100110111001000011111101011110 89b33f3fe3718ed13f3f3fe6e43f89b33f3fe3718ed13f3f3fe6e43f5e
EUC-JP 乙??縡紗???趾?乙??縡紗???趾?^ 1011001010110101001111110011111111100101110100101011110011010011001111110011111100111111111011001110011000111111101100101011010100111111001111111110010111010010101111001101001100111111001111110011111111101100111001100011111101011110 b2b53f3fe5d2bcd33f3f3fece63fb2b53f3fe5d2bcd33f3f3fece63f5e
UTF-8 乙재횐縡紗歷狀렢趾쌨乙재횐縡紗歷狀렢趾쌤^ 11100100101110011001100111101100100111101010110011101101100110101001000011100111101110001010000111100111101101001001011111100110101011011011011111101111101001111011101011101011101000001010001011101000101101101011111011101100100011001010100011100100101110011001100111101100100111101010110011101101100110101001000011100111101110001010000111100111101101001001011111100110101011011011011111101111101001111011101011101011101000001010001011101000101101101011111011101100100011001010010001011110 e4b999ec9eaced9a90e7b8a1e7b497e6adb7efa7baeba0a2e8b6beec8ca8e4b999ec9eaced9a90e7b8a1e7b497e6adb7efa7baeba0a2e8b6beec8ca45e
UHC 乙재횐縡紗歷狀렢趾쌨乙재횐縡紗歷狀렢趾쌤^ 1110101111100000110000001110011111001000101110101110111010101101110111101110100111010101111101101110110111101110100011101011001111110010101111111011110111011110111010111110000011000000111001111100100010111010111011101010110111011110111010011101010111110110111011011110111010001110101100111111001010111111101111011101110001011110 ebe0c0e7c8baeeaddee9d5f6edee8eb3f2bfbddeebe0c0e7c8baeeaddee9d5f6edee8eb3f2bfbddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)