To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???梧?????曜?????梧?????曜??^ 0011111100111111001111111000110011100110001111110011111100111111001111110011111110010111011010100011111100111111001111110011111100111111100011001110011000111111001111110011111100111111001111111001011101101010001111110011111101011110 3f3f3f8ce63f3f3f3f3f976a3f3f3f3f3f8ce63f3f3f3f3f976a3f3f5e
EUC-JP 縕??梧??縕??曜??縕??梧??縕??曜??^ 10001111110101001100001000111111001111111011100011101000001111110011111110001111110101001100001000111111001111111100110111001011001111110011111110001111110101001100001000111111001111111011100011101000001111110011111110001111110101001100001000111111001111111100110111001011001111110011111101011110 8fd4c23f3fb8e83f3f8fd4c23f3fcdcb3f3f8fd4c23f3fb8e83f3f8fd4c23f3fcdcb3f3f5e
UTF-8 縕됵슴梧놂숴縕됵슴曜경렦縕됵슴梧놂숴縕됵슴曜경럦^ 11100111101110001001010111101011100100001011010111101100100010101011010011100110101000101010011111101011100001101000001011101100100010001011010011100111101110001001010111101011100100001011010111101100100010101011010011100110100110111001110011101010101100101011110111101011101000001010011011100111101110001001010111101011100100001011010111101100100010101011010011100110101000101010011111101011100001101000001011101100100010001011010011100111101110001001010111101011100100001011010111101100100010101011010011100110100110111001110011101010101100101011110111101011100111111010011001011110 e7b895eb90b5ec8ab4e6a2a7eb8682ec88b4e7b895eb90b5ec8ab4e69b9ceab2bdeba0a6e7b895eb90b5ec8ab4e6a2a7eb8682ec88b4e7b895eb90b5ec8ab4e69b9ceab2bdeb9fa65e
UHC 縕됵슴梧놂숴縕됵슴曜경렦縕됵슴梧놂숴縕됵슴曜경럦^ 11101000101100101000100111101111101111011011111111100111111111001011001111101111101111011010010011101000101100101000100111101111101111011011111111101000111110001011000011100110100011101011010111101000101100101000100111101111101111011011111111100111111111001011001111101111101111011010010011101000101100101000100111101111101111011011111111101000111110001011000011100110100011101000100101011110 e8b289efbdbfe7fcb3efbda4e8b289efbdbfe8f8b0e68eb5e8b289efbdbfe7fcb3efbda4e8b289efbdbfe8f8b0e68e895e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)