To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 曜?????嗽??窈??節?????鈺?? 1001011101101010001111110011111100111111001111110011111110011010011101010011111100111111111000100111011100111111001111111001000011011111001111110011111100111111001111110011111111111011110001000011111100111111 976a3f3f3f3f3f9a753f3fe2773f3f90df3f3f3f3f3ffbc43f3f
EUC-JP 曜??縕??嗽??窈??節?????鈺?? 1100110111001011001111110011111110001111110101001100001000111111001111111101001111010110001111110011111111100011110110000011111100111111110000001110000100111111001111110011111100111111001111111000111111100011110101010011111100111111 cdcb3f3f8fd4c23f3fd3d63f3fe3d83f3fc0e13f3f3f3f3f8fe3d53f3f
UTF-8 曜뱄슘縕됧퉫嗽뤺춾窈뚳쉼節룩쾳亮꺿뿈鈺싨쪡 111001101001101110011100111010111011000110000100111011001000101010011000111001111011100010010101111010111001000010100111111011011000100110101011111001011001011110111101111010111010010010111010111011001011011010111110111001111010101010001000111010111001101010110011111011001000100110111100111001111010111110000000111010111010001110101001111011001011111010110011111011111010010110110111111010101011101010111111111010111011111110001000111010011000100010111010111011001000101110101000111011001010101010100001 e69b9cebb184ec8a98e7b895eb90a7ed89abe597bdeba4baecb6bee7aa88eb9ab3ec89bce7af80eba3a9ecbeb3efa5b7eababfebbf88e988baec8ba8ecaaa1
UHC 曜뱄슘縕됧퉫嗽뤺춾窈뚳쉼節룩쾳亮꺿뿈鈺싨쪡 111010001111100010111001111011111011110110110111111010001011001010001001111001011011100110000011111000011111010110001111111010001010110110011010111010011010000110001100111011111011110110110000111011111011110110110111111010001011001010001001111001011011100110000011111000101001011110001111111010001010110110011010111001101010010110011010 e8f8b9efbdb7e8b289e5b983e1f58fe8ad9ae9a18cefbdb0efbdb7e8b289e5b983e2978fe8ad9ae6a59a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)