To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瑤??與??窈??暗∽?繹??若??撓 111010101010001000111111001111111110010001101111001111110011111111100010011101110011111100111111100010001100001110000001111001000011111111100011100010000011111100111111100011101110000100111111001111111001110110011010 eaa23f3fe46f3f3fe2773f3f88c381e43fe3883f3f8ee13f3f9d9a
EUC-JP 瑤??與??窈??暗∽?繹??若??撓 111101001010010000111111001111111110011111010000001111110011111111100011110110000011111100111111101100001100010110100010111001100011111111100101111010000011111100111111101111001110001100111111001111111101100111111010 f4a43f3fe7d03f3fe3d83f3fb0c5a2e63fe5e83f3fbce33f3fd9fa
UTF-8 瑤덆죱與쀩웼窈뚳쉭暗∽쉥繹뤄쉰若롨예撓 111001111001000110100100111010111000110110000110111011001010001110110001111010001000100010000111111011001000000010101001111011001001101110111100111001111010101010001000111010111001101010110011111011001000100110101101111001101001101010010111111000101000100010111101111011001000100110100101111001111011100110111001111010111010010010000100111011001000100110110000111010001000101110100101111010111010000110101000111011001001100010001000111001101001001010010011 e791a4eb8d86eca3b1e88887ec80a9ec9bbce7aa88eb9ab3ec89ade69a97e288bdec89a5e7b9b9eba484ec89b0e88ba5eba1a8ec9888e69293
UHC 瑤덆죱與쀩웼窈뚳쉭暗∽쉥繹뤄쉰若롨예撓 1110100011111101100010001110100110100001100011001110011010101000100101111110100110011111100010001110100110100001100011001110111110111101101011011110010011011110101000011110111110111101101010111110011010111010101101111110111110111101101011101110010110110100100011101110100010111111101110011110100011110101 e8fd88e9a18ce6a897e99f88e9a18cefbdade4dea1efbdabe6bab7efbdaee5b48ee8bfb9e8f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)