To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蝴・縲畷沃莠護サ厭蝴・縲畷沃莠護サ閲^ 111001011001101010100101111000111000000010010011111010111001011110000000111001001011101010001100111011001011101110001001011111011110010110011010101001011110001110000000100100111110101110010111100000001110010010111010100011001110110010111011100010010111101101011110 e59aa5e38093eb9780e4ba8cecbb897de59aa5e38093eb9780e4ba8cecbb897b5e
EUC-JP 蝴・縲畷沃莠護サ厭蝴・縲畷沃莠護サ閲^ 11101001111110101000111010100101111001011110000011000110111011011100110111100000111010001011110010111000111011101000111010111011101100011101111011101001111110101000111010100101111001011110000011000110111011011100110111100000111010001011110010111000111011101000111010111011101100011101110001011110 e9fa8ea5e5e0c6edcde0e8bcb8ee8ebbb1dee9fa8ea5e5e0c6edcde0e8bcb8ee8ebbb1dc5e
UTF-8 蝴・縲畷沃莠護サ厭蝴・縲畷沃莠護サ閲^ 11101000100111011011010011101111101111011010010111100111101110001011001011100111100101011011011111100110101100101000001111101000100011101010000011101000101011011011011111101111101111011011101111100101100011101010110111101000100111011011010011101111101111011010010111100111101110001011001011100111100101011011011111100110101100101000001111101000100011101010000011101000101011011011011111101111101111011011101111101001100101101011001001011110 e89db4efbda5e7b8b2e795b7e6b283e88ea0e8adb7efbdbbe58eade89db4efbda5e7b8b2e795b7e6b283e88ea0e8adb7efbdbbe996b25e
UHC 蝴???沃?護?厭蝴???沃?護??^ 1111101111011101001111110011111100111111111010001010101000111111111110111101111000111111111001101111010011111011110111010011111100111111001111111110100010101010001111111111101111011110001111110011111101011110 fbdd3f3f3fe8aa3ffbde3fe6f4fbdd3f3f3fe8aa3ffbde3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)