To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 鐔種従鐔種臭鐔種従鐔種愁鐔種従鐔種臭鐔種従鐔種衆h 11101000010111001000111011101101100011110101110111101000010111001000111011101101100011110100110011101000010111001000111011101101100011110101110111101000010111001000111011101101100011110100010011101000010111001000111011101101100011110101110111101000010111001000111011101101100011110100110011101000010111001000111011101101100011110101110111101000010111001000111011101101100011110100111101101000 e85c8eed8f5de85c8eed8f4ce85c8eed8f5de85c8eed8f44e85c8eed8f5de85c8eed8f4ce85c8eed8f5de85c8eed8f4f68
EUC-JP 鐔種従鐔種臭鐔種従鐔種愁鐔種従鐔種臭鐔種従鐔種衆h 11101111101111011011110011101111101111011011111011101111101111011011110011101111101111011010110111101111101111011011110011101111101111011011111011101111101111011011110011101111101111011010010111101111101111011011110011101111101111011011111011101111101111011011110011101111101111011010110111101111101111011011110011101111101111011011111011101111101111011011110011101111101111011011000001101000 efbdbcefbdbeefbdbcefbdadefbdbcefbdbeefbdbcefbda5efbdbcefbdbeefbdbcefbdadefbdbcefbdbeefbdbcefbdb068
UTF-8 鐔種従鐔種臭鐔種従鐔種愁鐔種従鐔種臭鐔種従鐔種衆h 11101001100100001001010011100111101010001010111011100101101111101001001111101001100100001001010011100111101010001010111011101000100001111010110111101001100100001001010011100111101010001010111011100101101111101001001111101001100100001001010011100111101010001010111011100110100001001000000111101001100100001001010011100111101010001010111011100101101111101001001111101001100100001001010011100111101010001010111011101000100001111010110111101001100100001001010011100111101010001010111011100101101111101001001111101001100100001001010011100111101010001010111011101000101000011000011001101000 e99094e7a8aee5be93e99094e7a8aee887ade99094e7a8aee5be93e99094e7a8aee68481e99094e7a8aee5be93e99094e7a8aee887ade99094e7a8aee5be93e99094e7a8aee8a18668
UHC ?種??種臭?種??種愁?種??種臭?種??種衆h 00111111111100001111101000111111001111111111000011111010111101101010101100111111111100001111101000111111001111111111000011111010111000011111111000111111111100001111101000111111001111111111000011111010111101101010101100111111111100001111101000111111001111111111000011111010111100011110101101101000 3ff0fa3f3ff0faf6ab3ff0fa3f3ff0fae1fe3ff0fa3f3ff0faf6ab3ff0fa3f3ff0faf1eb68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)