To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??蹂〓?域??寃??節?????泣 0011111100111111001111111000101110000011001111110011111111100110111110001000000110101100001111111000100011100110001111110011111110011011100000110011111100111111100100001101111100111111001111110011111100111111001111111000101110000011 3f3f3f8b833f3fe6f881ac3f88e63f3f9b833f3f90df3f3f3f3f3f8b83
EUC-JP ???泣??蹂〓?域??寃??節?????泣 0011111100111111001111111011010111100011001111110011111111101100111110101010001010101110001111111011000011101000001111110011111111010101111000110011111100111111110000001110000100111111001111110011111100111111001111111011010111100011 3f3f3fb5e33f3fecfaa2ae3fb0e83f3fd5e33f3fc0e13f3f3f3f3fb5e3
UTF-8 捻꿔꺂泣볢땻蹂〓븶域뱀빖寃김돳節뗭탡列룸뿥泣 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111010111011001110100010111010111001010110111011111010001011100110000010111000111000000010010011111010111011100010110110111001011001111110011111111010111011000110000000111010111011100110010110111001011010111110000011111010101011100110000000111010111000111110110011111001111010111110000000111010111001011110101101111011011000001110100001111011111010011010011100111010111010001110111000111010111011111110100101111001101011001110100011 efa6a4eabf94eaba82e6b3a3ebb3a2eb95bbe8b982e38093ebb8b6e59f9febb180ebb996e5af83eab980eb8fb3e7af80eb97aded83a1efa69ceba3b8ebbfa5e6b3a3
UHC 捻꿔꺂泣볢땻蹂〓븶域뱀빖寃김돳節뗭탡列룸뿥泣 1110011011110111101100101110001110000011101010111110101111101000100100111110100010001011100100011110101110110011101000011110101110010101100111111110011010110100101110011110110010010101101110001110101010110010101100011110100010001001101101101110111110111101100010111110110010110101100001001110011011101010101101111110101110010111101001011110101111101000 e6f7b2e383abebe893e88b91ebb3a1eb959fe6b4b9ec95b8eab2b1e889b6efbd8becb584e6eab7eb97a5ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)