To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????M@??????????M@B 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011010100000000111111001111110011111100111111001111110011111100111111001111110011111100111111010011010100000001000010 3f3f3f3f3f3f3f3f3f3f4d403f3f3f3f3f3f3f3f3f3f4d4042
SJIS-WIN 嗚??泣?嗚??泣?M@嗚??泣?嗚??泣?M@B 100110100110101000111111001111111000101110000011001111111001101001101010001111110011111110001011100000110011111101001101010000001001101001101010001111110011111110001011100000110011111110011010011010100011111100111111100010111000001100111111010011010100000001000010 9a6a3f3f8b833f9a6a3f3f8b833f4d409a6a3f3f8b833f9a6a3f3f8b833f4d4042
EUC-JP 嗚??泣?嗚??泣?M@嗚??泣?嗚??泣?M@B 110100111100101100111111001111111011010111100011001111111101001111001011001111110011111110110101111000110011111101001101010000001101001111001011001111110011111110110101111000110011111111010011110010110011111100111111101101011110001100111111010011010100000001000010 d3cb3f3fb5e33fd3cb3f3fb5e33f4d40d3cb3f3fb5e33fd3cb3f3fb5e33f4d4042
UTF-8 嗚삳챿泣갎嗚삳챿泣갲M@嗚삳챿泣갎嗚삳챿泣갲M@B 1110010110010111100110101110110010000010101100111110110010110001101111111110011010110011101000111110101010110000100011101110010110010111100110101110110010000010101100111110110010110001101111111110011010110011101000111110101010110000101100100100110101000000111001011001011110011010111011001000001010110011111011001011000110111111111001101011001110100011111010101011000010001110111001011001011110011010111011001000001010110011111011001011000110111111111001101011001110100011111010101011000010110010010011010100000001000010 e5979aec82b3ecb1bfe6b3a3eab08ee5979aec82b3ecb1bfe6b3a3eab0b24d40e5979aec82b3ecb1bfe6b3a3eab08ee5979aec82b3ecb1bfe6b3a3eab0b24d4042
UHC 嗚삳챿泣갎嗚삳챿泣갲M@嗚삳챿泣갎嗚삳챿泣갲M@B 111001111111000010111011111010111010101010001100111010111110100010000001010010001110011111110000101110111110101110101010100011001110101111101000100000010101100001001101010000001110011111110000101110111110101110101010100011001110101111101000100000010100100011100111111100001011101111101011101010101000110011101011111010001000000101011000010011010100000001000010 e7f0bbebaa8cebe88148e7f0bbebaa8cebe881584d40e7f0bbebaa8cebe88148e7f0bbebaa8cebe881584d4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)