To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????M@?????????M@B 0011111100111111001111110011111100111111001111110011111100111111001111110100110101000000001111110011111100111111001111110011111100111111001111110011111100111111010011010100000001000010 3f3f3f3f3f3f3f3f3f4d403f3f3f3f3f3f3f3f3f4d4042
SJIS-WIN 鵝??厭??魚??M@鵝??厭??魚??M@B 1110101001000000001111110011111110001001011111010011111100111111100010111001101100111111001111110100110101000000111010100100000000111111001111111000100101111101001111110011111110001011100110110011111100111111010011010100000001000010 ea403f3f897d3f3f8b9b3f3f4d40ea403f3f897d3f3f8b9b3f3f4d4042
EUC-JP 鵝??厭??魚??M@鵝??厭??魚??M@B 1111001110100001001111110011111110110001110111100011111100111111101101011111101100111111001111110100110101000000111100111010000100111111001111111011000111011110001111110011111110110101111110110011111100111111010011010100000001000010 f3a13f3fb1de3f3fb5fb3f3f4d40f3a13f3fb1de3f3fb5fb3f3f4d4042
UTF-8 鵝얜젶厭묐젒魚좉킀M@鵝얜젶厭묐젒魚좉킀M@B 1110100110110101100111011110110010010110100111001110110010100000101101101110010110001110101011011110101110101100100100001110110010100000100100101110100110101101100110101110110010100010100010011110110110000010100000000100110101000000111010011011010110011101111011001001011010011100111011001010000010110110111001011000111010101101111010111010110010010000111011001010000010010010111010011010110110011010111011001010001010001001111011011000001010000000010011010100000001000010 e9b59dec969ceca0b6e58eadebac90eca092e9ad9aeca289ed82804d40e9b59dec969ceca0b6e58eadebac90eca092e9ad9aeca289ed82804d4042
UHC 鵝얜젶厭묐젒魚좉킀M@鵝얜젶厭묐젒魚좉킀M@B 1110010010111101101111101110101110100000101010101110011011110100100100011110101110100000100100011110010111100000101000001110101010110100100011010100110101000000111001001011110110111110111010111010000010101010111001101111010010010001111010111010000010010001111001011110000010100000111010101011010010001101010011010100000001000010 e4bdbeeba0aae6f491eba091e5e0a0eab48d4d40e4bdbeeba0aae6f491eba091e5e0a0eab48d4d4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)