To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??F^h??F^fN}??F^h??F^fN{^ 00111111001111110100011001011110011010000011111100111111010001100101111001100110010011100111110100111111001111110100011001011110011010000011111100111111010001100101111001100110010011100111101101011110 3f3f465e683f3f465e664e7d3f3f465e683f3f465e664e7b5e
SJIS-WIN 惡搖F^h惡搖F^fN}惡搖F^h惡搖F^fN{^ 100111001010011010011101100010100100011001011110011010001001110010100110100111011000101001000110010111100110011001001110011111011001110010100110100111011000101001000110010111100110100010011100101001101001110110001010010001100101111001100110010011100111101101011110 9ca69d8a465e689ca69d8a465e664e7d9ca69d8a465e689ca69d8a465e664e7b5e
EUC-JP 惡搖F^h惡搖F^fN}惡搖F^h惡搖F^fN{^ 110110001010100011011001111010100100011001011110011010001101100010101000110110011110101001000110010111100110011001001110011111011101100010101000110110011110101001000110010111100110100011011000101010001101100111101010010001100101111001100110010011100111101101011110 d8a8d9ea465e68d8a8d9ea465e664e7dd8a8d9ea465e68d8a8d9ea465e664e7b5e
UTF-8 惡搖F^h惡搖F^fN}惡搖F^h惡搖F^fN{^ 1110011010000011101000011110011010010000100101100100011001011110011010001110011010000011101000011110011010010000100101100100011001011110011001100100111001111101111001101000001110100001111001101001000010010110010001100101111001101000111001101000001110100001111001101001000010010110010001100101111001100110010011100111101101011110 e683a1e69096465e68e683a1e69096465e664e7de683a1e69096465e68e683a1e69096465e664e7b5e
UHC 惡搖F^h惡搖F^fN}惡搖F^h惡搖F^fN{^ 111001001100001011101000111101000100011001011110011010001110010011000010111010001111010001000110010111100110011001001110011111011110010011000010111010001111010001000110010111100110100011100100110000101110100011110100010001100101111001100110010011100111101101011110 e4c2e8f4465e68e4c2e8f4465e664e7de4c2e8f4465e68e4c2e8f4465e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)