To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 𬎰ñáØ珸á𬎰ðáØçì¸ 111100001010110010001110101100001111000111100001110110001110011110001111101110001110000111110000101011001000111010110000111100001110000111011000111001111110110010111000 f0ac8eb0f1e1d8e78fb8e1f0ac8eb0f0e1d8e7ecb8
SJIS-WIN ?¬?°????????¬?°?????? 00111111100000011100101000111111100000011000101100111111001111110011111100111111001111110011111100111111001111111000000111001010001111111000000110001011001111110011111100111111001111110011111100111111 3f81ca3f818b3f3f3f3f3f3f3f3f81ca3f818b3f3f3f3f3f3f
EUC-JP ð¬?°ñáØç?¸áð¬?°ðáØçì¸ 1000111110101001110000111010001011001100001111111010000111101011100011111010101111010000100011111010101110100001100011111010100110101100100011111010101110101110001111111000111110100010101100011000111110101011101000011000111110101001110000111010001011001100001111111010000111101011100011111010100111000011100011111010101110100001100011111010100110101100100011111010101110101110100011111010101111000000100011111010001010110001 8fa9c3a2cc3fa1eb8fabd08faba18fa9ac8fabae3f8fa2b18faba18fa9c3a2cc3fa1eb8fa9c38faba18fa9ac8fabae8fabc08fa2b1
UTF-8 𬎰ñáØ珸á𬎰ðáØçì¸ 110000111011000011000010101011001100001010001110110000101011000011000011101100011100001110100001110000111001100011000011101001111100001010001111110000101011100011000011101000011100001110110000110000101010110011000010100011101100001010110000110000111011000011000011101000011100001110011000110000111010011111000011101011001100001010111000 c3b0c2acc28ec2b0c3b1c3a1c398c3a7c28fc2b8c3a1c3b0c2acc28ec2b0c3b0c3a1c398c3a7c3acc2b8
UHC ð??°??Ø??¸?ð??°ð?Ø??¸ 101010011010001100111111001111111010000111000110001111110011111110101000101010100011111100111111101000101010110000111111101010011010001100111111001111111010000111000110101010011010001100111111101010001010101000111111001111111010001010101100 a9a33f3fa1c63f3fa8aa3f3fa2ac3fa9a33f3fa1c6a9a33fa8aa3f3fa2ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)