To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 êþ»é°»êþªU}êþ»é°»êþªU{^ 1110101011111110101110111110100110110000101110111110101011111110101010100101010101111101111010101111111010111011111010011011000010111011111010101111111010101010010101010111101101011110 eafebbe9b0bbeafeaa557deafebbe9b0bbeafeaa557b5e
SJIS-WIN ????°????U}????°????U{^ 00111111001111110011111100111111100000011000101100111111001111110011111100111111010101010111110100111111001111110011111100111111100000011000101100111111001111110011111100111111010101010111101101011110 3f3f3f3f818b3f3f3f3f557d3f3f3f3f818b3f3f3f3f557b5e
EUC-JP êþ?é°?êþªU}êþ?é°?êþªU{^ 10001111101010111011010010001111101010011101000000111111100011111010101110110001101000011110101100111111100011111010101110110100100011111010100111010000100011111010001011101100010101010111110110001111101010111011010010001111101010011101000000111111100011111010101110110001101000011110101100111111100011111010101110110100100011111010100111010000100011111010001011101100010101010111101101011110 8fabb48fa9d03f8fabb1a1eb3f8fabb48fa9d08fa2ec557d8fabb48fa9d03f8fabb1a1eb3f8fabb48fa9d08fa2ec557b5e
UTF-8 êþ»é°»êþªU}êþ»é°»êþªU{^ 1100001110101010110000111011111011000010101110111100001110101001110000101011000011000010101110111100001110101010110000111011111011000010101010100101010101111101110000111010101011000011101111101100001010111011110000111010100111000010101100001100001010111011110000111010101011000011101111101100001010101010010101010111101101011110 c3aac3bec2bbc3a9c2b0c2bbc3aac3bec2aa557dc3aac3bec2bbc3a9c2b0c2bbc3aac3bec2aa557b5e
UHC ?þ??°??þªU}?þ??°??þªU{^ 00111111101010011010110100111111001111111010000111000110001111110011111110101001101011011010100010100011010101010111110100111111101010011010110100111111001111111010000111000110001111110011111110101001101011011010100010100011010101010111101101011110 3fa9ad3f3fa1c63f3fa9ada8a3557d3fa9ad3f3fa1c63f3fa9ada8a3557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)