To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ì°»þìøªì¨˜ì°»þìøªì¨˜^ 111011001011000010111011111111101110110011111000101010101110110010101000100110001110110010110000101110111111111011101100111110001010101011101100101010001001100001011110 ecb0bbfeecf8aaeca898ecb0bbfeecf8aaeca8985e
SJIS-WIN ?°??????¨??°??????¨?^ 00111111100000011000101100111111001111110011111100111111001111110011111110000001010011100011111100111111100000011000101100111111001111110011111100111111001111110011111110000001010011100011111101011110 3f818b3f3f3f3f3f3f814e3f3f818b3f3f3f3f3f3f814e3f5e
EUC-JP ì°?þìøªì¨?ì°?þìøªì¨?^ 10001111101010111100000010100001111010110011111110001111101010011101000010001111101010111100000010001111101010011100110010001111101000101110110010001111101010111100000010100001101011110011111110001111101010111100000010100001111010110011111110001111101010011101000010001111101010111100000010001111101010011100110010001111101000101110110010001111101010111100000010100001101011110011111101011110 8fabc0a1eb3f8fa9d08fabc08fa9cc8fa2ec8fabc0a1af3f8fabc0a1eb3f8fa9d08fabc08fa9cc8fa2ec8fabc0a1af3f5e
UTF-8 ì°»þìøªì¨˜ì°»þìøªì¨˜^ 1100001110101100110000101011000011000010101110111100001110111110110000111010110011000011101110001100001010101010110000111010110011000010101010001100001010011000110000111010110011000010101100001100001010111011110000111011111011000011101011001100001110111000110000101010101011000011101011001100001010101000110000101001100001011110 c3acc2b0c2bbc3bec3acc3b8c2aac3acc2a8c298c3acc2b0c2bbc3bec3acc3b8c2aac3acc2a8c2985e
UHC ?°?þ?øª?¨??°?þ?øª?¨?^ 00111111101000011100011000111111101010011010110100111111101010011010101010101000101000110011111110100001101001110011111100111111101000011100011000111111101010011010110100111111101010011010101010101000101000110011111110100001101001110011111101011110 3fa1c63fa9ad3fa9aaa8a33fa1a73f3fa1c63fa9ad3fa9aaa8a33fa1a73f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)