To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æø·Wznfæø·Wzn^}Yæø·Wznfæø·Wzn^}bE 111001101111100010110111010101110111101001101110011001101110011011111000101101110101011101111010011011100101111001111101010110011110011011111000101101110101011101111010011011100110011011100110111110001011011101010111011110100110111001011110011111010110001001000101 e6f8b7577a6e66e6f8b7577a6e5e7d59e6f8b7577a6e66e6f8b7577a6e5e7d6245
SJIS-WIN ???Wznf???Wzn^}Y???Wznf???Wzn^}bE 001111110011111100111111010101110111101001101110011001100011111100111111001111110101011101111010011011100101111001111101010110010011111100111111001111110101011101111010011011100110011000111111001111110011111101010111011110100110111001011110011111010110001001000101 3f3f3f577a6e663f3f3f577a6e5e7d593f3f3f577a6e663f3f3f577a6e5e7d6245
EUC-JP æø?Wznfæø?Wzn^}Yæø?Wznfæø?Wzn^}bE 10001111101010011100000110001111101010011100110000111111010101110111101001101110011001101000111110101001110000011000111110101001110011000011111101010111011110100110111001011110011111010101100110001111101010011100000110001111101010011100110000111111010101110111101001101110011001101000111110101001110000011000111110101001110011000011111101010111011110100110111001011110011111010110001001000101 8fa9c18fa9cc3f577a6e668fa9c18fa9cc3f577a6e5e7d598fa9c18fa9cc3f577a6e668fa9c18fa9cc3f577a6e5e7d6245
UTF-8 æø·Wznfæø·Wzn^}Yæø·Wznfæø·Wzn^}bE 110000111010011011000011101110001100001010110111010101110111101001101110011001101100001110100110110000111011100011000010101101110101011101111010011011100101111001111101010110011100001110100110110000111011100011000010101101110101011101111010011011100110011011000011101001101100001110111000110000101011011101010111011110100110111001011110011111010110001001000101 c3a6c3b8c2b7577a6e66c3a6c3b8c2b7577a6e5e7d59c3a6c3b8c2b7577a6e66c3a6c3b8c2b7577a6e5e7d6245
UHC æø·Wznfæø·Wzn^}Yæø·Wznfæø·Wzn^}bE 101010011010000110101001101010101010000110100100010101110111101001101110011001101010100110100001101010011010101010100001101001000101011101111010011011100101111001111101010110011010100110100001101010011010101010100001101001000101011101111010011011100110011010101001101000011010100110101010101000011010010001010111011110100110111001011110011111010110001001000101 a9a1a9aaa1a4577a6e66a9a1a9aaa1a4577a6e5e7d59a9a1a9aaa1a4577a6e66a9a1a9aaa1a4577a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)