To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??移?鷹??怨烽∧??移?鷹??怨烽∧^ 00111111001111111000100011011010001111111001000111101001001111110011111110001001100001011110000010000010100000011100100000111111001111111000100011011010001111111001000111101001001111110011111110001001100001011110000010000010100000011100100001011110 3f3f88da3f91e93f3f8985e08281c83f3f88da3f91e93f3f8985e08281c85e
EUC-JP ??移?鷹??怨烽∧??移?鷹??怨烽∧^ 00111111001111111011000011011100001111111100001011101011001111110011111110110001111001011101111111100010101000101100101000111111001111111011000011011100001111111100001011101011001111110011111110110001111001011101111111100010101000101100101001011110 3f3fb0dc3fc2eb3f3fb1e5dfe2a2ca3f3fb0dc3fc2eb3f3fb1e5dfe2a2ca5e
UTF-8 欌렪移렊鷹꿴떵怨烽∧欌렪移렊鷹꿴떵怨烽∧^ 11100110101011001000110011101011101000001010101011100111101001111011101111101011101000001000101011101001101101111011100111101010101111111011010011101011100101101011010111100110100000001010100011100111100000111011110111100010100010001010011111100110101011001000110011101011101000001010101011100111101001111011101111101011101000001000101011101001101101111011100111101010101111111011010011101011100101101011010111100110100000001010100011100111100000111011110111100010100010001010011101011110 e6ac8ceba0aae7a7bbeba08ae9b7b9eabfb4eb96b5e680a8e783bde288a7e6ac8ceba0aae7a7bbeba08ae9b7b9eabfb4eb96b5e680a8e783bde288a75e
UHC 欌렪移렊鷹꿴떵怨烽∧欌렪移렊鷹꿴떵怨烽∧^ 1110110111101011100011101011100011101100101110011000111010100001111010111110110110110010111010011011011010111010111010101011001111011100111010111010000111111100111011011110101110001110101110001110110010111001100011101010000111101011111011011011001011101001101101101011101011101010101100111101110011101011101000011111110001011110 edeb8eb8ecb98ea1ebedb2e9b6baeab3dceba1fcedeb8eb8ecb98ea1ebedb2e9b6baeab3dceba1fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)