To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 祭頭??庄?└??祭頭??庄?└??^ 100011011101010110010011101010100011111100111111100011111010111100111111100001001010010000111111001111111000110111010101100100111010101000111111001111111000111110101111001111111000010010100100001111110011111101011110 8dd593aa3f3f8faf3f84a43f3f8dd593aa3f3f8faf3f84a43f3f5e
EUC-JP 祭頭??庄?└頊?祭頭??庄?└頊?^ 10111010110101111100011010101100001111110011111110111110101100010011111110101000101001101000111111100111111101000011111110111010110101111100011010101100001111110011111110111110101100010011111110101000101001101000111111100111111101000011111101011110 bad7c6ac3f3fbeb13fa8a68fe7f43fbad7c6ac3f3fbeb13fa8a68fe7f43f5e
UTF-8 祭頭렗렗庄흙└頊텡祭頭렗렗庄흙└頊텝^ 11100111101001011010110111101001101000001010110111101011101000001001011111101011101000001001011111100101101110101000010011101101100111011001100111100010100101001001010011101001101000001000101011101101100001011010000111100111101001011010110111101001101000001010110111101011101000001001011111101011101000001001011111100101101110101000010011101101100111011001100111100010100101001001010011101001101000001000101011101101100001011001110101011110 e7a5ade9a0adeba097eba097e5ba84ed9d99e29494e9a08aed85a1e7a5ade9a0adeba097eba097e5ba84ed9d99e29494e9a08aed859d5e
UHC 祭頭렗렗庄흙└頊텡祭頭렗렗庄흙└頊텝^ 11110000101011101101010011101001100011101010110010001110101011001110110111100100110010001110101110100110101001101110100111110101110001011101111011110000101011101101010011101001100011101010110010001110101011001110110111100100110010001110101110100110101001101110100111110101110001011101110001011110 f0aed4e98eac8eacede4c8eba6a6e9f5c5def0aed4e98eac8eacede4c8eba6a6e9f5c5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)