To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???????二??怨??翁??誼??鷹?? 001111110011111100111111001111110011111100111111001111111001001111110001001111110011111110001001100001010011111100111111100010011010010100111111001111111000101101100010001111110011111110010001111010010011111100111111 3f3f3f3f3f3f3f93f13f3f89853f3f89a53f3f8b623f3f91e93f3f
EUC-JP ???????二??怨??翁??誼??鷹?? 001111110011111100111111001111110011111100111111001111111100011011110011001111110011111110110001111001010011111100111111101100101010011100111111001111111011010111000011001111110011111111000010111010110011111100111111 3f3f3f3f3f3f3fc6f33f3fb1e53f3fb2a73f3fb5c33f3fc2eb3f3f
UTF-8 閱뤣꾨큵樂낅뿩二당윯怨살돟翁띾쓧誼껅븭鷹됱뒳 111010011001011010110001111010111010010010100011111010101011111010101000111011011000000110110101111011111010011010111111111010111000001010000101111010111011111110101001111001001011101010001100111010111000101110111001111011001001110010101111111001101000000010101000111011001000001010110100111010111000111110011111111001111011111110000001111010111001110110111110111011001001001110100111111010001010101010111100111010101011101110000101111010111011100010101101111010011011011110111001111010111001000010110001111010111001001010110011 e996b1eba4a3eabea8ed81b5efa6bfeb8285ebbfa9e4ba8ceb8bb9ec9cafe680a8ec82b4eb8f9fe7bf81eb9dbeec93a7e8aabceabb85ebb8ade9b7b9eb90b1eb92b3
UHC 閱뤣꾨큵樂낅뿩二당윯怨살돟翁띾쓧誼껅븭鷹됱뒳 1110011011110011100011111101000110000100111010111011010010000100111010001111100110000101111010111001011110101001111011001010001110110100111001111001111110101110111010101011001110111011111011001000100110100101111010001011101010001101111010111001110110001000111010111111111010000011111001101001010110010110111010111110110110001001111011001000101010101100 e6f38fd184ebb484e8f985eb97a9eca3b4e79faeeab3bbec89a5e8ba8deb9d88ebfe83e69596ebed89ec8aac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)