To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌??巍??應у?隘?????宥??縊e?^ 111000101010001100111111001111111001101111011001001111110011111110011100111001001000010010000101001111111110100010100101001111110011111100111111001111110011111110010111010001110011111100111111111000110110111110000010100001010011111101011110 e2a33f3f9bd93f3f9ce484853fe8a53f3f3f3f3f97473f3fe36f82853f5e
EUC-JP 筌??巍??應у?隘?????宥??縊e?^ 111001001010010100111111001111111101011011011011001111110011111111011000111001101010011111100101001111111111000010100111001111110011111100111111001111110011111111001101101010000011111100111111111001011101000010100011111001010011111101011110 e4a53f3fd6db3f3fd8e6a7e53ff0a73f3f3f3f3fcda83f3fe5d0a3e53f5e
UTF-8 筌잙젾巍띾떧應у옡隘뷂쭫溜곈냽宥븐뜫縊e삍^ 111001111010110110001100111011001001111010011001111011001010000010111110111001011011011110001101111010111001110110111110111010111001011010100111111001101000011110001001110100011000001111101100100110001010000111101001100110101001100011101011101101111000001011101100101011011010101111101111101001111000101111101010101100111000100011101011100000111011110111100101101011101010010111101011101110001001000011101011100111001010101111100111101110001000101011101111101111011000010111101100100000101000110101011110 e7ad8cec9e99eca0bee5b78deb9dbeeb96a7e68789d183ec98a1e99a98ebb782ecadabefa78beab388eb83bde5aea5ebb890eb9cabe7b88aefbd85ec828d5e
UHC 筌잙젾巍띾떧應у옡隘뷂쭫溜곈냽宥븐뜫縊e삍^ 11101111101001111001111111101011101000001011000011101000111001001000110111101011100010111011101011101011111010111010110011100101100111101010001111100100111101101001010011101111101001111001111111101010111111101011000011101001100001101000110111101010111010011011101011101100100011011010110011100100111111001010001111100101100110001001010001011110 efa79feba0b0e8e48deb8bbaebebace59ea3e4f694efa79feafeb0e9868deae9baec8dace4fca3e598945e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)