To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌??巍??應ц。??????宥??烏f?^ 111000101010001100111111001111111001101111011001001111110011111110011100111001001000010010001000100000010100001000111111001111110011111100111111001111110011111110010111010001110011111100111111100010010100011110000010100001100011111101011110 e2a33f3f9bd93f3f9ce4848881423f3f3f3f3f3f97473f3f894782863f5e
EUC-JP 筌??巍??應ц。濚?????宥??烏f?^ 1110010010100101001111110011111111010110110110110011111100111111110110001110011010100111111010001010000110100011100011111100100110100001001111110011111100111111001111110011111111001101101010000011111100111111101100011010100010100011111001100011111101011110 e4a53f3fd6db3f3fd8e6a7e8a1a38fc9a13f3f3f3f3fcda83f3fb1a8a3e63f5e
UTF-8 筌잙젾巍띾떧應ц。濚껓쭫溜곈냽宥븐뜫烏f짎^ 111001111010110110001100111011001001111010011001111011001010000010111110111001011011011110001101111010111001110110111110111010111001011010100111111001101000011110001001110100011000011011100011100000001000001011100110101111111001101011101010101110111001001111101100101011011010101111101111101001111000101111101010101100111000100011101011100000111011110111100101101011101010010111101011101110001001000011101011100111001010101111100111100000111000111111101111101111011000011011101100101001111000111001011110 e7ad8cec9e99eca0bee5b78deb9dbeeb96a7e68789d186e38082e6bf9aeabb93ecadabefa78beab388eb83bde5aea5ebb890eb9cabe7838fefbd86eca78e5e
UHC 筌잙젾巍띾떧應ц。濚껓쭫溜곈냽宥븐뜫烏f짎^ 11101111101001111001111111101011101000001011000011101000111001001000110111101011100010111011101011101011111010111010110011101000101000011010001111100111101110011000001111101111101001111001111111101010111111101011000011101001100001101000110111101010111010011011101011101100100011011010110011101000101000011010001111100110101000111001101001011110 efa79feba0b0e8e48deb8bbaebebace8a1a3e7b983efa79feafeb0e9868deae9baec8dace8a1a3e6a39a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)