To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???韋??管幼??誘?????臾??懿??^ 0011111100111111001111111110100011101000001111110011111110001010110001111001011101100011001111110011111110010111010101010011111100111111001111110011111100111111111001000110101100111111001111111001110011110010001111110011111101011110 3f3f3fe8e83f3f8ac797633f3f97553f3f3f3f3fe46b3f3f9cf23f3f5e
EUC-JP ???韋??管幼??誘?????臾??懿??^ 0011111100111111001111111111000011101010001111110011111110110100110010011100110111000100001111110011111111001101101101100011111100111111001111110011111100111111111001111100110000111111001111111101100011110100001111110011111101011110 3f3f3ff0ea3f3fb4c9cdc43f3fcdb63f3f3f3f3fe7cc3f3fd8f43f3f5e
UTF-8 捻곌풝韋껅틦管幼싨콨誘⑹젵廬믩객臾묊춯懿몄젟^ 11101111101001101010010011101010101100111000110011101101100100101001110111101001100111111000101111101010101110111000010111101101100010111010011011100111101011101010000111100101101110011011110011101100100010111010100011101100101111011010100011101000101010101001100011100010100100011011100111101100101000001011010111101111101001101000001011101011101011111010100111101010101100001001110111101000100001111011111011101011101011001000101011101100101101101010111111100110100001111011111111101011101010101000010011101100101000001001111101011110 efa6a4eab38ced929de99f8beabb85ed8ba6e7aea1e5b9bcec8ba8ecbda8e8aa98e291b9eca0b5efa682ebafa9eab09de887beebac8aecb6afe687bfebaa84eca09f5e
UHC 捻곌풝韋껅틦管幼싨콨誘⑹젵廬믩객臾묊춯懿몄젟^ 111001101111011110110000111010101011111010100000111010101101111110000011111001101011101010010000110011101011011111101010111010101001101011100110101100011001110111101011101011111010100111101100101000001010100111100101111111101001001011101011101100001011010011101011101011001001000111100111101011011000110011101011111100111011100011101100101000001001100101011110 e6f7b0eabea0eadf83e6ba90ceb7eaea9ae6b19debafa9eca0a9e5fe92ebb0b4ebac91e7ad8cebf3b8eca0995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)