To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻??韋??潁??倚э?蹂μ?筌??誼→?猷?? 1001111101001110001111110011111111101000111010000011111100111111100111111111000100111111001111111001100011011111100001001000111100111111111001101111100010000011110010100011111111100010101000110011111100111111100010110110001010000001101010000011111110010111010100010011111100111111 9f4e3f3fe8e83f3f9ff13f3f98df848f3fe6f883ca3fe2a33f3f8b6281a83f97513f3f
EUC-JP 櫻??韋??潁??倚э?蹂μ?筌??誼→?猷?? 1101110110101111001111110011111111110000111010100011111100111111110111101111001100111111001111111101000011100001101001111110111100111111111011001111101010100110110011000011111111100100101001010011111100111111101101011100001110100010101010100011111111001101101100100011111100111111 ddaf3f3ff0ea3f3fdef33f3fd0e1a7ef3fecfaa6cc3fe4a53f3fb5c3a2aa3fcdb23f3f
UTF-8 櫻뗰퐙韋귞맦潁뺚돦倚э쭓蹂μ졄筌껉퉭誼→떁猷몄젶 11100110101010111011101111101011100101111011000011101101100100001001100111101001100111111000101111101010101101111001111011101011101001111010011011100110101111011000000111101011101110101001101011101011100011111010011011100101100000001001101011010001100011011110110010101101100100111110100010111001100000101100111010111100111011001010000110000100111001111010110110001100111010101011101110001001111011011000100110101101111010001010101010111100111000101000011010010010111010111001011010000001111001111000110010110111111010111010101010000100111011001010000010110110 e6abbbeb97b0ed9099e99f8beab79eeba7a6e6bd81ebba9aeb8fa6e5809ad18decad93e8b982cebceca184e7ad8ceabb89ed89ade8aabce28692eb9681e78cb7ebaa84eca0b6
UHC 櫻뗰퐙韋귞맦潁뺚돦倚э쭓蹂μ졄筌껉퉭誼→떁猷몄젶 111001011010000110001011111011111011110110000011111010101101111110000010111001111001000010101111111001111011100010010101111000101000100110101010111010111110111110101100111011111010011110001011111010111011001110100101111011001010000010110101111011111010011110000011111010101011100110000101111010111111111010100001111001101000101110010111111010111010001110111000111011001010000010101010 e5a18befbd83eadf82e790afe7b895e289aaebefacefa78bebb3a5eca0b5efa783eab985ebfea1e68b97eba3b8eca0aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)