To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????r[?????????r[^ 0011111100111111001111110011111100111111001111110011111100111111001111110111001001011011001111110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f3f725b5e
SJIS-WIN ???瀛??嘶??r[???瀛??嘶??r[^ 001111110011111100111111111000000110100100111111001111111001101001111100001111110011111101110010010110110011111100111111001111111110000001101001001111110011111110011010011111000011111100111111011100100101101101011110 3f3f3fe0693f3f9a7c3f3f725b3f3f3fe0693f3f9a7c3f3f725b5e
EUC-JP ???瀛??嘶??r[???瀛??嘶??r[^ 001111110011111100111111110111111100101000111111001111111101001111011101001111110011111101110010010110110011111100111111001111111101111111001010001111110011111111010011110111010011111100111111011100100101101101011110 3f3f3fdfca3f3fd3dd3f3f725b3f3f3fdfca3f3fd3dd3f3f725b5e
UTF-8 拾굝뜪瀛귦뢝嘶꼷뢞r[拾굝뜪瀛귦뢝嘶꼷뢞r[^ 1110111110100101101100111110101010110101100111011110101110011100101010101110011110000000100110111110101010110111101001101110101110100010100111011110010110011000101101101110101010111100101101111110101110100010100111100111001001011011111011111010010110110011111010101011010110011101111010111001110010101010111001111000000010011011111010101011011110100110111010111010001010011101111001011001100010110110111010101011110010110111111010111010001010011110011100100101101101011110 efa5b3eab59deb9caae7809beab7a6eba29de598b6eabcb7eba29e725befa5b3eab59deb9caae7809beab7a6eba29de598b6eabcb7eba29e725b5e
UHC 拾굝뜪瀛귦뢝嘶꼷뢞r[拾굝뜪瀛귦뢝嘶꼷뢞r[^ 1110010010101001100000101000010110001101101010111110011110111010100000101110110110001111010110001110001110110110100001001000111110001111010110010111001001011011111001001010100110000010100001011000110110101011111001111011101010000010111011011000111101011000111000111011011010000100100011111000111101011001011100100101101101011110 e4a982858dabe7ba82ed8f58e3b6848f8f59725be4a982858dabe7ba82ed8f58e3b6848f8f59725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)