To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????m}?????????m{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110110101111101001111110011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN ???瀛??嘶??m}???瀛??嘶??m{^ 001111110011111100111111111000000110100100111111001111111001101001111100001111110011111101101101011111010011111100111111001111111110000001101001001111110011111110011010011111000011111100111111011011010111101101011110 3f3f3fe0693f3f9a7c3f3f6d7d3f3f3fe0693f3f9a7c3f3f6d7b5e
EUC-JP ???瀛??嘶??m}???瀛??嘶??m{^ 001111110011111100111111110111111100101000111111001111111101001111011101001111110011111101101101011111010011111100111111001111111101111111001010001111110011111111010011110111010011111100111111011011010111101101011110 3f3f3fdfca3f3fd3dd3f3f6d7d3f3f3fdfca3f3fd3dd3f3f6d7b5e
UTF-8 拾굝뜪瀛귦뢝嘶꼷뢞m}拾굝뜪瀛귦뢝嘶꼷뢞m{^ 1110111110100101101100111110101010110101100111011110101110011100101010101110011110000000100110111110101010110111101001101110101110100010100111011110010110011000101101101110101010111100101101111110101110100010100111100110110101111101111011111010010110110011111010101011010110011101111010111001110010101010111001111000000010011011111010101011011110100110111010111010001010011101111001011001100010110110111010101011110010110111111010111010001010011110011011010111101101011110 efa5b3eab59deb9caae7809beab7a6eba29de598b6eabcb7eba29e6d7defa5b3eab59deb9caae7809beab7a6eba29de598b6eabcb7eba29e6d7b5e
UHC 拾굝뜪瀛귦뢝嘶꼷뢞m}拾굝뜪瀛귦뢝嘶꼷뢞m{^ 1110010010101001100000101000010110001101101010111110011110111010100000101110110110001111010110001110001110110110100001001000111110001111010110010110110101111101111001001010100110000010100001011000110110101011111001111011101010000010111011011000111101011000111000111011011010000100100011111000111101011001011011010111101101011110 e4a982858dabe7ba82ed8f58e3b6848f8f596d7de4a982858dabe7ba82ed8f58e3b6848f8f596d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)