To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 曜???ы??る?節?????荏??幼??^ 10010111011010100011111100111111001111111000010010001101001111110011111110000010111010010011111110010000110111110011111100111111001111110011111100111111100010010110000000111111001111111001011101100011001111110011111101011110 976a3f3f3f848d3f3f82e93f90df3f3f3f3f3f89603f3f97633f3f5e
EUC-JP 曜???ы??る?節?????荏??幼??^ 11001101110010110011111100111111001111111010011111101101001111110011111110100100111010110011111111000000111000010011111100111111001111110011111100111111101100011100000100111111001111111100110111000100001111110011111101011110 cdcb3f3f3fa7ed3f3fa4eb3fc0e13f3f3f3f3fb1c13f3fcdc43f3f5e
UTF-8 曜쏅젒吳ы쓳溜る뙌節싮슋溜묋뫊荏묐젶幼묕퐯^ 111001101001101110011100111011001000111110000101111011001010000010010010111001011001000010110011110100011000101111101100100100111011001111101111101001111000101111100011100000101000101111101011100110011000110011100111101011111000000011101100100010111010111011101100100010101000101111101111101001111000101111101011101011001000101111101011101010111000101011101000100011011000111111101011101011001001000011101100101000001011011011100101101110011011110011101011101011001001010111101101100100001010111101011110 e69b9cec8f85eca092e590b3d18bec93b3efa78be3828beb998ce7af80ec8baeec8a8befa78bebac8bebab8ae88d8febac90eca0b6e5b9bcebac95ed90af5e
UHC 曜쏅젒吳ы쓳溜る뙌節싮슋溜묋뫊荏묐젶幼묕퐯^ 11101000111110001001101111101011101000001001000111100111111011111010110011101101100111011001000111101010111111101010101011101011100011001001000111101111101111011001101011101001100110101001101111101010111111101001000111101000100100011010110011101100111110111001000111101011101000001010101011101010111010101001000111101111101111011001100001011110 e8f89beba091e7efaced9d91eafeaaeb8c91efbd9ae99a9beafe91e891acecfb91eba0aaeaea91efbd985e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)