To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鈺而妺質鈺而甯爾鈺而妺質鈺而甯爾^ 111110111100010010001110101001111111101010100101100011101011111111111011110001001000111010100111111110101010100010001110101000101111101111000100100011101010011111111010101001011000111010111111111110111100010010001110101001111111101010101000100011101010001001011110 fbc48ea7faa58ebffbc48ea7faa88ea2fbc48ea7faa58ebffbc48ea7faa88ea25e
EUC-JP 鈺而妺質鈺而甯爾鈺而妺質鈺而甯爾^ 1000111111100011110101011011110010101001100011111011100110110111101111001100000110001111111000111101010110111100101010011000111111001101101010101011110010100100100011111110001111010101101111001010100110001111101110011011011110111100110000011000111111100011110101011011110010101001100011111100110110101010101111001010010001011110 8fe3d5bca98fb9b7bcc18fe3d5bca98fcdaabca48fe3d5bca98fb9b7bcc18fe3d5bca98fcdaabca45e
UTF-8 鈺而妺質鈺而甯爾鈺而妺質鈺而甯爾^ 11101001100010001011101011101000100000001000110011100101101001101011101011101000101100111010101011101001100010001011101011101000100000001000110011100111100101001010111111100111100010001011111011101001100010001011101011101000100000001000110011100101101001101011101011101000101100111010101011101001100010001011101011101000100000001000110011100111100101001010111111100111100010001011111001011110 e988bae8808ce5a6bae8b3aae988bae8808ce794afe788bee988bae8808ce5a6bae8b3aae988bae8808ce794afe788be5e
UHC 鈺而?質鈺而?爾鈺而?質鈺而?爾^ 1110100010101101111011001011101100111111111100101111010111101000101011011110110010111011001111111110110010110011111010001010110111101100101110110011111111110010111101011110100010101101111011001011101100111111111011001011001101011110 e8adecbb3ff2f5e8adecbb3fecb3e8adecbb3ff2f5e8adecbb3fecb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)