To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?ш?旬??寃??^ 00111111100001001000101000111111100011110111101100111111001111111001101110000011001111110011111101011110 3f848a3f8f7b3f3f9b833f3f5e
EUC-JP ?ш?旬??寃??^ 00111111101001111110101000111111101111011101110000111111001111111101010111100011001111110011111101011110 3fa7ea3fbddc3f3fd5e33f3f5e
UTF-8 輦ш꼈旬루넭寃쇱뿳^ 111011111010011010011000110100011000100011101010101111001000100011100110100101111010110011101011101000111010100011101011100001001010110111100101101011111000001111101100100001111011000111101011101111111011001101011110 efa698d188eabc88e697aceba3a8eb84ade5af83ec87b1ebbfb35e
UHC 輦ш꼈旬루넭寃쇱뿳^ 11100110111001001010110011101010101100101011110011100010111000101011011111100111100001101010110011101010101100101011110011101100100101111011001101011110 e6e4aceab2bce2e2b7e786aceab2bcec97b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)