To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 要ο?澳??節o?晤??撓??節????? 1001011101110110100000111100110100111111111000000101001100111111001111111001000011011111100000101000111100111111100111011110101100111111001111111001110110011010001111110011111110010000110111110011111100111111001111110011111100111111 977683cd3fe0533f3f90df828f3f9deb3f3f9d9a3f3f90df3f3f3f3f3f
EUC-JP 要ο?澳??節o?晤??撓??節??縕?? 11001101110101111010011011001111001111111101111110110100001111110011111111000000111000011010001111101111001111111101101011101101001111110011111111011001111110100011111100111111110000001110000100111111001111111000111111010100110000100011111100111111 cdd7a6cf3fdfb43f3fc0e1a3ef3fdaed3f3fd9fa3f3fc0e13f3f8fd4c23f3f
UTF-8 要ο쉭澳묈돭節o슝晤볩슘撓껅솄節ㅿ스縕귨슝 1110100010100110100000011100111010111111111011001000100110101101111001101011111010110011111010111010110010001000111010111000111110101101111001111010111110000000111011111011110110001111111011001000101010011101111001101001100110100100111010111011001110101001111011001000101010011000111001101001001010010011111010101011101110000101111011001000011010000100111001111010111110000000111000111000010110111111111011001000101010100100111001111011100010010101111010101011011110101000111011001000101010011101 e8a681cebfec89ade6beb3ebac88eb8fade7af80efbd8fec8a9de699a4ebb3a9ec8a98e69293eabb85ec8684e7af80e385bfec8aa4e7b895eab7a8ec8a9d
UHC 要ο쉭澳묈돭節o슝晤볩슘撓껅솄節ㅿ스縕귨슝 111010011010100110100101111011111011110110101101111001111111111010010001111001011000100110110000111011111011110110100011111011111011110110111001111001111111101110010011111011111011110110110111111010001111010110000011111001101001100110001001111011111011110110100100111011111011110110111010111010001011001010000010111011111011110110111001 e9a9a5efbdade7fe91e589b0efbda3efbdb9e7fb93efbdb7e8f583e69989efbda4efbdbae8b282efbdb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)