To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 澳?????營??恂??節??松х????B 11100000010100110011111100111111001111110011111100111111100110100111101000111111001111111001110010010110001111110011111110010000110111110011111100111111100011111011110010000100100001110011111100111111001111110011111101000010 e0533f3f3f3f3f9a7a3f3f9c963f3f90df3f3f8fbc84873f3f3f3f42
EUC-JP 澳?????營??恂??節??松х????B 11011111101101000011111100111111001111110011111100111111110100111101101100111111001111111101011111110110001111110011111111000000111000010011111100111111101111101011111010100111111001110011111100111111001111110011111101000010 dfb43f3f3f3f3fd3db3f3fd7f63f3fc0e13f3fbebea7e73f3f3f3f42
UTF-8 澳묕쉼樂됤닾營먬썛恂삭젒節곈썖松х퐧樂롢꺕B 111001101011111010110011111010111010110010010101111011001000100110111100111011111010011010111111111010111001000010100100111010111000101110111110111001111000011110011111111010111010100010101100111011001000110110011011111001101000000110000010111011001000001010101101111011001010000010010010111001111010111110000000111010101011001110001000111011001000110110010110111001101001110110111110110100011000010111101101100100001010011111101111101001101011111111101011101000011010001011101010101110101001010101000010 e6beb3ebac95ec89bcefa6bfeb90a4eb8bbee7879feba8acec8d9be68182ec82adeca092e7af80eab388ec8d96e69dbed185ed90a7efa6bfeba1a2eaba9542
UHC 澳묕쉼樂됤닾營먬썛恂삭젒節곈썖松х퐧樂롢꺕B 11100111111111101001000111101111101111011011000011101000111110011000100111100010100010001010110011100111101111011001000011101001100110111000111011100010111000011011101111101000101000001001000111101111101111011011000011101001100110111000100111100001111001101010110011100111101111011001000011101000111110011000111011100011100000111011101101000010 e7fe91efbdb0e8f989e288ace7bd90e99b8ee2e1bbe8a091efbdb0e99b89e1e6ace7bd90e8f98ee383bb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)