To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 晤o?邀??節??穩??晤??娃????? 10011101111010111000001010001111001111111110011110110001001111110011111110010000110111110011111100111111111000100111001000111111001111111001110111101011001111110011111110001000101000010011111100111111001111110011111100111111 9deb828f3fe7b13f3f90df3f3fe2723f3f9deb3f3f88a13f3f3f3f3f
EUC-JP 晤o?邀??節??穩??晤??娃????? 11011010111011011010001111101111001111111110111010110011001111110011111111000000111000010011111100111111111000111101001100111111001111111101101011101101001111110011111110110000101000110011111100111111001111110011111100111111 daeda3ef3feeb33f3fc0e13f3fe3d33f3fdaed3f3fb0a33f3f3f3f3f
UTF-8 晤o슈邀섋뙣節쏙슈穩쇽슉晤볩슘娃띰스樂됵쉠 111001101001100110100100111011111011110110001111111011001000101010001000111010011000001010000000111011001000010010001011111010111001100110100011111001111010111110000000111011001000111110011001111011001000101010001000111001111010100110101001111011001000011110111101111011001000101010001001111001101001100110100100111010111011001110101001111011001000101010011000111001011010100010000011111010111001110110110000111011001000101010100100111011111010011010111111111010111001000010110101111011001000100110100000 e699a4efbd8fec8a88e98280ec848beb99a3e7af80ec8f99ec8a88e7a9a9ec87bdec8a89e699a4ebb3a9ec8a98e5a883eb9db0ec8aa4efa6bfeb90b5ec89a0
UHC 晤o슈邀섋뙣節쏙슈穩쇽슉晤볩슘娃띰스樂됵쉠 111001111111101110100011111011111011110110110100111010011010110110011000111010001000110010101000111011111011110110111101111011111011110110110100111010001011000110111100111011111011110110110101111001111111101110010011111011111011110110110111111010001101111110110110111011111011110110111010111010001111100110001001111011111011110110101010 e7fba3efbdb4e9ad98e88ca8efbdbdefbdb4e8b1bcefbdb5e7fb93efbdb7e8dfb6efbdbae8f989efbdaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)