To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ???澳?????}v???澳?????}vB 00111111001111110011111111100000010100110011111100111111001111110011111100111111011111010111011000111111001111110011111111100000010100110011111100111111001111110011111100111111011111010111011001000010 3f3f3fe0533f3f3f3f3f7d763f3f3fe0533f3f3f3f3f7d7642
EUC-JP 縕??澳?????}v縕??澳?????}vB 1000111111010100110000100011111100111111110111111011010000111111001111110011111100111111001111110111110101110110100011111101010011000010001111110011111111011111101101000011111100111111001111110011111100111111011111010111011001000010 8fd4c23f3fdfb43f3f3f3f3f7d768fd4c23f3fdfb43f3f3f3f3f7d7642
UTF-8 縕됵슴澳묉닅娛곤쉴}v縕됵슴澳묉닅娛곤쉴}vB 1110011110111000100101011110101110010000101101011110110010001010101101001110011010111110101100111110101110101100100010011110101110001011100001011110010110101000100110111110101010110011101001001110110010001001101101000111110101110110111001111011100010010101111010111001000010110101111011001000101010110100111001101011111010110011111010111010110010001001111010111000101110000101111001011010100010011011111010101011001110100100111011001000100110110100011111010111011001000010 e7b895eb90b5ec8ab4e6beb3ebac89eb8b85e5a89beab3a4ec89b47d76e7b895eb90b5ec8ab4e6beb3ebac89eb8b85e5a89beab3a4ec89b47d7642
UHC 縕됵슴澳묉닅娛곤쉴}v縕됵슴澳묉닅娛곤쉴}vB 1110100010110010100010011110111110111101101111111110011111111110100100011110011010001000100011101110011111110100101100001110111110111101101011110111110101110110111010001011001010001001111011111011110110111111111001111111111010010001111001101000100010001110111001111111010010110000111011111011110110101111011111010111011001000010 e8b289efbdbfe7fe91e6888ee7f4b0efbdaf7d76e8b289efbdbfe7fe91e6888ee7f4b0efbdaf7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)