To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????K}?????????K{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100101101111101001111110011111100111111001111110011111100111111001111110011111100111111010010110111101101011110 3f3f3f3f3f3f3f3f3f4b7d3f3f3f3f3f3f3f3f3f4b7b5e
SJIS-WIN 晤??晤??晤??K}晤??晤??晤??K{^ 1001110111101011001111110011111110011101111010110011111100111111100111011110101100111111001111110100101101111101100111011110101100111111001111111001110111101011001111110011111110011101111010110011111100111111010010110111101101011110 9deb3f3f9deb3f3f9deb3f3f4b7d9deb3f3f9deb3f3f9deb3f3f4b7b5e
EUC-JP 晤??晤??晤??K}晤??晤??晤??K{^ 1101101011101101001111110011111111011010111011010011111100111111110110101110110100111111001111110100101101111101110110101110110100111111001111111101101011101101001111110011111111011010111011010011111100111111010010110111101101011110 daed3f3fdaed3f3fdaed3f3f4b7ddaed3f3fdaed3f3fdaed3f3f4b7b5e
UTF-8 晤쀥삻晤쀥뿽晤쀥삏K}晤쀥삻晤쀥뿽晤쀥삏K{^ 1110011010011001101001001110110010000000101001011110110010000010101110111110011010011001101001001110110010000000101001011110101110111111101111011110011010011001101001001110110010000000101001011110110010000010100011110100101101111101111001101001100110100100111011001000000010100101111011001000001010111011111001101001100110100100111011001000000010100101111010111011111110111101111001101001100110100100111011001000000010100101111011001000001010001111010010110111101101011110 e699a4ec80a5ec82bbe699a4ec80a5ebbfbde699a4ec80a5ec828f4b7de699a4ec80a5ec82bbe699a4ec80a5ebbfbde699a4ec80a5ec828f4b7b5e
UHC 晤쀥삻晤쀥뿽晤쀥삏K}晤쀥삻晤쀥뿽晤쀥삏K{^ 1110011111111011100101111110010110011000101100101110011111111011100101111110010110010111101111011110011111111011100101111110010110011000100101100100101101111101111001111111101110010111111001011001100010110010111001111111101110010111111001011001011110111101111001111111101110010111111001011001100010010110010010110111101101011110 e7fb97e598b2e7fb97e597bde7fb97e598964b7de7fb97e598b2e7fb97e597bde7fb97e598964b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)