To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????jon}????????jon{^ 00111111001111110011111100111111001111110011111100111111001111110110101001101111011011100111110100111111001111110011111100111111001111110011111100111111001111110110101001101111011011100111101101011110 3f3f3f3f3f3f3f3f6a6f6e7d3f3f3f3f3f3f3f3f6a6f6e7b5e
SJIS-WIN 錚貊???絅??jon}錚貊???絅??jon{^ 11101000010000101110011010111011001111110011111100111111111000110100010000111111001111110110101001101111011011100111110111101000010000101110011010111011001111110011111100111111111000110100010000111111001111110110101001101111011011100111101101011110 e842e6bb3f3f3fe3443f3f6a6f6e7de842e6bb3f3f3fe3443f3f6a6f6e7b5e
EUC-JP 錚貊???絅??jon}錚貊???絅??jon{^ 11101111101000111110110010111101001111110011111100111111111001011010010100111111001111110110101001101111011011100111110111101111101000111110110010111101001111110011111100111111111001011010010100111111001111110110101001101111011011100111101101011110 efa3ecbd3f3f3fe5a53f3f6a6f6e7defa3ecbd3f3f3fe5a53f3f6a6f6e7b5e
UTF-8 錚貊렍렖狀絅렪렟jon}錚貊렍렖狀絅렪렟jon{^ 111010011000110010011010111010001011001010001010111010111010000010001101111010111010000010010110111011111010011110111010111001111011010110000101111010111010000010101010111010111010000010011111011010100110111101101110011111011110100110001100100110101110100010110010100010101110101110100000100011011110101110100000100101101110111110100111101110101110011110110101100001011110101110100000101010101110101110100000100111110110101001101111011011100111101101011110 e98c9ae8b28aeba08deba096efa7bae7b585eba0aaeba09f6a6f6e7de98c9ae8b28aeba08deba096efa7bae7b585eba0aaeba09f6a6f6e7b5e
UHC 錚貊렍렖狀絅렪렟jon}錚貊렍렖狀絅렪렟jon{^ 1110111010110110110110001110011110001110101000111000111010101011111011011110111011001100111001111000111010111000100011101011000001101010011011110110111001111101111011101011011011011000111001111000111010100011100011101010101111101101111011101100110011100111100011101011100010001110101100000110101001101111011011100111101101011110 eeb6d8e78ea38eabedeecce78eb88eb06a6f6e7deeb6d8e78ea38eabedeecce78eb88eb06a6f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)