To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????BN}?????????BN{^ 00111111001111110011111100111111001111110011111100111111001111110011111101000010010011100111110100111111001111110011111100111111001111110011111100111111001111110011111101000010010011100111101101011110 3f3f3f3f3f3f3f3f3f424e7d3f3f3f3f3f3f3f3f3f424e7b5e
SJIS-WIN ??????也??BN}??????也??BN{^ 001111110011111100111111001111110011111100111111100101101110011100111111001111110100001001001110011111010011111100111111001111110011111100111111001111111001011011100111001111110011111101000010010011100111101101011110 3f3f3f3f3f3f96e73f3f424e7d3f3f3f3f3f3f96e73f3f424e7b5e
EUC-JP ??????也??BN}??????也??BN{^ 001111110011111100111111001111110011111100111111110011001110100100111111001111110100001001001110011111010011111100111111001111110011111100111111001111111100110011101001001111110011111101000010010011100111101101011110 3f3f3f3f3f3fcce93f3f424e7d3f3f3f3f3f3fcce93f3f424e7b5e
UTF-8 曆뤿젳獵밸젙也싲젾BN}曆뤿젳獵밸젙也싲젾BN{^ 11101111101001101000101111101011101001001011111111101100101000001011001111101111101001101010011111101011101100001011100011101100101000001001100111100100101110011001111111101100100010111011001011101100101000001011111001000010010011100111110111101111101001101000101111101011101001001011111111101100101000001011001111101111101001101010011111101011101100001011100011101100101000001001100111100100101110011001111111101100100010111011001011101100101000001011111001000010010011100111101101011110 efa68beba4bfeca0b3efa6a7ebb0b8eca099e4b99fec8bb2eca0be424e7defa68beba4bfeca0b3efa6a7ebb0b8eca099e4b99fec8bb2eca0be424e7b5e
UHC 曆뤿젳獵밸젙也싲젾BN}曆뤿젳獵밸젙也싲젾BN{^ 11100110101101111000111111101011101000001010011111100111101001101011100111101011101000001001010111100101101001011001101011101011101000001011000001000010010011100111110111100110101101111000111111101011101000001010011111100111101001101011100111101011101000001001010111100101101001011001101011101011101000001011000001000010010011100111101101011110 e6b78feba0a7e7a6b9eba095e5a59aeba0b0424e7de6b78feba0a7e7a6b9eba095e5a59aeba0b0424e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)