To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???溢??揄?????溢??揄??勇??B 001111110011111100111111100010001110110000111111001111111001110110001001001111110011111100111111001111110011111110001000111011000011111100111111100111011000100100111111001111111001011101000101001111110011111101000010 3f3f3f88ec3f3f9d893f3f3f3f3f88ec3f3f9d893f3f97453f3f42
EUC-JP ???溢??揄?????溢??揄??勇??B 001111110011111100111111101100001110111000111111001111111101100111101001001111110011111100111111001111110011111110110000111011100011111100111111110110011110100100111111001111111100110110100110001111110011111101000010 3f3f3fb0ee3f3fd9e93f3f3f3f3fb0ee3f3fd9e93f3fcda63f3f42
UTF-8 列룸벝溢당뙴揄덇탿列룸벝溢당뙴揄먭쉐勇싳콇B 11101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101011100010111011100111101011100110011011010011100110100011111000010011101011100011011000011111101101100000111011111111101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101011100010111011100111101011100110011011010011100110100011111000010011101011101010001010110111101100100010011001000011100101100010111000011111101100100010111011001111101100101111011000011101000010 efa69ceba3b8ebb29de6baa2eb8bb9eb99b4e68f84eb8d87ed83bfefa69ceba3b8ebb29de6baa2eb8bb9eb99b4e68f84eba8adec8990e58b87ec8bb3ecbd8742
UHC 列룸벝溢당뙴揄덇탿列룸벝溢당뙴揄먭쉐勇싳콇B 11100110111010101011011111101011100100111011100011101100111011101011010011100111100011001011011111101010111100011000100011101010101101011001101111100110111010101011011111101011100100111011100011101100111011101011010011100111100011001011011111101010111100011001000011101010101111011010011011101001101110001001101011101100101100011000001101000010 e6eab7eb93b8eceeb4e78cb7eaf188eab59be6eab7eb93b8eceeb4e78cb7eaf190eabda6e9b89aecb18342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)