To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???溢??揄?????溢??揄??鼇??竊 00111111001111110011111110001000111011000011111100111111100111011000100100111111001111110011111100111111001111111000100011101100001111110011111110011101100010010011111100111111111010101000011100111111001111111110001010000110 3f3f3f88ec3f3f9d893f3f3f3f3f88ec3f3f9d893f3fea873f3fe286
EUC-JP ???溢??揄?????溢??揄??鼇??竊 00111111001111110011111110110000111011100011111100111111110110011110100100111111001111110011111100111111001111111011000011101110001111110011111111011001111010010011111100111111111100111110011100111111001111111110001111100110 3f3f3fb0ee3f3fd9e93f3f3f3f3fb0ee3f3fd9e93f3ff3e73f3fe3e6
UTF-8 列룸벝溢당뙴揄덇탿列룸벝溢당뙴揄먭쉐鼇앹궠竊 111011111010011010011100111010111010001110111000111010111011001010011101111001101011101010100010111010111000101110111001111010111001100110110100111001101000111110000100111010111000110110000111111011011000001110111111111011111010011010011100111010111010001110111000111010111011001010011101111001101011101010100010111010111000101110111001111010111001100110110100111001101000111110000100111010111010100010101101111011001000100110010000111010011011110010000111111011001001010110111001111010101011011010100000111001111010101110001010 efa69ceba3b8ebb29de6baa2eb8bb9eb99b4e68f84eb8d87ed83bfefa69ceba3b8ebb29de6baa2eb8bb9eb99b4e68f84eba8adec8990e9bc87ec95b9eab6a0e7ab8a
UHC 列룸벝溢당뙴揄덇탿列룸벝溢당뙴揄먭쉐鼇앹궠竊 1110011011101010101101111110101110010011101110001110110011101110101101001110011110001100101101111110101011110001100010001110101010110101100110111110011011101010101101111110101110010011101110001110110011101110101101001110011110001100101101111110101011110001100100001110101010111101101001101110100010101000100111011110110010000010101100111110111110111100 e6eab7eb93b8eceeb4e78cb7eaf188eab59be6eab7eb93b8eceeb4e78cb7eaf190eabda6e8a89dec82b3efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)