To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??貫竊??猿??藥〓?竊??諛??? 00111111001111110011111110001011100000110011111100111111100010101101000111100010100001100011111100111111100010011000111000111111001111111110010101011010100000011010110000111111111000101000011000111111001111111110011010000111001111110011111100111111 3f3f3f8b833f3f8ad1e2863f3f898e3f3fe55a81ac3fe2863f3fe6873f3f3f
EUC-JP ???泣??貫竊??猿??藥〓?竊??諛??? 00111111001111110011111110110101111000110011111100111111101101001101001111100011111001100011111100111111101100011110111000111111001111111110100110111011101000101010111000111111111000111110011000111111001111111110101111100111001111110011111100111111 3f3f3fb5e33f3fb4d3e3e63f3fb1ee3f3fe9bba2ae3fe3e63f3febe73f3f3f
UTF-8 列룸똾泣껅걗貫竊뺠뙲猿곴맘藥〓낄竊뗥넇諛ㅺ텓列 111011111010011010011100111010111010001110111000111010111001100010111110111001101011001110100011111010101011101110000101111010101011000110010111111010001011001010101011111001111010101110001010111010111011101010100000111010111001100110110010111001111000110010111111111010101011001110110100111010111010011110011000111010001001011110100101111000111000000010010011111010111000001010000100111001111010101110001010111010111001011110100101111010111000010010000111111010001010101110011011111000111000010110111010111011011000010110010011111011111010011010011100 efa69ceba3b8eb98bee6b3a3eabb85eab197e8b2abe7ab8aebbaa0eb99b2e78cbfeab3b4eba798e897a5e38093eb8284e7ab8aeb97a5eb8487e8ab9be385baed8593efa69c
UHC 列룸똾泣껅걗貫竊뺠뙲猿곴맘藥〓낄竊뗥넇諛ㅺ텓列 11100110111010101011011111101011100011001000010011101011111010001000001111100110100000011000001011001110101110111110111110111100100101011110100010001100101101011110101010111011100000011110101010111000101111101110010110110111101000011110101110110011101001011110111110111100100010111110010110000110100101111110101110110000101001001110101010110110100011011110011011101010 e6eab7eb8c84ebe883e68182cebbefbc95e88cb5eabb81eab8bee5b7a1ebb3a5efbc8be58697ebb0a4eab68de6ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)