To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????º????????º@B 0011111100111111001111110011111100111111001111110011111100111111101110100011111100111111001111110011111100111111001111110011111100111111101110100100000001000010 3f3f3f3f3f3f3f3fba3f3f3f3f3f3f3f3fba4042
SJIS-WIN 厓??宥??揄??厓??愉?????@B 11111010100011010011111100111111100101110100011100111111001111111001110110001001001111110011111111111010100011010011111100111111100101101111100100111111001111110011111100111111001111110100000001000010 fa8d3f3f97473f3f9d893f3ffa8d3f3f96f93f3f3f3f3f4042
EUC-JP 厓??宥??揄?º厓??愉??洧?º@B 100011111011010011000111001111110011111111001101101010000011111100111111110110011110100100111111100011111010001011101011100011111011010011000111001111110011111111001100111110110011111100111111100011111100011110110100001111111000111110100010111010110100000001000010 8fb4c73f3fcda83f3fd9e93f8fa2eb8fb4c73f3fccfb3f3f8fc7b43f8fa2eb4042
UTF-8 厓쀢뼰宥썹뙴揄몄º厓쀢뼰愉꾤뙴洧곗º@B 111001011000111010010011111011001000000010100010111010111011110010110000111001011010111010100101111011001000110110111001111010111001100110110100111001101000111110000100111010111010101010000100110000101011101011100101100011101001001111101100100000001010001011101011101111001011000011100110100001001000100111101010101111101010010011101011100110011011010011100110101101001010011111101010101100111001011111000010101110100100000001000010 e58e93ec80a2ebbcb0e5aea5ec8db9eb99b4e68f84ebaa84c2bae58e93ec80a2ebbcb0e68489eabea4eb99b4e6b4a7eab397c2ba4042
UHC 厓쀢뼰宥썹뙴揄몄º厓쀢뼰愉꾤뙴洧곗º@B 1110010011101101100101111110001010010110101100111110101011101001101111011110011110001100101101111110101011110001101110001110110010101000101011001110010011101101100101111110001010010110101100111110101011110000100001001110011110001100101101111110101011111011101100001110110010101000101011000100000001000010 e4ed97e296b3eae9bde78cb7eaf1b8eca8ace4ed97e296b3eaf084e78cb7eafbb0eca8ac4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)