To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???臾??揄???????Ъ??????B 00111111001111110011111111100100011010110011111100111111100111011000100100111111001111110011111100111111001111110011111100111111100001000101101100111111001111110011111100111111001111110011111101000010 3f3f3fe46b3f3f9d893f3f3f3f3f3f3f845b3f3f3f3f3f3f42
EUC-JP ???臾??揄?????洧?Ъ???洧??B 0011111100111111001111111110011111001100001111110011111111011001111010010011111100111111001111110011111100111111100011111100011110110100001111111010011110111100001111110011111100111111100011111100011110110100001111110011111101000010 3f3f3fe7cc3f3fd9e93f3f3f3f3f8fc7b43fa7bc3f3f3f8fc7b43f3f42
UTF-8 列룸쓷臾쇘뙴揄쒕쓡梨덄뙴洧붾Ъ列룸뱶洧귥콌B 111011111010011010011100111010111010001110111000111011001001001110110111111010001000011110111110111011001000011110011000111010111001100110110100111001101000111110000100111011001001001010010101111011001001001110100001111011111010011110100010111010111000110110000100111010111001100110110100111001101011010010100111111010111011011010111110110100001010101011101111101001101001110011101011101000111011100011101011101100011011011011100110101101001010011111101010101101111010010111101100101111011000110001000010 efa69ceba3b8ec93b7e887beec8798eb99b4e68f84ec9295ec93a1efa7a2eb8d84eb99b4e6b4a7ebb6bed0aaefa69ceba3b8ebb1b6e6b4a7eab7a5ecbd8c42
UHC 列룸쓷臾쇘뙴揄쒕쓡梨덄뙴洧붾Ъ列룸뱶洧귥콌B 11100110111010101011011111101011100111011001010011101011101011001011110011100111100011001011011111101010111100011001110011101011100111011000001011101100101100011000100011100111100011001011011111101010111110111001010011101011101011001011110011100110111010101011011111101011100100111001110011101010111110111000001011101100101100011000100001000010 e6eab7eb9d94ebacbce78cb7eaf19ceb9d82ecb188e78cb7eafb94ebacbce6eab7eb939ceafb82ecb18842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)