To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????▼?愿η???ぁ孃り?竊??邑κ? 001111110011111100111111001111111000000110100101001111111001110011000011100000111100010100111111001111110011111110000010100111111001101101101111100000101110100000111111111000101000011000111111001111111001011101010111100000111100100000111111 3f3f3f3f81a53f9cc383c53f3f3f829f9b6f82e83fe2863f3f975783c83f
EUC-JP ????▼?愿η?洧?ぁ孃り?竊??邑κ? 0011111100111111001111110011111110100010101001110011111111011000110001011010011011000111001111111000111111000111101101000011111110100100101000011101010111010000101001001110101000111111111000111110011000111111001111111100110110111000101001101100101000111111 3f3f3f3fa2a73fd8c5a6c73f8fc7b43fa4a1d5d0a4ea3fe3e63f3fcdb8a6ca3f
UTF-8 列룸씈履▼푻愿η뙴洧좎ぁ孃り퉭竊뽨펺邑κ뎌 11101111101001101001110011101011101000111011100011101100100101001000100011101111101001111001111111100010100101101011110011101101100100011011101111100110100001001011111111001110101101111110101110011001101101001110011010110100101001111110110010100010100011101110001110000001100000011110010110101101100000111110001110000010100010101110110110001001101011011110011110101011100010101110101110111101101010001110110110001110101110101110100110000010100100011100111010111010111010111000111010001100 efa69ceba3b8ec9488efa79fe296bced91bbe684bfceb7eb99b4e6b4a7eca28ee38181e5ad83e3828aed89ade7ab8aebbda8ed8ebae98291cebaeb8e8c
UHC 列룸씈履▼푻愿η뙴洧좎ぁ孃り퉭竊뽨펺邑κ뎌 111001101110101010110111111010111001110110100000111011001010101010100001111001011011111010000111111010101011010010100101111001111000110010110111111010101111101110100000111011001010101010100001111001011011111010101010111010101011100110000101111011111011110010010110111001001011110010001010111010111110100110100101111010101011010110101110 e6eab7eb9da0ecaaa1e5be87eab4a5e78cb7eafba0ecaaa1e5beaaeab985efbc96e4bc8aebe9a5eab5ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)