To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????cn}???????cn{^ 001111110011111100111111001111110011111100111111001111110110001101101110011111010011111100111111001111110011111100111111001111110011111101100011011011100111101101011110 3f3f3f3f3f3f3f636e7d3f3f3f3f3f3f3f636e7b5e
SJIS-WIN 鮏竺霖ホ闔ク汐cn}鮏竺霖ホ闔ク汐cn{^ 11111100010000111000111010110001111010001100000111001110111010001000111010111000100011101010110001100011011011100111110111111100010000111000111010110001111010001100000111001110111010001000111010111000100011101010110001100011011011100111101101011110 fc438eb1e8c1cee88eb88eac636e7dfc438eb1e8c1cee88eb88eac636e7b5e
EUC-JP 鮏竺霖ホ闔ク汐cn}鮏竺霖ホ闔ク汐cn{^ 10001111111010101101101110111100101100111111000011000011100011101100111011101111111011101000111010111000101111001010111001100011011011100111110110001111111010101101101110111100101100111111000011000011100011101100111011101111111011101000111010111000101111001010111001100011011011100111101101011110 8feadbbcb3f0c38eceefee8eb8bcae636e7d8feadbbcb3f0c38eceefee8eb8bcae636e7b5e
UTF-8 鮏竺霖ホ闔ク汐cn}鮏竺霖ホ闔ク汐cn{^ 11101001101011101000111111100111101010111011101011101001100111001001011011101111101111101000111011101001100101111001010011101111101111011011100011100110101100011001000001100011011011100111110111101001101011101000111111100111101010111011101011101001100111001001011011101111101111101000111011101001100101111001010011101111101111011011100011100110101100011001000001100011011011100111101101011110 e9ae8fe7abbae99c96efbe8ee99794efbdb8e6b190636e7de9ae8fe7abbae99c96efbe8ee99794efbdb8e6b190636e7b5e
UHC ?竺霖?闔?汐cn}?竺霖?闔?汐cn{^ 0011111111110101111001111101011111111101001111111111100111101111001111111110000010110001011000110110111001111101001111111111010111100111110101111111110100111111111110011110111100111111111000001011000101100011011011100111101101011110 3ff5e7d7fd3ff9ef3fe0b1636e7d3ff5e7d7fd3ff9ef3fe0b1636e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)