To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????n}v???????n}vB 001111110011111100111111001111110011111100111111001111110110111001111101011101100011111100111111001111110011111100111111001111110011111101101110011111010111011001000010 3f3f3f3f3f3f3f6e7d763f3f3f3f3f3f3f6e7d7642
SJIS-WIN 鮏竺霖ホ闔ク汐n}v鮏竺霖ホ闔ク汐n}vB 11111100010000111000111010110001111010001100000111001110111010001000111010111000100011101010110001101110011111010111011011111100010000111000111010110001111010001100000111001110111010001000111010111000100011101010110001101110011111010111011001000010 fc438eb1e8c1cee88eb88eac6e7d76fc438eb1e8c1cee88eb88eac6e7d7642
EUC-JP 鮏竺霖ホ闔ク汐n}v鮏竺霖ホ闔ク汐n}vB 10001111111010101101101110111100101100111111000011000011100011101100111011101111111011101000111010111000101111001010111001101110011111010111011010001111111010101101101110111100101100111111000011000011100011101100111011101111111011101000111010111000101111001010111001101110011111010111011001000010 8feadbbcb3f0c38eceefee8eb8bcae6e7d768feadbbcb3f0c38eceefee8eb8bcae6e7d7642
UTF-8 鮏竺霖ホ闔ク汐n}v鮏竺霖ホ闔ク汐n}vB 11101001101011101000111111100111101010111011101011101001100111001001011011101111101111101000111011101001100101111001010011101111101111011011100011100110101100011001000001101110011111010111011011101001101011101000111111100111101010111011101011101001100111001001011011101111101111101000111011101001100101111001010011101111101111011011100011100110101100011001000001101110011111010111011001000010 e9ae8fe7abbae99c96efbe8ee99794efbdb8e6b1906e7d76e9ae8fe7abbae99c96efbe8ee99794efbdb8e6b1906e7d7642
UHC ?竺霖?闔?汐n}v?竺霖?闔?汐n}vB 0011111111110101111001111101011111111101001111111111100111101111001111111110000010110001011011100111110101110110001111111111010111100111110101111111110100111111111110011110111100111111111000001011000101101110011111010111011001000010 3ff5e7d7fd3ff9ef3fe0b16e7d763ff5e7d7fd3ff9ef3fe0b16e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)