To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 垓耕???奚?蟹??羈垓耕???奚?蟹??羈^ 100110101011010010001101011010110011111100111111001111111001101011110110001111111000101001001001001111110011111111100011101100011001101010110100100011010110101100111111001111110011111110011010111101100011111110001010010010010011111100111111111000111011000101011110 9ab48d6b3f3f3f9af63f8a493f3fe3b19ab48d6b3f3f3f9af63f8a493f3fe3b15e
EUC-JP 垓耕?瀣?奚?蟹?瀣羈垓耕?瀣?奚?蟹?瀣羈^ 1101010010110110101110011100110000111111100011111100100110110001001111111101010011111000001111111011001110101010001111111000111111001001101100011110011010110011110101001011011010111001110011000011111110001111110010011011000100111111110101001111100000111111101100111010101000111111100011111100100110110001111001101011001101011110 d4b6b9cc3f8fc9b13fd4f83fb3aa3f8fc9b1e6b3d4b6b9cc3f8fc9b13fd4f83fb3aa3f8fc9b1e6b35e
UTF-8 垓耕볶瀣렞奚렊蟹렋瀣羈垓耕볶瀣렞奚렊蟹렋瀣羈^ 11100101100111101001001111101000100000001001010111101011101100111011011011100111100000001010001111101011101000001001111011100101101001011001101011101011101000001000101011101000100111111011100111101011101000001000101111100111100000001010001111100111101111101000100011100101100111101001001111101000100000001001010111101011101100111011011011100111100000001010001111101011101000001001111011100101101001011001101011101011101000001000101011101000100111111011100111101011101000001000101111100111100000001010001111100111101111101000100001011110 e59e93e88095ebb3b6e780a3eba09ee5a59aeba08ae89fb9eba08be780a3e7be88e59e93e88095ebb3b6e780a3eba09ee5a59aeba08ae89fb9eba08be780a3e7be885e
UHC 垓耕볶瀣렞奚렊蟹렋瀣羈垓耕볶瀣렞奚렊蟹렋瀣羈^ 111110101010011111001100111010011011101010111010111110101010111010001110101011111111101010101000100011101010000111111010101011111000111010100010111110101010111011010001101111001111101010100111110011001110100110111010101110101111101010101110100011101010111111111010101010001000111010100001111110101010111110001110101000101111101010101110110100011011110001011110 faa7cce9babafaae8eaffaa88ea1faaf8ea2faaed1bcfaa7cce9babafaae8eaffaa88ea1faaf8ea2faaed1bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)