To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 韻?新?悼韻?新??????赫孑???赫?^ 100010010100001100111111100100000101011000111111100100111000100110001001010000110011111110010000010101100011111100111111001111110011111100111111001111111000101001110001100110110111001000111111001111110011111110001010011100010011111101011110 89433f90563f938989433f90563f3f3f3f3f3f8a719b723f3f3f8a713f5e
EUC-JP 韻?新?悼韻?新?????堧赫孑??堧赫?^ 10110001101001000011111110111111101101110011111111000101111010011011000110100100001111111011111110110111001111110011111100111111001111110011111110001111101110001010100010110011110100101101010111010011001111110011111110001111101110001010100010110011110100100011111101011110 b1a43fbfb73fc5e9b1a43fbfb73f3f3f3f3f8fb8a8b3d2d5d33f3f8fb8a8b3d23f5e
UTF-8 韻렮新저悼韻렮新저磊列쨴삠堧赫孑쨴삠堧赫곫^ 11101001100111111011101111101011101000001010111011100110100101101011000011101100101000001000000011100110100000101011110011101001100111111011101111101011101000001010111011100110100101101011000011101100101000001000000011101111101001011000011111101111101001101001110011101100101010001011010011101100100000101010000011100101101000001010011111101000101101011010101111100101101011011001000111101100101010001011010011101100100000101010000011100101101000001010011111101000101101011010101111101010101100111010101101011110 e99fbbeba0aee696b0eca080e682bce99fbbeba0aee696b0eca080efa587efa69ceca8b4ec82a0e5a0a7e8b5abe5ad91eca8b4ec82a0e5a0a7e8b5abeab3ab5e
UHC 韻렮新저悼韻렮新저磊列쨴삠堧赫孑쨴삠堧赫곫^ 11101010101001001000111010111011111000111110011011000000111110101101001111111010111010101010010010001110101110111110001111100110110000001111101011010010110111111110011011101010101001001000111010111011111000111110011011000000111110101101001111111010111010101010010010001110101110111110001111100110110000001111101011010011100000011110011001011110 eaa48ebbe3e6c0fad3faeaa48ebbe3e6c0fad2dfe6eaa48ebbe3e6c0fad3faeaa48ebbe3e6c0fad381e65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)