To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 症?藏?郵?錠???椅?碇?沮???止 100011111100011100111111111001010101010100111111100101110101100000111111100011111111100100111111001111110011111110001000110101100011111110010010111101000011111110011111100111000011111100111111001111111000111001111110 8fc73fe5553f97583f8ff93f3f3f88d63f92f43f9f9c3f3f3f8e7e
EUC-JP 症?藏?郵?錠???椅?碇?沮???止 101111101100100100111111111010011011011000111111110011011011100100111111101111101111101100111111001111110011111110110000110110000011111111000100111101100011111111011101111111000011111100111111001111111011101111011111 bec93fe9b63fcdb93fbefb3f3f3fb0d83fc4f63fddfc3f3f3fbbdf
UTF-8 症렜藏렜郵렎錠쾰렩렫椅렟碇렢沮렖렺렍止 111001111001011110000111111010111010000010011100111010001001011110001111111010111010000010011100111010011000001110110101111010111010000010001110111010011000110010100000111011001011111010110000111010111010000010101001111010111010000010101011111001101010010010000101111010111010000010011111111001111010001010000111111010111010000010100010111001101011001010101110111010111010000010010110111010111010000010111010111010111010000010001101111001101010110110100010 e79787eba09ce8978feba09ce983b5eba08ee98ca0ecbeb0eba0a9eba0abe6a485eba09fe7a287eba0a2e6b2aeeba096eba0baeba08de6ada2
UHC 症렜藏렜郵렎錠쾰렩렫椅렟碇렢沮렖렺렍止 1111000111111000100011101010111011101101111110101000111010101110111010011110100010001110101001001110111111111100110001001110101110001110101101111000111010111001111010111111010110001110101100001110111111101101100011101011001111101110110000011000111010101011100011101100001010001110101000111111001010101101 f1f88eaeedfa8eaee9e88ea4effcc4eb8eb78eb9ebf58eb0efed8eb3eec18eab8ec28ea3f2ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)