To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b[?????????b[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001001011011001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN 蜈??円??汚ц?b[蜈??円??汚ц?b[^ 11100101100001010011111100111111100010010111111000111111001111111000100110011000100001001000100000111111011000100101101111100101100001010011111100111111100010010111111000111111001111111000100110011000100001001000100000111111011000100101101101011110 e5853f3f897e3f3f899884883f625be5853f3f897e3f3f899884883f625b5e
EUC-JP 蜈??円??汚ц?b[蜈??円??汚ц?b[^ 11101001111001010011111100111111101100011101111100111111001111111011000111111000101001111110100000111111011000100101101111101001111001010011111100111111101100011101111100111111001111111011000111111000101001111110100000111111011000100101101101011110 e9e53f3fb1df3f3fb1f8a7e83f625be9e53f3fb1df3f3fb1f8a7e83f625b5e
UTF-8 蜈욘누円녔옄汚ц닃b[蜈욘누円녔옄汚ц닃b[^ 111010001001110010001000111011001001101010011000111010111000100010000100111001011000011010000110111010111000010110010100111011001001100010000100111001101011000110011010110100011000011011101011100010111000001101100010010110111110100010011100100010001110110010011010100110001110101110001000100001001110010110000110100001101110101110000101100101001110110010011000100001001110011010110001100110101101000110000110111010111000101110000011011000100101101101011110 e89c88ec9a98eb8884e58686eb8594ec9884e6b19ad186eb8b83625be89c88ec9a98eb8884e58686eb8594ec9884e6b19ad186eb8b83625b5e
UHC 蜈욘누円녔옄汚ц닃b[蜈욘누円녔옄汚ц닃b[^ 1110100010100101101111111110011010110100101010011110010111110111101100111110011010011110100100001110011111111101101011001110100010001000100011000110001001011011111010001010010110111111111001101011010010101001111001011111011110110011111001101001111010010000111001111111110110101100111010001000100010001100011000100101101101011110 e8a5bfe6b4a9e5f7b3e69e90e7fdace8888c625be8a5bfe6b4a9e5f7b3e69e90e7fdace8888c625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)