To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 躁?ε闇???鬱???躁?ε闇???鬱???^ 11100111010011100011111110000011110000111000100011000101001111110011111100111111100111110101010000111111001111110011111111100111010011100011111110000011110000111000100011000101001111110011111100111111100111110101010000111111001111110011111101011110 e74e3f83c388c53f3f3f9f543f3f3fe74e3f83c388c53f3f3f9f543f3f3f5e
EUC-JP 躁?ε闇???鬱???躁?ε闇???鬱???^ 11101101101011110011111110100110110001011011000011000111001111110011111100111111110111011011010100111111001111110011111111101101101011110011111110100110110001011011000011000111001111110011111100111111110111011011010100111111001111110011111101011110 edaf3fa6c5b0c73f3f3fddb53f3f3fedaf3fa6c5b0c73f3f3fddb53f3f3f5e
UTF-8 躁ㅸε闇쇠렩렯鬱ㅸ렢렕躁ㅸε闇쇠렩렯鬱ㅸ렢렕^ 1110100010111010100000011110001110000101101110001100111010110101111010011001011110000111111011001000011110100000111010111010000010101001111010111010000010101111111010011010110010110001111000111000010110111000111010111010000010100010111010111010000010010101111010001011101010000001111000111000010110111000110011101011010111101001100101111000011111101100100001111010000011101011101000001010100111101011101000001010111111101001101011001011000111100011100001011011100011101011101000001010001011101011101000001001010101011110 e8ba81e385b8ceb5e99787ec87a0eba0a9eba0afe9acb1e385b8eba0a2eba095e8ba81e385b8ceb5e99787ec87a0eba0a9eba0afe9acb1e385b8eba0a2eba0955e
UHC 躁ㅸε闇쇠렩렯鬱ㅸ렢렕躁ㅸε闇쇠렩렯鬱ㅸ렢렕^ 111100001110001010100100111010001010010111100101111001001110000110111100111010001000111010110111100011101011110011101010101001101010010011101000100011101011001110001110101010101111000011100010101001001110100010100101111001011110010011100001101111001110100010001110101101111000111010111100111010101010011010100100111010001000111010110011100011101010101001011110 f0e2a4e8a5e5e4e1bce88eb78ebceaa6a4e88eb38eaaf0e2a4e8a5e5e4e1bce88eb78ebceaa6a4e88eb38eaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)