To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 躁?ε闇???鬱???躁?ε闇???鬱???^ 11100111010011100011111110000011110000111000100011000101001111110011111100111111100111110101010000111111001111110011111111100111010011100011111110000011110000111000100011000101001111110011111100111111100111110101010000111111001111110011111101011110 e74e3f83c388c53f3f3f9f543f3f3fe74e3f83c388c53f3f3f9f543f3f3f5e
EUC-JP 躁?ε闇???鬱???躁?ε闇???鬱???^ 11101101101011110011111110100110110001011011000011000111001111110011111100111111110111011011010100111111001111110011111111101101101011110011111110100110110001011011000011000111001111110011111100111111110111011011010100111111001111110011111101011110 edaf3fa6c5b0c73f3f3fddb53f3f3fedaf3fa6c5b0c73f3f3fddb53f3f3f5e
UTF-8 躁ㅸε闇쇘렏렍鬱ㅸ렢렕躁ㅸε闇쇘렏렍鬱ㅸ렢렕^ 1110100010111010100000011110001110000101101110001100111010110101111010011001011110000111111011001000011110011000111010111010000010001111111010111010000010001101111010011010110010110001111000111000010110111000111010111010000010100010111010111010000010010101111010001011101010000001111000111000010110111000110011101011010111101001100101111000011111101100100001111001100011101011101000001000111111101011101000001000110111101001101011001011000111100011100001011011100011101011101000001010001011101011101000001001010101011110 e8ba81e385b8ceb5e99787ec8798eba08feba08de9acb1e385b8eba0a2eba095e8ba81e385b8ceb5e99787ec8798eba08feba08de9acb1e385b8eba0a2eba0955e
UHC 躁ㅸε闇쇘렏렍鬱ㅸ렢렕躁ㅸε闇쇘렏렍鬱ㅸ렢렕^ 111100001110001010100100111010001010010111100101111001001110000110111100111001111000111010100101100011101010001111101010101001101010010011101000100011101011001110001110101010101111000011100010101001001110100010100101111001011110010011100001101111001110011110001110101001011000111010100011111010101010011010100100111010001000111010110011100011101010101001011110 f0e2a4e8a5e5e4e1bce78ea58ea3eaa6a4e88eb38eaaf0e2a4e8a5e5e4e1bce78ea58ea3eaa6a4e88eb38eaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)