To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 耀??泣ο┸柔λ?耀??泣ο┸柔??^ 100101110111001100111111001111111000101110000011100000111100110110000100101111011000111101011111100000111100100100111111100101110111001100111111001111111000101110000011100000111100110110000100101111011000111101011111001111110011111101011110 97733f3f8b8383cd84bd8f5f83c93f97733f3f8b8383cd84bd8f5f3f3f5e
EUC-JP 耀??泣ο┸柔λ?耀??泣ο┸柔??^ 110011011101010000111111001111111011010111100011101001101100111110101000101111111011110111000000101001101100101100111111110011011101010000111111001111111011010111100011101001101100111110101000101111111011110111000000001111110011111101011110 cdd43f3fb5e3a6cfa8bfbdc0a6cb3fcdd43f3fb5e3a6cfa8bfbdc03f3f5e
UTF-8 耀믩낑泣ο┸柔λ젡耀믩낑泣ο┸柔⑸섞^ 11101000100000001000000011101011101011111010100111101011100000101001000111100110101100111010001111001110101111111110001010010100101110001110011010011111100101001100111010111011111011001010000010100001111010001000000010000000111010111010111110101001111010111000001010010001111001101011001110100011110011101011111111100010100101001011100011100110100111111001010011100010100100011011100011101100100001001001111001011110 e88080ebafa9eb8291e6b3a3cebfe294b8e69f94cebbeca0a1e88080ebafa9eb8291e6b3a3cebfe294b8e69f94e291b8ec849e5e
UHC 耀믩낑泣ο┸柔λ젡耀믩낑泣ο┸柔⑸섞^ 11101001101001011001001011101011101100111010100111101011111010001010010111101111101001101011111111101010111101011010010111101011101000001001101011101001101001011001001011101011101100111010100111101011111010001010010111101111101001101011111111101010111101011010100111101011101111001010111101011110 e9a592ebb3a9ebe8a5efa6bfeaf5a5eba09ae9a592ebb3a9ebe8a5efa6bfeaf5a9ebbcaf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)