To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淼・譁懦罠螟ア跏オ驟醐讌譁懦罠螟ア跏オ轣シ^ 11111011010001011010010111100110100101101001110011101101111000111010100111100101101001001011000111100110111001101011010111101001100001011000110011101101111001101010010111100110100101101001110011101101111000111010100111100101101001001011000111100110111001101011010111100111100000011011110001011110 fb45a5e6969cede3a9e5a4b1e6e6b5e9858cede6a5e6969cede3a9e5a4b1e6e6b5e781bc5e
EUC-JP 淼・譁懦罠螟ア跏オ驟醐讌譁懦罠螟ア跏オ轣シ^ 1000111111000111111001101000111010100101111010111111011011011000111011111110011010101011111010101010011010001110101100011110110011101000100011101011010111110001111001011011100011101111111011001010011111101011111101101101100011101111111001101010101111101010101001101000111010110001111011001110100010001110101101011110110111100001100011101011110001011110 8fc7e68ea5ebf6d8efe6abeaa68eb1ece88eb5f1e5b8efeca7ebf6d8efe6abeaa68eb1ece88eb5ede18ebc5e
UTF-8 淼・譁懦罠螟ア跏オ驟醐讌譁懦罠螟ア跏オ轣シ^ 11100110101101111011110011101111101111011010010111101000101011011000000111100110100001111010011011100111101111011010000011101000100111101001111111101111101111011011000111101000101101111000111111101111101111011011010111101001101010011001111111101001100001101001000011101000101011101000110011101000101011011000000111100110100001111010011011100111101111011010000011101000100111101001111111101111101111011011000111101000101101111000111111101111101111011011010111101000101111011010001111101111101111011011110001011110 e6b7bcefbda5e8ad81e687a6e7bda0e89e9fefbdb1e8b78fefbdb5e9a99fe98690e8ae8ce8ad81e687a6e7bda0e89e9fefbdb1e8b78fefbdb5e8bda3efbdbc5e
UHC ??譁懦?螟?跏?驟??譁懦?螟?跏???^ 00111111001111111111110010100110110100011101011100111111110110011010110100111111110010101011101000111111111101101010111000111111001111111111110010100110110100011101011100111111110110011010110100111111110010101011101000111111001111110011111101011110 3f3ffca6d1d73fd9ad3fcaba3ff6ae3f3ffca6d1d73fd9ad3fcaba3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)