To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????日?????獄??掖⑤????? 0011111100111111001111110011111100111111001111111001001111111010001111110011111100111111001111110011111110001101100101100011111100111111100111010111010010000111010001000011111100111111001111110011111100111111 3f3f3f3f3f3f93fa3f3f3f3f3f8d963f3f9d7487443f3f3f3f3f
EUC-JP ??????日?????獄??掖?????? 00111111001111110011111100111111001111110011111111000110111111000011111100111111001111110011111100111111101110011111011000111111001111111101100111010101001111110011111100111111001111110011111100111111 3f3f3f3f3f3fc6fc3f3f3f3f3fb9f63f3fd9d53f3f3f3f3f3f
UTF-8 嶪륁캈痢뚨텧日덈젿硫며뵦獄몃젶掖⑤㈇溜쒙쪖溜 111001011011011010101010111010111010010110000001111011001011101010001000111011111010011110100101111010111001101010101000111011011000010110100111111001101001011110100101111010111000110110001000111011001010000010111111111011111010011110001110111010111010100110110000111010111011010110100110111001111000110110000100111010111010101010000011111011001010000010110110111001101000111010010110111000101001000110100100111000111000100010000111111011111010011110001011111011001001001010011001111011001010101010010110111011111010011110001011 e5b6aaeba581ecba88efa7a5eb9aa8ed85a7e697a5eb8d88eca0bfefa78eeba9b0ebb5a6e78d84ebaa83eca0b6e68e96e291a4e38887efa78bec9299ecaa96efa78b
UHC 嶪륁캈痢뚨텧日덈젿硫며뵦獄몃젶掖⑤㈇溜쒙쪖溜 1110010111110101100011111110110010101111100101001110110010111000100011001110011110110110100111001110110011101101100010001110101110100000101100011110101110101001101110001110011110010100101001011110100010101011101110001110101110100000101010101110010011111010101010001110101110101001101110001110101011111110100111001110111110100101100100001110101011111110 e5f58fecaf94ecb88ce7b69ceced88eba0b1eba9b8e794a5e8abb8eba0aae4faa8eba9b8eafe9cefa590eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)