To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 墻??帳??慂??辱?墻??帳??慂??辱?^ 10011010110101000011111100111111100100101010000000111111001111111001110011001000001111110011111110010000010010100011111110011010110101000011111100111111100100101010000000111111001111111001110011001000001111110011111110010000010010100011111101011110 9ad43f3f92a03f3f9cc83f3f904a3f9ad43f3f92a03f3f9cc83f3f904a3f5e
EUC-JP 墻??帳??慂??辱?墻??帳??慂??辱?^ 11010100110101100011111100111111110001001010001000111111001111111101100011001010001111110011111110111111101010110011111111010100110101100011111100111111110001001010001000111111001111111101100011001010001111110011111110111111101010110011111101011110 d4d63f3fc4a23f3fd8ca3f3fbfab3fd4d63f3fc4a23f3fd8ca3f3fbfab3f5e
UTF-8 墻욥눈帳쒙푽慂딉슭辱쳘墻욥눈帳쒙푽慂딉슭辱쳘^ 11100101101000101011101111101100100110101010010111101011100010001000100011100101101110001011001111101100100100101001100111101101100100011011110111100110100001011000001011101011100101001000100111101100100010101010110111101000101111101011000111101100101100111001100011100101101000101011101111101100100110101010010111101011100010001000100011100101101110001011001111101100100100101001100111101101100100011011110111100110100001011000001011101011100101001000100111101100100010101010110111101000101111101011000111101100101100111001100001011110 e5a2bbec9aa5eb8888e5b8b3ec9299ed91bde68582eb9489ec8aade8beb1ecb398e5a2bbec9aa5eb8888e5b8b3ec9299ed91bde68582eb9489ec8aade8beb1ecb3985e
UHC 墻욥눈帳쒙푽慂딉슭辱쳘墻욥눈帳쒙푽慂딉슭辱쳘^ 111011011101111110111111111010011011010010101011111011011110001110011100111011111011111010001000111010011011110110001010111011111011110110111110111010011011010010101011011110001110110111011111101111111110100110110100101010111110110111100011100111001110111110111110100010001110100110111101100010101110111110111101101111101110100110110100101010110111100001011110 eddfbfe9b4abede39cefbe88e9bd8aefbdbee9b4ab78eddfbfe9b4abede39cefbe88e9bd8aefbdbee9b4ab785e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)