To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??怨峰舞??醍??拒??怨峰舞??醍??居^ 001111110011111110001001100001011001010111110100100101011001000100111111001111111001000111100111001111110011111110001011100100010011111100111111100010011000010110010101111101001001010110010001001111110011111110010001111001110011111100111111100010111000111101011110 3f3f898595f495913f3f91e73f3f8b913f3f898595f495913f3f91e73f3f8b8f5e
EUC-JP ??怨峰舞??醍??拒??怨峰舞??醍??居^ 001111110011111110110001111001011100101011110110110010011111000100111111001111111100001011101001001111110011111110110101111100010011111100111111101100011110010111001010111101101100100111110001001111110011111111000010111010010011111100111111101101011110111101011110 3f3fb1e5caf6c9f13f3fc2e93f3fb5f13f3fb1e5caf6c9f13f3fc2e93f3fb5ef5e
UTF-8 欌렪怨峰舞欌렪醍닿렋拒欌렪怨峰舞欌렪醍닿렋居^ 11100110101011001000110011101011101000001010101011100110100000001010100011100101101100111011000011101000100010001001111011100110101011001000110011101011101000001010101011101001100001101000110111101011100010111011111111101011101000001000101111100110100010111001001011100110101011001000110011101011101000001010101011100110100000001010100011100101101100111011000011101000100010001001111011100110101011001000110011101011101000001010101011101001100001101000110111101011100010111011111111101011101000001000101111100101101100011000010101011110 e6ac8ceba0aae680a8e5b3b0e8889ee6ac8ceba0aae9868deb8bbfeba08be68b92e6ac8ceba0aae680a8e5b3b0e8889ee6ac8ceba0aae9868deb8bbfeba08be5b1855e
UHC 欌렪怨峰舞欌렪醍닿렋拒欌렪怨峰舞欌렪醍닿렋居^ 111011011110101110001110101110001110101010110011110111001110100011011001111100011110110111101011100011101011100011110000101101011011010011101010100011101010001011001011110111101110110111101011100011101011100011101010101100111101110011101000110110011111000111101101111010111000111010111000111100001011010110110100111010101000111010100010110010111101110001011110 edeb8eb8eab3dce8d9f1edeb8eb8f0b5b4ea8ea2cbdeedeb8eb8eab3dce8d9f1edeb8eb8f0b5b4ea8ea2cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)