To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??怨封?耕?醍??拒??怨封?耕?醍??居^ 001111110011111110001001100001011001010110010101001111111000110101101011001111111001000111100111001111110011111110001011100100010011111100111111100010011000010110010101100101010011111110001101011010110011111110010001111001110011111100111111100010111000111101011110 3f3f898595953f8d6b3f91e73f3f8b913f3f898595953f8d6b3f91e73f3f8b8f5e
EUC-JP ??怨封?耕?醍??拒??怨封?耕?醍??居^ 001111110011111110110001111001011100100111110101001111111011100111001100001111111100001011101001001111110011111110110101111100010011111100111111101100011110010111001001111101010011111110111001110011000011111111000010111010010011111100111111101101011110111101011110 3f3fb1e5c9f53fb9cc3fc2e93f3fb5f13f3fb1e5c9f53fb9cc3fc2e93f3fb5ef5e
UTF-8 欌렪怨封렮耕떵醍닿렋拒欌렪怨封렮耕떵醍닿렋居^ 11100110101011001000110011101011101000001010101011100110100000001010100011100101101100001000000111101011101000001010111011101000100000001001010111101011100101101011010111101001100001101000110111101011100010111011111111101011101000001000101111100110100010111001001011100110101011001000110011101011101000001010101011100110100000001010100011100101101100001000000111101011101000001010111011101000100000001001010111101011100101101011010111101001100001101000110111101011100010111011111111101011101000001000101111100101101100011000010101011110 e6ac8ceba0aae680a8e5b081eba0aee88095eb96b5e9868deb8bbfeba08be68b92e6ac8ceba0aae680a8e5b081eba0aee88095eb96b5e9868deb8bbfeba08be5b1855e
UHC 欌렪怨封렮耕떵醍닿렋拒欌렪怨封렮耕떵醍닿렋居^ 111011011110101110001110101110001110101010110011110111001110011010001110101110111100110011101001101101101011101011110000101101011011010011101010100011101010001011001011110111101110110111101011100011101011100011101010101100111101110011100110100011101011101111001100111010011011011010111010111100001011010110110100111010101000111010100010110010111101110001011110 edeb8eb8eab3dce68ebbcce9b6baf0b5b4ea8ea2cbdeedeb8eb8eab3dce68ebbcce9b6baf0b5b4ea8ea2cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)