To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????U}??????????U{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010101010111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 中???雨?茁???U}中???雨?茁???U{^ 10010010100001100011111100111111001111111000100101001010001111111111101110010011001111110011111100111111010101010111110110010010100001100011111100111111001111111000100101001010001111111111101110010011001111110011111100111111010101010111101101011110 92863f3f3f894a3ffb933f3f3f557d92863f3f3f894a3ffb933f3f3f557b5e
EUC-JP 中???雨?茁???U}中???雨?茁???U{^ 110000111110011000111111001111110011111110110001101010110011111110001111110101111101111000111111001111110011111101010101011111011100001111100110001111110011111100111111101100011010101100111111100011111101011111011110001111110011111100111111010101010111101101011110 c3e63f3f3fb1ab3f8fd7de3f3f3f557dc3e63f3f3fb1ab3f8fd7de3f3f3f557b5e
UTF-8 中꿱렭렩雨렍茁찔렰렚U}中꿱렭렩雨렍茁찔렰렚U{^ 1110010010111000101011011110101010111111101100011110101110100000101011011110101110100000101010011110100110011011101010001110101110100000100011011110100010001100100000011110110010110000100101001110101110100000101100001110101110100000100110100101010101111101111001001011100010101101111010101011111110110001111010111010000010101101111010111010000010101001111010011001101110101000111010111010000010001101111010001000110010000001111011001011000010010100111010111010000010110000111010111010000010011010010101010111101101011110 e4b8adeabfb1eba0adeba0a9e99ba8eba08de88c81ecb094eba0b0eba09a557de4b8adeabfb1eba0adeba0a9e99ba8eba08de88c81ecb094eba0b0eba09a557b5e
UHC 中꿱렭렩雨렍茁찔렰렚U}中꿱렭렩雨렍茁찔렰렚U{^ 111100011110100110110010111010001000111010111010100011101011011111101001111010111000111010100011111100011110100011000010111100011000111010111101100011101010110101010101011111011111000111101001101100101110100010001110101110101000111010110111111010011110101110001110101000111111000111101000110000101111000110001110101111011000111010101101010101010111101101011110 f1e9b2e88eba8eb7e9eb8ea3f1e8c2f18ebd8ead557df1e9b2e88eba8eb7e9eb8ea3f1e8c2f18ebd8ead557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)