To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???U}???U{^ 0011111100111111001111110101010101111101001111110011111100111111010101010111101101011110 3f3f3f557d3f3f3f557b5e
SJIS-WIN 脱谷即U}脱谷即U{^ 1001001001000101100100100100101010010001101001100101010101111101100100100100010110010010010010101001000110100110010101010111101101011110 9245924a91a6557d9245924a91a6557b5e
EUC-JP 脱谷即U}脱谷即U{^ 1100001110100110110000111010101111000010101010000101010101111101110000111010011011000011101010111100001010101000010101010111101101011110 c3a6c3abc2a8557dc3a6c3abc2a8557b5e
UTF-8 脱谷即U}脱谷即U{^ 1110100010000100101100011110100010110000101101111110010110001101101100110101010101111101111010001000010010110001111010001011000010110111111001011000110110110011010101010111101101011110 e884b1e8b0b7e58db3557de884b1e8b0b7e58db3557b5e
UHC ?谷?U}?谷?U{^ 00111111110011011101101100111111010101010111110100111111110011011101101100111111010101010111101101011110 3fcddb3f557d3fcddb3f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)