To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 張θ?橈??辱??U}張θ?橈??辱??U{^ 10010010101000111000001111000110001111111001111011110100001111110011111110010000010010100011111100111111010101010111110110010010101000111000001111000110001111111001111011110100001111110011111110010000010010100011111100111111010101010111101101011110 92a383c63f9ef43f3f904a3f3f557d92a383c63f9ef43f3f904a3f3f557b5e
EUC-JP 張θ?橈??辱??U}張θ?橈??辱??U{^ 11000100101001011010011011001000001111111101110011110110001111110011111110111111101010110011111100111111010101010111110111000100101001011010011011001000001111111101110011110110001111110011111110111111101010110011111100111111010101010111101101011110 c4a5a6c83fdcf63f3fbfab3f3f557dc4a5a6c83fdcf63f3fbfab3f3f557b5e
UTF-8 張θ뻗橈롳풕辱껓쉼U}張θ뻗橈롳풕辱껓쉼U{^ 111001011011110010110101110011101011100011101011101110111001011111100110101010011000100011101011101000011011001111101101100100101001010111101000101111101011000111101010101110111001001111101100100010011011110001010101011111011110010110111100101101011100111010111000111010111011101110010111111001101010100110001000111010111010000110110011111011011001001010010101111010001011111010110001111010101011101110010011111011001000100110111100010101010111101101011110 e5bcb5ceb8ebbb97e6a988eba1b3ed9295e8beb1eabb93ec89bc557de5bcb5ceb8ebbb97e6a988eba1b3ed9295e8beb1eabb93ec89bc557b5e
UHC 張θ뻗橈롳풕辱껓쉼U}張θ뻗橈롳풕辱껓쉼U{^ 1110110111100101101001011110100010111011101110001110100011111010100011101110111110111110100110001110100110110100100000111110111110111101101100000101010101111101111011011110010110100101111010001011101110111000111010001111101010001110111011111011111010011000111010011011010010000011111011111011110110110000010101010111101101011110 ede5a5e8bbb8e8fa8eefbe98e9b483efbdb0557dede5a5e8bbb8e8fa8eefbe98e9b483efbdb0557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)