To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????G????R}????G????R{^ 0011111100111111001111110011111101000111001111110011111100111111001111110101001001111101001111110011111100111111001111110100011100111111001111110011111100111111010100100111101101011110 3f3f3f3f473f3f3f3f527d3f3f3f3f473f3f3f3f527b5e
SJIS-WIN 諸???G諸???R}諸???G諸???R{^ 100011111001010000111111001111110011111101000111100011111001010000111111001111110011111101010010011111011000111110010100001111110011111100111111010001111000111110010100001111110011111100111111010100100111101101011110 8f943f3f3f478f943f3f3f527d8f943f3f3f478f943f3f3f527b5e
EUC-JP 諸???G諸???R}諸???G諸???R{^ 101111011111010000111111001111110011111101000111101111011111010000111111001111110011111101010010011111011011110111110100001111110011111100111111010001111011110111110100001111110011111100111111010100100111101101011110 bdf43f3f3f47bdf43f3f3f527dbdf43f3f3f47bdf43f3f3f527b5e
UTF-8 諸골렰렗G諸골렰렗R}諸골렰렗G諸골렰렗R{^ 11101000101010111011100011101010101100111010100011101011101000001011000011101011101000001001011101000111111010001010101110111000111010101011001110101000111010111010000010110000111010111010000010010111010100100111110111101000101010111011100011101010101100111010100011101011101000001011000011101011101000001001011101000111111010001010101110111000111010101011001110101000111010111010000010110000111010111010000010010111010100100111101101011110 e8abb8eab3a8eba0b0eba09747e8abb8eab3a8eba0b0eba097527de8abb8eab3a8eba0b0eba09747e8abb8eab3a8eba0b0eba097527b5e
UHC 諸골렰렗G諸골렰렗R}諸골렰렗G諸골렰렗R{^ 111100001011001110110000111100011000111010111101100011101010110001000111111100001011001110110000111100011000111010111101100011101010110001010010011111011111000010110011101100001111000110001110101111011000111010101100010001111111000010110011101100001111000110001110101111011000111010101100010100100111101101011110 f0b3b0f18ebd8eac47f0b3b0f18ebd8eac527df0b3b0f18ebd8eac47f0b3b0f18ebd8eac527b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)