To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 諸?弛?修諸?弛?τ[諸?弛?修諸?弛?τ[^ 1000111110010100001111111001001001101111001111111000111101000011100011111001010000111111100100100110111100111111100000111101000101011011100011111001010000111111100100100110111100111111100011110100001110001111100101000011111110010010011011110011111110000011110100010101101101011110 8f943f926f3f8f438f943f926f3f83d15b8f943f926f3f8f438f943f926f3f83d15b5e
EUC-JP 諸?弛?修諸?弛?τ[諸?弛?修諸?弛?τ[^ 1011110111110100001111111100001111010000001111111011110110100100101111011111010000111111110000111101000000111111101001101101001101011011101111011111010000111111110000111101000000111111101111011010010010111101111101000011111111000011110100000011111110100110110100110101101101011110 bdf43fc3d03fbda4bdf43fc3d03fa6d35bbdf43fc3d03fbda4bdf43fc3d03fa6d35b5e
UTF-8 諸렪弛쇘修諸렪弛쇠τ[諸렪弛쇘修諸렪弛쇠τ[^ 11101000101010111011100011101011101000001010101011100101101111001001101111101100100001111001100011100100101111111010111011101000101010111011100011101011101000001010101011100101101111001001101111101100100001111010000011001111100001000101101111101000101010111011100011101011101000001010101011100101101111001001101111101100100001111001100011100100101111111010111011101000101010111011100011101011101000001010101011100101101111001001101111101100100001111010000011001111100001000101101101011110 e8abb8eba0aae5bc9bec8798e4bfaee8abb8eba0aae5bc9bec87a0cf845be8abb8eba0aae5bc9bec8798e4bfaee8abb8eba0aae5bc9bec87a0cf845b5e
UHC 諸렪弛쇘修諸렪弛쇠τ[諸렪弛쇘修諸렪弛쇠τ[^ 11110000101100111000111010111000111011001010110010111100111001111110000111110011111100001011001110001110101110001110110010101100101111001110100010100101111100110101101111110000101100111000111010111000111011001010110010111100111001111110000111110011111100001011001110001110101110001110110010101100101111001110100010100101111100110101101101011110 f0b38eb8ecacbce7e1f3f0b38eb8ecacbce8a5f35bf0b38eb8ecacbce7e1f3f0b38eb8ecacbce8a5f35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)