To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 諸?弛?τ諸?弛?τ[諸?弛?τ諸?弛?τ[^ 1000111110010100001111111001001001101111001111111000001111010001100011111001010000111111100100100110111100111111100000111101000101011011100011111001010000111111100100100110111100111111100000111101000110001111100101000011111110010010011011110011111110000011110100010101101101011110 8f943f926f3f83d18f943f926f3f83d15b8f943f926f3f83d18f943f926f3f83d15b5e
EUC-JP 諸?弛?τ諸?弛?τ[諸?弛?τ諸?弛?τ[^ 1011110111110100001111111100001111010000001111111010011011010011101111011111010000111111110000111101000000111111101001101101001101011011101111011111010000111111110000111101000000111111101001101101001110111101111101000011111111000011110100000011111110100110110100110101101101011110 bdf43fc3d03fa6d3bdf43fc3d03fa6d35bbdf43fc3d03fa6d3bdf43fc3d03fa6d35b5e
UTF-8 諸렪弛쇠τ諸렪弛쇠τ[諸렪弛쇠τ諸렪弛쇠τ[^ 1110100010101011101110001110101110100000101010101110010110111100100110111110110010000111101000001100111110000100111010001010101110111000111010111010000010101010111001011011110010011011111011001000011110100000110011111000010001011011111010001010101110111000111010111010000010101010111001011011110010011011111011001000011110100000110011111000010011101000101010111011100011101011101000001010101011100101101111001001101111101100100001111010000011001111100001000101101101011110 e8abb8eba0aae5bc9bec87a0cf84e8abb8eba0aae5bc9bec87a0cf845be8abb8eba0aae5bc9bec87a0cf84e8abb8eba0aae5bc9bec87a0cf845b5e
UHC 諸렪弛쇠τ諸렪弛쇠τ[諸렪弛쇠τ諸렪弛쇠τ[^ 11110000101100111000111010111000111011001010110010111100111010001010010111110011111100001011001110001110101110001110110010101100101111001110100010100101111100110101101111110000101100111000111010111000111011001010110010111100111010001010010111110011111100001011001110001110101110001110110010101100101111001110100010100101111100110101101101011110 f0b38eb8ecacbce8a5f3f0b38eb8ecacbce8a5f35bf0b38eb8ecacbce8a5f3f0b38eb8ecacbce8a5f35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)