To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????^???????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f
SJIS-WIN ??????????????^???????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f
EUC-JP ??????????????^???????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f
UTF-8 혧쨩첫혦짧혧쨩첫횎챤혦째혦쩍^혧쨩첫혦짧혧쨩첫 11101101100110001010011111101100101010001010100111101100101100101010101111101101100110001010011011101100101001111010011111101101100110001010011111101100101010001010100111101100101100101010101111101101100110101000111011101100101100011010010011101101100110001010011011101100101001111011100011101101100110001010011011101100101010011000110101011110111011011001100010100111111011001010100010101001111011001011001010101011111011011001100010100110111011001010011110100111111011011001100010100111111011001010100010101001111011001011001010101011 ed98a7eca8a9ecb2abed98a6eca7a7ed98a7eca8a9ecb2abed9a8eecb1a4ed98a6eca7b8ed98a6eca98d5eed98a7eca8a9ecb2abed98a6eca7a7ed98a7eca8a9ecb2ab
UHC 혧쨩첫혦짧혧쨩첫횎챤혦째혦쩍^혧쨩첫혦짧혧쨩첫 110000101000111111000010101110111100001110111001110000101000111011000010101010101100001010001111110000101011101111000011101110011100001110001010110000111010111011000010100011101100001010110000110000101000111011000010101111010101111011000010100011111100001010111011110000111011100111000010100011101100001010101010110000101000111111000010101110111100001110111001 c28fc2bbc3b9c28ec2aac28fc2bbc3b9c38ac3aec28ec2b0c28ec2bd5ec28fc2bbc3b9c28ec2aac28fc2bbc3b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)