To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????z??????????z????^ 0011111100111111001111110011111100111111001111110111101000111111001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111101011110 3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f5e
SJIS-WIN ??????z??????????z????^ 0011111100111111001111110011111100111111001111110111101000111111001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111101011110 3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f5e
EUC-JP ??????z??????????z????^ 0011111100111111001111110011111100111111001111110111101000111111001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111101011110 3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f5e
UTF-8 셔샴셔렾렽섕z셔샹셍섐셔샴셔렾렽섕z셔샹셍섐^ 111011001000010110010100111011001000001110110100111011001000010110010100111010111010000010111110111010111010000010111101111011001000010010010101011110101110110010000101100101001110110010000011101110011110110010000101100011011110110010000100100100001110110010000101100101001110110010000011101101001110110010000101100101001110101110100000101111101110101110100000101111011110110010000100100101010111101011101100100001011001010011101100100000111011100111101100100001011000110111101100100001001001000001011110 ec8594ec83b4ec8594eba0beeba0bdec84957aec8594ec83b9ec858dec8490ec8594ec83b4ec8594eba0beeba0bdec84957aec8594ec83b9ec858dec84905e
UHC 셔샴셔렾렽섕z셔샹셍섐셔샴셔렾렽섕z셔샹셍섐^ 10111100110001011011110010100100101111001100010110001110110001101000111011000101101111001010110001111010101111001100010110111100101001111011110011000100101111001010101110111100110001011011110010100100101111001100010110001110110001101000111011000101101111001010110001111010101111001100010110111100101001111011110011000100101111001010101101011110 bcc5bca4bcc58ec68ec5bcac7abcc5bca7bcc4bcabbcc5bca4bcc58ec68ec5bcac7abcc5bca7bcc4bcab5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)