To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???????疾??????????疾???^ 00111111001111110011111100111111001111110011111100111111100011101011111000111111001111110011111100111111001111110011111100111111001111110011111100111111100011101011111000111111001111110011111101011110 3f3f3f3f3f3f3f8ebe3f3f3f3f3f3f3f3f3f3f8ebe3f3f3f5e
EUC-JP ???????疾??????????疾???^ 00111111001111110011111100111111001111110011111100111111101111001100000000111111001111110011111100111111001111110011111100111111001111110011111100111111101111001100000000111111001111110011111101011110 3f3f3f3f3f3f3fbcc03f3f3f3f3f3f3f3f3f3fbcc03f3f3f5e
UTF-8 렯렎렯롈센셈셈疾섣센섞렯렎렯롈센셈셈疾섣센샷^ 11101011101000001010111111101011101000001000111011101011101000001010111111101011101000011000100011101100100001001011110011101100100001011000100011101100100001011000100011100111100101101011111011101100100001001010001111101100100001001011110011101100100001001001111011101011101000001010111111101011101000001000111011101011101000001010111111101011101000011000100011101100100001001011110011101100100001011000100011101100100001011000100011100111100101101011111011101100100001001010001111101100100001001011110011101100100000111011011101011110 eba0afeba08eeba0afeba188ec84bcec8588ec8588e796beec84a3ec84bcec849eeba0afeba08eeba0afeba188ec84bcec8588ec8588e796beec84a3ec84bcec83b75e
UHC 렯렎렯롈센셈셈疾섣센섞렯렎렯롈센셈셈疾섣센샷^ 100011101011110010001110101001001000111010111100100011101100111010111100101111101011110011000000101111001100000011110010111100001011110010110010101111001011111010111100101011111000111010111100100011101010010010001110101111001000111011001110101111001011111010111100110000001011110011000000111100101111000010111100101100101011110010111110101111001010011001011110 8ebc8ea48ebc8ecebcbebcc0bcc0f2f0bcb2bcbebcaf8ebc8ea48ebc8ecebcbebcc0bcc0f2f0bcb2bcbebca65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)