To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 午??踰??????源??癲?????B 100011001101111100111111001111111110011011111010001111110011111100111111001111110011111100111111100011001011100100111111001111111110000110011111001111110011111100111111001111110011111101000010 8cdf3f3fe6fa3f3f3f3f3f3f8cb93f3fe19f3f3f3f3f3f42
EUC-JP 午??踰??????源??癲?????B 101110001110000100111111001111111110110011111100001111110011111100111111001111110011111100111111101110001011101100111111001111111110001010100001001111110011111100111111001111110011111101000010 b8e13f3fecfc3f3f3f3f3f3fb8bb3f3fe2a13f3f3f3f3f42
UTF-8 午닿퓥踰곮린栒맡곤쭖源낆젛癲용겇硫명떒B 11100101100011011000100011101011100010111011111111101101100100111010010111101000101110001011000011101010101100111010111011101011101001101011000011100110101000001001001011101011101001111010000111101010101100111010010011101100101011011001011011100110101110101001000011101011100000101000011011101100101000001001101111100111100110011011001011101100100110101010100111101010101100101000011111101111101001111000111011101011101010101000010111101011100101101001001001000010 e58d88eb8bbfed93a5e8b8b0eab3aeeba6b0e6a092eba7a1eab3a4ecad96e6ba90eb8286eca09be799b2ec9aa9eab287efa78eebaa85eb969242
UHC 午닿퓥踰곮린栒맡곤쭖源낆젛癲용겇硫명떒B 111001111110110110110100111010101011111110001110111010111011001010000001111010001011100010110000111000101110001110111000110000111011000011101111101001111000111011101010101110011000010111101100101000001001011111101111101001101011111111101011100000011010010011101011101010011011100011101101100010111010100001000010 e7edb4eabf8eebb281e8b8b0e2e3b8c3b0efa78eeab985eca097efa6bfeb81a4eba9b8ed8ba842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)