To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 å‡ääfå‡ää^}Yå‡ääfå‡ää^}bE 11100101100001111110010011100100011001101110010110000111111001001110010001011110011111010101100111100101100001111110010011100100011001101110010110000111111001001110010001011110011111010110001001000101 e587e4e466e587e4e45e7d59e587e4e466e587e4e45e7d6245
SJIS-WIN ????f????^}Y????f????^}bE 00111111001111110011111100111111011001100011111100111111001111110011111101011110011111010101100100111111001111110011111100111111011001100011111100111111001111110011111101011110011111010110001001000101 3f3f3f3f663f3f3f3f5e7d593f3f3f3f663f3f3f3f5e7d6245
EUC-JP å?ääfå?ää^}Yå?ääfå?ää^}bE 10001111101010111010100100111111100011111010101110100011100011111010101110100011011001101000111110101011101010010011111110001111101010111010001110001111101010111010001101011110011111010101100110001111101010111010100100111111100011111010101110100011100011111010101110100011011001101000111110101011101010010011111110001111101010111010001110001111101010111010001101011110011111010110001001000101 8faba93f8faba38faba3668faba93f8faba38faba35e7d598faba93f8faba38faba3668faba93f8faba38faba35e7d6245
UTF-8 å‡ääfå‡ää^}Yå‡ääfå‡ää^}bE 1100001110100101110000101000011111000011101001001100001110100100011001101100001110100101110000101000011111000011101001001100001110100100010111100111110101011001110000111010010111000010100001111100001110100100110000111010010001100110110000111010010111000010100001111100001110100100110000111010010001011110011111010110001001000101 c3a5c287c3a4c3a466c3a5c287c3a4c3a45e7d59c3a5c287c3a4c3a466c3a5c287c3a4c3a45e7d6245
UHC ????f????^}Y????f????^}bE 00111111001111110011111100111111011001100011111100111111001111110011111101011110011111010101100100111111001111110011111100111111011001100011111100111111001111110011111101011110011111010110001001000101 3f3f3f3f663f3f3f3f5e7d593f3f3f3f663f3f3f3f5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)