To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | færB | 100000010110011010000001111001100111001001000010 | 816681e67242 |
SJIS-WIN | ?f??rB | 001111110110011000111111001111110111001001000010 | 3f663f3f7242 |
EUC-JP | ?f?ærB | 0011111101100110001111111000111110101001110000010111001001000010 | 3f663f8fa9c17242 |
UTF-8 | færB | 110000101000000101100110110000101000000111000011101001100111001001000010 | c28166c281c3a67242 |
UHC | ?f?ærB | 00111111011001100011111110101001101000010111001001000010 | 3f663fa9a17242 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)