To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | éâÒç|· | 1110100110000110111000101101001011100111011111001001011010110111 | e986e2d2e77c96b7 |
SJIS-WIN | ?????|?? | 0011111100111111001111110011111100111111011111000011111100111111 | 3f3f3f3f3f7c3f3f |
EUC-JP | é?âÒç|?? | 10001111101010111011000100111111100011111010101110100100100011111010101011010010100011111010101110101110011111000011111100111111 | 8fabb13f8faba48faad28fabae7c3f3f |
UTF-8 | éâÒç|· | 110000111010100111000010100001101100001110100010110000111001001011000011101001110111110011000010100101101100001010110111 | c3a9c286c3a2c392c3a77cc296c2b7 |
UHC | ?????|?· | 001111110011111100111111001111110011111101111100001111111010000110100100 | 3f3f3f3f3f7c3fa1a4 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)