To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????B | 001111110011111100111111001111110011111101000010 | 3f3f3f3f3f42 |
SJIS-WIN | 逾賀ウ∝響B | 11100111101001011000100111101010101100111000000111100101100010111011111101000010 | e7a589eab381e58bbf42 |
EUC-JP | 逾賀ウ∝響B | 1110111010100111101100101110110010001110101100111010001011100111101101101100000101000010 | eea7b2ec8eb3a2e7b6c142 |
UTF-8 | 逾賀ウ∝響B | 11101001100000001011111011101000101100111000000011101111101111011011001111100010100010001001110111101001100111111011111101000010 | e980bee8b380efbdb3e2889de99fbf42 |
UHC | 逾賀?∝響B | 11101011101101011111100111000101001111111010000111110000111110101100001001000010 | ebb5f9c53fa1f0fac242 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)