To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | V§OJO | 10010010010101101001000110100111100100100100111110010010010010101001001001001111 | 925691a7924f924a924f |
SJIS-WIN | ?V?§?O?J?O | 0011111101010110001111111000000110011000001111110100111100111111010010100011111101001111 | 3f563f81983f4f3f4a3f4f |
EUC-JP | ?V?§?O?J?O | 0011111101010110001111111010000111111000001111110100111100111111010010100011111101001111 | 3f563fa1f83f4f3f4a3f4f |
UTF-8 | V§OJO | 11000010100100100101011011000010100100011100001010100111110000101001001001001111110000101001001001001010110000101001001001001111 | c29256c291c2a7c2924fc2924ac2924f |
UHC | ?V?§?O?J?O | 0011111101010110001111111010000111010111001111110100111100111111010010100011111101001111 | 3f563fa1d73f4f3f4a3f4f |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)