To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??????? | 00111111001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f3f |
SJIS-WIN | ロ・ミ瞻・マ濘 | 110110111010010111010000111000011101101110100101110011111110000001011110 | dba5d0e1dba5cfe05e |
EUC-JP | ロ・ミ瞻・マ濘 | 1000111011011011100011101010010110001110110100001110001011011101100011101010010110001110110011111101111110111111 | 8edb8ea58ed0e2dd8ea58ecfdfbf |
UTF-8 | ロ・ミ瞻・マ濘 | 111011111011111010011011111011111011110110100101111011111011111010010000111001111001111010111011111011111011110110100101111011111011111010001111111001101011111110011000 | efbe9befbda5efbe90e79ebbefbda5efbe8fe6bf98 |
UHC | ???瞻??? | 0011111100111111001111111111010011010111001111110011111100111111 | 3f3f3ff4d73f3f3f |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)