To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??Vh??V | 00111111001111110101011001101000001111110011111101010110 | 3f3f56683f3f56 |
SJIS-WIN | ツ嘶Vhツ嘶V | 110000101001101001111100010101100110100011000010100110100111110001010110 | c29a7c5668c29a7c56 |
EUC-JP | ツ嘶Vhツ嘶V | 1000111011000010110100111101110101010110011010001000111011000010110100111101110101010110 | 8ec2d3dd56688ec2d3dd56 |
UTF-8 | ツ嘶Vhツ嘶V | 111011111011111010000010111001011001100010110110010101100110100011101111101111101000001011100101100110001011011001010110 | efbe82e598b65668efbe82e598b656 |
UHC | ?嘶Vh?嘶V | 001111111110001110110110010101100110100000111111111000111011011001010110 | 3fe3b656683fe3b656 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)