To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??}Y??}bE | 001111110011111101111101010110010011111100111111011111010110001001000101 | 3f3f7d593f3f7d6245 |
SJIS-WIN | 順?}Y順?}bE | 1000111110000111001111110111110101011001100011111000011100111111011111010110001001000101 | 8f873f7d598f873f7d6245 |
EUC-JP | 順?}Y順?}bE | 1011110111100111001111110111110101011001101111011110011100111111011111010110001001000101 | bde73f7d59bde73f7d6245 |
UTF-8 | 順몒}Y順몒}bE | 1110100110100000100001101110101110101010100100100111110101011001111010011010000010000110111010111010101010010010011111010110001001000101 | e9a086ebaa927d59e9a086ebaa927d6245 |
UHC | 順몒}Y順몒}bE | 11100010111101111001000101111010011111010101100111100010111101111001000101111010011111010110001001000101 | e2f7917a7d59e2f7917a7d6245 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)