To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?[?l???U? | 001111110101101100111111011011000011111100111111001111110101010100111111 | 3f5b3f6c3f3f3f553f |
SJIS-WIN | ?[?l???U? | 001111110101101100111111011011000011111100111111001111110101010100111111 | 3f5b3f6c3f3f3f553f |
EUC-JP | ?[?l???U? | 001111110101101100111111011011000011111100111111001111110101010100111111 | 3f5b3f6c3f3f3f553f |
UTF-8 | 혡[챗l혗쨋혶U혲 | 111011011001100010100001010110111110110010110001100101110110110011101101100110001001011111101100101010001000101111101101100110001011011001010101111011011001100010110010 | ed98a15becb1976ced9897eca88bed98b655ed98b2 |
UHC | 혡[챗l혗쨋혶U혲 | 110000101000101001011011110000111010101001101100110000101000001011000010101101101100001010011101010101011100001010011001 | c28a5bc3aa6cc282c2b6c29d55c299 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)