To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?蝎?燿??溢??? 00111111111001011001100100111111111000001010000000111111001111111000100011101100001111110011111100111111 3fe5993fe0a03f3f88ec3f3f3f
EUC-JP 塼蝎?燿??溢?雩? 1000111110111000101110011110100111111001001111111110000010100010001111110011111110110000111011100011111110001111111001101111101000111111 8fb8b9e9f93fe0a23f3fb0ee3f8fe6fa3f
UTF-8 塼蝎섦燿띔혁溢렡雩뮈 111001011010000110111100111010001001110110001110111011001000010010100110111001111000011110111111111010111001110110010100111011011001100010000001111001101011101010100010111010111010000010100001111010011001101110101001111010111010111010001000 e5a1bce89d8eec84a6e787bfeb9d94ed9881e6baa2eba0a1e99ba9ebae88
UHC 塼蝎섦燿띔혁溢렡雩뮈 1110111011110100110010101110100110111100101101001110100011111100101101101110101011000111111101011110110011101110100011101011001011101001111011001011100110111111 eef4cae9bcb4e8fcb6eac7f5ecee8eb2e9ecb9bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)