To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 絲の残随Р褸視牡茯 1110011110110101101100101110001110000001101011101110011010101110100010111110100110011010100011111101000010100000111010001010010010111000111010001010011010010110111001111000100110100001111010001000110010101111 e7b5b2e381aee6ae8be99a8fd0a0e8a4b8e8a696e789a1e88caf
SJIS-WIN ?????????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ç??ã?®æ®?é????褸è¦?ç?¡è?¯ 1000111110101011101011100011111100111111100011111010101110101010001111111000111110100010111011101000111110101001110000011000111110100010111011100011111110001111101010111011000100111111001111110011111100111111100011111010101110110010100011111010001011110000100011111010001010110001100011111010101110110010100011111010001011000011001111111000111110101011101011100011111110001111101000101100001010001111101010111011001000111111100011111010001010110100 8fabae3f3f8fabaa3f8fa2ee8fa9c18fa2ee3f8fabb13f3f3f3f8fabb28fa2f08fa2b18fabb28fa2c33f8fabae3f8fa2c28fabb23f8fa2b4
UTF-8 絲の残随Р褸視牡茯 11000011101001111100001010110101110000101011001011000011101000111100001010000001110000101010111011000011101001101100001010101110110000101000101111000011101010011100001010011010110000101000111111000011100100001100001010100000110000111010100011000010101001001100001010111000110000111010100011000010101001101100001010010110110000111010011111000010100010011100001010100001110000111010100011000010100011001100001010101111 c3a7c2b5c2b2c3a3c281c2aec3a6c2aec28bc3a9c29ac28fc390c2a0c3a8c2a4c2b8c3a8c2a6c296c3a7c289c2a1c3a8c28cc2af
UHC ??²??®æ®????Ð??¤¸?????¡??? 00111111001111111010100111110111001111110011111110100010111001111010100110100001101000101110011100111111001111110011111100111111101010001010001000111111001111111010001010110100101000101010110000111111001111110011111100111111001111111010001010101110001111110011111100111111 3f3fa9f73f3fa2e7a9a1a2e73f3f3f3fa8a23f3fa2b4a2ac3f3f3f3f3fa2ae3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)