To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 沃??而?ぜ怨??n}沃??而?ぜ怨??n{^ 10010111100000000011111100111111100011101010011100111111100000101011101010001001100001010011111100111111011011100111110110010111100000000011111100111111100011101010011100111111100000101011101010001001100001010011111100111111011011100111101101011110 97803f3f8ea73f82ba89853f3f6e7d97803f3f8ea73f82ba89853f3f6e7b5e
EUC-JP 沃??而?ぜ怨??n}沃??而?ぜ怨??n{^ 11001101111000000011111100111111101111001010100100111111101001001011110010110001111001010011111100111111011011100111110111001101111000000011111100111111101111001010100100111111101001001011110010110001111001010011111100111111011011100111101101011110 cde03f3fbca93fa4bcb1e53f3f6e7dcde03f3fbca93fa4bcb1e53f3f6e7b5e
UTF-8 沃쇨낫而욆ぜ怨⑹젂n}沃쇨낫而욆ぜ怨⑹젂n{^ 1110011010110010100000111110110010000111101010001110101110000010101010111110100010000000100011001110110010011010100001101110001110000001100111001110011010000000101010001110001010010001101110011110110010100000100000100110111001111101111001101011001010000011111011001000011110101000111010111000001010101011111010001000000010001100111011001001101010000110111000111000000110011100111001101000000010101000111000101001000110111001111011001010000010000010011011100111101101011110 e6b283ec87a8eb82abe8808cec9a86e3819ce680a8e291b9eca0826e7de6b283ec87a8eb82abe8808cec9a86e3819ce680a8e291b9eca0826e7b5e
UHC 沃쇨낫而욆ぜ怨⑹젂n}沃쇨낫而욆ぜ怨⑹젂n{^ 1110100010101010101111001110101010110011101101001110110010111011100111101110100010101010101111001110101010110011101010011110110010100000100001100110111001111101111010001010101010111100111010101011001110110100111011001011101110011110111010001010101010111100111010101011001110101001111011001010000010000110011011100111101101011110 e8aabceab3b4ecbb9ee8aabceab3a9eca0866e7de8aabceab3b4ecbb9ee8aabceab3a9eca0866e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)