To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????u^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111010101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f755e
SJIS-WIN ???衡①?艶g????衡①?艶{》u^ 0011111100111111001111111000110101110100100001110100000000111111100010011001000010000010100001110011111100111111001111110011111110001101011101001000011101000000001111111000100110010000100000010110111110000001011101000111010101011110 3f3f3f8d7487403f899082873f3f3f3f8d7487403f8990816f8174755e
EUC-JP ???衡??艶g????衡??艶{》u^ 001111110011111100111111101110011101010100111111001111111011000111110000101000111110011100111111001111110011111100111111101110011101010100111111001111111011000111110000101000011101000010100001110101010111010101011110 3f3f3fb9d53f3fb1f0a3e73f3f3f3fb9d53f3fb1f0a1d0a1d5755e
UTF-8 怜붺윢衡①컮艶g왃怜붺윢衡①컮艶{》u^ 1110111110100110101011001110101110110110101110101110110010011100101000101110100010100001101000011110001010010001101000001110110010111011101011101110100010001001101101101110111110111101100001111110110010011001100000111110111110100110101011001110101110110110101110101110110010011100101000101110100010100001101000011110001010010001101000001110110010111011101011101110100010001001101101101110111110111101100110111110001110000000100010110111010101011110 efa6acebb6baec9ca2e8a1a1e291a0ecbbaee889b6efbd87ec9983efa6acebb6baec9ca2e8a1a1e291a0ecbbaee889b6efbd9be3808b755e
UHC 怜붺윢衡①컮艶g왃怜붺윢衡①컮艶{》u^ 1110011110110000100101001110011110011111101000111111101110101100101010001110011110110000100101001110011011111101101000111110011110011110101101101110011110110000100101001110011110011111101000111111101110101100101010001110011110110000100101001110011011111101101000111111101110100001101101110111010101011110 e7b094e79fa3fbaca8e7b094e6fda3e79eb6e7b094e79fa3fbaca8e7b094e6fda3fba1b7755e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)