To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 憶?∥誼?┸轅??憶?∥誼?┸轅??^ 1000100110101111001111111000000101100001100010110110001000111111100001001011110111100111011101100011111100111111100010011010111100111111100000010110000110001011011000100011111110000100101111011110011101110110001111110011111101011110 89af3f81618b623f84bde7763f3f89af3f81618b623f84bde7763f3f5e
EUC-JP 憶?‖誼?┸轅??憶?‖誼?┸轅??^ 1011001010110001001111111010000111000010101101011100001100111111101010001011111111101101110101110011111100111111101100101011000100111111101000011100001010110101110000110011111110101000101111111110110111010111001111110011111101011110 b2b13fa1c2b5c33fa8bfedd73f3fb2b13fa1c2b5c33fa8bfedd73f3f5e
UTF-8 憶귣∥誼억┸轅롫씩憶귣∥誼억┸轅롫퓞^ 11100110100001101011011011101010101101111010001111100010100010001010010111101000101010101011110011101100100101101011010111100010100101001011100011101000101111011000010111101011101000011010101111101100100101001010100111100110100001101011011011101010101101111010001111100010100010001010010111101000101010101011110011101100100101101011010111100010100101001011100011101000101111011000010111101011101000011010101111101101100100111001111001011110 e686b6eab7a3e288a5e8aabcec96b5e294b8e8bd85eba1abec94a9e686b6eab7a3e288a5e8aabcec96b5e294b8e8bd85eba1abed939e5e
UHC 憶귣∥誼억┸轅롫씩憶귣∥誼억┸轅롫퓞^ 11100101111000111000001011101011101000011010101111101011111111101011111011101111101001101011111111101010101111111000111011101011101111101011111111100101111000111000001011101011101000011010101111101011111111101011111011101111101001101011111111101010101111111000111011101011101111111000100001011110 e5e382eba1abebfebeefa6bfeabf8eebbebfe5e382eba1abebfebeefa6bfeabf8eebbf885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)