To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????鴉??^ 0011111100111111001111110011111100111111001111111110100111101011001111110011111101011110 3f3f3f3f3f3fe9eb3f3f5e
EUC-JP ??????鴉??^ 0011111100111111001111110011111100111111001111111111001011101101001111110011111101011110 3f3f3f3f3f3ff2ed3f3f5e
UTF-8 了욂펹歷잓돒鴉됪툨^ 11101111101001101011101011101100100110101000001011101101100011101011100111101111101001101000110011101100100111101001001111101011100011111001001011101001101101001000100111101011100100001010101011101101100010001010100001011110 efa6baec9a82ed8eb9efa68cec9e93eb8f92e9b489eb90aaed88a85e
UHC 了욂펹歷잓돒鴉됪툨^ 11101000111001111001111011100100101111001000100111100110101110001001111111101001100010011001111011100100101111001000100111100110101110001001111101011110 e8e79ee4bc89e6b89fe9899ee4bc89e6b89f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)