To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 悟??唯??亦???????????音??^ 1000110011100101001111110011111110010111010000100011111100111111100101101001001000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000100110111001001111110011111101011110 8ce53f3f97423f3f96923f3f3f3f3f3f3f3f3f3f3f89b93f3f5e
EUC-JP 悟??唯??亦??嫄??洧??絪??音??^ 1011100011100111001111110011111111001101101000110011111100111111110010111111001000111111001111111000111110111010101000010011111100111111100011111100011110110100001111110011111110001111110100111110110000111111001111111011001010111011001111110011111101011110 b8e73f3fcda33f3fcbf23f3f8fbaa13f3f8fc7b43f3f8fd3ec3f3fb2bb3f3f5e
UTF-8 悟귣씈唯뽱걫亦껋눘嫄숃쁻洧좊닑絪붹쾬音깃콡^ 11100110100000101001111111101010101101111010001111101100100101001000100011100101100101001010111111101011101111011011000111101010101100011010101111100100101110101010011011101010101110111000101111101011100010001001100011100101101010111000010011101100100010001000001111101100100000011011101111100110101101001010011111101100101000101000101011101011100010111001000111100111101101011010101011101011101101101011100111101100101111101010110011101001100111111011001111101010101110011000001111101100101111011010000101011110 e6829feab7a3ec9488e594afebbdb1eab1abe4baa6eabb8beb8898e5ab84ec8883ec81bbe6b4a7eca28aeb8b91e7b5aaebb6b9ecbeace99fb3eab983ecbda15e
UHC 悟귣씈唯뽱걫亦껋눘嫄숃쁻洧좊닑絪붹쾬音깃콡^ 11100111111101101000001011101011100111011010000011101010111001101001011011101101100000011001010011100110101100101000001111101100100001111011000111101010101100011001100111101000100110001000001011101010111110111010000011101011100010001001011011101100110111111001010011100110101100101000001111101011111001011011000111101010101100011001100101011110 e7f682eb9da0eae696ed8194e6b283ec87b1eab199e89882eafba0eb8896ecdf94e6b283ebe5b1eab1995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)