To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?⑤?昻??洵℡???????鴉??昻??^ 00111111100001110100010000111111111110101101000000111111001111111001111110101011100001111000010000111111001111110011111100111111001111110011111100111111111010011110101100111111001111111111101011010000001111110011111101011110 3f87443ffad03f3f9fab87843f3f3f3f3f3f3fe9eb3f3ffad03f3f5e
EUC-JP ??????洵????????鴉?????^ 001111110011111100111111001111110011111100111111110111101010110100111111001111110011111100111111001111110011111100111111001111111111001011101101001111110011111100111111001111110011111101011110 3f3f3f3f3f3fdead3f3f3f3f3f3f3f3ff2ed3f3f3f3f3f5e
UTF-8 曆⑤젨昻뽨펯洵℡떀令뜻랬溜졾떀鴉곥돚昻뽫㈅^ 11101111101001101000101111100010100100011010010011101100101000001010100011100110100110001011101111101011101111011010100011101101100011101010111111100110101101001011010111100010100001001010000111101011100101101000000011101111101001101010100011101011100111001011101111101011100111101010110011101111101001111000101111101100101000011011111011101011100101101000000011101001101101001000100111101010101100111010010111101011100011111001101011100110100110001011101111101011101111011010101111100011100010001000010101011110 efa68be291a4eca0a8e698bbebbda8ed8eafe6b4b5e284a1eb9680efa6a8eb9cbbeb9eacefa78beca1beeb9680e9b489eab3a5eb8f9ae698bbebbdabe388855e
UHC 曆⑤젨昻뽨펯洵℡떀令뜻랬溜졾떀鴉곥돚昻뽫㈅^ 11100110101101111010100011101011101000001010000011100100111010011001011011100100101111001000000111100010111001111010001011100101100010111001011011100111101010011011011011100110101101111010100011101010111111101010000011100101100010111001011011100100101111001000000111100011100010011010001011100100111010011001011011100111101010011011011001011110 e6b7a8eba0a0e4e996e4bc81e2e7a2e58b96e7a9b6e6b7a8eafea0e58b96e4bc81e389a2e4e996e7a9b65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)