To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲?????宥?????援?┸韋??語≪?^ 1110000110011111001111110011111100111111001111110011111110010111010001110011111100111111001111110011111100111111100010011000011100111111100001001011110111101000111010000011111100111111100011001110101010000001111000010011111101011110 e19f3f3f3f3f3f97473f3f3f3f3f89873f84bde8e83f3f8cea81e13f5e
EUC-JP 癲??馹??宥??饔??援?┸韋??語≪?^ 111000101010000100111111001111111000111111101001101000010011111100111111110011011010100000111111001111111000111111101000111011110011111100111111101100011110011100111111101010001011111111110000111010100011111100111111101110001110110010100010111000110011111101011110 e2a13f3f8fe9a13f3fcda83f3f8fe8ef3f3fb1e73fa8bff0ea3f3fb8eca2e33f5e
UTF-8 癲쒕쓷馹띈린宥밸눧饔끸뫗援쏉┸韋밴틓語≪풊^ 11100111100110011011001011101100100100101001010111101100100100111011011111101001101001101011100111101011100111011000100011101011101001101011000011100101101011101010010111101011101100001011100011101011100010001010011111101001101001011001010011101011100000011011100011101011101010111001011111100110100011111011010011101100100011111000100111100010100101001011100011101001100111111000101111101011101100001011010011101101100010111001001111101000101010101001111011100010100010011010101011101101100100101000101001011110 e799b2ec9295ec93b7e9a6b9eb9d88eba6b0e5aea5ebb0b8eb88a7e9a594eb81b8ebab97e68fb4ec8f89e294b8e99f8bebb0b4ed8b93e8aa9ee289aaed928a5e
UHC 癲쒕쓷馹띈린宥밸눧饔끸뫗援쏉┸韋밴틓語≪풊^ 11101111101001101001110011101011100111011001010011101100111100011011011011101000101110001011000011101010111010011011100111101011100001111011111011101000101111011000010111100010100100011011100111101010101101011001101111101111101001101011111111101010110111111011100111101010101110101000001011100101110111101010000111101100101111101001000001011110 efa69ceb9d94ecf1b6e8b8b0eae9b9eb87bee8bd85e291b9eab59befa6bfeadfb9eaba82e5dea1ecbe905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)