To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???鉛???f?湲???←?????リ?^ 001111110011111100111111100010011001010000111111001111110011111110000010100001100011111110011111110100010011111100111111001111111000000110101001001111110011111100111111001111110011111110000011100010100011111101011110 3f3f3f89943f3f3f82863f9fd13f3f3f81a93f3f3f3f3f838a3f5e
EUC-JP ???鉛???f?湲???←?孼???リ?^ 0011111100111111001111111011000111110100001111110011111100111111101000111110011000111111110111101101001100111111001111110011111110100010101010110011111110001111101110101100001100111111001111110011111110100101111010100011111101011110 3f3f3fb1f43f3f3fa3e63fded33f3f3fa2ab3f8fbac33f3f3fa5ea3f5e
UTF-8 女앮젩鉛앮뤃溜f략湲됪옗廬←돑孼대젫曆リ툈^ 11101111101001101000000111101100100101011010111011101100101000001010100111101001100010011001101111101100100101011010111011101011101001001000001111101111101001111000101111101111101111011000011011101011100111101011010111100110101110011011001011101011100100001010101011101100100110001001011111101111101001101000001011100010100001101001000011101011100011111001000111100101101011011011110011101011100011001000000011101100101000001010101111101111101001101000101111100011100000111010101011101101100010001000100001011110 efa681ec95aeeca0a9e9899bec95aeeba483efa78befbd86eb9eb5e6b9b2eb90aaec9897efa682e28690eb8f91e5adbceb8c80eca0abefa68be383aaed88885e
UHC 女앮젩鉛앮뤃溜f략湲됪옗廬←돑孼대젫曆リ툈^ 11100101111111001001110111100110101000001010000111100110111001111001110111100110100011111011010011101010111111101010001111100110101101111010101111101010101110001000100111100110100111101001110111100101111111101010000111100111100010011001110111100101111011011011010011101011101000001010001111100110101101111010101111101010101110001000000101011110 e5fc9de6a0a1e6e79de68fb4eafea3e6b7abeab889e69e9de5fea1e7899de5edb4eba0a3e6b7abeab8815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)