To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???惟??有??淫??懿??筍ル?疫??^ 0011111100111111001111111000100011010010001111110011111110010111010011000011111100111111100010001111101000111111001111111001110011110010001111110011111111100010101000011000001110001011001111111000100101110101001111110011111101011110 3f3f3f88d23f3f974c3f3f88fa3f3f9cf23f3fe2a1838b3f89753f3f5e
EUC-JP ???惟??有??淫??懿??筍ル?疫??^ 0011111100111111001111111011000011010100001111110011111111001101101011010011111100111111101100001111110000111111001111111101100011110100001111110011111111100100101000111010010111101011001111111011000111010110001111110011111101011110 3f3f3fb0d43f3fcdad3f3fb0fc3f3fd8f43f3fe4a3a5eb3fb1d63f3f5e
UTF-8 僚녹븮惟롢렖有잍묾淫들뒽懿쀫릮筍ル븶疫뀁꽪^ 11101111101001101011101111101011100001011011100111101011101110001010111011100110100000111001111111101011101000011010001011101011101000001001011011100110100111001000100111101100100111101000110111101011101011001011111011100110101101111010101111101011100100111010010011101011100100101011110111100110100001111011111111101100100000001010101111101011101001101010111011100111101011011000110111100011100000111010101111101011101110001011011011100111100101101010101111101011100000001000000111101010101111011010101001011110 efa6bbeb85b9ebb8aee6839feba1a2eba096e69c89ec9e8debacbee6b7abeb93a4eb92bde687bfec80abeba6aee7ad8de383abebb8b6e796abeb8081eabdaa5e
UHC 僚녹븮惟롢렖有잍묾淫들뒽懿쀫릮筍ル븶疫뀁꽪^ 11101000111010001011001111101100100101011001011111101010111011101000111011100011100011101010101111101010111100111001111111100110101110011011001011101011111000101011010111101001100010101011001111101011111100111001011111101011100100001000111011100010111011001010101111101011100101011001111111100110101110011011001011101100100001001011010101011110 e8e8b3ec9597eaee8ee38eabeaf39fe6b9b2ebe2b5e98ab3ebf397eb908ee2ecabeb959fe6b9b2ec84b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)