To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲??宜??循???ル?魏??諛?????B 11100001100111110011111100111111100010110101100000111111001111111000111101111010001111110011111100111111100000111000101100111111111010011011000000111111001111111110011010000111001111110011111100111111001111110011111101000010 e19f3f3f8b583f3f8f7a3f3f3f838b3fe9b03f3fe6873f3f3f3f3f42
EUC-JP 癲??宜??循???ル?魏??諛?????B 11100010101000010011111100111111101101011011100100111111001111111011110111011011001111110011111100111111101001011110101100111111111100101011001000111111001111111110101111100111001111110011111100111111001111110011111101000010 e2a13f3fb5b93f3fbddb3f3f3fa5eb3ff2b23f3febe73f3f3f3f3f42
UTF-8 癲덈챶宜방쨫循됰젧曆ル봿魏멨넇諛댄뜑燎룹뼢B 11100111100110011011001011101011100011011000100011101100101100011011011011100101101011101001110011101011101100001010100111101100101010001010101111100101101111101010101011101011100100001011000011101100101000001010011111101111101001101000101111100011100000111010101111101011101101001011111111101001101011011000111111101011101010011010100011101011100001001000011111101000101010111001101111101011100011001000010011101011100111001001000111101111101001111000000011101011101000111011100111101011101111001010001001000010 e799b2eb8d88ecb1b6e5ae9cebb0a9eca8abe5beaaeb90b0eca0a7efa68be383abebb4bfe9ad8feba9a8eb8487e8ab9beb8c84eb9c91efa780eba3b9ebbca242
UHC 癲덈챶宜방쨫循됰젧曆ル봿魏멨넇諛댄뜑燎룹뼢B 11101111101001101000100011101011101010101000001111101011111100011011100111100110101001001000010111100010111000001000100111101011101000001001111111100110101101111010101111101011100101001000011011101010111000001011100011100101100001101001011111101011101100001011010011101101100011011001010011101000111110111011011111101100100101101010010101000010 efa688ebaa83ebf1b9e6a485e2e089eba09fe6b7abeb9486eae0b8e58697ebb0b4ed8d94e8fbb7ec96a542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)