To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8諭ゅ???エ碍?―愿??猷??鴉?‥逸 11100001100111110011111110000010010101111001011101000000100000101110001100111111001111110011111110000011010001111000101001010110001111111000000101011100100111001100001100111111001111111001011101010001001111110011111111101001111010110011111110000001011001001000100011101101 e19f3f8257974082e33f3f3f83478a563f815c9cc33f3f97513f3fe9eb3f816488ed
EUC-JP 癲?8諭ゅ???エ碍?―愿??猷??鴉?‥逸 11100010101000010011111110100011101110001100110110100001101001001110010100111111001111110011111110100101101010001011001110110111001111111010000110111101110110001100010100111111001111111100110110110010001111110011111111110010111011010011111110100001110001011011000011101111 e2a13fa3b8cda1a4e53f3f3fa5a8b3b73fa1bdd8c53f3fcdb23f3ff2ed3fa1c5b0ef
UTF-8 癲쒕8諭ゅ쳞戮녹エ碍⑸―愿뚳쬅猷붽뭬鴉딅‥逸 111001111001100110110010111011001001001010010101111011111011110010011000111010001010101110101101111000111000001010000101111011001011001110011110111011111010011110010010111010111000010110111001111000111000001010101000111001111010001010001101111000101001000110111000111000101000000010010101111001101000010010111111111010111001101010110011111011001010110010000101111001111000110010110111111010111011011010111101111010111010110110101100111010011011010010001001111010111001010010000101111000101000000010100101111010011000000010111000 e799b2ec9295efbc98e8abade38285ecb39eefa792eb85b9e382a8e7a28de291b8e28095e684bfeb9ab3ecac85e78cb7ebb6bdebadace9b489eb9485e280a5e980b8
UHC 癲쒕8諭ゅ쳞戮녹エ碍⑸―愿뚳쬅猷붽뭬鴉딅‥逸 1110111110100110100111001110101110100011101110001110101110110001101010101110010110101011100001001110101110111101101100111110110010101011101010001110010011110100101010011110101110100001101010101110101010110100100011001110111110100110100111001110101110100011100101001110101010111001101111101110010010111100100010101110101110100001101001011110110011101111 efa69ceba3b8ebb1aae5ab84ebbdb3ecaba8e4f4a9eba1aaeab48cefa69ceba394eab9bee4bc8aeba1a5ecef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)