To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ油??域??維??怨k??λ?應 00111111001111110011111111100010100001100011111110000011010011001001011011111011001111110011111110001000111001100011111100111111100010001101101100111111001111111000100110000101100000101000101100111111001111111000001111001001001111111001110011100100 3f3f3fe2863f834c96fb3f3f88e63f3f88db3f3f8985828b3f3f83c93f9ce4
EUC-JP ???竊?キ油??域??維??怨k??λ?應 00111111001111110011111111100011111001100011111110100101101011011100110011111101001111110011111110110000111010000011111100111111101100001101110100111111001111111011000111100101101000111110101100111111001111111010011011001011001111111101100011100110 3f3f3fe3e63fa5adccfd3f3fb0e83f3fb0dd3f3fb1e5a3eb3f3fa6cb3fd8e6
UTF-8 捻뀁뮆竊섋キ油삳눤域뱀룇維볡넭怨k쳟嶪λ뿩應 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010111110001110000010101011011110011010110010101110011110110010000010101100111110101110001000101001001110010110011111100111111110101110110001100000001110101110100011100001111110011110110110101011011110101110110011101000011110101110000100101011011110011010000000101010001110111110111101100010111110110010110011100111111110010110110110101010101100111010111011111010111011111110101001111001101000011110001001 efa6a4eb8081ebae86e7ab8aec848be382ade6b2b9ec82b3eb88a4e59f9febb180eba387e7b6adebb3a1eb84ade680a8efbd8becb39fe5b6aacebbebbfa9e68789
UHC 捻뀁뮆竊섋キ油삳눤域뱀룇維볡넭怨k쳟嶪λ뿩應 1110011011110111101100101110110010010010100101011110111110111100100110001110100010101011101011011110101011111010101110111110101110000111101110111110011010110100101110011110110010001111100001101110101110101011100100111110011110000110101011001110101010110011101000111110101110101011100001011110010111110101101001011110101110010111101010011110101111101011 e6f7b2ec9295efbc98e8abadeafabbeb87bbe6b4b9ec8f86ebab93e786aceab3a3ebab85e5f5a5eb97a9ebeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)