To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ油??域??維??怨???λ?弛 001111110011111100111111111000101000011000111111100000110100110010010110111110110011111100111111100010001110011000111111001111111000100011011011001111110011111110001001100001010011111100111111001111111000001111001001001111111001001001101111 3f3f3fe2863f834c96fb3f3f88e63f3f88db3f3f89853f3f3f83c93f926f
EUC-JP ???竊?キ油??域??維??怨???λ?弛 001111110011111100111111111000111110011000111111101001011010110111001100111111010011111100111111101100001110100000111111001111111011000011011101001111110011111110110001111001010011111100111111001111111010011011001011001111111100001111010000 3f3f3fe3e63fa5adccfd3f3fb0e83f3fb0dd3f3fb1e53f3f3fa6cb3fc3d0
UTF-8 捻뀁뮆竊섋キ油삳눤域뱀룇維볡넭怨몄삌嶪λ톪弛 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010111110001110000010101011011110011010110010101110011110110010000010101100111110101110001000101001001110010110011111100111111110101110110001100000001110101110100011100001111110011110110110101011011110101110110011101000011110101110000100101011011110011010000000101010001110101110101010100001001110110010000010100011001110010110110110101010101100111010111011111011011000011010101010111001011011110010011011 efa6a4eb8081ebae86e7ab8aec848be382ade6b2b9ec82b3eb88a4e59f9febb180eba387e7b6adebb3a1eb84ade680a8ebaa84ec828ce5b6aacebbed86aae5bc9b
UHC 捻뀁뮆竊섋キ油삳눤域뱀룇維볡넭怨몄삌嶪λ톪弛 1110011011110111101100101110110010010010100101011110111110111100100110001110100010101011101011011110101011111010101110111110101110000111101110111110011010110100101110011110110010001111100001101110101110101011100100111110011110000110101011001110101010110011101110001110110010011000100100111110010111110101101001011110101110110111100000101110110010101100 e6f7b2ec9295efbc98e8abadeafabbeb87bbe6b4b9ec8f86ebab93e786aceab3b8ec9893e5f5a5ebb782ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)