To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??純??域??有∽?矣??塋?И 0011111100111111001111111000101110000011001111110011111110001111100000110011111100111111100010001110011000111111001111111001011101001100100000011110010000111111111000011110000100111111001111111001101011001000001111111000010001001001 3f3f3f8b833f3f8f833f3f88e63f3f974c81e43fe1e13f3f9ac83f8449
EUC-JP ???泣??純??域??有∽?矣??塋?И 0011111100111111001111111011010111100011001111110011111110111101111000110011111100111111101100001110100000111111001111111100110110101101101000101110011000111111111000101110001100111111001111111101010011001010001111111010011110101010 3f3f3fb5e33f3fbde33f3fb0e83f3fcdada2e63fe2e33f3fd4ca3fa7aa
UTF-8 捻꿔꺂泣볠궇純놁춸域뱀쉸有∽쫳矣곴틓塋딅И 1110111110100110101001001110101010111111100101001110101010111010100000101110011010110011101000111110101110110011101000001110101010110110100001111110011110110100100101001110101110000110100000011110110010110110101110001110010110011111100111111110101110110001100000001110110010001001101110001110011010011100100010011110001010001000101111011110110010101011101100111110011110011111101000111110101010110011101101001110110110001011100100111110010110100001100010111110101110010100100001011101000010011000 efa6a4eabf94eaba82e6b3a3ebb3a0eab687e7b494eb8681ecb6b8e59f9febb180ec89b8e69c89e288bdecabb3e79fa3eab3b4ed8b93e5a18beb9485d098
UHC 捻꿔꺂泣볠궇純놁춸域뱀쉸有∽쫳矣곴틓塋딅И 111001101111011110110010111000111000001110101011111010111110100010010011111001101000001010100000111000101110110110000110111011001010110110010100111001101011010010111001111011001001101010001110111010101111001110100001111011111010011010001011111010111111100010000001111010101011101010000010111001111010101110001010111010111010110010101010 e6f7b2e383abebe893e682a0e2ed86ecad94e6b4b9ec9a8eeaf3a1efa68bebf881eaba82e7ab8aebacaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)