To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖??揖??瑜?????泣??擬?????泣 10011111010100000011111100111111100101110100101100111111001111111110000011101111001111110011111100111111001111110011111110001011100000110011111100111111100010110101101100111111001111110011111100111111001111111000101110000011 9f503f3f974b3f3fe0ef3f3f3f3f3f8b833f3f8b5b3f3f3f3f3f8b83
EUC-JP 蘖??揖??瑜?????泣??擬?????泣 11011101101100010011111100111111110011011010110000111111001111111110000011110001001111110011111100111111001111110011111110110101111000110011111100111111101101011011110000111111001111110011111100111111001111111011010111100011 ddb13f3fcdac3f3fe0f13f3f3f3f3fb5e33f3fb5bc3f3f3f3f3fb5e3
UTF-8 蘖뽰눦揖썲쮦瑜낆돪嶪용뵃泣곫콢擬듭뒳嶪용뜉泣 111010001001100010010110111010111011110110110000111010111000100010100110111001101000111110010110111011001000110110110010111011001010111010100110111001111001000110011100111010111000001010000110111010111000111110101010111001011011011010101010111011001001101010101001111010111011010110000011111001101011001110100011111010101011001110101011111011001011110110100010111001101001001110101100111010111001001110101101111010111001001010110011111001011011011010101010111011001001101010101001111010111001110010001001111001101011001110100011 e89896ebbdb0eb88a6e68f96ec8db2ecaea6e7919ceb8286eb8faae5b6aaec9aa9ebb583e6b3a3eab3abecbda2e693aceb93adeb92b3e5b6aaec9aa9eb9c89e6b3a3
UHC 蘖뽰눦揖썲쮦瑜낆돪嶪용뵃泣곫콢擬듭뒳嶪용뜉泣 1110010111101110100101101110110010000111101111011110101111100111101111011110010110101000100000111110101110100101100001011110110010001001101011011110010111110101101111111110101110010100100010011110101111101000100000011110011010110001100110101110101111110100101101011110110010001010101011001110010111110101101111111110101110001101100011001110101111101000 e5ee96ec87bdebe7bde5a883eba585ec89ade5f5bfeb9489ebe881e6b19aebf4b5ec8aace5f5bfeb8d8cebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)