To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?λ??裔??諭???k????揖??矣 0011111110000011110010010011111100111111111001011110000100111111001111111001011101000000001111110011111100111111100000101000101100111111001111110011111100111111100101110100101100111111001111111110000111100001 3f83c93f3fe5e13f3f97403f3f3f828b3f3f3f3f974b3f3fe1e1
EUC-JP ?λ??裔??諭???k????揖??矣 0011111110100110110010110011111100111111111010101110001100111111001111111100110110100001001111110011111100111111101000111110101100111111001111110011111100111111110011011010110000111111001111111110001011100011 3fa6cb3f3feae33f3fcda13f3f3fa3eb3f3f3f3fcdac3f3fe2e3
UTF-8 筽λ쓹흢裔됰뛽諭싧꼺類k샨閱묐벊揖ⓧ슭矣 1110011110101101101111011100111010111011111011001001001110111001111011011001110110100010111010001010001110010100111010111001000010110000111010111001101110111101111010001010101110101101111011001000101110100111111010101011110010111010111011111010011110010000111011111011110110001011111011001000001110101000111010011001011010110001111010111010110010010000111010111011001010001010111001101000111110010110111000101001001110100111111011001000101010101101111001111001111110100011 e7adbdcebbec93b9ed9da2e8a394eb90b0eb9bbde8abadec8ba7eabcbaefa790efbd8bec83a8e996b1ebac90ebb28ae68f96e293a7ec8aade79fa3
UHC 筽λ쓹흢裔됰뛽諭싧꼺類k샨閱묐벊揖ⓧ슭矣 11101000101001001010010111101011100111011001010111000101100000101110011111100000100010011110101110001101100000111110101110110001100110101110010110000100100100101110101110111010101000111110101110111100101000101110011011110011100100011110101110010011101011011110101111100111101010001110010010111101101111101110101111111000 e8a4a5eb9d95c582e7e089eb8d83ebb19ae58492ebbaa3ebbca2e6f391eb93adebe7a8e4bdbeebf8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)