To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???逾??矣??筌??悠??惟??癲??殉? 001111110011111100111111111001111010010100111111001111111110000111100001001111110011111111100010101000110011111100111111100101110100100100111111001111111000100011010010001111110011111111100001100111110011111100111111100011110111110100111111 3f3f3fe7a53f3fe1e13f3fe2a33f3f97493f3f88d23f3fe19f3f3f8f7d3f
EUC-JP ???逾??矣??筌??悠??惟??癲??殉? 001111110011111100111111111011101010011100111111001111111110001011100011001111110011111111100100101001010011111100111111110011011010101000111111001111111011000011010100001111110011111111100010101000010011111100111111101111011101111000111111 3f3f3feea73f3fe2e33f3fe4a53f3fcdaa3f3fb0d43f3fe2a13f3fbdde3f
UTF-8 麗몃쓹逾껅룚矣쒕눧筌롮뇴悠끾뵺惟깅뤍癲숈옓殉딟 111011111010011010001000111010111010101010000011111011001001001110111001111010011000000010111110111010101011101110000101111010111010001110011010111001111001111110100011111011001001001010010101111010111000100010100111111001111010110110001100111010111010000110101110111010111000011110110100111001101000001010100000111010111000000110111110111010111011010110111010111001101000001110011111111010101011100110000101111010111010010010001101111001111001100110110010111011001000100010001000111011001001100010010011111001101010111010001001111010111001010010011111 efa688ebaa83ec93b9e980beeabb85eba39ae79fa3ec9295eb88a7e7ad8ceba1aeeb87b4e682a0eb81beebb5bae6839feab985eba48de799b2ec8888ec9893e6ae89eb949f
UHC 麗몃쓹逾껅룚矣쒕눧筌롮뇴悠끾뵺惟깅뤍癲숈옓殉딟 11100110101100001011100011101011100111011001010111101011101101011000001111100110100011111001011011101011111110001001110011101011100001111011111011101111101001111000111011101100100001111001100011101010111011011000010111100110100101001011100011101010111011101011000111101011100011111011110111101111101001101001100111101100100111101001100111100010111001101000101101000010 e6b0b8eb9d95ebb583e68f96ebf89ceb87beefa78eec8798eaed85e694b8eaeeb1eb8fbdefa699ec9e99e2e68b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)