To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 億?????怨??藥??誼?┸柔レ?狎 100010011010110100111111001111110011111100111111001111111000100110000101001111110011111111100101010110100011111100111111100010110110001000111111100001001011110110001111010111111000001110001100001111111110000010111110 89ad3f3f3f3f3f89853f3fe55a3f3f8b623f84bd8f5f838c3fe0be
EUC-JP 億?????怨??藥??誼?┸柔レ?狎 101100101010111100111111001111110011111100111111001111111011000111100101001111110011111111101001101110110011111100111111101101011100001100111111101010001011111110111101110000001010010111101100001111111110000011000000 b2af3f3f3f3f3fb1e53f3fe9bb3f3fb5c33fa8bfbdc0a5ec3fe0c0
UTF-8 億쏅맟利억쭓怨뺤젔藥띾씭誼숋┸柔レ젌狎 111001011000010010000100111011001000111110000101111010111010011110011111111011111010011110011101111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010010100111010001001011110100101111010111001110110111110111011001001010010101101111010001010101010111100111011001000100010001011111000101001010010111000111001101001111110010100111000111000001110101100111011001010000010001100111001111000101110001110 e58484ec8f85eba79fefa79dec96b5ecad93e680a8ebbaa4eca094e897a5eb9dbeec94ade8aabcec888be294b8e69f94e383aceca08ce78b8e
UHC 億쏅맟利억쭓怨뺤젔藥띾씭誼숋┸柔レ젌狎 1110010111100010100110111110101110010000101011001110110010100110101111101110111110100111100010111110101010110011100101011110110010100000100100101110010110110111100011011110101110011101101111101110101111111110100110011110111110100110101111111110101011110101101010111110110010100000100011011110010011100100 e5e29beb90aceca6beefa78beab395eca092e5b78deb9dbeebfe99efa6bfeaf5abeca08de4e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)