To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?b?泣η?矣?㎝?b?泣η?矣?㎝B 0011111110000010100000100011111110001011100000111000001111000101001111111110000111100001001111111000011101110000001111111000001010000010001111111000101110000011100000111100010100111111111000011110000100111111100001110111000001000010 3f82823f8b8383c53fe1e13f87703f82823f8b8383c53fe1e13f877042
EUC-JP 渶b?泣η?矣??渶b?泣η?矣??B 10001111110001111110110110100011111000100011111110110101111000111010011011000111001111111110001011100011001111110011111110001111110001111110110110100011111000100011111110110101111000111010011011000111001111111110001011100011001111110011111101000010 8fc7eda3e23fb5e3a6c73fe2e33f3f8fc7eda3e23fb5e3a6c73fe2e33f3f42
UTF-8 渶b뫅泣η윯矣낅㎝渶b뫅泣η윯矣낅㎝B 1110011010111000101101101110111110111101100000101110101110101011100001011110011010110011101000111100111010110111111011001001110010101111111001111001111110100011111010111000001010000101111000111000111010011101111001101011100010110110111011111011110110000010111010111010101110000101111001101011001110100011110011101011011111101100100111001010111111100111100111111010001111101011100000101000010111100011100011101001110101000010 e6b8b6efbd82ebab85e6b3a3ceb7ec9cafe79fa3eb8285e38e9de6b8b6efbd82ebab85e6b3a3ceb7ec9cafe79fa3eb8285e38e9d42
UHC 渶b뫅泣η윯矣낅㎝渶b뫅泣η윯矣낅㎝B 11100111101101111010001111100010100100011010100011101011111010001010010111100111100111111010111011101011111110001000010111101011101001111010111111100111101101111010001111100010100100011010100011101011111010001010010111100111100111111010111011101011111110001000010111101011101001111010111101000010 e7b7a3e291a8ebe8a5e79faeebf885eba7afe7b7a3e291a8ebe8a5e79faeebf885eba7af42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)