To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 褥??肉??臾ч????燁???ε?喩??^ 1110010111110001001111110011111110010011111101110011111100111111111001000110101110000100100010010011111100111111001111110011111111111011010110010011111100111111001111111000001111000011001111111001101001100111001111110011111101011110 e5f13f3f93f73f3fe46b84893f3f3f3ffb593f3f3f83c33f9a673f3f5e
EUC-JP 褥?ł肉??臾ч????燁???ε?喩??^ 1110101011110011001111111000111110101001110010001100011011111001001111110011111111100111110011001010011111101001001111110011111100111111001111111000111111001010101100110011111100111111001111111010011011000101001111111101001111001000001111110011111101011110 eaf33f8fa9c8c6f93f3fe7cca7e93f3f3f3f8fcab33f3f3fa6c53fd3c83f3f5e
UTF-8 褥띕ł肉덄㎣臾ч낭硫⑸퓞燁㏓뀞杻εㄾ喩볥뼥^ 11101000101001001010010111101011100111011001010111000101100000101110100010000010100010011110101110001101100001001110001110001110101000111110100010000111101111101101000110000111111010111000001010101101111011111010011110001110111000101001000110111000111011011001001110011110111001111000011110000001111000111000111110010011111010111000000010011110111011111010011110001000110011101011010111100011100001001011111011100101100101101010100111101011101100111010010111101011101111001010010101011110 e8a4a5eb9d95c582e88289eb8d84e38ea3e887bed187eb82adefa78ee291b8ed939ee78781e38f93eb809eefa788ceb5e384bee596a9ebb3a5ebbca55e
UHC 褥띕ł肉덄㎣臾ч낭硫⑸퓞燁㏓뀞杻εㄾ喩볥뼥^ 11101001101100111011011011101011101010011010100111101011101111111000100011100111101001111010011111101011101011001010110011101001101100111011011011101011101010011010100111101011101111111000100011100111101001111010011111101011100001011001010111101010111101001010010111100101101001001010111011101010111001111001001111101011100101101010100001011110 e9b3b6eba9a9ebbf88e7a7a7ebacace9b3b6eba9a9ebbf88e7a7a7eb8595eaf4a5e5a4aeeae793eb96a85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)