To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 晤??有???↑??ヨ?宥????????^ 100111011110101100111111001111111001011101001100001111110011111100111111100000011010101000111111001111111000001110001000001111111001011101000111001111110011111100111111001111110011111100111111001111110011111101011110 9deb3f3f974c3f3f3f81aa3f3f83883f97473f3f3f3f3f3f3f3f5e
EUC-JP 晤??有???↑??ヨ?宥????????^ 110110101110110100111111001111111100110110101101001111110011111100111111101000101010110000111111001111111010010111101000001111111100110110101000001111110011111100111111001111110011111100111111001111110011111101011110 daed3f3fcdad3f3f3fa2ac3f3fa5e83fcda83f3f3f3f3f3f3f3f5e
UTF-8 晤대떩有껆뫆溜↑솦列ヨ씠宥삳븗烈⑸젩惡잍씟^ 11100110100110011010010011101011100011001000000011101011100101101010100111100110100111001000100111101010101110111000011011101011101010111000011011101111101001111000101111100010100001101001000111101100100001101010011011101111101001101001110011100011100000111010100011101100100101001010000011100101101011101010010111101100100000101011001111101011101110001001011111101111101001101001111111100010100100011011100011101100101000001010100111101111101001101011100111101100100111101000110111101100100101001001111101011110 e699a4eb8c80eb96a9e69c89eabb86ebab86efa78be28691ec86a6efa69ce383a8ec94a0e5aea5ec82b3ebb897efa69fe291b8eca0a9efa6b9ec9e8dec949f5e
UHC 晤대떩有껆뫆溜↑솦列ヨ씠宥삳븗烈⑸젩惡잍씟^ 11100111111110111011010011101011100010111011101111101010111100111000001111100111100100011010100111101010111111101010000111101000100110011001111111100110111010101010101111101000100111011011010011101010111010011011101111101011100101011000001111100110111011111010100111101011101000001010000111100111111101111001111111100110100111011011001101011110 e7fbb4eb8bbbeaf383e791a9eafea1e8999fe6eaabe89db4eae9bbeb9583e6efa9eba0a1e7f79fe69db35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)