To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??毅?.宥???ル?愉??源??孃 100010011010010100111111001111111000101101000010001111111000000101000100100101110100011100111111001111110011111110000011100010110011111110010110111110010011111100111111100011001011100100111111001111111001101101101111 89a53f3f8b423f814497473f3f3f838b3f96f93f3f8cb93f3f9b6f
EUC-JP 翁??毅?.宥???ル?愉??源??孃 101100101010011100111111001111111011010110100011001111111010000110100101110011011010100000111111001111110011111110100101111010110011111111001100111110110011111100111111101110001011101100111111001111111101010111010000 b2a73f3fb5a33fa1a5cda83f3f3fa5eb3fccfb3f3fb8bb3f3fd5d0
UTF-8 翁띾끃毅싮.宥룸럡曆ル낌愉뗦를源놁쁼孃 111001111011111110000001111010111001110110111110111010111000000110000011111001101010111110000101111011001000101110101110111011111011110010001110111001011010111010100101111010111010001110111000111010111001111110100001111011111010011010001011111000111000001110101011111010111000001010001100111001101000010010001001111010111001011110100110111010111010010110111100111001101011101010010000111010111000011010000001111011001000000110111100111001011010110110000011 e7bf81eb9dbeeb8183e6af85ec8baeefbc8ee5aea5eba3b8eb9fa1efa68be383abeb828ce68489eb97a6eba5bce6ba90eb8681ec81bce5ad83
UHC 翁띾끃毅싮.宥룸럡曆ル낌愉뗦를源놁쁼孃 1110100010111010100011011110101110000101101110011110101111110110100110101110100110100011101011101110101011101001101101111110101110001110100001001110011010110111101010111110101110110011101001101110101011110000100010111110011010111000101001101110101010111001100001101110110010011000100000111110010110111110 e8ba8deb85b9ebf69ae9a3aeeae9b7eb8e84e6b7abebb3a6eaf08be6b8a6eab986ec9883e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)