To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鬱警?聾??乙???鬱警?聾??乙???^ 1001111101010100100011000111100000111111100110000101011100111111001111111000100110110011001111110011111100111111100111110101010010001100011110000011111110011000010101110011111100111111100010011011001100111111001111110011111101011110 9f548c783f98573f3f89b33f3f3f9f548c783f98573f3f89b33f3f3f5e
EUC-JP 鬱警?聾??乙???鬱警?聾??乙???^ 1101110110110101101101111101100100111111110011111011100000111111001111111011001010110101001111110011111100111111110111011011010110110111110110010011111111001111101110000011111100111111101100101011010100111111001111110011111101011110 ddb5b7d93fcfb83f3fb2b53f3f3fddb5b7d93fcfb83f3fb2b53f3f3f5e
UTF-8 鬱警렭聾렦렎乙어렢룁鬱警렭聾렦렎乙어렢뢸^ 11101001101011001011000111101000101011011010011011101011101000001010110111101000100000011011111011101011101000001010011011101011101000001000111011100100101110011001100111101100100101101011010011101011101000001010001011101011101000111000000111101001101011001011000111101000101011011010011011101011101000001010110111101000100000011011111011101011101000001010011011101011101000001000111011100100101110011001100111101100100101101011010011101011101000001010001011101011101000101011100001011110 e9acb1e8ada6eba0ade881beeba0a6eba08ee4b999ec96b4eba0a2eba381e9acb1e8ada6eba0ade881beeba0a6eba08ee4b999ec96b4eba0a2eba2b85e
UHC 鬱警렭聾렦렎乙어렢룁鬱警렭聾렦렎乙어렢뢸^ 1110101010100110110011001110110110001110101110101101011011101100100011101011010110001110101001001110101111100000101111101110111010001110101100111011011111011110111010101010011011001100111011011000111010111010110101101110110010001110101101011000111010100100111010111110000010111110111011101000111010110011101101111101110001011110 eaa6cced8ebad6ec8eb58ea4ebe0beee8eb3b7deeaa6cced8ebad6ec8eb58ea4ebe0beee8eb3b7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)