To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 而?莊?而?醍???μ??蹙??????^ 100011101010011100111111111001001011010100111111100011101010011100111111100100011110011100111111001111110011111110000011110010100011111100111111111001110100010100111111001111110011111100111111001111110011111101011110 8ea73fe4b53f8ea73f91e73f3f3f83ca3f3fe7453f3f3f3f3f3f5e
EUC-JP 而?莊?而?醍???μ??蹙??????^ 101111001010100100111111111010001011011100111111101111001010100100111111110000101110100100111111001111110011111110100110110011000011111100111111111011011010011000111111001111110011111100111111001111110011111101011110 bca93fe8b73fbca93fc2e93f3f3fa6cc3f3feda63f3f3f3f3f3f5e
UTF-8 而렲莊렱而렲醍렕뤗횓μ퍗우蹙얠퍗욹탮₃렍^ 111010001000000010001100111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111010011000011010001101111010111010000010010101111010111010010010010111111011011001101010010011110011101011110011101101100011011001011111101100100110101011000011101000101110011001100111101100100101101010000011101101100011011001011111101100100110101011100111101101100000111010111011100010100000101000001111101011101000001000110101011110 e8808ceba0b2e88e8aeba0b1e8808ceba0b2e9868deba095eba497ed9a93cebced8d97ec9ab0e8b999ec96a0ed8d97ec9ab9ed83aee28283eba08d5e
UHC 而렲莊렱而렲醍렕뤗횓μ퍗우蹙얠퍗욹탮₃렍^ 1110110010111011100011101011111111101101111101101000111010111110111011001011101110001110101111111111000010110101100011101010101010001111110001111100001110001110101001011110110010111011100011101011111111101100111101011110110010111110111011001011101110001110101111111111000010110101100011101010100111111101100011101010001101011110 ecbb8ebfedf68ebeecbb8ebff0b58eaa8fc7c38ea5ecbb8ebfecf5ecbeecbb8ebff0b58ea9fd8ea35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)