To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 醤ヤ磁鋐ホ鼾貍レ贖賞贅磁鋐ホ齊セロワ^ 100011111101110111010100100011101010010111111011110100101100111011101010100011011110011010111100110110101110011011011100100011111101110011100110110100101000111010100101111110111101001011001110111010101000111010111110110110111101110001011110 8fddd48ea5fbd2ceea8de6bcdae6dc8fdce6d28ea5fbd2ceea8ebedbdc5e
EUC-JP 醤ヤ磁鋐ホ鼾貍レ贖賞贅磁鋐ホ齊セロワ^ 101111101101111110001110110101001011110010100111100011111110010010111110100011101100111011110011111011011110110010111110100011101101101011101100110111101011111011011110111011001101010010111100101001111000111111100100101111101000111011001110111100111110111010001110101111101000111011011011100011101101110001011110 bedf8ed4bca78fe4be8ecef3edecbe8edaecdebedeecd4bca78fe4be8ecef3ee8ebe8edb8edc5e
UTF-8 醤ヤ磁鋐ホ鼾貍レ贖賞贅磁鋐ホ齊セロワ^ 11101001100001101010010011101111101111101001010011100111101000111000000111101001100010111001000011101111101111101000111011101001101111001011111011101000101100101000110111101111101111101001101011101000101101001001011011101000101100111001111011101000101101001000010111100111101000111000000111101001100010111001000011101111101111101000111011101001101111011000101011101111101111011011111011101111101111101001101111101111101111101001110001011110 e986a4efbe94e7a381e98b90efbe8ee9bcbee8b28defbe9ae8b496e8b39ee8b485e7a381e98b90efbe8ee9bd8aefbdbeefbe9befbe9c5e
UHC ??磁?????贖賞贅磁??齊???^ 00111111001111111110110110111000001111110011111100111111001111110011111111100001110110111101111111011011111101101010000111101101101110000011111100111111111100001011101000111111001111110011111101011110 3f3fedb83f3f3f3f3fe1dbdfdbf6a1edb83f3ff0ba3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)