To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????W}??????????W{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 而?駿???遵?腫?W}而?駿???遵?腫?W{^ 100011101010011100111111100011110111100000111111001111110011111110001111100001010011111110001110111011100011111101010111011111011000111010100111001111111000111101111000001111110011111100111111100011111000010100111111100011101110111000111111010101110111101101011110 8ea73f8f783f3f3f8f853f8eee3f577d8ea73f8f783f3f3f8f853f8eee3f577b5e
EUC-JP 而?駿???遵?腫?W}而?駿???遵?腫?W{^ 101111001010100100111111101111011101100100111111001111110011111110111101111001010011111110111100111100000011111101010111011111011011110010101001001111111011110111011001001111110011111100111111101111011110010100111111101111001111000000111111010101110111101101011110 bca93fbdd93f3f3fbde53fbcf03f577dbca93fbdd93f3f3fbde53fbcf03f577b5e
UTF-8 而렲駿계렫렲遵랬腫렣W}而렲駿계렫렲遵랬腫렣W{^ 1110100010000000100011001110101110100000101100101110100110100111101111111110101010110011100001001110101110100000101010111110101110100000101100101110100110000001101101011110101110011110101011001110100010000101101010111110101110100000101000110101011101111101111010001000000010001100111010111010000010110010111010011010011110111111111010101011001110000100111010111010000010101011111010111010000010110010111010011000000110110101111010111001111010101100111010001000010110101011111010111010000010100011010101110111101101011110 e8808ceba0b2e9a7bfeab384eba0abeba0b2e981b5eb9eace885abeba0a3577de8808ceba0b2e9a7bfeab384eba0abeba0b2e981b5eb9eace885abeba0a3577b5e
UHC 而렲駿계렫렲遵랬腫렣W}而렲駿계렫렲遵랬腫렣W{^ 111011001011101110001110101111111111000111100111101100001110100010001110101110011000111010111111111100011110010110110111101010001111000011111110100011101011010001010111011111011110110010111011100011101011111111110001111001111011000011101000100011101011100110001110101111111111000111100101101101111010100011110000111111101000111010110100010101110111101101011110 ecbb8ebff1e7b0e88eb98ebff1e5b7a8f0fe8eb4577decbb8ebff1e7b0e88eb98ebff1e5b7a8f0fe8eb4577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)