To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 汚ц?沃よ???お[汚ц?沃よ???お[^ 10001001100110001000010010001000001111111001011110000000100000101110011000111111001111110011111110000010101010000101101110001001100110001000010010001000001111111001011110000000100000101110011000111111001111110011111110000010101010000101101101011110 899884883f978082e63f3f3f82a85b899884883f978082e63f3f3f82a85b5e
EUC-JP 汚ц?沃よ???お[汚ц?沃よ???お[^ 10110001111110001010011111101000001111111100110111100000101001001110100000111111001111110011111110100100101010100101101110110001111110001010011111101000001111111100110111100000101001001110100000111111001111110011111110100100101010100101101101011110 b1f8a7e83fcde0a4e83f3f3fa4aa5bb1f8a7e83fcde0a4e83f3f3fa4aa5b5e
UTF-8 汚ц댒沃よ윭隸배お[汚ц댒沃よ윭隸배お[^ 11100110101100011001101011010001100001101110101110001100100100101110011010110010100000111110001110000010100010001110110010011100101011011110111110100110101110001110101110110000101100001110001110000001100010100101101111100110101100011001101011010001100001101110101110001100100100101110011010110010100000111110001110000010100010001110110010011100101011011110111110100110101110001110101110110000101100001110001110000001100010100101101101011110 e6b19ad186eb8c92e6b283e38288ec9cadefa6b8ebb0b0e3818a5be6b19ad186eb8c92e6b283e38288ec9cadefa6b8ebb0b0e3818a5b5e
UHC 汚ц댒沃よ윭隸배お[汚ц댒沃よ윭隸배お[^ 111001111111110110101100111010001000100010111001111010001010101010101010111010001001111110101100111001111110011010111001111010001010101010101010010110111110011111111101101011001110100010001000101110011110100010101010101010101110100010011111101011001110011111100110101110011110100010101010101010100101101101011110 e7fdace888b9e8aaaae89face7e6b9e8aaaa5be7fdace888b9e8aaaae89face7e6b9e8aaaa5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)