To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 闖エ寀ヽ闖エ寀/[闖エ寀ヽ闖エ寀/[^ 11101000100011111011010011111010101001111000000101010010111010001000111110110100111110101010011110000001010111100101101111101000100011111011010011111010101001111000000101010010111010001000111110110100111110101010011110000001010111100101101101011110 e88fb4faa78152e88fb4faa7815e5be88fb4faa78152e88fb4faa7815e5b5e
EUC-JP 闖エ寀ヽ闖エ寀/[闖エ寀ヽ闖エ寀/[^ 111011111110111110001110101101001000111110111010110110111010000110110011111011111110111110001110101101001000111110111010110110111010000110111111010110111110111111101111100011101011010010001111101110101101101110100001101100111110111111101111100011101011010010001111101110101101101110100001101111110101101101011110 efef8eb48fbadba1b3efef8eb48fbadba1bf5befef8eb48fbadba1b3efef8eb48fbadba1bf5b5e
UTF-8 闖エ寀ヽ闖エ寀/[闖エ寀ヽ闖エ寀/[^ 111010011001011110010110111011111011110110110100111001011010111110000000111000111000001110111101111010011001011110010110111011111011110110110100111001011010111110000000111011111011110010001111010110111110100110010111100101101110111110111101101101001110010110101111100000001110001110000011101111011110100110010111100101101110111110111101101101001110010110101111100000001110111110111100100011110101101101011110 e99796efbdb4e5af80e383bde99796efbdb4e5af80efbc8f5be99796efbdb4e5af80e383bde99796efbdb4e5af80efbc8f5b5e
UHC 闖?寀?闖?寀/[闖?寀?闖?寀/[^ 1111011111100110001111111111001111110010001111111111011111100110001111111111001111110010101000111010111101011011111101111110011000111111111100111111001000111111111101111110011000111111111100111111001010100011101011110101101101011110 f7e63ff3f23ff7e63ff3f2a3af5bf7e63ff3f23ff7e63ff3f2a3af5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)