To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 髮穂サ呵。キ霎キ 111010011001101110010101111001001011101110011001111010001010000110110111111010001011111010110111 e99b95e4bb99e8a1b7e8beb7
EUC-JP 髮穂サ呵。キ霎キ 11110001111110111100101011100110100011101011101111010010111010101000111010100001100011101011011111110000110000001000111010110111 f1fbcae68ebbd2ea8ea18eb7f0c08eb7
UTF-8 髮穂サ呵。キ霎キ 111010011010101110101110111001111010100110000010111011111011110110111011111001011001000110110101111011111011110110100001111011111011110110110111111010011001110010001110111011111011110110110111 e9abaee7a982efbdbbe591b5efbda1efbdb7e99c8eefbdb7
UHC 髮??呵???? 11011011101001010011111100111111110010101010011100111111001111110011111100111111 dba53f3fcaa73f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)